Spaces:
Sleeping
Sleeping
Tuo Nome commited on
Commit Β·
8075ff4
1
Parent(s): c63c513
Add Important Notes section to help users understand results
Browse files
app.py
CHANGED
|
@@ -298,6 +298,27 @@ with gr.Blocks(
|
|
| 298 |
By using entropy as a complexity signal, Adaptive-K allocates compute where it's needed.
|
| 299 |
""")
|
| 300 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 301 |
gr.Examples(
|
| 302 |
examples=EXAMPLES,
|
| 303 |
inputs=[text_input, threshold_low, threshold_high],
|
|
|
|
| 298 |
By using entropy as a complexity signal, Adaptive-K allocates compute where it's needed.
|
| 299 |
""")
|
| 300 |
|
| 301 |
+
with gr.Accordion("π Important Notes", open=True):
|
| 302 |
+
gr.Markdown("""
|
| 303 |
+
### Understanding the Results
|
| 304 |
+
|
| 305 |
+
**π‘ About Compute Savings:**
|
| 306 |
+
- Savings are calculated vs **fixed K=4** (the baseline)
|
| 307 |
+
- When K=4 is selected β **0% savings** (this is expected! Complex queries need all experts)
|
| 308 |
+
- When K=1 is selected β **75% savings** (simple queries save the most)
|
| 309 |
+
- Try "What is 2+2?" to see maximum savings!
|
| 310 |
+
|
| 311 |
+
**π― About Expert Selection:**
|
| 312 |
+
- This demo uses **keyword heuristics** to simulate router behavior
|
| 313 |
+
- In production with real MoE models (Mixtral, DeepSeek), the neural router makes more accurate selections
|
| 314 |
+
- The concept remains the same: entropy guides K selection
|
| 315 |
+
|
| 316 |
+
**π§ͺ Try These Examples:**
|
| 317 |
+
- Simple: "What is 2+2?" β Low entropy β K=1 β **75% savings**
|
| 318 |
+
- Medium: "Translate hello to Italian" β Medium entropy β K=2 β **50% savings**
|
| 319 |
+
- Complex: Multi-domain questions β High entropy β K=4 β **0% savings** (but accuracy preserved!)
|
| 320 |
+
""")
|
| 321 |
+
|
| 322 |
gr.Examples(
|
| 323 |
examples=EXAMPLES,
|
| 324 |
inputs=[text_input, threshold_low, threshold_high],
|