Tuo Nome commited on
Commit
8075ff4
Β·
1 Parent(s): c63c513

Add Important Notes section to help users understand results

Browse files
Files changed (1) hide show
  1. app.py +21 -0
app.py CHANGED
@@ -298,6 +298,27 @@ with gr.Blocks(
298
  By using entropy as a complexity signal, Adaptive-K allocates compute where it's needed.
299
  """)
300
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
301
  gr.Examples(
302
  examples=EXAMPLES,
303
  inputs=[text_input, threshold_low, threshold_high],
 
298
  By using entropy as a complexity signal, Adaptive-K allocates compute where it's needed.
299
  """)
300
 
301
+ with gr.Accordion("πŸ“ Important Notes", open=True):
302
+ gr.Markdown("""
303
+ ### Understanding the Results
304
+
305
+ **πŸ’‘ About Compute Savings:**
306
+ - Savings are calculated vs **fixed K=4** (the baseline)
307
+ - When K=4 is selected β†’ **0% savings** (this is expected! Complex queries need all experts)
308
+ - When K=1 is selected β†’ **75% savings** (simple queries save the most)
309
+ - Try "What is 2+2?" to see maximum savings!
310
+
311
+ **🎯 About Expert Selection:**
312
+ - This demo uses **keyword heuristics** to simulate router behavior
313
+ - In production with real MoE models (Mixtral, DeepSeek), the neural router makes more accurate selections
314
+ - The concept remains the same: entropy guides K selection
315
+
316
+ **πŸ§ͺ Try These Examples:**
317
+ - Simple: "What is 2+2?" β†’ Low entropy β†’ K=1 β†’ **75% savings**
318
+ - Medium: "Translate hello to Italian" β†’ Medium entropy β†’ K=2 β†’ **50% savings**
319
+ - Complex: Multi-domain questions β†’ High entropy β†’ K=4 β†’ **0% savings** (but accuracy preserved!)
320
+ """)
321
+
322
  gr.Examples(
323
  examples=EXAMPLES,
324
  inputs=[text_input, threshold_low, threshold_high],