EAGLE Speculator

#2
by Mitaka-D - opened

Thank you for this model variant!

If you use this model with the provided EAGLE speculator ( https://huggingface.co/mistralai/Mistral-Medium-3.5-128B-EAGLE), it won't "accept" the drafted tokens.
According to AI ( I'm no expert πŸ˜… ), the problem is in the activation quantization.

Fix should be to skip it:

"input_activations": null   // ← no activation quantization
Owner

Hi,

Thanks for pointing out!
ATM I don’t have hardware to test speculative setup, but if this fix works for you feel free to open PR πŸ™‚
I’ll verify and merge it

Looks like only changing the config.json won't fix this ...

Sign up or log in to comment