tested UD-Q4_K_XL - it is terrible vs DS4F

#9
by alexbi29 - opened

On my math/physics/coding test:

Model Correct Failures Total completion tokens matching correct answers
MiniMax-M3 Q4 BF16KV 93/120 27 TO 42,354
DeepSeek V4 Flash CUTLASS 120/120 0 28,667

either is a bad quant or minimax m3 is hard to quantize.
Note: most of the failures are token limits.

I also have problem with the endless reasoning. I'm using Q4_K_XL.

I think that q4_k_xl is busted.

2026-06-22-002525_hyprshot

Sign up or log in to comment