Looking forward to the mxfp4 quantization format.

by markxixihaha - opened Apr 24

Apr 24

It would be even better if there were mxfp4, as this quantization format generates the fastest speed and smallest size on my Mac.

TheCluster

Owner Apr 24

Here - https://huggingface.co/TheCluster/Qwen3.6-35B-A3B-Heretic-MLX-mxfp4.

According to a research by Unsloth, the Qwen 3.5 (and likely 3.6) hybrid models degrade significantly starting with mxfp4, which is why I didn't include that option.

markxixihaha

Apr 24

Thanks for your hard work. I'll keep the points you mentioned in mind.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment