MXFP4 MOE quantized model for MTP support llama.cpp https://github.com/ggml-org/llama.cpp/pull/22673

Downloads last month
31
GGUF
Model size
36B params
Architecture
qwen35moe
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for Marker689/Qwen3.6-35B-A3B-GGUF-MTP

Quantized
(4)
this model