Qwythos-9B-Claude-Mythos-5-1M-mxfp4-mlx

MLX quantization of empero-ai/Qwythos-9B-Claude-Mythos-5-1M for Apple Silicon.

Note — text tower only. The source model is a Qwen3.5-VL multimodal model (Qwen3_5ForConditionalGeneration, with a vision encoder). This MLX conversion contains only the text/language tower — the vision encoder weights are not included, so this is a text-only model and does not accept image or video input. The text reasoning the original is benchmarked for (GSM8K, MMLU) is unaffected.

Variant: Block float MX FP4
Disk size: 4557 MB
Quantized by: sahilchachra

Benchmark results

Evaluated on Apple M5 Pro with MLX. Model loaded once; performance and quality measured in a single pass.

Performance

This model FP16 baseline
Decode tok/s (avg, long traces) 60.03 N/A
Peak memory (GB) 5.245 N/A
Disk size (MB) 4557 17969

Quality

Benchmark This model FP16 baseline n
GSM8K (math, accuracy) 92.0% N/A 50
MMLU (knowledge, accuracy) 74.0% N/A 50

Context scaling (decode tok/s)

Context length Decode tok/s
~128 tokens 60.9
~256 tokens 60.6
~512 tokens 60.4
~1024 tokens 60.6

Usage

pip install mlx-lm
from mlx_lm import load, generate

model, tokenizer = load("sahilchachra/Qwythos-9B-Claude-Mythos-5-1M-mxfp4-mlx")
response = generate(model, tokenizer, prompt="Your prompt here", max_tokens=256, verbose=True)

All variants in this collection

Model Variant
sahilchachra/Qwythos-9B-Claude-Mythos-5-1M-mxfp4-mlx Block float MX FP4 ← this model
sahilchachra/Qwythos-9B-Claude-Mythos-5-1M-mxfp8-mlx Block float MX FP8
sahilchachra/Qwythos-9B-Claude-Mythos-5-1M-optiq-5bpw-mlx OptiQ mixed-precision (target 5.0 bpw)

Notes

Original model

See empero-ai/Qwythos-9B-Claude-Mythos-5-1M for full model details and intended use.

Downloads last month
80
Safetensors
Model size
2B params
Tensor type
U8
·
U32
·
BF16
·
MLX
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for sahilchachra/Qwythos-9B-Claude-Mythos-5-1M-mxfp4-mlx

Finetuned
Qwen/Qwen3.5-9B
Quantized
(12)
this model

Collection including sahilchachra/Qwythos-9B-Claude-Mythos-5-1M-mxfp4-mlx