A1-Q2_K-imatrix

Static imatrix-calibrated GGUF quant of InternScience/Agents-A1.

llama-cli -hf Chungulus/A1-Q2_K-imatrix-GGUF -p "Write a Python sorting function" -n 160

File

File Size SHA-256
A1-Q2_K-imatrix.gguf 12.05 GiB 58d9f6e1de32a960c4dca0dca932241956303687b3a7436f8852a69812864067

Quality Snapshot

F16 baseline mini accuracy: 89.58%. F16 baseline PPL on KL holdout: 13.0194.

Metric Value
Mini accuracy 87.50%
Retention vs F16 97.67%
Mean KLD vs F16 0.128242
Same top p 81.75%

Notes

  • Calibration source: Glint-Research/Fable-5-traces
  • MTP is not included because the downloaded checkpoint did not contain MTP tensors.
  • Static imatrix GGUF, not Unsloth Dynamic 2.0 / UD2.
Downloads last month
-
GGUF
Model size
35B params
Architecture
qwen35moe
Hardware compatibility
Log In to add your hardware

2-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Chungulus/A1-Q2_K-imatrix-GGUF

Quantized
(39)
this model

Collection including Chungulus/A1-Q2_K-imatrix-GGUF