lordx64 commited on
Commit
4bac31f
·
verified ·
1 Parent(s): b7b0abc

GGUF table: replace approximate sizes with exact upload sizes

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -85,9 +85,9 @@ Quantized GGUF weights for `llama.cpp`, `LM Studio`, and `Ollama` are published
85
 
86
  | Quant | Approx size | Use case | File |
87
  |---|---|---|---|
88
- | **IQ4_XS** | ~19 GB | Smallest — fits on a single 24 GB consumer GPU; LM Studio default | [`Qwen3.6-35B-A3B-Kimi-K2.6-Reasoning-Distilled.IQ4_XS.gguf`](https://huggingface.co/lordx64/Qwen3.6-35B-A3B-Kimi-K2.6-Reasoning-Distilled-GGUF/blob/main/Qwen3.6-35B-A3B-Kimi-K2.6-Reasoning-Distilled.IQ4_XS.gguf) |
89
- | **Q5_K_M** | ~25 GB | Balanced quality / size, recommended sweet spot | [`Qwen3.6-35B-A3B-Kimi-K2.6-Reasoning-Distilled.Q5_K_M.gguf`](https://huggingface.co/lordx64/Qwen3.6-35B-A3B-Kimi-K2.6-Reasoning-Distilled-GGUF/blob/main/Qwen3.6-35B-A3B-Kimi-K2.6-Reasoning-Distilled.Q5_K_M.gguf) |
90
- | **Q8_0** | ~35 GB | Near-lossless, closest to bf16 quality | [`Qwen3.6-35B-A3B-Kimi-K2.6-Reasoning-Distilled.Q8_0.gguf`](https://huggingface.co/lordx64/Qwen3.6-35B-A3B-Kimi-K2.6-Reasoning-Distilled-GGUF/blob/main/Qwen3.6-35B-A3B-Kimi-K2.6-Reasoning-Distilled.Q8_0.gguf) |
91
 
92
  #### LM Studio
93
 
 
85
 
86
  | Quant | Approx size | Use case | File |
87
  |---|---|---|---|
88
+ | **IQ4_XS** | 18.94 GB | Smallest — fits on a single 24 GB consumer GPU; LM Studio default | [`Qwen3.6-35B-A3B-Kimi-K2.6-Reasoning-Distilled.IQ4_XS.gguf`](https://huggingface.co/lordx64/Qwen3.6-35B-A3B-Kimi-K2.6-Reasoning-Distilled-GGUF/blob/main/Qwen3.6-35B-A3B-Kimi-K2.6-Reasoning-Distilled.IQ4_XS.gguf) |
89
+ | **Q5_K_M** | 24.73 GB | Balanced quality / size, recommended sweet spot | [`Qwen3.6-35B-A3B-Kimi-K2.6-Reasoning-Distilled.Q5_K_M.gguf`](https://huggingface.co/lordx64/Qwen3.6-35B-A3B-Kimi-K2.6-Reasoning-Distilled-GGUF/blob/main/Qwen3.6-35B-A3B-Kimi-K2.6-Reasoning-Distilled.Q5_K_M.gguf) |
90
+ | **Q8_0** | 36.90 GB | Near-lossless, closest to bf16 quality | [`Qwen3.6-35B-A3B-Kimi-K2.6-Reasoning-Distilled.Q8_0.gguf`](https://huggingface.co/lordx64/Qwen3.6-35B-A3B-Kimi-K2.6-Reasoning-Distilled-GGUF/blob/main/Qwen3.6-35B-A3B-Kimi-K2.6-Reasoning-Distilled.Q8_0.gguf) |
91
 
92
  #### LM Studio
93