Update callout on GGUF format.
Browse files
README.md
CHANGED
|
@@ -11,7 +11,14 @@ tags: [text-to-image, diffusion, flow-matching, quantization, gguf, q8_0, ideogr
|
|
| 11 |
|
| 12 |
A **GGUF Q8_0** (8.5 bits/weight) quantization of the Ideogram 4 DiT.
|
| 13 |
|
| 14 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 15 |
|
| 16 |
## Why this one
|
| 17 |
Q8_0 is **quality-neutral** vs the FP8 reference (Pick 18.71 vs ceiling 18.71) — a clean,
|
|
|
|
| 11 |
|
| 12 |
A **GGUF Q8_0** (8.5 bits/weight) quantization of the Ideogram 4 DiT.
|
| 13 |
|
| 14 |
+
⚠️ **Not a llama.cpp / stable-diffusion.cpp file.** Despite the `.gguf` extension, this
|
| 15 |
+
loads **only** via the included PyTorch `gguf_loader.py` + the `ideogram4` pipeline. It is
|
| 16 |
+
**not** compatible with llama.cpp, stable-diffusion.cpp, Ollama, etc.
|
| 17 |
+
|
| 18 |
+
ℹ️ **Quantized DiT only.** This checkpoint is the DiT (both CFG branches). To generate you
|
| 19 |
+
also need the **Qwen3-VL text encoder and VAE** from the base repo [`ideogram-ai/ideogram-4-fp8`](https://huggingface.co/ideogram-ai/ideogram-4-fp8)
|
| 20 |
+
and the custom inference code at [`github.com/ideogram-oss/ideogram4`](https://github.com/ideogram-oss/ideogram4).
|
| 21 |
+
The quantization recipe and loader are included **in this repo** (`recipe-q8_0.json`, `gguf_loader.py`).
|
| 22 |
|
| 23 |
## Why this one
|
| 24 |
Q8_0 is **quality-neutral** vs the FP8 reference (Pick 18.71 vs ceiling 18.71) — a clean,
|