transformerlab
/

ideogram-4-gguf-q8_0

Model card Files Files and versions

dadmobile commited on 5 days ago

Commit

12c2eda

·

verified ·

1 Parent(s): 5babb16

Update callout on GGUF format.

Files changed (1) hide show

README.md +8 -1

README.md CHANGED Viewed

@@ -11,7 +11,14 @@ tags: [text-to-image, diffusion, flow-matching, quantization, gguf, q8_0, ideogr
 A **GGUF Q8_0** (8.5 bits/weight) quantization of the Ideogram 4 DiT.
-> **Note:** this checkpoint is the **quantized DiT only** (both CFG branches). To run it you also need the **Qwen3-VL text encoder and VAE** from the base repo [`ideogram-ai/ideogram-4-fp8`](https://huggingface.co/ideogram-ai/ideogram-4-fp8) and the custom inference code at [`github.com/ideogram-oss/ideogram4`](https://github.com/ideogram-oss/ideogram4). The quantization recipe and loader are included **in this repo** (`recipe-q8_0.json`, `gguf_loader.py`).
 ## Why this one
 Q8_0 is **quality-neutral** vs the FP8 reference (Pick 18.71 vs ceiling 18.71) — a clean,

 A **GGUF Q8_0** (8.5 bits/weight) quantization of the Ideogram 4 DiT.
+⚠️ **Not a llama.cpp / stable-diffusion.cpp file.** Despite the `.gguf` extension, this
+loads **only** via the included PyTorch `gguf_loader.py` + the `ideogram4` pipeline. It is
+**not** compatible with llama.cpp, stable-diffusion.cpp, Ollama, etc.
+ℹ️ **Quantized DiT only.** This checkpoint is the DiT (both CFG branches). To generate you
+also need the **Qwen3-VL text encoder and VAE** from the base repo [`ideogram-ai/ideogram-4-fp8`](https://huggingface.co/ideogram-ai/ideogram-4-fp8)
+and the custom inference code at [`github.com/ideogram-oss/ideogram4`](https://github.com/ideogram-oss/ideogram4).
+The quantization recipe and loader are included **in this repo** (`recipe-q8_0.json`, `gguf_loader.py`).
 ## Why this one
 Q8_0 is **quality-neutral** vs the FP8 reference (Pick 18.71 vs ceiling 18.71) — a clean,