dadmobile commited on
Commit
12c2eda
·
verified ·
1 Parent(s): 5babb16

Update callout on GGUF format.

Browse files
Files changed (1) hide show
  1. README.md +8 -1
README.md CHANGED
@@ -11,7 +11,14 @@ tags: [text-to-image, diffusion, flow-matching, quantization, gguf, q8_0, ideogr
11
 
12
  A **GGUF Q8_0** (8.5 bits/weight) quantization of the Ideogram 4 DiT.
13
 
14
- > **Note:** this checkpoint is the **quantized DiT only** (both CFG branches). To run it you also need the **Qwen3-VL text encoder and VAE** from the base repo [`ideogram-ai/ideogram-4-fp8`](https://huggingface.co/ideogram-ai/ideogram-4-fp8) and the custom inference code at [`github.com/ideogram-oss/ideogram4`](https://github.com/ideogram-oss/ideogram4). The quantization recipe and loader are included **in this repo** (`recipe-q8_0.json`, `gguf_loader.py`).
 
 
 
 
 
 
 
15
 
16
  ## Why this one
17
  Q8_0 is **quality-neutral** vs the FP8 reference (Pick 18.71 vs ceiling 18.71) — a clean,
 
11
 
12
  A **GGUF Q8_0** (8.5 bits/weight) quantization of the Ideogram 4 DiT.
13
 
14
+ ⚠️ **Not a llama.cpp / stable-diffusion.cpp file.** Despite the `.gguf` extension, this
15
+ loads **only** via the included PyTorch `gguf_loader.py` + the `ideogram4` pipeline. It is
16
+ **not** compatible with llama.cpp, stable-diffusion.cpp, Ollama, etc.
17
+
18
+ ℹ️ **Quantized DiT only.** This checkpoint is the DiT (both CFG branches). To generate you
19
+ also need the **Qwen3-VL text encoder and VAE** from the base repo [`ideogram-ai/ideogram-4-fp8`](https://huggingface.co/ideogram-ai/ideogram-4-fp8)
20
+ and the custom inference code at [`github.com/ideogram-oss/ideogram4`](https://github.com/ideogram-oss/ideogram4).
21
+ The quantization recipe and loader are included **in this repo** (`recipe-q8_0.json`, `gguf_loader.py`).
22
 
23
  ## Why this one
24
  Q8_0 is **quality-neutral** vs the FP8 reference (Pick 18.71 vs ceiling 18.71) — a clean,