Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -11,7 +11,7 @@ tags: [text-to-image, diffusion, flow-matching, quantization, gguf, q4_k, ideogr
|
|
| 11 |
|
| 12 |
A **GGUF Q4_K** (4.5 bits/weight) quantization of the Ideogram 4 DiT, for consumer GPUs.
|
| 13 |
|
| 14 |
-
> **Note:** this checkpoint is the **quantized DiT only** (both CFG branches). To run it you also need the **Qwen3-VL text encoder and VAE** from the base repo [`ideogram-ai/ideogram-4-fp8`](https://huggingface.co/ideogram-ai/ideogram-4-fp8) and the custom inference code at [`github.com/ideogram-oss/ideogram4`](https://github.com/ideogram-oss/ideogram4).
|
| 15 |
|
| 16 |
## Why this one
|
| 17 |
Q4_K is the **Pareto winner** on the quality-vs-memory frontier: at **10.4 GB** (the same
|
|
@@ -47,7 +47,9 @@ Files here:
|
|
| 47 |
- `download_deps.py`, `usage.py` — setup + a minimal generation example.
|
| 48 |
- `recipe-q4_k.json` — the exact quantization recipe / tensor layout.
|
| 49 |
|
| 50 |
-
> `gguf_loader.py` is a **reference**: the dequant math is validated bit-exact
|
|
|
|
|
|
|
| 51 |
> path + the `ideogram4` pipeline.
|
| 52 |
|
| 53 |
## License
|
|
|
|
| 11 |
|
| 12 |
A **GGUF Q4_K** (4.5 bits/weight) quantization of the Ideogram 4 DiT, for consumer GPUs.
|
| 13 |
|
| 14 |
+
> **Note:** this checkpoint is the **quantized DiT only** (both CFG branches). To run it you also need the **Qwen3-VL text encoder and VAE** from the base repo [`ideogram-ai/ideogram-4-fp8`](https://huggingface.co/ideogram-ai/ideogram-4-fp8) and the custom inference code at [`github.com/ideogram-oss/ideogram4`](https://github.com/ideogram-oss/ideogram4). The quantization recipe and loader are included **in this repo** (`recipe-q4_k.json`, `gguf_loader.py`).
|
| 15 |
|
| 16 |
## Why this one
|
| 17 |
Q4_K is the **Pareto winner** on the quality-vs-memory frontier: at **10.4 GB** (the same
|
|
|
|
| 47 |
- `download_deps.py`, `usage.py` — setup + a minimal generation example.
|
| 48 |
- `recipe-q4_k.json` — the exact quantization recipe / tensor layout.
|
| 49 |
|
| 50 |
+
> `gguf_loader.py` is a **reference**: the dequant math is validated bit-exact, but the
|
| 51 |
+
> standalone loader hasn't been GPU-tested end-to-end yet — verify before production use.
|
| 52 |
+
> This is **not** a llama.cpp / stable-diffusion.cpp file; it loads only via this PyTorch
|
| 53 |
> path + the `ideogram4` pipeline.
|
| 54 |
|
| 55 |
## License
|