deep1401 commited on
Commit
73f42c6
·
verified ·
1 Parent(s): a4f9c83

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +4 -2
README.md CHANGED
@@ -11,7 +11,7 @@ tags: [text-to-image, diffusion, flow-matching, quantization, gguf, q4_k, ideogr
11
 
12
  A **GGUF Q4_K** (4.5 bits/weight) quantization of the Ideogram 4 DiT, for consumer GPUs.
13
 
14
- > **Note:** this checkpoint is the **quantized DiT only** (both CFG branches). To run it you also need the **Qwen3-VL text encoder and VAE** from the base repo [`ideogram-ai/ideogram-4-fp8`](https://huggingface.co/ideogram-ai/ideogram-4-fp8) and the custom inference code at [`github.com/ideogram-oss/ideogram4`](https://github.com/ideogram-oss/ideogram4). Quantization recipe + loader: see `recipe*.json`.
15
 
16
  ## Why this one
17
  Q4_K is the **Pareto winner** on the quality-vs-memory frontier: at **10.4 GB** (the same
@@ -47,7 +47,9 @@ Files here:
47
  - `download_deps.py`, `usage.py` — setup + a minimal generation example.
48
  - `recipe-q4_k.json` — the exact quantization recipe / tensor layout.
49
 
50
- > `gguf_loader.py` is a **reference**: the dequant math is validated bit-exact; it loads only via this PyTorch
 
 
51
  > path + the `ideogram4` pipeline.
52
 
53
  ## License
 
11
 
12
  A **GGUF Q4_K** (4.5 bits/weight) quantization of the Ideogram 4 DiT, for consumer GPUs.
13
 
14
+ > **Note:** this checkpoint is the **quantized DiT only** (both CFG branches). To run it you also need the **Qwen3-VL text encoder and VAE** from the base repo [`ideogram-ai/ideogram-4-fp8`](https://huggingface.co/ideogram-ai/ideogram-4-fp8) and the custom inference code at [`github.com/ideogram-oss/ideogram4`](https://github.com/ideogram-oss/ideogram4). The quantization recipe and loader are included **in this repo** (`recipe-q4_k.json`, `gguf_loader.py`).
15
 
16
  ## Why this one
17
  Q4_K is the **Pareto winner** on the quality-vs-memory frontier: at **10.4 GB** (the same
 
47
  - `download_deps.py`, `usage.py` — setup + a minimal generation example.
48
  - `recipe-q4_k.json` — the exact quantization recipe / tensor layout.
49
 
50
+ > `gguf_loader.py` is a **reference**: the dequant math is validated bit-exact, but the
51
+ > standalone loader hasn't been GPU-tested end-to-end yet — verify before production use.
52
+ > This is **not** a llama.cpp / stable-diffusion.cpp file; it loads only via this PyTorch
53
  > path + the `ideogram4` pipeline.
54
 
55
  ## License