ideogram-4-gguf / README.md
rectangleworm's picture
upload files
7b353b1
metadata
language:
  - en
  - ru
  - zh
base_model:
  - ideogram-ai/ideogram-4-fp8
  - ideogram-ai/ideogram-4-nf4
tags:
  - gguf
  - text-to-image
  - image-generation
  - comfyui
  - stable-diffusion.cpp
pipeline_tag: text-to-image
license: other
license_name: ideogram-non-commercial-model-agreement
license_link: https://huggingface.co/ideogram-ai/ideogram-4-fp8/blob/main/LICENSE.md

Ideogram4 GGUF quantized files

.
β”œβ”€β”€ diffusion/
β”‚   β”œβ”€β”€ cond/
β”‚   β”‚   β”œβ”€β”€ ideogram4_Q4_0.gguf
β”‚   β”‚   β”œβ”€β”€ ideogram4_Q4_1.gguf
β”‚   β”‚   β”œβ”€β”€ ideogram4-Q4_K.gguf
β”‚   β”‚   β”œβ”€β”€ ideogram4-Q5_0.gguf
β”‚   β”‚   β”œβ”€β”€ ideogram4_Q5_1.gguf
β”‚   β”‚   β”œβ”€β”€ ideogram4_Q5_K.gguf
β”‚   β”‚   β”œβ”€β”€ ideogram4-Q6_K.gguf
β”‚   β”‚   └── ideogram4-Q8_0.gguf
β”‚   └── uncond/
β”‚       β”œβ”€β”€ ideogram4_unconditional_Q4_0.gguf
β”‚       β”œβ”€β”€ ideogram4_unconditional_Q4_1.gguf
β”‚       β”œβ”€β”€ ideogram4_unconditional_Q4_K.gguf
β”‚       β”œβ”€β”€ ideogram4_unconditional_Q5_0.gguf
β”‚       β”œβ”€β”€ ideogram4_unconditional_Q5_1.gguf
β”‚       β”œβ”€β”€ ideogram4_unconditional_Q5_K.gguf
β”‚       β”œβ”€β”€ ideogram4_unconditional_Q6_K.gguf
β”‚       └── ideogram4_unconditional-Q8_0.gguf
β”œβ”€β”€ text_encoder/
β”‚   β”œβ”€β”€ Qwen3-VL-8B-Q4_0.gguf
β”‚   β”œβ”€β”€ Qwen3-VL-8B-Q4_1.gguf
β”‚   β”œβ”€β”€ Qwen3-VL-8B-Q4_K_S.gguf
β”‚   β”œβ”€β”€ Qwen3-VL-8B-Q4_K_M.gguf
β”‚   β”œβ”€β”€ Qwen3-VL-8B-Q5_K_S.gguf
β”‚   β”œβ”€β”€ Qwen3-VL-8B-Q5_K_M.gguf
β”‚   β”œβ”€β”€ Qwen3-VL-8B-Q6_K.gguf
β”‚   └── Qwen3-VL-8B-Q8_0.gguf
└── vae/
β”‚   β”œβ”€β”€ flux2-vae.safetensors
β”‚   └── flux2-hdr-vae.safetensors
└── lora/
    β”œβ”€β”€ realism_engine_v3.safetensors
    β”œβ”€β”€ big_boobs.safetensors
    β”œβ”€β”€ cum.safetensors
    β”œβ”€β”€ innie_vulva_x.safetensors
    β”œβ”€β”€ vintage_beauties_womans.safetensors
    β”œβ”€β”€ missionary_sex.safetensors
    β”œβ”€β”€ 80s_anime.safetensors
    β”œβ”€β”€ penis.safetensors
    └── penix.safetensors

Model Selection & Quantization Guide

To balance generation quality, memory usage, and inference speed, we recommend the following quantization choices for each component:

1. Conditional Diffusion Model (diffusion/cond/)

  • Recommended: Q6_K or Q8_0
  • Since this model handles the main conditional generation pass, keeping a higher quantization level is key to preserving detail and prompt adherence.

2. Unconditional Diffusion Model (diffusion/uncond/)

  • Recommended: Q4_K or Q5_K
  • Note: Using Q6_K or Q8_0 for the unconditional model is generally unnecessary (overkill) and may slow down generation without providing a noticeable improvement in quality.

3. Text Encoder (text_encoder/)

  • Recommended: Q5_K_M or Q4_K_M
  • These medium-sized "K-measure" quants offer a good trade-off, retaining the text encoder's comprehension capabilities while fitting within reasonable memory limits.

General Recommendations for Quantization Types

If you are optimizing for inference speed or trying to fit a specific model entirely into VRAM/RAM, keep these rules of thumb in mind:

  • Prefer _K variants over _0 and _1: When choosing between Q4 or Q5 options, always prefer the _K variants (e.g., Q4_K_M, Q5_K_M, or standard _K).
  • Avoid _0 and _1 if possible: The older _0 and _1 quants (like Q4_0 or Q4_1) perform worse in terms of quality loss. While they are marginally smaller, the minor size reduction rarely justifies the drop in generation quality compared to _K equivalents.