---
license: other
license_name: krea-2-community-license
license_link: https://huggingface.co/krea/Krea-2-Turbo/blob/main/LICENSE
pipeline_tag: text-to-image
tags:
  - image-generation
  - diffusion
  - flow-matching
  - dit
  - krea2
  - quantization
  - comfyui
library_name: gguf
base_model:
  - krea/Krea-2-Raw
  - krea/Krea-2-Turbo
---

# Krea 2 GGUF

Quantized GGUF diffusion transformer weights for [Krea 2](https://huggingface.co/krea/Krea-2-Turbo),
converted from the original BF16 releases for use with ComfyUI GGUF loader nodes.

This repository provides GGUF files for two checkpoints of the Krea 2 model family:

- `krea2_raw_bf16-*.gguf` — converted from [krea/Krea-2-Raw](https://huggingface.co/krea/Krea-2-Raw), the base release checkpoint.
- `krea2_turbo_bf16-*.gguf` — converted from [krea/Krea-2-Turbo](https://huggingface.co/krea/Krea-2-Turbo), the post-trained checkpoint with additional fine-tuning and distillation.

Krea 2 is a 12-billion parameter Diffusion Transformer with a novel architecture featuring layerwise and refiner text-fusion blocks. It is not based on Flux or any prior open-weight architecture.

These files are not a complete standalone Krea 2 package. Your workflow still needs the text encoder and VAE components.

## ComfyUI Support

Use these models with the ComfyUI nodes from [molbal/ComfyUI-GGUF](https://github.com/molbal/ComfyUI-GGUF).
Install that custom node repository into your ComfyUI `custom_nodes` folder, then restart ComfyUI.

> **Important:** This repository requires [molbal/ComfyUI-GGUF](https://github.com/molbal/ComfyUI-GGUF),
> which is a fork of [city96/ComfyUI-GGUF](https://github.com/city96/ComfyUI-GGUF) with added support
> for the Krea 2 architecture. The original city96 plugin does **not** support these files. (as of 2026-06-24)

Place the downloaded `.gguf` files in one of ComfyUI's diffusion model folders:

```
ComfyUI/models/diffusion_models/
ComfyUI/models/unet/
```

Load the file with `Unet Loader (GGUF)` in a Krea 2 workflow. Krea 2 uses a single transformer
(unlike Ideogram 4, there is no separate unconditional transformer component).

## Files

| Quant | Raw (base) | Turbo | Size |
|-------|-----------|-------------------|------|
| Q4_0 | krea2_raw_bf16-Q4_0.gguf | krea2_turbo_bf16-Q4_0.gguf | 7.74 GB |
| Q4_1 | krea2_raw_bf16-Q4_1.gguf | krea2_turbo_bf16-Q4_1.gguf | 8.47 GB |
| Q5_0 | krea2_raw_bf16-Q5_0.gguf | krea2_turbo_bf16-Q5_0.gguf | 9.20 GB |
| Q5_1 | krea2_raw_bf16-Q5_1.gguf | krea2_turbo_bf16-Q5_1.gguf | 9.93 GB |
| Q8_0 | krea2_raw_bf16-Q8_0.gguf | krea2_turbo_bf16-Q8_0.gguf | 13.56 GB |

Choose either the Raw or Turbo variant depending on your workflow; they are not paired with each other.

## Which Checkpoint to Use

| Checkpoint | Steps | CFG | Notes |
|------------|-------|-----|-------|
| **Turbo** | 4–8 | 0.0 | Distilled; CFG-free. Fast, good for most use cases. |
| **Raw** | 20–30 | 3.0–7.0 | Full CFG; more controllable, higher inference cost. |

The Turbo checkpoint has been post-trained with distillation and runs well at 8 steps with `CFG=1`. The Raw checkpoint behaves like a standard flow-matching DiT and benefits from more steps and positive CFG.

## When GGUFs Are Worth the Tradeoff

The BF16 source weights for Krea 2 are 26.6 GB each — far beyond what most consumer GPUs can hold entirely in VRAM. GGUFs make sense when:

- **Limited VRAM:** Q4_0 at 7.74 GB fits entirely in an 8 GB GPU; Q5_1 at 9.93 GB targets 10–12 GB cards. Running BF16 on these GPUs would require heavy CPU offloading and become impractically slow.
- **CPU offload workflows:** If you are already offloading model layers to RAM, GGUF reduces the RAM footprint proportionally alongside VRAM, which is often the actual bottleneck.
- **Acceptable quality loss at Q5+:** At Q5_0 and above the visual output of Krea 2 is very close to BF16. Q4 levels show mild softening on fine detail but remain usable for most creative tasks.

GGUFs are generally **not** worth it if you have a 24 GB+ GPU and want maximum fidelity — load the FP8 or BF16 source directly in that case.

## Download

Download the file you want from the Files tab, or use the Hugging Face CLI. For example:

```
huggingface-cli download molbal/krea2-gguf krea2_turbo_bf16-Q5_1.gguf --local-dir ComfyUI/models/diffusion_models
```

## Compatibility Notes

These are non-K GGUF quantizations intended for PyTorch dequantization in ComfyUI. K-quants are not included because this ComfyUI loading path does not use fused quantized linear kernels.

Krea 2 GGUF support requires ComfyUI to have the `krea2` architecture registered in its model detection system. If your ComfyUI installation does not recognise the checkpoint, update ComfyUI core and [molbal/ComfyUI-GGUF](https://github.com/molbal/ComfyUI-GGUF) to their latest versions.

## License

These files are derived from [krea/Krea-2-Raw](https://huggingface.co/krea/Krea-2-Raw) and [krea/Krea-2-Turbo](https://huggingface.co/krea/Krea-2-Turbo) and follow the [Krea 2 Community License](https://huggingface.co/krea/Krea-2-Turbo/blob/main/LICENSE).