File size: 2,331 Bytes
aceebc6
959192c
 
aceebc6
959192c
aceebc6
56798ea
aceebc6
 
959192c
56798ea
 
aceebc6
 
5de4da2
959192c
5de4da2
 
 
959192c
 
 
5de4da2
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
959192c
5de4da2
959192c
5de4da2
959192c
5de4da2
2005962
5de4da2
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
---
title: Zeta-Chroma Test Bench
emoji: 🎨
colorFrom: purple
colorTo: indigo
sdk: gradio
sdk_version: 5.33.0
app_file: app.py
pinned: false
hardware: zero-a10g
tags:
- ml-intern
---

# 🎨 Zeta-Chroma Test Bench

Generates images from [lodestones/Zeta-Chroma](https://huggingface.co/lodestones/Zeta-Chroma) β€” a **pixel-space** diffusion model based on the Z-Image architecture.

This Space runs a **custom inference pipeline** re-implemented from the [ComfyUI source code](https://github.com/Comfy-Org/ComfyUI/blob/main/comfy/ldm/lumina/model.py), since this model variant is not yet supported by the `diffusers` library.

## How it works

- **Model:** `NextDiTPixelSpace` β€” 6.5B parameter pixel-space diffusion transformer with `dec_net` decoder
- **Text encoder:** Qwen3 4B (layer -2, chat template) from [Tongyi-MAI/Z-Image](https://huggingface.co/Tongyi-MAI/Z-Image)
- **Scheduler:** Custom Euler sampler with AuraFlow shift=8.0 and beta sigmas (Ξ±=0.6, Ξ²=0.6)
- **No VAE** β€” model generates RGB pixels directly (patch_size=32)
- **Free** β€” runs on ZeroGPU, uses the visitor's own daily GPU quota

## Features

- **πŸ§ͺ Test Suite** β€” 4 challenging furry art prompts (hyper-realism β†’ toony) with fixed seeds for cross-version comparison
- **🎨 Custom Prompt** β€” full control over prompt, negative prompt, resolution, steps, CFG, shift, and seed
- **πŸ”– Checkpoint tracking** β€” displays the model's commit SHA so you know which training checkpoint is active
- **πŸ“‘ Webhook** β€” auto-triggers on model updates (configure at [webhook settings](https://huggingface.co/settings/webhooks))

## Resolution presets

All resolutions are multiples of the model's patch_size=32:

| Ratio | Landscape | Portrait |
|-------|-----------|----------|
| 1:1 | 512Β², 768Β², 1024Β² | β€” |
| 4:3 | 1024Γ—768 | 768Γ—1024 |
| 16:9 | 1024Γ—576 | 576Γ—1024 |
| 5:3 | 1280Γ—768 | 768Γ—1280 |
| 5:4 | 1280Γ—1024 | 1024Γ—1280 |

## Privacy

All generations are private to your browser session. No images are stored server-side or shared between users. GPU time is billed to *your* account quota, not the Space owner's.

## ⚠️ WIP Model

This is a work-in-progress training checkpoint. Image quality and prompt adherence will improve as training progresses. The webhook will automatically pick up new checkpoint versions.