WaveCut commited on
Commit
ae0561e
·
verified ·
1 Parent(s): d937520

Update component card with Comfy test and benchmark grid

Browse files
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ images/anima_original_uint4_int8_grid_5x3_1024x1024_1to1.jpg filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -1,23 +1,29 @@
1
  ---
2
  license: other
3
- base_model: circlestone-labs/Anima
 
 
 
4
  tags:
 
 
5
  - sdnq
6
- - int8
7
- - diffusion-single-file
8
- - text-to-image
9
  - cosmos
 
 
10
  ---
11
 
12
- # Anima SDNQ INT8 Diffusers Transformer
13
 
14
- SDNQ INT8 quantization of the `circlestone-labs/Anima` Preview3 diffusion transformer checkpoint (`split_files/diffusion_models/anima-preview3-base.safetensors`).
15
 
16
- Quantization was performed with `sdnq==0.1.8` using static INT8 weights and quantized matmul enabled. The resulting transformer weights are saved in standard diffusers component format with `quantization_config.json`.
17
 
18
- Important: the upstream Anima checkpoint contains additional `llm_adapter.*` weights used by the native ComfyUI loader. Current `diffusers.CosmosTransformer3DModel.from_single_file` ignores those adapter weights, so this repo is a diffusers-compatible SDNQ transformer artifact, not a full ComfyUI-native split checkpoint replacement.
19
 
20
- ## Load Test
21
 
22
  ```python
23
  import torch
@@ -25,9 +31,74 @@ import sdnq
25
  from diffusers import CosmosTransformer3DModel
26
 
27
  transformer = CosmosTransformer3DModel.from_pretrained(
28
- "WaveCut/Anima-SDNQ-int8",
29
  torch_dtype=torch.bfloat16,
30
- )
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
31
  ```
32
 
33
- Original model: https://huggingface.co/circlestone-labs/Anima
 
 
 
 
 
 
 
1
  ---
2
  license: other
3
+ license_name: circlestone-labs-non-commercial-license
4
+ base_model:
5
+ - circlestone-labs/Anima
6
+ library_name: diffusers
7
  tags:
8
+ - diffusers
9
+ - safetensors
10
  - sdnq
11
+ - transformer-component
12
+ - anima
 
13
  - cosmos
14
+ - text-to-image
15
+ - int8
16
  ---
17
 
18
+ # Anima Preview 3 SDNQ INT8 Transformer Component
19
 
20
+ 8-bit int8 dynamic SDNQ quantization of the Anima Preview 3 diffusion transformer component from `circlestone-labs/Anima` (`split_files/diffusion_models/anima-preview3-base.safetensors`). This repo is the fastest measured option in the companion full-pipeline benchmark.
21
 
22
+ This is a **Diffusers transformer component repo**, not a standalone text-to-image pipeline and not a ComfyUI-native single-file checkpoint. It contains `config.json`, SDNQ quantized safetensors shards, and `quantization_config.json` for `diffusers.CosmosTransformer3DModel`.
23
 
24
+ For a runnable full Diffusers pipeline with Anima's text encoder, VAE, custom `pipeline.py`, and `llm_adapter`, use the companion repo: [`WaveCut/Anima-Preview-3-SDNQ-int8-diffusers`](https://huggingface.co/WaveCut/Anima-Preview-3-SDNQ-int8-diffusers).
25
 
26
+ ## Component Load Test
27
 
28
  ```python
29
  import torch
 
31
  from diffusers import CosmosTransformer3DModel
32
 
33
  transformer = CosmosTransformer3DModel.from_pretrained(
34
+ "WaveCut/Anima-Preview-3-SDNQ-int8",
35
  torch_dtype=torch.bfloat16,
36
+ ).to("cuda")
37
+ ```
38
+
39
+ Component-only smoke test on RTX 5090 32GB:
40
+
41
+ | Component | Size | Load time | VRAM after load | Peak VRAM while loading |
42
+ | --- | ---: | ---: | ---: | ---: |
43
+ | Original Anima Preview 3 diffusion model | 3.89 GiB | not measured here | not measured here | not measured here |
44
+ | SDNQ UINT4 component | 1.06 GiB (-72.8%) | 2.20s | 1611 MiB | 1611 MiB |
45
+ | SDNQ INT8 component | 1.85 GiB (-52.6%) | 12.18s | 2437 MiB | 2437 MiB |
46
+
47
+ Raw component load data: [`benchmarks/component_load_tests.json`](benchmarks/component_load_tests.json).
48
+
49
+ ## ComfyUI Test
50
+
51
+ Native original Anima ComfyUI baseline was verified with ComfyUI commit `8505abf52e42f4441d9d53baf4c31a2ec7123400` using:
52
+
53
+ - `UNETLoader`: `anima-preview3-base.safetensors`
54
+ - `CLIPLoader`: `qwen_3_06b_base.safetensors`
55
+ - `VAELoader`: `qwen_image_vae.safetensors`
56
+ - `ModelSamplingAuraFlow`: shift `3.0`
57
+ - `KSampler`: `er_sde`, `simple`, 24 steps, CFG 4.0
58
+ - Resolution: `1024x1024`
59
+
60
+ Original ComfyUI baseline on the same five prompt/seed pairs: mean `6.53s/img`, peak generation VRAM `26519 MiB`. ComfyUI keeps the model mostly lazy/offloaded after loader nodes, so the meaningful memory number is the generation peak. Raw data: [`benchmarks/comfy_original_baseline_1024.json`](benchmarks/comfy_original_baseline_1024.json).
61
+
62
+ Direct ComfyUI loading of this component repo was also tested through `comfyui-sdnq`'s `SDNQSampler` custom path. It is **not directly loadable** there because that node expects a full Diffusers pipeline directory with `model_index.json`; this repo is only the transformer component. ComfyUI core `UNETLoader` also expects a single diffusion model file and Anima detection requires `llm_adapter.*` weights, which are not present in this component repo. Test log: [`benchmarks/comfy_direct_load_tests.json`](benchmarks/comfy_direct_load_tests.json).
63
+
64
+ ## Full-Pipeline Generation Benchmark
65
+
66
+ The generation benchmark below uses the companion full Diffusers checkpoints, where these transformer components are combined with Anima's original Qwen3 text encoder, VAE, and learned LLM adapter. This is the runnable comparison against the original full BF16 Diffusers conversion.
67
+
68
+ ![Anima Original BF16 vs SDNQ UINT4 and INT8 1024x1024 grid](images/anima_original_uint4_int8_grid_5x3_1024x1024_1to1.jpg)
69
+
70
+ The source JPEG is `3572x5576`; every generated cell is exactly `1024x1024` and pasted 1:1 with no resizing. Five prompt/seed pairs are listed in the grid's left column. Raw benchmark JSON: [`benchmarks/full_diffusers_benchmark_results_1024.json`](benchmarks/full_diffusers_benchmark_results_1024.json).
71
+
72
+ Measured on RTX 5090 32GB with `torch 2.8.0+cu128`, `diffusers 0.38.0`, `transformers 5.8.1`, `sdnq 0.1.8`, `torch.bfloat16`, 24 steps, CFG 4.0, and 1024x1024 output. Network download excluded; one warm-up image discarded; VRAM sampled with `nvidia-smi` every 50 ms.
73
+
74
+ | Model | Repo | Size | Load time | Mean generation | Speed vs original | VRAM after load | Peak VRAM while generating |
75
+ | --- | --- | ---: | ---: | ---: | ---: | ---: | ---: |
76
+ | Original BF16 | `CalamitousFelicitousness/Anima-Preview-3-sdnext-diffusers` | 5.3 GiB | 10.04s | 6.37s/img | 1.00x | 6005 MiB | 10759 MiB |
77
+ | SDNQ UINT4 full pipeline | `WaveCut/Anima-Preview-3-SDNQ-uint4-diffusers` | 2.7 GiB (-49.1%) | 11.96s | 6.13s/img | 1.04x (+3.9%) | 3285 MiB (-45.3%) | 8157 MiB (-24.2%) |
78
+ | SDNQ INT8 full pipeline | `WaveCut/Anima-Preview-3-SDNQ-int8-diffusers` | 3.5 GiB (-34.1%) | 22.41s | 4.60s/img | 1.38x (+38.4%) | 4111 MiB (-31.5%) | 8961 MiB (-16.7%) |
79
+
80
+ Quant-to-quant tradeoff in the full-pipeline run: UINT4 is 22.7% smaller than INT8 and uses 826 MiB less VRAM after load plus 804 MiB less peak generation VRAM. INT8 is 1.33x faster than UINT4 on this RTX 5090 setup.
81
+
82
+ ## Prompting
83
+
84
+ Anima was trained on Danbooru-style tags, natural language captions, and mixtures of both. The upstream Anima Preview 3 card recommends about 1MP generation, for example `1024x1024`, `896x1152`, or `1152x896`, with roughly 30-50 steps and CFG 4-5.
85
+
86
+ Recommended positive prefix:
87
+
88
+ ```text
89
+ masterpiece, best quality, score_7, safe,
90
+ ```
91
+
92
+ Recommended negative prompt:
93
+
94
+ ```text
95
+ worst quality, low quality, score_1, score_2, score_3, artist name
96
  ```
97
 
98
+ Use lowercase tags with spaces instead of underscores, except score tags such as `score_7`. For artist tags, prefix the artist with `@`.
99
+
100
+ ## Notes
101
+
102
+ The original Anima split checkpoint is a ComfyUI-native model with a Qwen3 text encoder and a learned LLM adapter. Earlier transformer-only exports that load the checkpoint directly as `CosmosTransformer3DModel` ignore the `llm_adapter.*` weights; this component repo intentionally only stores the quantized transformer. Use the companion full Diffusers checkpoint for generation.
103
+
104
+ License follows the upstream Anima/CircleStone non-commercial license and the NVIDIA Cosmos derivative terms referenced by the upstream model card.
benchmarks/comfy_direct_load_tests.json ADDED
@@ -0,0 +1,101 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "comfyui_commit": "8505abf52e42f4441d9d53baf4c31a2ec7123400",
3
+ "comfyui_sdnq_commit": "c0e01f4b05a14414e518e2e1e70d241c13ffaee9",
4
+ "tests": [
5
+ {
6
+ "repo": "WaveCut/Anima-Preview-3-SDNQ-uint4",
7
+ "node": "comfyui-sdnq SDNQSampler custom path",
8
+ "result": "not directly loadable",
9
+ "error": "ValueError: Invalid model directory: /root/anima-comfy-work/models/Anima-Preview-3-SDNQ-uint4",
10
+ "reason": "The custom node expects a full Diffusers pipeline directory with model_index.json. This repo is a transformer component directory with config.json and quantized shards."
11
+ },
12
+ {
13
+ "repo": "WaveCut/Anima-Preview-3-SDNQ-int8",
14
+ "node": "comfyui-sdnq SDNQSampler custom path",
15
+ "result": "not directly loadable",
16
+ "error": "ValueError: Invalid model directory: /root/anima-comfy-work/models/Anima-Preview-3-SDNQ-int8",
17
+ "reason": "The custom node expects a full Diffusers pipeline directory with model_index.json. This repo is a transformer component directory with config.json and quantized shards."
18
+ }
19
+ ],
20
+ "native_original_comfy_baseline": {
21
+ "key": "original_comfy",
22
+ "title": "Original BF16 ComfyUI",
23
+ "repo": "circlestone-labs/Anima",
24
+ "hardware": "NVIDIA GeForce RTX 5090 32GB",
25
+ "comfyui_commit": "8505abf52e42f4441d9d53baf4c31a2ec7123400",
26
+ "width": 1024,
27
+ "height": 1024,
28
+ "steps": 24,
29
+ "cfg": 4.0,
30
+ "sampler": "er_sde",
31
+ "scheduler": "simple",
32
+ "negative_prompt": "worst quality, low quality, score_1, score_2, score_3, blurry, jpeg artifacts, artist name",
33
+ "baseline_vram_mib": 507,
34
+ "load_seconds": 3.8163154299982125,
35
+ "vram_after_load_mib": 507,
36
+ "vram_load_peak_mib": 507,
37
+ "warmup_seconds": 7.7854839060019,
38
+ "vram_generation_peak_mib": 26519,
39
+ "torch_peak_allocated_mib": 25550,
40
+ "runs": [
41
+ {
42
+ "prompt_id": "fern",
43
+ "seed": 424242,
44
+ "seconds": 6.791207475005649,
45
+ "image": "/root/anima-comfy-work/comfy_benchmark/images/fern_original_comfy_seed_424242_1024x1024.png"
46
+ },
47
+ {
48
+ "prompt_id": "city",
49
+ "seed": 424243,
50
+ "seconds": 6.512618430002476,
51
+ "image": "/root/anima-comfy-work/comfy_benchmark/images/city_original_comfy_seed_424243_1024x1024.png"
52
+ },
53
+ {
54
+ "prompt_id": "witch",
55
+ "seed": 424244,
56
+ "seconds": 6.485916037010611,
57
+ "image": "/root/anima-comfy-work/comfy_benchmark/images/witch_original_comfy_seed_424244_1024x1024.png"
58
+ },
59
+ {
60
+ "prompt_id": "mecha",
61
+ "seed": 424245,
62
+ "seconds": 6.408086219991674,
63
+ "image": "/root/anima-comfy-work/comfy_benchmark/images/mecha_original_comfy_seed_424245_1024x1024.png"
64
+ },
65
+ {
66
+ "prompt_id": "garden",
67
+ "seed": 424246,
68
+ "seconds": 6.437578155004303,
69
+ "image": "/root/anima-comfy-work/comfy_benchmark/images/garden_original_comfy_seed_424246_1024x1024.png"
70
+ }
71
+ ],
72
+ "mean_generation_seconds": 6.527081263402943,
73
+ "prompts": [
74
+ {
75
+ "id": "fern",
76
+ "seed": 424242,
77
+ "prompt": "masterpiece, best quality, score_7, safe, 1girl, fern (sousou no frieren), purple hair, purple eyes, black robe, white dress, butterfly on hand, simple background, looking at viewer"
78
+ },
79
+ {
80
+ "id": "city",
81
+ "seed": 424243,
82
+ "prompt": "masterpiece, best quality, score_7, safe, anime screenshot, 1girl, short black hair, red jacket, standing on a rainy neon city street at night, reflections, cinematic lighting"
83
+ },
84
+ {
85
+ "id": "witch",
86
+ "seed": 424244,
87
+ "prompt": "masterpiece, best quality, score_7, safe, 1girl, witch hat, silver hair, blue eyes, starry sky, floating books, glowing magic circle, detailed illustration"
88
+ },
89
+ {
90
+ "id": "mecha",
91
+ "seed": 424245,
92
+ "prompt": "masterpiece, best quality, score_7, safe, 1boy, pilot suit, white mecha in the background, sunset hangar, dramatic rim light, anime key visual"
93
+ },
94
+ {
95
+ "id": "garden",
96
+ "seed": 424246,
97
+ "prompt": "masterpiece, best quality, score_7, safe, 2girls, summer dresses, flower garden, butterflies, warm sunlight, soft watercolor anime style"
98
+ }
99
+ ]
100
+ }
101
+ }
benchmarks/comfy_original_baseline_1024.json ADDED
@@ -0,0 +1,81 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "key": "original_comfy",
3
+ "title": "Original BF16 ComfyUI",
4
+ "repo": "circlestone-labs/Anima",
5
+ "hardware": "NVIDIA GeForce RTX 5090 32GB",
6
+ "comfyui_commit": "8505abf52e42f4441d9d53baf4c31a2ec7123400",
7
+ "width": 1024,
8
+ "height": 1024,
9
+ "steps": 24,
10
+ "cfg": 4.0,
11
+ "sampler": "er_sde",
12
+ "scheduler": "simple",
13
+ "negative_prompt": "worst quality, low quality, score_1, score_2, score_3, blurry, jpeg artifacts, artist name",
14
+ "baseline_vram_mib": 507,
15
+ "load_seconds": 3.8163154299982125,
16
+ "vram_after_load_mib": 507,
17
+ "vram_load_peak_mib": 507,
18
+ "warmup_seconds": 7.7854839060019,
19
+ "vram_generation_peak_mib": 26519,
20
+ "torch_peak_allocated_mib": 25550,
21
+ "runs": [
22
+ {
23
+ "prompt_id": "fern",
24
+ "seed": 424242,
25
+ "seconds": 6.791207475005649,
26
+ "image": "/root/anima-comfy-work/comfy_benchmark/images/fern_original_comfy_seed_424242_1024x1024.png"
27
+ },
28
+ {
29
+ "prompt_id": "city",
30
+ "seed": 424243,
31
+ "seconds": 6.512618430002476,
32
+ "image": "/root/anima-comfy-work/comfy_benchmark/images/city_original_comfy_seed_424243_1024x1024.png"
33
+ },
34
+ {
35
+ "prompt_id": "witch",
36
+ "seed": 424244,
37
+ "seconds": 6.485916037010611,
38
+ "image": "/root/anima-comfy-work/comfy_benchmark/images/witch_original_comfy_seed_424244_1024x1024.png"
39
+ },
40
+ {
41
+ "prompt_id": "mecha",
42
+ "seed": 424245,
43
+ "seconds": 6.408086219991674,
44
+ "image": "/root/anima-comfy-work/comfy_benchmark/images/mecha_original_comfy_seed_424245_1024x1024.png"
45
+ },
46
+ {
47
+ "prompt_id": "garden",
48
+ "seed": 424246,
49
+ "seconds": 6.437578155004303,
50
+ "image": "/root/anima-comfy-work/comfy_benchmark/images/garden_original_comfy_seed_424246_1024x1024.png"
51
+ }
52
+ ],
53
+ "mean_generation_seconds": 6.527081263402943,
54
+ "prompts": [
55
+ {
56
+ "id": "fern",
57
+ "seed": 424242,
58
+ "prompt": "masterpiece, best quality, score_7, safe, 1girl, fern (sousou no frieren), purple hair, purple eyes, black robe, white dress, butterfly on hand, simple background, looking at viewer"
59
+ },
60
+ {
61
+ "id": "city",
62
+ "seed": 424243,
63
+ "prompt": "masterpiece, best quality, score_7, safe, anime screenshot, 1girl, short black hair, red jacket, standing on a rainy neon city street at night, reflections, cinematic lighting"
64
+ },
65
+ {
66
+ "id": "witch",
67
+ "seed": 424244,
68
+ "prompt": "masterpiece, best quality, score_7, safe, 1girl, witch hat, silver hair, blue eyes, starry sky, floating books, glowing magic circle, detailed illustration"
69
+ },
70
+ {
71
+ "id": "mecha",
72
+ "seed": 424245,
73
+ "prompt": "masterpiece, best quality, score_7, safe, 1boy, pilot suit, white mecha in the background, sunset hangar, dramatic rim light, anime key visual"
74
+ },
75
+ {
76
+ "id": "garden",
77
+ "seed": 424246,
78
+ "prompt": "masterpiece, best quality, score_7, safe, 2girls, summer dresses, flower garden, butterflies, warm sunlight, soft watercolor anime style"
79
+ }
80
+ ]
81
+ }
benchmarks/component_load_tests.json ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "key": "uint4",
4
+ "path": "/root/anima-comfy-work/models/Anima-Preview-3-SDNQ-uint4",
5
+ "class": "CosmosTransformer3DModel",
6
+ "baseline_vram_mib": 509,
7
+ "load_seconds": 2.2016136780002853,
8
+ "vram_after_load_mib": 1611,
9
+ "vram_load_peak_mib": 1611
10
+ },
11
+ {
12
+ "key": "int8",
13
+ "path": "/root/anima-comfy-work/models/Anima-Preview-3-SDNQ-int8",
14
+ "class": "CosmosTransformer3DModel",
15
+ "baseline_vram_mib": 509,
16
+ "load_seconds": 12.180104692990426,
17
+ "vram_after_load_mib": 2437,
18
+ "vram_load_peak_mib": 2437
19
+ }
20
+ ]
benchmarks/full_diffusers_benchmark_results_1024.json ADDED
@@ -0,0 +1,216 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "hardware": "NVIDIA GeForce RTX 5090 32GB",
3
+ "software": {
4
+ "torch": "2.8.0+cu128",
5
+ "diffusers": "0.38.0",
6
+ "transformers": "5.8.1",
7
+ "sdnq": "0.1.8"
8
+ },
9
+ "benchmark_note": "Network download excluded. One 1024x1024 warm-up generation per model, then five measured 1024x1024 generations. VRAM sampled with nvidia-smi every 50 ms in an isolated process per model.",
10
+ "width": 1024,
11
+ "height": 1024,
12
+ "steps": 24,
13
+ "guidance_scale": 4.0,
14
+ "negative_prompt": "worst quality, low quality, score_1, score_2, score_3, blurry, jpeg artifacts, artist name",
15
+ "prompts": [
16
+ {
17
+ "id": "fern",
18
+ "seed": 424242,
19
+ "prompt": "masterpiece, best quality, score_7, safe, 1girl, fern (sousou no frieren), purple hair, purple eyes, black robe, white dress, butterfly on hand, simple background, looking at viewer"
20
+ },
21
+ {
22
+ "id": "city",
23
+ "seed": 424243,
24
+ "prompt": "masterpiece, best quality, score_7, safe, anime screenshot, 1girl, short black hair, red jacket, standing on a rainy neon city street at night, reflections, cinematic lighting"
25
+ },
26
+ {
27
+ "id": "witch",
28
+ "seed": 424244,
29
+ "prompt": "masterpiece, best quality, score_7, safe, 1girl, witch hat, silver hair, blue eyes, starry sky, floating books, glowing magic circle, detailed illustration"
30
+ },
31
+ {
32
+ "id": "mecha",
33
+ "seed": 424245,
34
+ "prompt": "masterpiece, best quality, score_7, safe, 1boy, pilot suit, white mecha in the background, sunset hangar, dramatic rim light, anime key visual"
35
+ },
36
+ {
37
+ "id": "garden",
38
+ "seed": 424246,
39
+ "prompt": "masterpiece, best quality, score_7, safe, 2girls, summer dresses, flower garden, butterflies, warm sunlight, soft watercolor anime style"
40
+ }
41
+ ],
42
+ "models": [
43
+ {
44
+ "key": "original",
45
+ "title": "Original BF16",
46
+ "path": "/root/anima-transformers-convert/original-full",
47
+ "repo": "CalamitousFelicitousness/Anima-Preview-3-sdnext-diffusers",
48
+ "hardware": "NVIDIA GeForce RTX 5090 32GB",
49
+ "dtype": "torch.bfloat16",
50
+ "width": 1024,
51
+ "height": 1024,
52
+ "steps": 24,
53
+ "guidance_scale": 4.0,
54
+ "negative_prompt": "worst quality, low quality, score_1, score_2, score_3, blurry, jpeg artifacts, artist name",
55
+ "baseline_vram_mib": 511,
56
+ "load_seconds": 10.04116036000778,
57
+ "vram_after_load_mib": 6005,
58
+ "vram_load_peak_mib": 6005,
59
+ "vram_generation_peak_mib": 10759,
60
+ "torch_peak_allocated_mib": 9669,
61
+ "runs": [
62
+ {
63
+ "prompt_id": "fern",
64
+ "seed": 424242,
65
+ "seconds": 6.371356149989879,
66
+ "image": "/root/anima-transformers-convert/benchmark_1024/images/fern_original_seed_424242_1024x1024.png"
67
+ },
68
+ {
69
+ "prompt_id": "city",
70
+ "seed": 424243,
71
+ "seconds": 6.3718316220038105,
72
+ "image": "/root/anima-transformers-convert/benchmark_1024/images/city_original_seed_424243_1024x1024.png"
73
+ },
74
+ {
75
+ "prompt_id": "witch",
76
+ "seed": 424244,
77
+ "seconds": 6.374521128003835,
78
+ "image": "/root/anima-transformers-convert/benchmark_1024/images/witch_original_seed_424244_1024x1024.png"
79
+ },
80
+ {
81
+ "prompt_id": "mecha",
82
+ "seed": 424245,
83
+ "seconds": 6.371869497001171,
84
+ "image": "/root/anima-transformers-convert/benchmark_1024/images/mecha_original_seed_424245_1024x1024.png"
85
+ },
86
+ {
87
+ "prompt_id": "garden",
88
+ "seed": 424246,
89
+ "seconds": 6.372184988998924,
90
+ "image": "/root/anima-transformers-convert/benchmark_1024/images/garden_original_seed_424246_1024x1024.png"
91
+ }
92
+ ],
93
+ "mean_generation_seconds": 6.372352677199524,
94
+ "relative_to_original_speedup": 1.0,
95
+ "vram_after_load_delta_vs_original_mib": 0,
96
+ "vram_generation_peak_delta_vs_original_mib": 0
97
+ },
98
+ {
99
+ "key": "uint4",
100
+ "title": "SDNQ UINT4",
101
+ "path": "/root/anima-transformers-convert/full/Anima-SDNQ-uint4-diffusers",
102
+ "repo": "WaveCut/Anima-Preview-3-SDNQ-uint4-diffusers",
103
+ "hardware": "NVIDIA GeForce RTX 5090 32GB",
104
+ "dtype": "torch.bfloat16",
105
+ "width": 1024,
106
+ "height": 1024,
107
+ "steps": 24,
108
+ "guidance_scale": 4.0,
109
+ "negative_prompt": "worst quality, low quality, score_1, score_2, score_3, blurry, jpeg artifacts, artist name",
110
+ "baseline_vram_mib": 511,
111
+ "load_seconds": 11.955643722001696,
112
+ "vram_after_load_mib": 3285,
113
+ "vram_load_peak_mib": 3181,
114
+ "vram_generation_peak_mib": 8157,
115
+ "torch_peak_allocated_mib": 6971,
116
+ "runs": [
117
+ {
118
+ "prompt_id": "fern",
119
+ "seed": 424242,
120
+ "seconds": 6.849568051999086,
121
+ "image": "/root/anima-transformers-convert/benchmark_1024/images/fern_uint4_seed_424242_1024x1024.png"
122
+ },
123
+ {
124
+ "prompt_id": "city",
125
+ "seed": 424243,
126
+ "seconds": 5.868479846001719,
127
+ "image": "/root/anima-transformers-convert/benchmark_1024/images/city_uint4_seed_424243_1024x1024.png"
128
+ },
129
+ {
130
+ "prompt_id": "witch",
131
+ "seed": 424244,
132
+ "seconds": 6.189502780995099,
133
+ "image": "/root/anima-transformers-convert/benchmark_1024/images/witch_uint4_seed_424244_1024x1024.png"
134
+ },
135
+ {
136
+ "prompt_id": "mecha",
137
+ "seed": 424245,
138
+ "seconds": 5.836763394996524,
139
+ "image": "/root/anima-transformers-convert/benchmark_1024/images/mecha_uint4_seed_424245_1024x1024.png"
140
+ },
141
+ {
142
+ "prompt_id": "garden",
143
+ "seed": 424246,
144
+ "seconds": 5.911209135010722,
145
+ "image": "/root/anima-transformers-convert/benchmark_1024/images/garden_uint4_seed_424246_1024x1024.png"
146
+ }
147
+ ],
148
+ "mean_generation_seconds": 6.13110464180063,
149
+ "relative_to_original_speedup": 1.0393482169190384,
150
+ "vram_after_load_delta_vs_original_mib": -2720,
151
+ "vram_generation_peak_delta_vs_original_mib": -2602
152
+ },
153
+ {
154
+ "key": "int8",
155
+ "title": "SDNQ INT8",
156
+ "path": "/root/anima-transformers-convert/full/Anima-SDNQ-int8-diffusers",
157
+ "repo": "WaveCut/Anima-Preview-3-SDNQ-int8-diffusers",
158
+ "hardware": "NVIDIA GeForce RTX 5090 32GB",
159
+ "dtype": "torch.bfloat16",
160
+ "width": 1024,
161
+ "height": 1024,
162
+ "steps": 24,
163
+ "guidance_scale": 4.0,
164
+ "negative_prompt": "worst quality, low quality, score_1, score_2, score_3, blurry, jpeg artifacts, artist name",
165
+ "baseline_vram_mib": 511,
166
+ "load_seconds": 22.4127801930008,
167
+ "vram_after_load_mib": 4111,
168
+ "vram_load_peak_mib": 4049,
169
+ "vram_generation_peak_mib": 8961,
170
+ "torch_peak_allocated_mib": 7798,
171
+ "runs": [
172
+ {
173
+ "prompt_id": "fern",
174
+ "seed": 424242,
175
+ "seconds": 4.61064092599554,
176
+ "image": "/root/anima-transformers-convert/benchmark_1024/images/fern_int8_seed_424242_1024x1024.png"
177
+ },
178
+ {
179
+ "prompt_id": "city",
180
+ "seed": 424243,
181
+ "seconds": 4.606765301999985,
182
+ "image": "/root/anima-transformers-convert/benchmark_1024/images/city_int8_seed_424243_1024x1024.png"
183
+ },
184
+ {
185
+ "prompt_id": "witch",
186
+ "seed": 424244,
187
+ "seconds": 4.597769348009024,
188
+ "image": "/root/anima-transformers-convert/benchmark_1024/images/witch_int8_seed_424244_1024x1024.png"
189
+ },
190
+ {
191
+ "prompt_id": "mecha",
192
+ "seed": 424245,
193
+ "seconds": 4.587051768990932,
194
+ "image": "/root/anima-transformers-convert/benchmark_1024/images/mecha_int8_seed_424245_1024x1024.png"
195
+ },
196
+ {
197
+ "prompt_id": "garden",
198
+ "seed": 424246,
199
+ "seconds": 4.616055713006062,
200
+ "image": "/root/anima-transformers-convert/benchmark_1024/images/garden_int8_seed_424246_1024x1024.png"
201
+ }
202
+ ],
203
+ "mean_generation_seconds": 4.603656611600309,
204
+ "relative_to_original_speedup": 1.3841937431089992,
205
+ "vram_after_load_delta_vs_original_mib": -1894,
206
+ "vram_generation_peak_delta_vs_original_mib": -1798
207
+ }
208
+ ],
209
+ "grid": "/root/anima-transformers-convert/benchmark_1024/anima_original_uint4_int8_grid_5x3_1024x1024_1to1.jpg",
210
+ "grid_size": {
211
+ "width": 3572,
212
+ "height": 5576,
213
+ "cell_width": 1024,
214
+ "cell_height": 1024
215
+ }
216
+ }
images/anima_original_uint4_int8_grid_5x3_1024x1024_1to1.jpg ADDED

Git LFS Details

  • SHA256: d908c9ac7a81c3decd66b86aaec1eff4405ab774e1d9e8884e3f2ab1de07c909
  • Pointer size: 132 Bytes
  • Size of remote file: 4.02 MB