siyah1 commited on
Commit
aefe899
·
verified ·
1 Parent(s): b9e2f7d

Add NF4 quantized EchoMimicV3 transformer weights

Browse files
README.md ADDED
@@ -0,0 +1,55 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ - zh
5
+ tags:
6
+ - human-animation
7
+ - talking-head
8
+ - video-generation
9
+ - nf4
10
+ - quantized
11
+ - echomimic
12
+ base_model: BadToBest/EchoMimicV3
13
+ library_name: diffusers
14
+ license: apache-2.0
15
+ ---
16
+
17
+ # EchoMimicV3 — NF4 Quantized
18
+
19
+ This is a **NF4 (NormalFloat4) quantized** version of [EchoMimicV3](https://huggingface.co/BadToBest/EchoMimicV3).
20
+
21
+ ## Quantization Details
22
+
23
+ | Parameter | Value |
24
+ |-----------|-------|
25
+ | Quantization Type | NF4 (NormalFloat4) |
26
+ | Double Quantization | Yes |
27
+ | Compute Dtype | float16 |
28
+ | Library | bitsandbytes |
29
+ | VRAM Reduction | ~65% (5 GB → ~1.7 GB) |
30
+
31
+ ## Usage
32
+
33
+ ```python
34
+ from transformers import BitsAndBytesConfig
35
+ import torch
36
+
37
+ nf4_config = BitsAndBytesConfig(
38
+ load_in_4bit=True,
39
+ bnb_4bit_quant_type="nf4",
40
+ bnb_4bit_use_double_quant=True,
41
+ bnb_4bit_compute_dtype=torch.float16
42
+ )
43
+ ```
44
+
45
+ ## Citation
46
+
47
+ ```bibtex
48
+ @misc{meng2025echomimicv3,
49
+ title={EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation},
50
+ author={Rang Meng, Yan Wang, Weipeng Wu, Ruobing Zheng, Yuming Li, Chenguang Ma},
51
+ year={2025},
52
+ eprint={2507.03905},
53
+ archivePrefix={arXiv}
54
+ }
55
+ ```
quantization_config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "quant_type": "nf4",
3
+ "load_in_4bit": true,
4
+ "bnb_4bit_quant_type": "nf4",
5
+ "bnb_4bit_use_double_quant": true,
6
+ "bnb_4bit_compute_dtype": "float16",
7
+ "quantized_by": "bitsandbytes",
8
+ "base_model": "BadToBest/EchoMimicV3",
9
+ "quantization_date": "2025"
10
+ }
transformer/diffusion_pytorch_model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c3a7762147d228a55cf5963642c54e89f91070ebb9c8c0795c6f2dca0be6a215
3
+ size 3414540448
transformer/quantization_metadata.json ADDED
The diff for this file is too large to render. See raw diff