ZeyuLing commited on
Commit
3cbf65c
·
verified ·
1 Parent(s): 9f53eba

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +42 -0
README.md ADDED
@@ -0,0 +1,42 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ library_name: hftrainer
4
+ pipeline_tag: text-to-motion
5
+ tags:
6
+ - motion-generation
7
+ - text-to-motion
8
+ - diffusion
9
+ - humanml3d
10
+ - mdm
11
+ ---
12
+
13
+ # MDM — Human Motion Diffusion Model (hftrainer reproduction)
14
+
15
+ Self-contained, `ref_repo`-independent reproduction of **MDM** (Tevet et al.,
16
+ ICLR 2023), packaged as an [hftrainer](https://github.com/) `ModelBundle`
17
+ artifact. The vendored network + Gaussian diffusion are **bit-identical** to the
18
+ released checkpoint.
19
+
20
+ - **Task:** Text-to-Motion · **Representation:** HumanML3D-263 (20 fps, 22 joints)
21
+ - **Text encoder:** CLIP ViT-B/32 (frozen, reloaded by name — not stored here)
22
+ - **Paper:** [arXiv:2209.14916](https://arxiv.org/abs/2209.14916) · **Code:** https://github.com/GuyTevet/motion-diffusion-model
23
+
24
+ ## Usage
25
+
26
+ ```python
27
+ from hftrainer.models.mdm import MDMBundle
28
+ from hftrainer.pipelines.mdm import MDMPipeline
29
+
30
+ bundle = MDMBundle.from_pretrained("ZeyuLing/hftrainer-mdm-humanml3d")
31
+ pipe = MDMPipeline(bundle, device="cuda")
32
+ motions = pipe.infer_t2m(["a person walks forward then sits down"], [120]) # list of (T, 263)
33
+ ```
34
+
35
+ ## Metrics (official HumanML3D-263 protocol, n=3970)
36
+
37
+ | FID ↓ | Diversity → | R-Prec Top-3 ↑ | MM-Dist ↓ |
38
+ |---|---|---|---|
39
+ | **0.509** (paper 0.544) | **9.563** (paper 9.559) | 0.711 | 3.681 |
40
+
41
+ Files: `model.safetensors` (no CLIP) · `mdm_config.json` · `Mean.npy` / `Std.npy`
42
+ (HumanML3D training stats, embedded so the checkpoint is self-contained).