gafiatulin
/

vibevoice-7b-coreai

Model card Files Files and versions

gafiatulin commited on 8 days ago

Commit

6266a74

·

verified ·

1 Parent(s): b9210b3

Update README.md

Files changed (1) hide show

README.md +5 -3

README.md CHANGED Viewed

@@ -10,13 +10,15 @@ tags:
 - on-device
 - m4-max
 - text-to-speech
 ---
 # VibeVoice 7B — multi-speaker TTS (Core AI)
-Highest-quality multi-speaker TTS, 7B Qwen2 backbone. Bundles its own σ-VAE codec (shared with 1.5b) so the repo is self-contained. cfg 2.0 fused sampler avoids long-form static buildup.
-**On-device performance (M4 Max, Core AI):** 2.37× RTF, WER 0%.
 > ⚠️ **Beta artifacts.** These `.aimodel` bundles are compiled for macOS 27 / Xcode 27 beta (Core AI). They may need re-export on the GA toolchain. The original weights are Microsoft VibeVoice (see upstream for the model license).
@@ -82,4 +84,4 @@ Resolve assets by role via `manifest.json` (`default` = recommended variant):
   "decode_compute": "gpu",
   "sem_compute": "gpu"
 }
-```

 - on-device
 - m4-max
 - text-to-speech
+base_model:
+- vibevoice/VibeVoice-7B
 ---
 # VibeVoice 7B — multi-speaker TTS (Core AI)
+High-quality multi-speaker TTS, 7B Qwen2 backbone. cfg 2.0 fused sampler avoids long-form static buildup.
+**On-device performance (M4 Max, Core AI):** 2.37× RTF.
 > ⚠️ **Beta artifacts.** These `.aimodel` bundles are compiled for macOS 27 / Xcode 27 beta (Core AI). They may need re-export on the GA toolchain. The original weights are Microsoft VibeVoice (see upstream for the model license).
   "decode_compute": "gpu",
   "sem_compute": "gpu"
 }
+```