vocence_miner_v8

A naturalness-first prompt-driven TTS, built on top of magma90909/vocence_miner_v7. Two things distinguish this checkpoint:

  • British English coverage. Phrasings like "A man with a British English accent", "A Scottish woman, conversational", "a Welsh narrator" land on a real distribution rather than slipping back to neutral US English.
  • Conversational subtlety. Tuned for everyday delivery โ€” "speaking warmly", "softly sad", "with a touch of anger, controlled" โ€” rather than theatrical intensity. The model deliberately steps back when you don't ask for drama.

24 kHz mono WAV output, single forward call, no reference audio, no PEFT runtime. Everything ships in this repo.

Downloads last month
4
Safetensors
Model size
2B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support