grit_v4 / README.md
matthewliu0302's picture
adapt to new guideline
09a0f63
|
Raw
History Blame Contribute Delete
978 Bytes
metadata
license: cc-by-nc-sa-4.0
base_model: magma90909/vocence_miner_v7
pipeline_tag: text-to-speech
library_name: transformers
language:
  - en
tags:
  - tts
  - prompttts
  - qwen3-tts
  - voice-design
  - vocence
  - british-english
  - uk-accent

vocence_miner_v8

A naturalness-first prompt-driven TTS, built on top of magma90909/vocence_miner_v7. Two things distinguish this checkpoint:

  • British English coverage. Phrasings like "A man with a British English accent", "A Scottish woman, conversational", "a Welsh narrator" land on a real distribution rather than slipping back to neutral US English.
  • Conversational subtlety. Tuned for everyday delivery — "speaking warmly", "softly sad", "with a touch of anger, controlled" — rather than theatrical intensity. The model deliberately steps back when you don't ask for drama.

24 kHz mono WAV output, single forward call, no reference audio, no PEFT runtime. Everything ships in this repo.