Text-to-Speech
Transformers
Safetensors
Qwen3-TTS
English
text-generation
tts
prompttts
qwen3-tts
voice-design
vocence
british-english
uk-accent
Instructions to use matthewliu0302/grit_v1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use matthewliu0302/grit_v1 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-to-speech", model="matthewliu0302/grit_v1")# Load model directly from transformers import AutoModelForSeq2SeqLM model = AutoModelForSeq2SeqLM.from_pretrained("matthewliu0302/grit_v1", dtype="auto") - Notebooks
- Google Colab
- Kaggle
File size: 2,921 Bytes
4236e3c 1a78361 4236e3c | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 | ---
license: cc-by-nc-sa-4.0
base_model: magma90909/vocence_miner_v7
pipeline_tag: text-to-speech
library_name: transformers
language:
- en
tags:
- tts
- prompttts
- qwen3-tts
- voice-design
- vocence
- british-english
- uk-accent
---
# vocence_miner_v8
A naturalness-first prompt-driven TTS, built on top of `magma90909/vocence_miner_v8`.
## Generate
```bash
pip install qwen-tts transformers torch soundfile
```
```python
from qwen_tts import Qwen3TTSModel
import soundfile as sf
m = Qwen3TTSModel.from_pretrained("magma90909/vocence_miner_v8")
wavs, sr = m.generate_voice_design(
text="The train to Edinburgh departs from platform four.",
instruct="A man with a British English accent, calm and natural.",
language="english",
)
sf.write("out.wav", wavs[0], sr)
```
`demo.py` walks through three preset prompts.
## How to write `instruct`
The model responds best to **subtle, conversational** language — not intensifiers like *"intensely sad"* or *"nearly shouting"*. Stack these elements freely:
| Layer | Phrasings |
|-------|-----------|
| Accent / region | *British English*, *Scottish*, *Welsh*, *Northern Irish*, *Irish*, *unspecified* |
| Gender | *a man*, *a woman*, *a British woman* |
| Mood | *speaking warmly*, *softly sad*, *quietly pleased*, *with a touch of anger* |
| Persona | *bedtime storyteller, soft and warm*; *news anchor, professional and neutral*; *meditation guide, soft and serene* |
| Pace | *unhurried*, *brisk steady*, *naturally measured* |
Some example prompts that work well:
```
A British man speaks calmly and naturally.
A woman with a Scottish accent, in an everyday speaking tone.
A man, softly sad, calm and unhurried.
A British news anchor, professional and neutral, at a brisk steady pace.
A clear, neutral voice reading the sentence.
```
## Best-fit and not-fit
**Best at:**
* Natural, everyday English — both US and UK
* Bedtime storyteller / news anchor / meditation guide style reads
* Conversational sadness, warmth, mild anger, gentle pleasure
**Less suited for:**
* Theatrical / caricatured delivery (loud anger, shouted joy, dramatic sadness)
* Extreme intensifier prompts ("nearly shouting", "intensely sad") — the model intentionally tones these down
* Languages other than English
CC BY-NC-SA 4.0 — research and non-commercial use only.
## Files
```
model.safetensors # merged Talker weights (3.6 GB)
speech_tokenizer/ # Qwen3 12 Hz audio codec (~650 MB)
tokenizer.json + ... # text tokenizer
config.json + ... # model configs
miner.py # Vocence engine
chute_config.yml # Chutes build (TEE / pro_6000)
vocence_config.yaml # runtime knobs
demo.py # quick smoke test
```
The Vocence files make this repo deployable on **Bittensor SN78 (Vocence)** via the canonical Vocence/Chutes wrapper without modification.
|