keithito/lj_speech
Updated • 1.17k • 62
How to use zero0303/qwen3-tts-ljspeech-finetuned with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-to-speech", model="zero0303/qwen3-tts-ljspeech-finetuned") # Load model directly
from transformers import AutoModelForSeq2SeqLM
model = AutoModelForSeq2SeqLM.from_pretrained("zero0303/qwen3-tts-ljspeech-finetuned", dtype="auto")This model is a fine-tuned version of Qwen/Qwen3-TTS-12Hz-1.7B-Base trained on the LJSpeech dataset.
The model produces speech in the voice of Linda Johnson, featuring:
| Parameter | Value |
|---|---|
| Epochs | 3 |
| Batch Size | 1 (gradient accumulation: 4) |
| Learning Rate | 5e-6 |
| Mixed Precision | bf16 |
| Starting Loss | 20.4 |
| Final Loss | ~10.7 |
Base model
Qwen/Qwen3-TTS-12Hz-1.7B-Base