Add utility scripts: voice tools, chunking, batch generation, space restart 7135eb0 Running hf4uwho commited on May 12
Upload voices/celeb/jack_nicholson.wav with huggingface_hub 1f982cc verified hf4uwho commited on May 11
Upload samples/jack_nicholson_story.ogg with huggingface_hub b011c43 verified hf4uwho commited on May 11
Upload samples/scarlett_johansson_story.ogg with huggingface_hub a5794f5 verified hf4uwho commited on May 11
Add 7 celebrity voices: DiCaprio, Nicholson, Pesci, De Niro, Pacino, Gyllenhaal, Johansson 341df3d hf4uwho commited on May 11
Add POST /tts endpoint — no URL length limits, send full text in body 1334893 hf4uwho commited on May 11
Revert to one-shot generation. Chunked crossfade didn't fix stuttering. d65d8c9 hf4uwho commited on May 10
Chunked TTS: split long text into sentences, generate each, crossfade joins to remove abrupt cuts 7be824c hf4uwho commited on May 10
Comprehensive README: API docs, 78 voices, architecture, and full debugging notes d5bd886 hf4uwho commited on May 10
Add 24 character voices from chandypants/ollie-pocket-tts (78 total) 24c96a2 hf4uwho commited on May 10
FIX: use get_state_for_audio_prompt from Nymbo WAV files (actually produces speech) 721b988 hf4uwho commited on May 10
FIX: load english_2026-04 model WITH voice cloning (default loads without-voice-cloning = noise) 8674286 hf4uwho commited on May 10
Fix: convert flat v3/v2 safetensors keys to nested dict format for voice state da26835 hf4uwho commited on May 10
Fix: handle v3/v2 pre-computed KV cache embeddings (no audio_prompt key) c76f7f1 hf4uwho commited on May 10
Fix: pass HF_TOKEN to huggingface_hub for gated model embedding download d343afd hf4uwho commited on May 10
Switch to Docker SDK: FastAPI server with direct WAV/OGG output, no Gradio 1dd1d1c hf4uwho commited on May 10