Local Voice LLM

Client-side VAD, speech recognition, language model, and Supertonic speech synthesis.

VAD Idle
STT Idle
LLM Idle
TTS Idle

Input

Mic off
Waiting for speech.
Input level
0%

Assistant

Audio idle
Load the models, start the microphone, and speak naturally.

Events

    Benchmarks

    No benchmark runs yet.
    Run Stack ASR STT WER VAD close Prompt 1st token TTS queued TTS synth 1st audio End → audio Audio done Decode LLM OK Transcript Output
    No benchmark runs yet.