How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-to-speech", model="mlx-community/VibeVoice-Realtime-0.5B-4bit")
# Load model directly
from transformers import VibeVoiceStreamingForConditionalGenerationInference
model = VibeVoiceStreamingForConditionalGenerationInference.from_pretrained("mlx-community/VibeVoice-Realtime-0.5B-4bit", dtype="auto")
Quick Links

mlx-community/VibeVoice-Realtime-0.5B-4bit

This model was converted to MLX format from microsoft/VibeVoice-Realtime-0.5B using mlx-audio version 0.2.6. Refer to the original model card for more details on the model.

Use with mlx

pip install -U mlx-audio
python -m mlx_audio.tts.generate --model mlx-community/VibeVoice-Realtime-0.5B-4bit --text "Hello, this is VibeVoice real-time 0.5B model." --voice en-Emma_woman
Downloads last month
93
Safetensors
Model size
0.2B params
Tensor type
F16
·
U32
·
MLX
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for mlx-community/VibeVoice-Realtime-0.5B-4bit

Quantized
(107)
this model

Collection including mlx-community/VibeVoice-Realtime-0.5B-4bit