facebook/multilingual_librispeech
Viewer • Updated • 1.49M • 51.3k • 179
How to use sanchit-gandhi/whisper-medium-es-5k-1e-5-bs-64 with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="sanchit-gandhi/whisper-medium-es-5k-1e-5-bs-64") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("sanchit-gandhi/whisper-medium-es-5k-1e-5-bs-64")
model = AutoModelForSpeechSeq2Seq.from_pretrained("sanchit-gandhi/whisper-medium-es-5k-1e-5-bs-64")This model is a fine-tuned version of openai/whisper-medium on the Multilingual LibriSpeech dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer |
|---|---|---|---|---|
| 0.1438 | 0.2 | 1000 | 0.1414 | 6.4317 |
| 0.1294 | 0.4 | 2000 | 0.1139 | 4.7176 |
| 0.2289 | 0.6 | 3000 | 0.1048 | 4.3266 |
| 0.1164 | 1.11 | 4000 | 0.0964 | 3.9164 |
| 0.1609 | 1.31 | 5000 | 0.0969 | 4.0193 |