cdli/ugandan_english_nonstandard_speech_v1.0
Viewer • Updated • 7.25k • 37
How to use KasuleTrevor/cdli-whisper-en-ug-ke-openai-large-v3-full-a40-1500-fresh with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="KasuleTrevor/cdli-whisper-en-ug-ke-openai-large-v3-full-a40-1500-fresh") # Load model directly
from transformers import AutoProcessor, AutoModelForMultimodalLM
processor = AutoProcessor.from_pretrained("KasuleTrevor/cdli-whisper-en-ug-ke-openai-large-v3-full-a40-1500-fresh")
model = AutoModelForMultimodalLM.from_pretrained("KasuleTrevor/cdli-whisper-en-ug-ke-openai-large-v3-full-a40-1500-fresh")This model is a fine-tuned version of openai/whisper-large-v3 on the None dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
|---|---|---|---|---|---|
| 0.5387 | 1.6967 | 500 | 0.6719 | 0.2632 | 0.1803 |
| 0.5173 | 3.3908 | 1000 | 0.6613 | 0.2599 | 0.1791 |
| 0.412 | 5.0850 | 1500 | 0.6603 | 0.2533 | 0.1741 |
Base model
openai/whisper-large-v3