Instructions to use LokaalHub/nemotron-3.5-sv-SE with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- NeMo
How to use LokaalHub/nemotron-3.5-sv-SE with NeMo:
import nemo.collections.asr as nemo_asr asr_model = nemo_asr.models.ASRModel.from_pretrained("LokaalHub/nemotron-3.5-sv-SE") transcriptions = asr_model.transcribe(["file.wav"]) - Notebooks
- Google Colab
- Kaggle
sv-SE-asr-streaming-0.6b
A streaming Swedish (sv-SE) ASR model, fine-tuned from
nvidia/nemotron-3.5-asr-streaming-0.6b on
LokaalHub/sv-SE-asr-cv.
Community fine-tune, not an NVIDIA model. A derivative of NVIDIA's Nemotron 3.5 ASR. NVIDIA did not produce, endorse, or review this model. "Nemotron" is a trademark of NVIDIA, used here only to identify the base model.
TL;DR
Swedish (sv-SE) is a supported locale of the base model, but its out-of-the-box accuracy on Common Voice is modest (~40.49% WER). A single full fine-tune on ~26.1h brings it to ~18.57% WER. Prompt slot used during fine-tuning: sv-SE (own slot).
Results
| Condition | Base | Fine-tuned | Rel. improvement |
|---|---|---|---|
WER (offline, full-context, normalized) on LokaalHub/sv-SE-asr-cv test |
40.49% | 18.57% | 54.2% |
Offline (full-context) WER via NeMo
transcribe_speech.py. Cache-aware streaming WER (the condition NVIDIA headlines) was not measured for this release.
Usage
import nemo.collections.asr as nemo_asr
m = nemo_asr.models.ASRModel.restore_from("model.nemo") # from this repo
m.transcribe(["audio.wav"]) # target_lang prompt: sv-SE
Training
Single full fine-tune (init_from_nemo_model), bf16, NoamAnnealing. Data:
LokaalHub/sv-SE-asr-cv (~26.1h train).
Built and trained by the asr-loop pipeline.
Limitations
Low-resource fine-tune on read speech (Common Voice). Evaluated on a 2.0h speaker-disjoint test subset — not directly comparable to published full-Common-Voice-test numbers.
- Downloads last month
- 30
Model tree for LokaalHub/nemotron-3.5-sv-SE
Base model
nvidia/nemotron-3.5-asr-streaming-0.6bDataset used to train LokaalHub/nemotron-3.5-sv-SE
Evaluation results
- WER (offline / full-context, normalized) on LokaalHub/sv-SE-asr-cv (test)test set self-reported18.570