Automatic Speech Recognition
NeMo
PyTorch
English
speech
audio
CTC
Citrinet
Transformer
NeMo
hf-asr-leaderboard
Riva
Eval Results (legacy)
Instructions to use nvidia/stt_en_citrinet_1024_gamma_0_25 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- NeMo
How to use nvidia/stt_en_citrinet_1024_gamma_0_25 with NeMo:
import nemo.collections.asr as nemo_asr asr_model = nemo_asr.models.ASRModel.from_pretrained("nvidia/stt_en_citrinet_1024_gamma_0_25") transcriptions = asr_model.transcribe(["file.wav"]) - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -140,7 +140,8 @@ wget https://dldata-public.s3.us-east-2.amazonaws.com/2086-149220-0033.wav
|
|
| 140 |
```
|
| 141 |
Then simply do:
|
| 142 |
```
|
| 143 |
-
asr_model.transcribe(['2086-149220-0033.wav'])
|
|
|
|
| 144 |
```
|
| 145 |
|
| 146 |
### Transcribing many audio files
|
|
|
|
| 140 |
```
|
| 141 |
Then simply do:
|
| 142 |
```
|
| 143 |
+
output = asr_model.transcribe(['2086-149220-0033.wav'])
|
| 144 |
+
print(output[0].text)
|
| 145 |
```
|
| 146 |
|
| 147 |
### Transcribing many audio files
|