Automatic Speech Recognition
Transformers
PyTorch
speech-encoder-decoder
speech
xls_r
xls_r_translation
Instructions to use facebook/wav2vec2-xls-r-2b-21-to-en with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use facebook/wav2vec2-xls-r-2b-21-to-en with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="facebook/wav2vec2-xls-r-2b-21-to-en")# Load model directly from transformers import AutoTokenizer, AutoModelForMultimodalLM tokenizer = AutoTokenizer.from_pretrained("facebook/wav2vec2-xls-r-2b-21-to-en") model = AutoModelForMultimodalLM.from_pretrained("facebook/wav2vec2-xls-r-2b-21-to-en") - Notebooks
- Google Colab
- Kaggle
Adding generation config file(s)
Browse files- generation_config.json +13 -0
generation_config.json
ADDED
|
@@ -0,0 +1,13 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"_from_model_config": true,
|
| 3 |
+
"bos_token_id": 0,
|
| 4 |
+
"decoder_start_token_id": 2,
|
| 5 |
+
"early_stopping": true,
|
| 6 |
+
"eos_token_id": 2,
|
| 7 |
+
"forced_bos_token_id": 250004,
|
| 8 |
+
"forced_eos_token_id": 2,
|
| 9 |
+
"max_length": 200,
|
| 10 |
+
"num_beams": 5,
|
| 11 |
+
"pad_token_id": 1,
|
| 12 |
+
"transformers_version": "4.27.0.dev0"
|
| 13 |
+
}
|