Automatic Speech Recognition
Transformers
PyTorch
TensorBoard
speech-encoder-decoder
Generated from Trainer
Instructions to use speech-seq2seq/wav2vec2-2-roberta-large with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use speech-seq2seq/wav2vec2-2-roberta-large with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="speech-seq2seq/wav2vec2-2-roberta-large")# Load model directly from transformers import AutoTokenizer, AutoModelForSpeechSeq2Seq tokenizer = AutoTokenizer.from_pretrained("speech-seq2seq/wav2vec2-2-roberta-large") model = AutoModelForSpeechSeq2Seq.from_pretrained("speech-seq2seq/wav2vec2-2-roberta-large") - Notebooks
- Google Colab
- Kaggle
| { | |
| "do_normalize": true, | |
| "feature_extractor_type": "Wav2Vec2FeatureExtractor", | |
| "feature_size": 1, | |
| "padding_side": "right", | |
| "padding_value": 0.0, | |
| "return_attention_mask": true, | |
| "sampling_rate": 16000 | |
| } | |