Automatic Speech Recognition
Transformers
PyTorch
Estonian
wav2vec2
audio
hf-asr-leaderboard
Eval Results (legacy)
Instructions to use TalTechNLP/xls-r-300m-et with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use TalTechNLP/xls-r-300m-et with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="TalTechNLP/xls-r-300m-et")# Load model directly from transformers import AutoProcessor, AutoModelForCTC processor = AutoProcessor.from_pretrained("TalTechNLP/xls-r-300m-et") model = AutoModelForCTC.from_pretrained("TalTechNLP/xls-r-300m-et") - Notebooks
- Google Colab
- Kaggle
XLS-R-300m-ET
This is a XLS-R-300M model facebook/wav2vec2-xls-r-300m finetuned on around 800 hours of diverse Estonian data.
Model description
This is a general-purpose Estonian ASR model trained in the Lab of Language Technology at TalTech. It consists of only the CTC-based end-to-end model, no language model is currently provided.
Intended uses & limitations
This model is intended for general-purpose speech recognition, such as broadcast conversations, interviews, talks, etc.
How to use
TODO
Limitations and bias
Since this model was trained on mostly broadcast speech and texts from the web, it might have problems correctly decoding the following:
- Speech containing technical and other domain-specific terms
- Children's speech
- Non-native speech
- Speech recorded under very noisy conditions or with a microphone far from the speaker
- Very spontaneous and overlapping speech
Training data
Acoustic training data:
| Type | Amount (h) |
|---|---|
| Broadcast speech | 591 |
| Spontaneous speech | 53 |
| Elderly speech corpus | 53 |
| Talks, lectures | 49 |
| Parliament speeches | 31 |
| Total | 761 |
Training procedure
Finetuned using Fairseq.
Evaluation results
WER
| Dataset | WER |
|---|---|
| jutusaated.devset | 7.9 |
| jutusaated.testset | 6.1 |
| Common Voice 6.1 | 12.5 |
| Common Voice 8.0 | 13.4 |
- Downloads last month
- 432
Model tree for TalTechNLP/xls-r-300m-et
Evaluation results
- Test WER on Common Voiceself-reported12.520
- Test CER on Common Voiceself-reported2.709
- Test WER on Common Voice 8self-reported13.384
- Test CER on Common Voice 8self-reported2.982