Instructions to use SpireLab/spire_respin_baselines_espnet with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- ESPnet
How to use SpireLab/spire_respin_baselines_espnet with ESPnet:
from espnet2.bin.asr_inference import Speech2Text model = Speech2Text.from_pretrained( "SpireLab/spire_respin_baselines_espnet" ) speech, rate = soundfile.read("speech.wav") text, *_ = model(speech)[0] - Notebooks
- Google Colab
- Kaggle
ESPnet ASR Models in 9 Indian languages trained on RESPIN-S1.0 Small Train sets
This repository contains E-Branchformer-based ESPnet2 ASR models trained on the RESPIN-S1.0 small train splits.
π Demo: How to use in ESPnet2
Follow the ESPnet installation instructions if you haven't done that already.
cd espnet
pip install -e .
cd egs2/respin_small/asr1
./run.sh --skip_data_prep true --skip_train true --download_model SpireLab/spire_respin_baselines_espnet
π Results (CER/WER from RESULTS.md)
| Language | Model Name | CER (%) | WER (%) |
|---|---|---|---|
| bh | exp_small/exp_bh/asr_bh_ebf_size256_mlp1024_lin1024_e8_mactrue_bs6M_gacc1 | 4.4 | 15.2 |
| bn | exp_small/exp_bn/asr_bn_ebf_size256_mlp1024_lin1024_e8_mactrue_bs6M_gacc1 | 4.1 | 15.0 |
| ch | exp_small/exp_ch/asr_ch_ebf_size256_mlp1024_lin1024_e8_mactrue_bs6M_gacc1 | 3.1 | 10.6 |
| hi | exp_small/exp_hi/asr_hi_ebf_size256_mlp1024_lin1024_e8_mactrue_bs6M_gacc1 | 3.1 | 9.9 |
| kn | exp_small/exp_kn/asr_kn_ebf_size256_mlp1024_lin1024_e8_mactrue_bs6M_gacc1 | 4.6 | 24.5 |
| mg | exp_small/exp_mg/asr_mg_ebf_size256_mlp1024_lin1024_e8_mactrue_bs6M_gacc1 | 6.0 | 20.4 |
| mr | exp_small/exp_mr/asr_mr_ebf_size256_mlp1024_lin1024_e8_mactrue_bs6M_gacc1 | 3.1 | 14.5 |
| mt | exp_small/exp_mt/asr_mt_ebf_size256_mlp1024_lin1024_e8_mactrue_bs6M_gacc1 | 5.0 | 17.9 |
| te | exp_small/exp_te/asr_te_ebf_size256_mlp1024_lin1024_e8_mactrue_bs6M_gacc1 | 4.1 | 21.6 |
π Citation
@inproceedings{watanabe2018espnet,
author={Shinji Watanabe and Takaaki Hori and Shigeki Karita and Tomoki Hayashi and Jiro Nishitoba and Yuya Unno and Nelson Yalta and Jahn Heymann and Matthew Wiesner and Nanxin Chen and Adithya Renduchintala and Tsubasa Ochiai},
title={{ESPnet}: End-to-End Speech Processing Toolkit},
year={2018},
booktitle={Proc. Interspeech},
pages={2207--2211},
doi={10.21437/Interspeech.2018-1456},
url={http://dx.doi.org/10.21437/Interspeech.2018-1456}
}
- Downloads last month
- 5