Running on CPU Upgrade Agents Featured 1.33k Open ASR Leaderboard 🏆 1.33k Explore and compare speech‑recognition model benchmarks
MOSS-Audio Collection An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex • 5 items • Updated 7 days ago • 43
Running 14 Defeating the trainer-generator precision mismatch in TRL 🎯 14 Download research PDF (Pro access required)
Canary ASR/AST Collection A collection of multilingual and multitask speech to text models from NVIDIA NeMo 🐤 • 6 items • Updated 7 days ago • 34
Parakeet ASR Collection NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. • 16 items • Updated 7 days ago • 68