Canary Collection A collection of multilingual and multitask speech to text models from NVIDIA NeMo 🐤 • 6 items • Updated 3 days ago • 30
Parakeet Collection NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. • 16 items • Updated 3 days ago • 54
Nemotron Speech Collection Open, state-of-the-art, production‑ready enterprise speech models from the NVIDIA Speech research team for ASR, TTS, Speaker Diarization and S2S • 9 items • Updated 3 days ago • 37
nvidia/parakeet-ctc-0.6b-Vietnamese Automatic Speech Recognition • Updated about 7 hours ago • 809 • 56
Running on CPU Upgrade Featured 1.21k Open ASR Leaderboard 🏆 1.21k Explore speech recognition model benchmarks and request evaluations
nvidia/nemotron-speech-streaming-en-0.6b Automatic Speech Recognition • Updated 9 days ago • 11.6k • 467
nvidia/diar_streaming_sortformer_4spk-v2 Automatic Speech Recognition • Updated Dec 31, 2025 • 22.5k • 100
nvidia/multitalker-parakeet-streaming-0.6b-v1 Automatic Speech Recognition • Updated 11 days ago • 812 • 76