Canary Collection A collection of multilingual and multitask speech to text models from NVIDIA NeMo 🐤 • 6 items • Updated 1 day ago • 30
Parakeet Collection NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. • 16 items • Updated 1 day ago • 54
Nemotron Speech Collection Open, state-of-the-art, production‑ready enterprise speech models from the NVIDIA Speech research team for ASR, TTS, Speaker Diarization and S2S • 9 items • Updated 1 day ago • 37
Running on CPU Upgrade Featured 1.21k Open ASR Leaderboard 🏆 1.21k Compare and evaluate speech recognition model performance across multiple benchmarks
nvidia/nemotron-speech-streaming-en-0.6b Automatic Speech Recognition • Updated 8 days ago • 11.5k • 465
nvidia/diar_streaming_sortformer_4spk-v2 Automatic Speech Recognition • Updated Dec 31, 2025 • 21.7k • 100
nvidia/multitalker-parakeet-streaming-0.6b-v1 Automatic Speech Recognition • Updated 9 days ago • 856 • 76