audeering/wav2vec2-large-robust-12-ft-emotion-msp-dim Audio Classification • 0.2B • Updated Sep 19, 2024 • 821k • 158
Running on CPU Upgrade Agents Featured 1.32k Open ASR Leaderboard 🏆 1.32k Explore and compare speech-to-text model benchmarks
Vietnamese speech dataset Collection for any speech-related tasks including but not limited to: speech-to-text & text-to-speech, speech classification, speaker verification, etc. • 31 items • Updated 26 days ago • 43
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 331k • 1.58k