ibm-granite/granite-4.0-1b-speech Automatic Speech Recognition • 2B • Updated about 8 hours ago • 62.2k • 211
Qwen3 Voice Embedding Collection Standalone ECAPA-TDNN x-vector speaker encoders extracted from Qwen3-TTS. 1024-dim (0.6B) and 2048-dim (1.7B). • 4 items • Updated Feb 27 • 28
Text-to-Speech (TTS) models Collection A collection of 4-bit, Dynamic 4-bit and 16-bit voice models including Sesame-CSM, OpenAI's Whisper, Orpheus. Fine-tune them with Unsloth now! • 16 items • Updated 22 days ago • 27