openai/whisper-large-v3 Automatic Speech Recognition β’ 2B β’ Updated Aug 12, 2024 β’ 5.39M β’ β’ 5.78k
HuggingFaceM4/siglip-so400m-14-980-flash-attn2-navit Zero-Shot Image Classification β’ 0.9B β’ Updated Mar 7, 2024 β’ 170 β’ 53
Alibaba-NLP/Tongyi-DeepResearch-30B-A3B Text Generation β’ 31B β’ Updated Oct 10, 2025 β’ 35.9k β’ 814
Running on CPU Upgrade Featured 3.2k The Smol Training Playbook π 3.2k The secrets to building world-class LLMs
Running 3.87k The Ultra-Scale Playbook π 3.87k The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation β’ 2B β’ Updated Feb 24, 2025 β’ 788k β’ β’ 1.52k
Running 600 Scaling test-time compute π 600 Boost LLM answers with flexible testβtime search strategies