XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models Paper • 2411.15100 • Published Nov 22, 2024 • 7
view article Article Google Released Gemma-4 Four Days Ago. We Already Made It 1.72× Faster. lujangusface • Apr 7 • 2
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4 Text Generation • 67B • Updated 29 days ago • 1.39M • 321
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated Apr 6 • 176k • • 2.86k