nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8 Text Generation β’ 124B β’ Updated 12 minutes ago β’ 1.08M β’ 217
HauhauCS/Qwen3.5-35B-A3B-Uncensored-HauhauCS-Aggressive Image-Text-to-Text β’ 35B β’ Updated 1 day ago β’ 723k β’ 1.2k
Emu3.5 Collection Native Multimodal Models are World Learners π β’ 4 items β’ Updated Feb 4 β’ 76
FG-CLIP 2 Collection FG-CLIP 2 is the foundation model for fine-grained vision-language understanding in both English and Chinese. β’ 10 items β’ Updated Nov 6, 2025 β’ 5