Running on Zero Agents 20 AutoGaze 👀 20 Generate gaze pattern and reconstruction videos from any video
unsloth/gemma-3n-E2B-it-litert-preview-GGUF Image-Text-to-Text • 4B • Updated Jul 17, 2025 • 1.96k • 2
Running Featured 161 SmolVLM realtime WebGPU ⚡ 161 Ask questions about your webcam view and get text answers
LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models Paper • 2407.07895 • Published Jul 10, 2024 • 42
Runtime error Agents Featured 234 FastSAM 🐠 234 Segment images using texts, points, or everything mode