facebook/vjepa2-vitl-fpc64-256 Video Classification β’ 0.3B β’ Updated Aug 11, 2025 β’ 74k β’ 172
ibm-granite/granite-docling-258M Image-Text-to-Text β’ 0.3B β’ Updated Sep 23, 2025 β’ 202k β’ 1.07k
Runtime error 36 Multimodal RAG with Granite Vision π 36 RAG example using Granite [vision, embedding, instruct]
Running on Zero Featured 260 granite-docling-258M demo π 260 Convert images to structured text and answer questions
docling-project/SmolDocling-256M-preview Image-Text-to-Text β’ 0.3B β’ Updated Sep 17, 2025 β’ 48.6k β’ 1.6k
Running on A100 223 Omnilingual ASR Media Transcription π 223 Transcribe audio or video into text in any language