Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
inference-net
/
ClipTagger-12b
like
54
Follow
Inference R&D
82
Image-Text-to-Text
Safetensors
English
gemma3
VLM
video-understanding
image-captioning
gemma
json-mode
structured-output
video-analysis
conversational
Eval Results
compressed-tensors
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
2
main
ClipTagger-12b
/
assets
Commit History
Create README.md
fb437aa
samhogan
commited on
Aug 14, 2025