OpenGVLab/ViCLIP-B-16-hf
0.1B
•
Updated
•
106
•
1
Computer Vision
InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision
VKnowU: Evaluating Visual Knowledge Understanding in Multimodal LLMs