OpenGVLab

community

https://github.com/opengvlab

Activity Feed Request to join this org

AI & ML interests

Computer Vision

Recent Activity

yangxue authored a paper 16 days ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Kaining authored a paper 24 days ago

PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

cuierfei authored a paper 27 days ago

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

View all activity

Papers

RIVER: A Real-Time Interaction Benchmark for Video LLMs

InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision

View all Papers

OpenGVLab 's models 286

OpenGVLab/InternVL3_5-241B-A28B-Pretrained

Image-Text-to-Text • 241B • Updated Aug 29, 2025 • 43 • 1

OpenGVLab/InternVL3_5-241B-A28B-Instruct

Image-Text-to-Text • 241B • Updated Aug 29, 2025 • 32 • 15

OpenGVLab/InternVL3_5-38B-MPO

Image-Text-to-Text • 38B • Updated Aug 29, 2025 • 36 • 2

OpenGVLab/InternVL3_5-38B-Pretrained

Image-Text-to-Text • 38B • Updated Aug 29, 2025 • 32 • 2

OpenGVLab/InternVL3_5-38B

Image-Text-to-Text • Updated Aug 29, 2025 • 18.7k • 44

OpenGVLab/InternVL3_5-30B-A3B-MPO

Image-Text-to-Text • 31B • Updated Aug 29, 2025 • 16 • 4

OpenGVLab/InternVL3_5-30B-A3B

Image-Text-to-Text • 31B • Updated Aug 29, 2025 • 105k • 42

OpenGVLab/InternVL3_5-30B-A3B-Pretrained

Image-Text-to-Text • 31B • Updated Aug 29, 2025 • 19 • 1

OpenGVLab/InternVL3_5-38B-Instruct

Image-Text-to-Text • 38B • Updated Aug 29, 2025 • 1.26k • 6

OpenGVLab/InternVL3_5-14B-MPO

Image-Text-to-Text • 15B • Updated Aug 29, 2025 • 51 • 3

OpenGVLab/InternVL3_5-14B

Image-Text-to-Text • Updated Aug 29, 2025 • 18.8k • 27

OpenGVLab/InternVideo2_5_Chat_8B

Video-Text-to-Text • 8B • Updated Aug 4, 2025 • 10.9k • 89

OpenGVLab/ScaleCUA_Env

Updated Jul 31, 2025 • 2

OpenGVLab/InternVideo2-Stage2_6B-224p-f4

Updated Jul 30, 2025 • 6

OpenGVLab/Mono-InternVL-2B

Image-Text-to-Text • 3B • Updated Jul 22, 2025 • 10.5k • 37

OpenGVLab/Mono-InternVL-2B-S1-3

Image-Text-to-Text • 3B • Updated Jul 22, 2025 • 10 • 1

OpenGVLab/Mono-InternVL-2B-S1-2

Image-Text-to-Text • 3B • Updated Jul 22, 2025 • 10 • 1

OpenGVLab/Mono-InternVL-2B-S1-1

Image-Text-to-Text • 3B • Updated Jul 22, 2025 • 10

OpenGVLab/Docopilot-8B

Image-Text-to-Text • 8B • Updated Jul 20, 2025 • 16 • 3

OpenGVLab/Docopilot-2B

Image-Text-to-Text • 2B • Updated Jul 20, 2025 • 11 • 8

OpenGVLab/ZeroGUI-OSWorld-7B

Image-Text-to-Text • 8B • Updated Jun 20, 2025 • 20 • 7

OpenGVLab/InternVideo1.0

Video Classification • Updated Jun 10, 2025 • 1

OpenGVLab/ZeroGUI-AndroidLab-7B

Image-Text-to-Text • 8B • Updated May 30, 2025 • 28 • 5

OpenGVLab/InternVL3-9B-Instruct

Image-Text-to-Text • 9B • Updated May 29, 2025 • 135 • 4

OpenGVLab/InternVL3-9B

Image-Text-to-Text • 9B • Updated May 29, 2025 • 11.7k • 25

OpenGVLab/VisualPRM-8B-v1_1

Image-Text-to-Text • 8B • Updated May 29, 2025 • 129 • 9

OpenGVLab/InternVideo2_CLIP_S

0.4B • Updated May 22, 2025 • 1.06k • 2

OpenGVLab/VideoChat-Flash-Qwen2_5-7B-1M_res224

Video-Text-to-Text • 8B • Updated May 16, 2025 • 33 • 2

OpenGVLab/InternVL_2_5_HiCo_R64

Video-Text-to-Text • 8B • Updated May 13, 2025 • 78 • 3

OpenGVLab/VisualPRM-8B

Image-Text-to-Text • 8B • Updated May 6, 2025 • 145 • 18