Enrich VLMs’ vision-centric reasoning capabilities via Chain-of-Visual-Thought!
YM Qin
Wakals
AI & ML interests
Computer Vision, Vision-language Model, Generative Model
Recent Activity
upvoted a collection 1 day ago
Qwen3.5 liked a dataset 3 days ago
VisGym/visgym_data liked a dataset 15 days ago
VisGym/inference-datasetOrganizations
None yet