Rui Sun PRO
ThreeSR
AI & ML interests
Vision and Language Multimodal Learning, CV, NLP, LLM
Recent Activity
upvoted a paper about 5 hours ago
OpenSpatial: A Principled Data Engine for Empowering Spatial Intelligence upvoted a paper about 5 hours ago
MolmoWeb: Open Visual Web Agent and Open Data for the Open Web upvoted a paper about 5 hours ago
Act Wisely: Cultivating Meta-Cognitive Tool Use in Agentic Multimodal Models