Rui Sun's picture

Rui Sun PRO

ThreeSR

·

https://threesr.github.io/

AI & ML interests

Vision and Language Multimodal Learning, CV, NLP, LLM

Recent Activity

upvoted a paper 1 day ago

OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks

upvoted a paper 1 day ago

Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning

updated a collection 10 days ago

View all activity

Organizations

upvoted 2 papers 1 day ago

OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks

Paper • 2604.08539 • Published 2 days ago • 33

Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning

Paper • 2604.04746 • Published 3 days ago • 57

updated a collection 10 days ago

New Papers

96 items • Updated 10 days ago • 1

upvoted a paper 10 days ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published 22 days ago • 330

upvoted a paper 11 days ago

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published 17 days ago • 95

upvoted a paper 25 days ago

MoKus: Leveraging Cross-Modal Knowledge Transfer for Knowledge-Aware Concept Customization

Paper • 2603.12743 • Published 29 days ago • 3

updated a collection about 1 month ago

New Papers

96 items • Updated 10 days ago • 1

upvoted a paper about 2 months ago

UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding

Paper • 2307.00862 • Published Jul 3, 2023 • 1

updated a collection 2 months ago

New Papers

96 items • Updated 10 days ago • 1