-
Unified Reward Model for Multimodal Understanding and Generation
Paper • 2503.05236 • Published • 124 -
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning
Paper • 2505.03318 • Published • 93 -
CodeGoat24/UnifiedReward-Think-qwen35-9b
9B • Updated • 25 -
CodeGoat24/UnifiedReward-Think-qwen35-27b
3.05M • Updated • 24
SII-Yibin Wang
CodeGoat24
AI & ML interests
I'm part of Shanghai Innovation Institute, focusing on Multimodal RL and Generation.
Recent Activity
authored
a paper
1 minute ago
MeepleLM: A Virtual Playtester Simulating Diverse Subjective Experiences authored
a paper
1 minute ago
DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing authored
a paper
6 minutes ago
From Sparse to Dense: Multi-View GRPO for Flow Models via Augmented Condition Space