-
E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models
Paper • 2601.00423 • Published • 8 -
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
Paper • 2601.05242 • Published • 191 -
Motion Attribution for Video Generation
Paper • 2601.08828 • Published • 65
Jongmin Kim
jmkim0309
AI & ML interests
None yet
Recent Activity
liked
a model
about 16 hours ago
kakaocorp/kanana-2-30b-a3b-thinking-2601
updated
a collection
3 days ago
paper_seminar_260121
updated
a collection
5 days ago
paper_seminar_260121
Organizations
None yet