arxiv:2512.10430
Ramil Latypov
kylecr4ne
AI & ML interests
None yet
Recent Activity
upvoted a paper about 5 hours ago
Trust-Region Behavior Blending for On-Policy Distillation upvoted a paper 4 months ago
F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare liked a model 5 months ago
t-tech/T-lite-it-2.1