Hejian Sang
pb09204048
AI & ML interests
None yet
Recent Activity
upvoted a paper 23 days ago
On-Policy Self-Distillation for Reasoning Compression submitted a paper 24 days ago
On-Policy Self-Distillation for Reasoning Compression authored a paper 26 days ago
Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning