Kaiyan Zhang's picture

Kaiyan Zhang

iseesaw

·

https://iseesaw.github.io/

AI & ML interests

Large Reasoning Models, Reinforcement Learning, Agent

Recent Activity

authored a paper 4 days ago

Post-Trained MoE Can Skip Half Experts via Self-Distillation

upvoted a paper 6 days ago

Post-Trained MoE Can Skip Half Experts via Self-Distillation

upvoted a paper 3 months ago

How Far Can Unsupervised RLVR Scale LLM Training?

View all activity

Organizations

iseesaw 's models

None public yet