QuanjianSong
QuanjianSong
AI & ML interests
None yet
Recent Activity
upvoted a paper about 13 hours ago
D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models upvoted a paper 3 days ago
CEPO: RLVR Self-Distillation using Contrastive Evidence Policy Optimization upvoted a paper 4 days ago
Stress-Testing the Reasoning Competence of LLMs With Proofs Under Minimal FormalismOrganizations
None yet