arxiv:2605.16882
Wenjun Wang
juezhi
AI & ML interests
None yet
Recent Activity
upvoted a paper about 20 hours ago
LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards upvoted a paper about 20 hours ago
Not All Disagreement Is Learnable: Token Teachability in On-Policy Distillation upvoted a paper 14 days ago
E-PMQ: Expert-Guided Post-Merge Quantization with Merged-Weight Anchoring