arxiv:2604.07023
Lei Wang
demolei
AI & ML interests
LLMs
Recent Activity
upvoted a paper about 13 hours ago
Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories upvoted a paper 3 days ago
Diagnosing Harmful Continuation in Answer-Correct Long-CoT Training Traces upvoted a paper 3 days ago
CUA-Gym: Scaling Verifiable Training Environments and Tasks for Computer-Use Agents