What Does Vision Tool-Use Reinforcement Learning Really Learn? Disentangling Tool-Induced and Intrinsic Effects for Crop-and-Zoom Paper • 2602.01334 • Published 15 days ago • 2
daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently Paper • 2602.02619 • Published 14 days ago • 50
daVinci-Dev: Agent-native Mid-training for Software Engineering Paper • 2601.18418 • Published 21 days ago • 124
AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts Paper • 2601.11044 • Published Jan 16 • 34
One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling Paper • 2601.03111 • Published Jan 6 • 10
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Paper • 2512.23576 • Published Dec 29, 2025 • 65
MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation Paper • 2406.05690 • Published Jun 9, 2024
FRoG: Evaluating Fuzzy Reasoning of Generalized Quantifiers in Large Language Models Paper • 2407.01046 • Published Jul 1, 2024
Understanding Reference Policies in Direct Preference Optimization Paper • 2407.13709 • Published Jul 18, 2024 • 17