PyVision-RL: Forging Open Agentic Vision Models via RL Paper • 2602.20739 • Published 2 days ago • 27
Query-focused and Memory-aware Reranker for Long Context Processing Paper • 2602.12192 • Published 14 days ago • 44
GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning Paper • 2602.12099 • Published 14 days ago • 57
Judging What We Cannot Solve: A Consequence-Based Approach for Oracle-Free Evaluation of Research-Level Math Paper • 2602.06291 • Published 21 days ago • 23
World in a Frame: Understanding Culture Mixing as a New Challenge for Vision-Language Models Paper • 2511.22787 • Published Nov 27, 2025 • 10
Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models Paper • 2511.18890 • Published Nov 24, 2025 • 35
GR-RL: Going Dexterous and Precise for Long-Horizon Robotic Manipulation Paper • 2512.01801 • Published Dec 1, 2025 • 25
CaptionQA: Is Your Caption as Useful as the Image Itself? Paper • 2511.21025 • Published Nov 26, 2025 • 28