Beyond Language Modeling: An Exploration of Multimodal Pretraining Paper • 2603.03276 • Published Mar 3 • 106
DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning Paper • 2605.25604 • Published 10 days ago • 134
The Complexity Trap: Simple Observation Masking Is as Efficient as LLM Summarization for Agent Context Management Paper • 2508.21433 • Published Aug 29, 2025 • 9
Language Models Need Sleep: Learning to Self-Modify and Consolidate Memories Paper • 2606.03979 • Published 1 day ago • 11
Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses Paper • 2606.02373 • Published 3 days ago • 34
AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents Paper • 2603.27490 • Published Mar 29 • 20
GrepSeek: Training Search Agents for Direct Corpus Interaction Paper • 2605.29307 • Published 7 days ago • 95
Masking Stale Observations Helps Search Agents -- Until It Doesn't: A Regime Map and Its Mechanism Paper • 2606.00408 • Published 6 days ago • 51
view article Article MiniMax Goes Sparse: Decoding M3's Attention from a Single Diagram AtlasCloud-AI • 5 days ago • 5
How LoRA Remembers? A Parametric Memory Law for LLM Finetuning Paper • 2605.30260 • Published 7 days ago • 38
OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources Paper • 2605.29250 • Published 7 days ago • 74
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published 28 days ago • 233
MLS-Bench: A Holistic and Rigorous Assessment of AI Systems on Building Better AI Paper • 2605.08678 • Published 26 days ago • 9
From Pixels to Words -- Towards Native One-Vision Models at Scale Paper • 2605.28820 • Published 8 days ago • 70
AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration Paper • 2605.20025 • Published 16 days ago • 185
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published 23 days ago • 195
BrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language Models in Chinese Paper • 2504.19314 • Published Apr 27, 2025 • 8