Neural Additive Experts: Context-Gated Experts for Controllable Model Additivity Paper • 2602.10585 • Published 22 days ago • 2
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models Paper • 2602.12036 • Published 20 days ago • 93
lkevincc0/Step-3.5-Flash-REAP-128B-A11B Text Generation • 121B • Updated 22 days ago • 217 • 9
Routing the Lottery: Adaptive Subnetworks for Heterogeneous Data Paper • 2601.22141 • Published Jan 29 • 2
TTCS: Test-Time Curriculum Synthesis for Self-Evolving Paper • 2601.22628 • Published Jan 30 • 35
ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas Paper • 2601.21558 • Published Jan 29 • 59
Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning Paper • 2512.07461 • Published Dec 8, 2025 • 78