\$OneMillion-Bench: How Far are Language Agents from Human Experts? Paper • 2603.07980 • Published 1 day ago • 19
DreamCAD: Scaling Multi-modal CAD Generation using Differentiable Parametric Surfaces Paper • 2603.05607 • Published 5 days ago • 2
Censored LLMs as a Natural Testbed for Secret Knowledge Elicitation Paper • 2603.05494 • Published 5 days ago • 1
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens Paper • 2603.02138 • Published 8 days ago • 137
Causal Motion Diffusion Models for Autoregressive Motion Generation Paper • 2602.22594 • Published 13 days ago • 7
Code2World: A GUI World Model via Renderable Code Generation Paper • 2602.09856 • Published 28 days ago • 198
Visual Memory Injection Attacks for Multi-Turn Conversations Paper • 2602.15927 • Published 21 days ago • 3
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration Paper • 2602.05400 • Published Feb 5 • 346
Flavors of Moonshine Collection A suite of tiny automatic speech recognition (ASR) models specialized for a range of underrepresented languages. • 6 items • Updated Sep 11, 2025 • 1
NanoQuant: Efficient Sub-1-Bit Quantization of Large Language Models Paper • 2602.06694 • Published Feb 6 • 15
Reliable and Responsible Foundation Models: A Comprehensive Survey Paper • 2602.08145 • Published Feb 4 • 8
Theory of Space: Can Foundation Models Construct Spatial Beliefs through Active Exploration? Paper • 2602.07055 • Published Feb 4 • 22