Entropy Sentinel: Continuous LLM Accuracy Monitoring from Decoding Entropy Traces in STEM Paper • 2601.09001 • Published 7 days ago • 12
ACoT-VLA: Action Chain-of-Thought for Vision-Language-Action Models Paper • 2601.11404 • Published 4 days ago • 20
When Personalization Misleads: Understanding and Mitigating Hallucinations in Personalized LLMs Paper • 2601.11000 • Published 4 days ago • 24
Agent Skills in the Wild: An Empirical Study of Security Vulnerabilities at Scale Paper • 2601.10338 • Published 5 days ago • 4
LaViT: Aligning Latent Visual Thoughts for Multi-modal Reasoning Paper • 2601.10129 • Published 5 days ago • 10
HeartMuLa: A Family of Open Sourced Music Foundation Models Paper • 2601.10547 • Published 5 days ago • 23
PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution Paper • 2601.10657 • Published 5 days ago • 18
FlowAct-R1: Towards Interactive Humanoid Video Generation Paper • 2601.10103 • Published 5 days ago • 26
Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding Paper • 2601.10611 • Published 5 days ago • 25
Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders Paper • 2601.10332 • Published 5 days ago • 27
Toward Ultra-Long-Horizon Agentic Science: Cognitive Accumulation for Machine Learning Engineering Paper • 2601.10402 • Published 5 days ago • 35
Beyond Static Tools: Test-Time Tool Evolution for Scientific Reasoning Paper • 2601.07641 • Published 8 days ago • 43
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning Paper • 2601.09667 • Published 6 days ago • 78
Urban Socio-Semantic Segmentation with Vision-Language Reasoning Paper • 2601.10477 • Published 5 days ago • 151