Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics Paper • 2605.12178 • Published 3 days ago • 58
EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents Paper • 2605.13841 • Published 2 days ago • 58
Eureka-Audio: Triggering Audio Intelligence in Compact Language Models Paper • 2602.13954 • Published Feb 15 • 4
EpochX: Building the Infrastructure for an Emergent Agent Civilization Paper • 2603.27304 • Published Mar 28 • 47
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger Paper • 2602.08222 • Published Feb 9 • 290
Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs Paper • 2603.09906 • Published Mar 10 • 75
Beyond Language Modeling: An Exploration of Multimodal Pretraining Paper • 2603.03276 • Published Mar 3 • 104
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale Paper • 2603.25040 • Published Mar 26 • 132
Parakeet ASR Collection NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. • 16 items • Updated 6 days ago • 72
Scaling Open Discrete Audio Foundation Models with Interleaved Semantic, Acoustic, and Text Tokens Paper • 2602.16687 • Published Feb 18 • 5
LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis Paper • 2505.02625 • Published May 5, 2025 • 23