4 502

M Saad Salman

MSS444

MSS444

AI & ML interests

None yet

Recent Activity

upvoted a paper about 9 hours ago

Graph-Based Chain-of-Thought Pruning for Reducing Redundant Reflections in Reasoning LLMs

upvoted a paper about 9 hours ago

Combee: Scaling Prompt Learning for Self-Improving Language Model Agents

upvoted a paper about 9 hours ago

Neural Computers

View all activity

Organizations

None yet

upvoted 6 papers about 9 hours ago

Graph-Based Chain-of-Thought Pruning for Reducing Redundant Reflections in Reasoning LLMs

Paper • 2604.05643 • Published 8 days ago • 12

Combee: Scaling Prompt Learning for Self-Improving Language Model Agents

Paper • 2604.04247 • Published 10 days ago • 29

Neural Computers

Paper • 2604.06425 • Published 8 days ago • 27

FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling

Paper • 2604.06916 • Published 7 days ago • 31

MARS: Enabling Autoregressive Models Multi-Token Generation

Paper • 2604.07023 • Published 7 days ago • 36

RAGEN-2: Reasoning Collapse in Agentic RL

Paper • 2604.06268 • Published 8 days ago • 61

upvoted 2 papers 1 day ago

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

Paper • 2604.08377 • Published 6 days ago • 273

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published 7 days ago • 308

upvoted 11 papers 7 days ago

ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement

Paper • 2604.01591 • Published 13 days ago • 40

How Well Do Agentic Skills Work in the Wild: Benchmarking LLM Skill Usage in Realistic Settings

Paper • 2604.04323 • Published 9 days ago • 39

Memory Intelligence Agent

Paper • 2604.04503 • Published 9 days ago • 55

Vero: An Open RL Recipe for General Visual Reasoning

Paper • 2604.04917 • Published 9 days ago • 30

SkillX: Automatically Constructing Skill Knowledge Bases for Agents

Paper • 2604.04804 • Published 9 days ago • 31

ClawArena: Benchmarking AI Agents in Evolving Information Environments

Paper • 2604.04202 • Published 10 days ago • 36

LightThinker++: From Reasoning Compression to Memory Management

Paper • 2604.03679 • Published 11 days ago • 33

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Paper • 2604.04921 • Published 9 days ago • 106

upvoted a paper 9 days ago

Revision or Re-Solving? Decomposing Second-Pass Gains in Multi-LLM Pipelines

Paper • 2604.01029 • Published 13 days ago • 7

M Saad Salman

AI & ML interests

Recent Activity

Organizations

MSS444's activity