3 60 19

Haoxiang Zhang

IPF

https://isaacghx.github.io/about

AI & ML interests

None yet

Recent Activity

commentedon a paper about 3 hours ago

Masking Stale Observations Helps Search Agents -- Until It Doesn't: A Regime Map and Its Mechanism

upvoted a paper about 6 hours ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

upvoted a paper about 6 hours ago

DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning

View all activity

Organizations

upvoted 2 papers about 6 hours ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Paper • 2603.03276 • Published Mar 3 • 106

DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning

Paper • 2605.25604 • Published 10 days ago • 134

upvoted a paper about 7 hours ago

The Complexity Trap: Simple Observation Masking Is as Efficient as LLM Summarization for Agent Context Management

Paper • 2508.21433 • Published Aug 29, 2025 • 9

upvoted a paper about 8 hours ago

Language Models Need Sleep: Learning to Self-Modify and Consolidate Memories

Paper • 2606.03979 • Published 1 day ago • 11

upvoted a paper about 13 hours ago

Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses

Paper • 2606.02373 • Published 3 days ago • 34

upvoted 3 papers 1 day ago

AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents

Paper • 2603.27490 • Published Mar 29 • 20

GrepSeek: Training Search Agents for Direct Corpus Interaction

Paper • 2605.29307 • Published 7 days ago • 95

Masking Stale Observations Helps Search Agents -- Until It Doesn't: A Regime Map and Its Mechanism

Paper • 2606.00408 • Published 6 days ago • 51

upvoted an article 2 days ago

Article

MiniMax Goes Sparse: Decoding M3's Attention from a Single Diagram

AtlasCloud-AI

•

5 days ago

• 5

upvoted a paper 3 days ago

How LoRA Remembers? A Parametric Memory Law for LLM Finetuning

Paper • 2605.30260 • Published 7 days ago • 38

upvoted 6 papers 5 days ago

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

Paper • 2605.06169 • Published 28 days ago • 233

MLS-Bench: A Holistic and Rigorous Assessment of AI Systems on Building Better AI

Paper • 2605.08678 • Published 26 days ago • 9

From Pixels to Words -- Towards Native One-Vision Models at Scale

Paper • 2605.28820 • Published 8 days ago • 70

upvoted 2 papers 6 days ago

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

Paper • 2605.20025 • Published 16 days ago • 185

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 23 days ago • 195

upvoted a paper 13 days ago

Code as Agent Harness

Paper • 2605.18747 • Published 17 days ago • 212

upvoted a paper 15 days ago

BrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language Models in Chinese

Paper • 2504.19314 • Published Apr 27, 2025 • 8

Haoxiang Zhang

AI & ML interests

Recent Activity

Organizations

IPF's activity

MiniMax Goes Sparse: Decoding M3's Attention from a Single Diagram