KVServe: Service-Aware KV Cache Compression for Communication-Efficient Disaggregated LLM Serving Paper • 2605.13734 • Published 13 days ago • 11
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 150 items • Updated about 3 hours ago • 29
Forecasting Downstream Performance of LLMs With Proxy Metrics Paper • 2605.18607 • Published 8 days ago • 11
Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention Paper • 2605.22791 • Published 5 days ago • 26
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 150 items • Updated about 3 hours ago • 29
Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps Paper • 2605.16928 • Published 10 days ago • 89
π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows Paper • 2605.14678 • Published 7 days ago • 97
Perception or Prejudice: Can MLLMs Go Beyond First Impressions of Personality? Paper • 2605.22109 • Published 5 days ago • 164
TransitLM: A Large-Scale Dataset and Benchmark for Map-Free Transit Route Generation Paper • 2605.22355 • Published 5 days ago • 171
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 150 items • Updated about 3 hours ago • 29
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 6 days ago • 200
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 150 items • Updated about 3 hours ago • 29
The Illusion of Reasoning: Exposing Evasive Data Contamination in LLMs via Zero-CoT Truncation Paper • 2605.21856 • Published 5 days ago • 3
HINT-SD: Targeted Hindsight Self-Distillation for Long-Horizon Agents Paper • 2605.17873 • Published 8 days ago • 4
The Expense of Seeing: Attaining Trustworthy Multimodal Reasoning Within the Monolithic Paradigm Paper • 2604.20665 • Published 5 days ago • 5
Rethinking Muon Beyond Pretraining: Spectral Failures and High-Pass Remedies for VLA and RLVR Paper • 2605.19282 • Published 7 days ago • 6
LatentUMM: Dual Latent Alignment for Unified Multimodal Models Paper • 2605.17766 • Published 8 days ago • 6
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 150 items • Updated about 3 hours ago • 29
LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws Paper • 2605.23901 • Published 4 days ago • 9
SciAtlas: A Large-Scale Knowledge Graph for Automated Scientific Research Paper • 2605.22878 • Published 6 days ago • 48