Test-Time Training with KV Binding Is Secretly Linear Attention Paper • 2602.21204 • Published about 20 hours ago • 8
LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation Paper • 2602.11451 • Published 14 days ago • 15
TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior Paper • 2512.20757 • Published Dec 23, 2025 • 18