Convergent Evolution: How Different Language Models Learn Similar Number Representations Paper • 2604.20817 • Published 30 days ago • 7
Pre-trained Large Language Models Use Fourier Features to Compute Addition Paper • 2406.03445 • Published Jun 5, 2024 • 1