The Fragility of Chain-of-Thought Monitoring Across Typologically Diverse Languages Paper • 2605.27901 • Published 2 days ago • 6
Towards Understanding the Robustness of Sparse Autoencoders Paper • 2604.18756 • Published Apr 20 • 10
SAM: The Sensitivity of Attribution Methods to Hyperparameters Paper • 2003.08754 • Published Mar 4, 2020