CE-Bench: Towards a Reliable Contrastive Evaluation Benchmark of Interpretability of Sparse Autoencoders Paper • 2509.00691 • Published Aug 31 • 2