Created on December 12, 2025
2025
Two spotlight papers at the Mechanistic Interpretability workshop, NeurIPS 2025