Announcement_4
Our paper looking at how mechanistic knowledge can predict vulnerabilties was accepted to the Mechanistic Interpretability workshop at ICML 2026.
Our paper looking at how mechanistic knowledge can predict vulnerabilties was accepted to the Mechanistic Interpretability workshop at ICML 2026.