Announcement_1
Presenting Manipulating Feature Visualizations with Gradient Slingshots paper at the ICML 2024 Workshop on Mechanistic Interpretability.
Presenting Manipulating Feature Visualizations with Gradient Slingshots paper at the ICML 2024 Workshop on Mechanistic Interpretability.