news
Jun 13, 2025 | Major update of the Manipulating Feature Visualizations with Gradient Slingshots pre-print is now available on arXiv. |
---|---|
Apr 28, 2025 | Beyond Scalars: Concept-Based Alignment Analysis in Vision Transformers presented at the Workshop on Representational Alignment at ICLR 2025 in Singapore! |
Dec 14, 2024 | 🐼 quanda presented at the 2nd Workshop on Attributing Model Behavior at Scale at NeurIPS 2024! |
Dec 06, 2024 | 🏆 I am honored to have been awarded the Rolf Niedermeier Prize for an outstanding final thesis by the Faculty of Electrical Engineering and Computer Science at the Technical University of Berlin. |
Oct 09, 2024 | 🐼 quanda library release on GitHub and paper release on arXiv! |
Aug 07, 2024 | 🐣 Started this website AND learned how to ride a bike! |
Jul 27, 2024 | Presenting Manipulating Feature Visualizations with Gradient Slingshots paper at the ICML 2024 Workshop on Mechanistic Interpretability. |
Jul 26, 2024 | Presenting Manipulating Feature Visualizations with Gradient Slingshots paper at the NextGenAISafety workshop at ICML 2024. |
Jun 18, 2024 | First-authored paper Reactive Model Correction: Mitigating Harm to Task-Relevant Features via Conditional Bias Suppression presented at the Safe Artificial Intelligence for All Domains workshop at CVPR 2024 (unable to attend due to US visa issues). |