news | Dilyara Bareeva

Oct 03, 2025	✨ Our work Manipulating Feature Visualizations with Gradient Slingshots will appear at NeurIPS 2025!
Oct 03, 2025	✨ Beyond Scalars: Concept-Based Alignment Analysis in Vision Transformers has been accepted to NeurIPS 2025!
Jun 13, 2025	Major update of the Manipulating Feature Visualizations with Gradient Slingshots pre-print is now available on arXiv.
Apr 28, 2025	Beyond Scalars: Concept-Based Alignment Analysis in Vision Transformers presented at the Workshop on Representational Alignment at ICLR 2025 in Singapore!
Dec 14, 2024	🐼 quanda presented at the 2nd Workshop on Attributing Model Behavior at Scale at NeurIPS 2024!
Dec 06, 2024	🏆 I am honored to have been awarded the Rolf Niedermeier Prize for an outstanding final thesis by the Faculty of Electrical Engineering and Computer Science at the Technical University of Berlin.
Oct 09, 2024	🐼 quanda library release on GitHub and paper release on arXiv!
Jul 27, 2024	Presenting Manipulating Feature Visualizations with Gradient Slingshots paper at the ICML 2024 Workshop on Mechanistic Interpretability.
Jul 26, 2024	Presenting Manipulating Feature Visualizations with Gradient Slingshots paper at the NextGenAISafety workshop at ICML 2024.
Jun 18, 2024	First-authored paper Reactive Model Correction: Mitigating Harm to Task-Relevant Features via Conditional Bias Suppression presented at the Safe Artificial Intelligence for All Domains workshop at CVPR 2024 (unable to attend due to US visa issues).