Interpretability

5 items tagged

Circuit Mechanisms for Spatial Relation Generation in Diffusion Transformers

Circuit Mechanisms for Spatial Relation Generation in Diffusion Transformers

Conference on Computer Vision and Pattern Recognition (CVPR) 2026 (highlight, top 3%)
🌟 Highlight — top 3%
Binxu Wang, Jingxuan Fan, Xu Pan

Into the Rabbit Hull: From Task-Relevant Concepts in DINO to Minkowski Geometry

International Conference on Learning Representations (ICLR) 2026
Thomas Fel, Binxu Wang, Michael A. Lepori, Matthew Kowal, Andrew Lee, Randall Balestriero, Stella Joseph, Ekdeep S. Lubana, Talia Konkle, Demba E. Ba, Martin Wattenberg

Archetypal SAE: Adaptive and Stable Dictionary Learning for Concept Extraction in Large Vision Models

ICML 2025 (arXiv:2502.12892)
Thomas Fel, Ekdeep Singh Lubana, Jacob S Prince, Matthew Kowal, Victor Boutin, Isabel Papadimitriou, Binxu Wang, Martin Wattenberg, Demba Ba, Talia Konkle

Related Tags