Archetypal SAE: Adaptive and Stable Dictionary Learning for Concept Extraction in Large Vision Models

Jan 1, 2025ยท
Thomas Fel
,
Ekdeep Singh Lubana
,
Jacob S Prince
,
Matthew Kowal
,
Victor Boutin
,
Isabel Papadimitriou
,
Binxu Wang
,
Martin Wattenberg
,
Demba Ba
,
Talia Konkle
ยท 0 min read
Type
Publication
ICML 2025 (arXiv:2502.12892)