Mechanistic Interpretability with Sparse Autoencoder Neural Operators
arXiv:2509.03738v3 Announce Type: replace Abstract: We introduce sparse autoencoder neural operators (SAE-NOs), a new class of sparse autoencoders that operate directly in infinite-dimensional function spaces....