Bayes optimal learning of attention-indexed models
arXiv:2506.01582v3 Announce Type: replace-cross Abstract: We introduce the attention-indexed model (AIM), a theoretical framework for analyzing learning in deep attention layers. Inspired by multi-index models,...