Skip to yearly menu bar Skip to main content


Transformers Learn Nonlinear Features In Context: Nonconvex Mean-field Dynamics on the Attention Landscape

Juno Kim · Taiji Suzuki

Abstract

Chat is not available.