Skip to yearly menu bar Skip to main content


Uncovering Mesa-Optimization Algorithms in Transformers

Johannes von Oswald ⋅ Eyvind Niklasson ⋅ Maximilian Schlegel ⋅ Alexander Meulemans ⋅ Seijin Kobayashi ⋅ Nicolas Zucchet ⋅ Nino Scherrer ⋅ Nolan Miller ⋅ Mark Sandler ⋅ Blaise Aguera y Arcas ⋅ Max Vladymyrov ⋅ Razvan Pascanu ⋅ Joao Sacramento

Abstract

Chat is not available.