Skip to yearly menu bar Skip to main content


Do Transformers Use Their Depth Adaptively? Evidence from a Relational Reasoning Task

Alicia Curth ⋅ Rachel Lawrence ⋅ Sushrut Karmalkar ⋅ Niranjani Prasad

Abstract

Chat is not available.