Skip to yearly menu bar Skip to main content


Optimization, Not Architecture, Governs Vision Transformer Generalization in Small-Data Regimes

Divyanshu Gupta

Abstract

Chat is not available.