Skip to yearly menu bar Skip to main content


TrasMuon: Trust-Region Adaptive Scaling for Orthogonalized Momentum Optimizers

Peng Cheng ⋅ Jiucheng Zang ⋅ Qingnan Li ⋅ Liheng Ma ⋅ Yufei CUI ⋅ Yingxue Zhang ⋅ Boxing Chen ⋅ Ming Jian ⋅ Wen Tong

Abstract

Chat is not available.