Skip to yearly menu bar Skip to main content


Poster

Addax: Utilizing Zeroth-Order Gradients to Improve Memory Efficiency and Performance of SGD for Fine-Tuning Language Models

Zeman Li · Xinwei Zhang · Peilin Zhong · Yuan Deng · Meisam Razaviyayn · Vahab Mirrokni
2025 Poster

Abstract

Video

Chat is not available.