Skip to yearly menu bar Skip to main content


Game-Theoretic Regularized Self-Play Alignment of Large Language Models

Xiaohang Tang ⋅ Sangwoong Yoon ⋅ Seongho Son ⋅ Rina Hughes ⋅ Quanquan Gu ⋅ Ilija Bogunovic

Abstract

Chat is not available.