Skip to yearly menu bar Skip to main content


Poster

Generative Adversarial Post-Training Mitigates Reward Hacking in Live Human-AI Music Interaction

Yusong Wu · Stephen Brade · Teng Ma · Tia-Jane Fowler · Enning Yang · Berker Banar · Aaron Courville · Natasha Jaques · Anna Huang

Abstract

Log in and register to view live content