Skip to yearly menu bar Skip to main content


Poster

Reinforcement Learning from Dynamic Critic Feedback for Free-Form Generations

Mian Wu · Gavin Zhang · Sewon Min · Sergey Levine · Aviral Kumar

Abstract

Log in and register to view live content