Skip to yearly menu bar Skip to main content


Poster

V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control

Dhruva Tirumala ⋅ Arun Ahuja ⋅ Martin Riedmiller ⋅ Jack Rae ⋅ Hubert Soyer ⋅ Seb Noury ⋅ Nicolas Heess ⋅ Jost Tobias Springenberg ⋅ Francis Song ⋅ SIQI LIU ⋅ Abbas Abdolmaleki ⋅ Aidan Clark ⋅ Dan Belov ⋅ Matthew Botvinick

Abstract

Chat is not available.