Skip to yearly menu bar Skip to main content


In-Person Poster presentation / poster accept

Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences

Alan Chan ⋅ Hugo Silva ⋅ Sungsu Lim ⋅ Tadashi Kozuno ⋅ A. Rupam Mahmood ⋅ Martha White
2023 In-Person Poster presentation / poster accept
[ JMLR

Abstract

Video

Chat is not available.