Skip to yearly menu bar Skip to main content


Poster

Actor-critic is implicitly biased towards high entropy optimal policies

Yuzheng Hu ⋅ Ziwei Ji ⋅ Matus Telgarsky
2022 Poster

Abstract

Video

Chat is not available.