Skip to yearly menu bar Skip to main content


Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets

Han Zhong ⋅ Wei Xiong ⋅ Jiyuan Tan ⋅ Liwei Wang ⋅ Tong Zhang ⋅ Zhaoran Wang ⋅ Zhuoran Yang

Abstract

Chat is not available.