firstbacksecondback
62 Results
Poster
|
Wed 2:30 |
Collaborative Pure Exploration in Kernel Bandit Yihan Du · Wei Chen · Yuko Kuroki · Longbo Huang |
|
Poster
|
Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game Wei Xiong · Han Zhong · Chengshuai Shi · Cong Shen · Liwei Wang · Tong Zhang |