firstbacksecondback
15 Results
Poster
|
Tue 2:30 |
Dichotomy of Control: Separating What You Can Control from What You Cannot Sherry Yang · Dale Schuurmans · Pieter Abbeel · Ofir Nachum |
|
Oral
|
Tue 2:00 |
Dichotomy of Control: Separating What You Can Control from What You Cannot Sherry Yang · Dale Schuurmans · Pieter Abbeel · Ofir Nachum |
|
Poster
|
Learning Kernelized Contextual Bandits in a Distributed and Asynchronous Environment Chuanhao Li · Huazheng Wang · Mengdi Wang · Hongning Wang |
||
Oral
|
Mon 6:40 |
Powderworld: A Platform for Understanding Generalization via Rich Task Distributions Kevin Frans · Phillip Isola |
|
Poster
|
Mon 7:30 |
Powderworld: A Platform for Understanding Generalization via Rich Task Distributions Kevin Frans · Phillip Isola |
|
Poster
|
Tue 2:30 |
Stateful Active Facilitator: Coordination and Environmental Heterogeneity in Cooperative Multi-Agent Reinforcement Learning Dianbo Liu · Vedant Shah · Oussama Boussif · Cristian Meo · Anirudh Goyal · Tianmin Shu · Michael Mozer · Nicolas Heess · Yoshua Bengio |
|
Poster
|
Free Lunch for Domain Adversarial Training: Environment Label Smoothing YiFan Zhang · xue wang · Jian Liang · Zhang Zhang · Liang Wang · Rong Jin · Tieniu Tan |
||
Poster
|
Tue 7:30 |
MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning Mikayel Samvelyan · Akbir Khan · Michael Dennis · Minqi Jiang · Jack Parker-Holder · Jakob Foerster · Roberta Raileanu · Tim Rocktaeschel |
|
Poster
|
GoBigger: A Scalable Platform for Cooperative-Competitive Multi-Agent Interactive Simulation Ming Zhang · Shenghan Zhang · Zhenjie Yang · Lekai Chen · Jinliang Zheng · yang chao · Chuming Li · Hang Zhou · Yazhe Niu · Yu Liu |
||
Poster
|
Wed 7:30 |
Learning in temporally structured environments Matt Jones · Tyler Scott · Mengye Ren · Gamaleldin Elsayed · Katherine Hermann · David Mayo · Michael Mozer |
|
Poster
|
Revisiting Intrinsic Reward for Exploration in Procedurally Generated Environments Kaixin Wang · Kuangqi Zhou · Bingyi Kang · Jiashi Feng · shuicheng YAN |
||
Poster
|
Tue 7:30 |
Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection Jiajun Fan · Yuzheng Zhuang · Yuecheng Liu · Jianye HAO · Bin Wang · Jiangcheng Zhu · Hao Wang · Shu-Tao Xia |