Generalizable Policy Learning in the Physical World

Workshop

Generalizable Policy Learning in the Physical World

Young Min Kim · Sergey Levine · Ming Lin · Tongzhou Mu · Ashvin Nair · Hao Su

Fri 29 Apr, 8 a.m. PDT

[ Abstract ] Workshop Website

While the study of generalization has played an essential role in many application domains of machine learning (e.g., image recognition and natural language processing), it did not receive the same amount of attention in common frameworks of policy learning (e.g., reinforcement learning and imitation learning) at the early stage for reasons such as policy optimization is difficult and benchmark datasets are not quite ready yet. Generalization is particularly important when learning policies to interact with the physical world. The spectrum of such policies is broad: the policies can be high-level, such as action plans that concern temporal dependencies and causalities of environment states; or low-level, such as object manipulation skills to transform objects that are rigid, articulated, soft, or even fluid.In the physical world, an embodied agent can face a number of changing factors such as \textbf{physical parameters, action spaces, tasks, visual appearances of the scenes, geometry and topology of the objects}, etc. And many important real-world tasks involving generalizable policy learning, e.g., visual navigation, object manipulation, and autonomous driving. Therefore, learning generalizable policies is crucial to developing intelligent embodied agents in the real world. Though important, the field is very much under-explored in a systematic way.Learning generalizable policies in the physical world requires deep synergistic efforts across fields of vision, learning, and robotics, and poses many interesting research problems. This workshop is designed to foster progress in generalizable policy learning, in particular, with a focus on the tasks in the physical world, such as visual navigation, object manipulation, and autonomous driving. We envision that the workshop will bring together interdisciplinary researchers from machine learning, computer vision, and robotics to discuss the current and future research on this topic.

Chat is not available.

Timezone: America/Los_Angeles

Schedule

Fri 8:00 a.m. - 8:10 a.m.	Introduction and Opening Remarks ( Introduction ) >	Hao Su 🔗
Fri 8:10 a.m. - 8:35 a.m.	Invited Talk (Danica Kragic): Learning for contact rich tasks ( Invited Talk ) > SlidesLive Video	Danica Kragic 🔗
Fri 8:35 a.m. - 8:40 a.m.	Q&A for Invited Talk (Danica Kragic) ( Q&A ) >	Danica Kragic 🔗
Fri 8:40 a.m. - 9:05 a.m.	Invited Talk (Peter Stone): Grounded Simulation Learning for Sim2Real ( Invited Talk ) > SlidesLive Video	Peter Stone 🔗
Fri 9:05 a.m. - 9:10 a.m.	Q&A for Invited Talk (Peter Stone) ( Q&A ) >	Peter Stone 🔗
Fri 9:10 a.m. - 9:20 a.m.	Break	🔗
Fri 9:20 a.m. - 10:15 a.m.	Poster Session 1 ( Poster Session ) > link Link	🔗
Fri 10:15 a.m. - 11:15 a.m.	Panel Discussion ( Panel Discussion ) >	Young Min Kim · Peter Stone · Nadia Figueroa · Hao Su · Mrinal Kalakrishnan · Xiaolong Wang · Deepak Pathak · Ming Lin · Danfei Xu 🔗
Fri 11:15 a.m. - 11:23 a.m.	ManiSkill Challenge Winner Presentation (Zhutian Yang & Aidan Curtis) ( Contributed Talk ) > SlidesLive Video	Zhutian Yang 🔗
Fri 11:23 a.m. - 11:31 a.m.	ManiSkill Challenge Winner Presentation (Fattonny) ( Contributed Talk ) > SlidesLive Video	Kun Wu 🔗
Fri 11:31 a.m. - 1:00 p.m.	Lunch Break	🔗
Fri 1:00 p.m. - 1:10 p.m.	Contributed Talk (Sim-to-Lab-to-Real: Safe RL with Shielding and Generalization Guarantees) ( Contributed Talk ) > SlidesLive Video	Kai-Chieh Hsu 🔗
Fri 1:10 p.m. - 1:35 p.m.	Invited Talk (Shuran Song): Iterative Residual Policy for Generalizable Dynamic Manipulation of Deformable Objects ( Invited Talk ) > SlidesLive Video	Shuran Song 🔗
Fri 1:35 p.m. - 1:40 p.m.	Q&A for Invited Talk (Shuran Song) ( Q&A ) >	Shuran Song 🔗
Fri 1:40 p.m. - 2:05 p.m.	Invited Talk (Nadia Figueroa): Towards Safe and Efficient Learning and Control for Physical Human Robot Interaction ( Invited Talk ) > SlidesLive Video	Nadia Figueroa 🔗
Fri 2:05 p.m. - 2:10 p.m.	Q&A for Invited Talk (Nadia Figueroa) ( Q&A ) >	Nadia Figueroa 🔗
Fri 2:10 p.m. - 2:18 p.m.	ManiSkill Challenge Winner Presentation (EPIC Lab) ( Contributed Talk ) > SlidesLive Video	Weikang Wan 🔗
Fri 2:18 p.m. - 2:30 p.m.	Break	🔗
Fri 2:30 p.m. - 2:40 p.m.	Contributed Talk (Know Thyself: Transferable Visual Control Policies Through Robot-Awareness) ( Contributed Talk ) > SlidesLive Video	Edward Hu 🔗
Fri 2:40 p.m. - 3:05 p.m.	Invited Talk (Mrinal Kalakrishnan): Robot Learning & Generalization in the Real World ( Invited Talk ) > SlidesLive Video	Mrinal Kalakrishnan 🔗
Fri 3:05 p.m. - 3:10 p.m.	Q&A for Invited Talk (Mrinal Kalakrishnan) ( Q&A ) >	Mrinal Kalakrishnan 🔗
Fri 3:10 p.m. - 3:35 p.m.	Invited Talk (Xiaolong Wang): Generalizing Dexterous Manipulation by Learning from Humans ( Invited Talk ) > SlidesLive Video	Xiaolong Wang 🔗
Fri 3:35 p.m. - 3:40 p.m.	Q&A for Invited Talk (Xiaolong Wang) ( Q&A ) >	Xiaolong Wang 🔗
Fri 3:40 p.m. - 3:48 p.m.	ManiSkill Challenge Winner Presentation (Silver-Bullet-3D) ( Contributed Talk ) > SlidesLive Video	Yingwei Pan 🔗
Fri 3:48 p.m. - 3:50 p.m.	Break	🔗
Fri 3:50 p.m. - 4:45 p.m.	Poster Session 2 ( Poster Session ) > link Link	🔗
Fri 4:45 p.m. - 5:30 p.m.	ManiSkill Challenge Award Ceremony ( Challenge Award Ceremony ) >	13 presenters Hao Su · Weikang Wan · Hao Shen · He Wang · Yingwei Pan · Zhutian Yang · Fabian Dubois · Tom Sonoda · Kun Wu · Kangqi Ma · Liu Kun · Jilei Hou · Tongzhou Mu 🔗
Fri 5:30 p.m. - 6:30 p.m.	Closing Remarks ( Closing Remarks ) >	🔗
-	PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations ( Poster ) > link Link	Sang Tong · Hongyao Tang · Yi Ma · Jianye HAO · YAN ZHENG · Zhaopeng Meng · Boyan Li · Zhen Wang 🔗
-	Imitation Learning for Generalizable Self-driving Policy with Sim-to-real Transfer ( Poster ) > link Link	Zoltán Lőrincz · Márton Szemenyei · Robert Moni 🔗
-	FlexiBiT: Flexible Inference in Sequential Decision Problems via Bidirectional Transformers ( Poster ) > link Link	11 presenters Micah Carroll · Jessy Lin · Orr Paradise · Raluca Georgescu · Mingfei Sun · David Bignell · Stephanie Milani · Katja Hofmann · Matthew Hausknecht · Anca Dragan · Sam Devlin 🔗
-	Learning Category-Level Generalizable Object Manipulation Policy via Generative Adversarial Self-Imitation Learning from Demonstrations ( Poster ) > link Link	Hao Shen · Weikang Wan · He Wang 🔗
-	A Study of Off-Policy Learning in Environments with Procedural Content Generation ( Poster ) > link Link	Andrew Ehrenberg · Robert Kirk · Minqi Jiang · Edward Grefenstette · Tim Rocktaeschel 🔗
-	Planning to Practice: Efficient Online Fine-Tuning by Composing Goals in Latent Space ( Poster ) > link Link	Kuan Fang · Patrick Yin · Ashvin Nair · Sergey Levine 🔗
-	Learning Transferable Policies By Inferring Agent Morphology ( Poster ) > link Link	Brandon Trabucco · mariano Phielipp · Glen Berseth 🔗
-	Using Deep Learning to Bootstrap Abstractions for Robot Planning ( Poster ) > link Link	Naman Shah · Siddharth Srivastava 🔗
-	Don't Freeze Your Embedding: Lessons from Policy Finetuning in Environment Transfer ( Poster ) > link Link	Victoria Dean · Daniel Toyama · Doina Precup · Victoria Dean 🔗
-	Safer Autonomous Driving in a Stochastic, Partially-Observable Environment by Hierarchical Contingency Planning ( Poster ) > link Link	Ugo Lecerf · Christelle Yemdji-Tchassi · Pietro Michiardi 🔗
-	Separating the World and Ego Models for Self-Driving ( Poster ) > link SlidesLive Video Link	Vlad Sobal · Alfredo Canziani · Nicolas Carion · Kyunghyun Cho · Yann LeCun 🔗
-	Multi-objective evolution for Generalizable Policy Gradient Algorithms ( Poster ) > link Link	Juan Jose Garau-Luis · Yingjie Miao · John Co-Reyes · Aaron Parisi · Jie Tan · Esteban Real · Aleksandra Faust 🔗
-	ShiftNorm: On Data Efficiency in Reinforcement Learning with Shift Normalization ( Poster ) > link SlidesLive Video Link	Sicong Liu · Xi Zhang · Yushuo Li · Yifan Zhang · Jian Cheng 🔗
-	Improving performance on the ManiSkill Challenge via Super-convergence and Multi-Task Learning ( Poster ) > link Link	Fabian Dubois · Eric Platon · Tom Sonoda 🔗
-	Multi-task Reinforcement Learning with Task Representation Method ( Poster ) > link Link	Myungsik Cho · Whiyoung Jung · Youngchul Sung 🔗
-	Deep Sequenced Linear Dynamical Systems for Manipulation Policy Learning ( Poster ) > link Link	Mohammad Nomaan Qureshi · Ben Eisner · David Held 🔗
-	Learning Robust Task Context with Hypothetical Analogy-Making ( Poster ) > link Link	Shinyoung Joo · Sang Wan Lee 🔗
-	Silver-Bullet-3D at ManiSkill 2021: Learning-from-Demonstrations and Heuristic Rule-based Methods for Object Manipulation ( Poster ) > link Link	Yingwei Pan · Yehao Li · Yiheng Zhang · Qi Cai · Fuchen Long · Zhaofan Qiu · Ting Yao · Tao Mei 🔗
-	Zero-Shot Reward Specification via Grounded Natural Language ( Poster ) > link Link	Parsa Mahmoudieh · Deepak Pathak · trevor darrell 🔗
-	Reinforcement Learning for Location-Aware Warehouse Scheduling ( Poster ) > link Link	Stelios Stavroulakis · Biswa Sengupta 🔗
-	A Probabilistic Perspective on Reinforcement Learning via Supervised Learning ( Poster ) > link Link	Alexandre Piche · Rafael Pardinas · David Vazquez · Chris J Pal 🔗
-	Prompts and Pre-Trained Language Models for Offline Reinforcement Learning ( Poster ) > link Link	Denis Tarasov · Vladislav Kurenkov · Sergey Kolesnikov 🔗
-	Compositional Multi-Object Reinforcement Learning with Linear Relation Networks ( Poster ) > link Link	Davide Mambelli · Frederik Träuble · Stefan Bauer · Bernhard Schoelkopf · Francesco Locatello 🔗
-	Density Estimation For Conservative Q-Learning ( Poster ) > link Link	Paul Daoudi · Ludovic Dos Santos · Merwan Barlier · Aladin Virmaux 🔗
-	Control of Two-way Coupled Fluid Systems with Differentiable Solvers ( Poster ) > link SlidesLive Video Link	Brener Ramos · Felix Trost · Nils Thuerey 🔗
-	One-Shot Imitation with Skill Chaining using a Goal-Conditioned Policy in Long-Horizon Control ( Poster ) > link Link	Hayato Watahiki · Yoshimasa Tsuruoka 🔗
-	Versatile Offline Imitation Learning via State-Occupancy Matching ( Poster ) > link Link	Yecheng Jason Ma · Andrew Shen · Dinesh Jayaraman · Osbert Bastani 🔗
-	Let’s Handle It: Generalizable Manipulation of Articulated Objects ( Poster ) > link Link	Zhutian Yang · Aidan Curtis 🔗
-	Revisiting Model-based Value Expansion ( Poster ) > link Link	Daniel Palenicek · Michael Lutter · Jan Peters 🔗
-	An Empirical Study and Analysis of Learning Generalizable Manipulation Skill in the SAPIEN Simulator ( Poster ) > link Link	Liu Kun · Huiyuan Fu · Zheng Zhang · huanpu yin 🔗
-	Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning ( Poster ) > link Link	Denis Yarats · David Brandfonbrener · Hao Liu · Michael Laskin · Pieter Abbeel · Alessandro Lazaric · Lerrel Pinto 🔗
-	Learning Generalizable Dexterous Manipulation from Human Grasp Affordance ( Poster ) > link Link	Yueh-Hua Wu · Jiashun Wang · Xiaolong Wang 🔗
-	Continuous Control on Time ( Poster ) > link Link	Tianwei Ni · Eric Jang · Tianwei Ni 🔗
-	A Minimalist Ensemble Method for Generalizable Offline Deep Reinforcement Learning ( Poster ) > link Link	Kun Wu · Yinuo Zhao · Zhiyuan Xu · Zhen Zhao · Pei Ren · Zhengping Che · Chi Liu · Feifei Feng · Jian Tang 🔗
-	Know Thyself: Transferable Visual Control Policies Through Robot-Awareness ( Poster ) > link Link	Edward Hu · Kun Huang · Oleh Rybkin · Dinesh Jayaraman 🔗
-	Sim-to-Lab-to-Real: Safe RL with Shielding and Generalization Guarantees ( Poster ) > link Link	Kai-Chieh Hsu · Allen Z. Ren · Duy Nguyen · Anirudha Majumdar · Jaime Fernández Fisac 🔗