Autonomous Driving (AD) is a complex research area. Through the aid of Machine Learning (ML) techniques, AD systems have rapidly developed, yet there are several challenges involved in deploying these ML algorithms to systems in production. Solving these challenges is beyond the capability of any single company or institution, necessitating large-scale communication and collaboration. Our goal is to provide a platform for this at ICLR, and promote the real-world impact of ML research toward self-driving technology. To this end, we propose a workshop titled “Scene Representations for Autonomous Driving” (SR4AD). We have invited a diverse span of keynote speakers (different regions, academic/industry, junior/senior) to contribute to the workshop. In addition, our program includes a call for contributed papers and a call for competition participation around two new and exciting benchmark tasks for mapping and planning. Finally, to encourage broad discussions, we conclude with a panel debate regarding promising future directions.
Thu 11:50 p.m. - 12:00 a.m.
|
Opening Remarks
SlidesLive Video » |
🔗 |
Fri 12:00 a.m. - 12:30 a.m.
|
Scene Understanding beyond the Visible
(
Awesome Talk
)
SlidesLive Video » |
Hang Qiu 🔗 |
Fri 12:30 a.m. - 1:00 a.m.
|
Vision-Centric Autonomous Driving: Perception, Prediction and Mapping
(
Awesome Talk
)
SlidesLive Video » |
Hang Zhao 🔗 |
Fri 1:00 a.m. - 2:00 a.m.
|
Contributions
(
Oral Presentation
)
|
🔗 |
Fri 2:00 a.m. - 2:30 a.m.
|
Coffee Break
|
🔗 |
Fri 2:30 a.m. - 3:00 a.m.
|
Robust Visual Perception for All Domains: Domain Synthesis, Adaptation, and Generalization
(
Awesome Talk
)
SlidesLive Video » Intelligent safety-critical systems such as autonomous cars operate in the complex open world; as such, they must not only deliver excellent performance in their operational design domain, but also be robust to all unexpected inputs caused by extreme weather and lighting conditions, changes of operation domains, and rare but potentially catastrophic situations. In this talk, I will present novel approaches covering the full spectrum of settings for this challenge, ranging from domain synthesis to unsupervised domain adaptation, to test-time domain adaptation, and domain generalization. Our approaches have achieved state-of-the-art performance for a wide variety of domain changes, e.g. cross-datasets, normal-to-adverse, and synthetic-to-real. At the end of the talk, I will briefly present two benchmarks that we have developed for robust visual perception. |
Dengxin Dai 🔗 |
Fri 3:00 a.m. - 3:30 a.m.
|
Towards Generative Photorealistic Simulation
(
Awesome Talk
)
SlidesLive Video » A simulator with diverse photorealistic 3D assets is essential for autonomous driving. While reconstructing real-world scenes is promising for photorealistic simulation, generative models further enable the effortless creation of novel objects and scenes. In this talk, I will present our recent progress in 3D-aware image synthesis of objects, humans, and scenes toward generative simulation. I will introduce several 3D-aware generative models, including VoxGRAF for efficient object generation, VeRi3D for controllable human generation, and UrbanGIRAFFE for compositional and controllable urban scene generation. |
Yiyi Liao 🔗 |
Fri 3:30 a.m. - 4:30 a.m.
|
Lunch Break
|
🔗 |
Fri 4:30 a.m. - 5:00 a.m.
|
Learning a Globally Scalable Driving Intelligence
(
Awesome Talk
)
SlidesLive Video » |
Jamie Shotton 🔗 |
Fri 5:00 a.m. - 6:00 a.m.
|
Contributions
(
Oral Presentation
)
|
🔗 |
Fri 6:00 a.m. - 6:30 a.m.
|
Coffee Break
|
🔗 |
Fri 6:30 a.m. - 7:00 a.m.
|
Optimizing Internal Network Representations for Geometric and Semantic Perception
(
Awesome Talk
)
SlidesLive Video » Dense visual perception of the geometry and semantics of the surrounding scene differs from basic global tasks such as classification in that it requires to distinguish fine spatial details besides aggregating context across the extent of the input. These two conflicting goals are typically pursued through encoder-decoder architectures, which involve an information bottleneck at the interface of the encoder and the decoder. Motivated by the fact that this internal bottleneck marks the limit between context aggregation and fine-grained parsing, we operate on the respective internal representations and optimize them in conjunction with the output representations by introducing dedicated modules and losses in various geometric and semantic perception settings. The talk will include a thorough review of a geometric case and two semantic cases of the above internal representation optimization paradigm. The former case consists in our internal discretization method for dense regression, which has general application on geometric tasks such as supervised monocular depth estimation and surface normal estimation. The latter cases consist in our condition-invariant methods for unsupervised domain adaptation of semantic segmentation models, which introduce feature invariance and cross-domain contrastive losses on the internal network representations. |
Christos Sakaridis 🔗 |
Fri 7:00 a.m. - 7:30 a.m.
|
Secure and Safe Autonomous Driving in Adversarial Environments
(
Awesome Talk
)
SlidesLive Video » Advances in machine learning have led to the rapid and widespread deployment of ML algorithms in safety-critical applications, such as autonomous driving and medical healthcare. Standard machine learning systems, however, assume that training and test data follow the same, or similar, distributions, without explicitly considering active adversaries manipulating either distribution. For instance, our recent work has demonstrated that motivated adversaries can circumvent anomaly detection or other machine learning models at test-time through evasion attacks, or can inject well-crafted malicious instances into training data to induce errors during inference through poisoning attacks. In this talk, I will describe different perspectives of security and safety in machine learning, such as robustness, privacy, generalization, and their underlying interconnections. I will focus on a certifiably robust learning approach based on statistical learning with logical reasoning as an example, and then discuss the principles towards designing and developing practical trustworthy machine learning systems with guarantees, by considering these trustworthiness perspectives in a holistic view. I will also introduce our unified platform SefeBench which generates diverse safety-critical autonomous driving scenarios for safety tests for autonomous vehicles. |
Bo Li 🔗 |
Fri 7:30 a.m. - 8:30 a.m.
|
Panel Discussion
SlidesLive Video » |
🔗 |