firstbacksecondback
12 Results
Poster
|
Thu 9:00 |
Variational Information Bottleneck for Effective Low-Resource Fine-Tuning Rabeeh Karimi Mahabadi · Yonatan Belinkov · James Henderson |
|
Poster
|
Wed 9:00 |
Evaluation of Neural Architectures Trained With Square Loss vs Cross-Entropy in Classification Tasks Like Hui · Misha Belkin |
|
Spotlight
|
Wed 19:25 |
Large Scale Image Completion via Co-Modulated Generative Adversarial Networks Shengyu Zhao · Jonathan Cui · Yilun Sheng · Yue Dong · Xiao Liang · Eric Chang · Yan Xu |
|
Poster
|
Thu 17:00 |
Large Scale Image Completion via Co-Modulated Generative Adversarial Networks Shengyu Zhao · Jonathan Cui · Yilun Sheng · Yue Dong · Xiao Liang · Eric Chang · Yan Xu |
|
Poster
|
Wed 1:00 |
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale Alexey Dosovitskiy · Lucas Beyer · Alexander Kolesnikov · Dirk Weissenborn · Xiaohua Zhai · Thomas Unterthiner · Mostafa Dehghani · Matthias Minderer · Georg Heigold · Sylvain Gelly · Jakob Uszkoreit · Neil Houlsby |
|
Oral
|
Wed 3:00 |
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale Alexey Dosovitskiy · Lucas Beyer · Alexander Kolesnikov · Dirk Weissenborn · Xiaohua Zhai · Thomas Unterthiner · Mostafa Dehghani · Matthias Minderer · Georg Heigold · Sylvain Gelly · Jakob Uszkoreit · Neil Houlsby |
|
Poster
|
Mon 17:00 |
MONGOOSE: A Learnable LSH Framework for Efficient Neural Network Training Beidi Chen · Zichang Liu · Binghui Peng · Zhaozhuo Xu · Jonathan L Li · Tri Dao · Zhao Song · Anshumali Shrivastava · Christopher Re |
|
Oral
|
Tue 21:18 |
MONGOOSE: A Learnable LSH Framework for Efficient Neural Network Training Beidi Chen · Zichang Liu · Binghui Peng · Zhaozhuo Xu · Jonathan L Li · Tri Dao · Zhao Song · Anshumali Shrivastava · Christopher Re |
|
Poster
|
Wed 9:00 |
Witches' Brew: Industrial Scale Data Poisoning via Gradient Matching Jonas Geiping · Liam H Fowl · Ronny Huang · Wojciech Czaja · Gavin Taylor · Michael Moeller · Tom Goldstein |
|
Poster
|
Mon 17:00 |
MixKD: Towards Efficient Distillation of Large-scale Language Models Kevin Liang · Weituo Hao · Dinghan Shen · Yufan Zhou · Weizhu Chen · Changyou Chen · Lawrence Carin |
|
Poster
|
Thu 1:00 |
What Matters for On-Policy Deep Actor-Critic Methods? A Large-Scale Study Marcin Andrychowicz · Anton Raichuk · Piotr Stanczyk · Manu Orsini · Sertan Girgin · Raphaël Marinier · Léonard Hussenot-Desenonges · Matthieu Geist · Olivier Pietquin · Marcin Michalski · Sylvain Gelly · Olivier Bachem |
|
Oral
|
Thu 3:00 |
What Matters for On-Policy Deep Actor-Critic Methods? A Large-Scale Study Marcin Andrychowicz · Anton Raichuk · Piotr Stanczyk · Manu Orsini · Sertan Girgin · Raphaël Marinier · Léonard Hussenot-Desenonges · Matthieu Geist · Olivier Pietquin · Marcin Michalski · Sylvain Gelly · Olivier Bachem |