firstbacksecondback
20 Results
Poster
|
Thu 9:00 |
A Mathematical Exploration of Why Language Models Help Solve Downstream Tasks Nikunj Umesh Saunshi · Sadhika Malladi · Sanjeev Arora |
|
Poster
|
Mon 9:00 |
Predicting Inductive Biases of Pre-Trained Models Charles Lovering · Rohan Jha · Tal Linzen · Ellie Pavlick |
|
Poster
|
Wed 17:00 |
Deep Encoder, Shallow Decoder: Reevaluating Non-autoregressive Machine Translation Jungo Kasai · Nikolaos Pappas · Hao Peng · James Cross · Noah Smith |
|
Poster
|
Mon 17:00 |
MixKD: Towards Efficient Distillation of Large-scale Language Models Kevin Liang · Weituo Hao · Dinghan Shen · Yufan Zhou · Weizhu Chen · Changyou Chen · Lawrence Carin |
|
Poster
|
Tue 17:00 |
Discovering Non-monotonic Autoregressive Orderings with Variational Inference Xuanlin Li · Brandon Trabucco · Dong Huk Park · Michael Luo · Sheng Shen · trevor darrell · Yang Gao |
|
Poster
|
Mon 17:00 |
Taking Notes on the Fly Helps Language Pre-Training Qiyu Wu · Chen Xing · Yatao Li · Guolin Ke · Di He · Tie-Yan Liu |
|
Poster
|
Wed 17:00 |
Modelling Hierarchical Structure between Dialogue Policy and Natural Language Generator with Option Framework for Task-oriented Dialogue System Jianhong Wang · Yuan Zhang · Tae-Kyun Kim · Yunjie Gu |
|
Poster
|
Thu 9:00 |
Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data Jonathan Pilault · Amine EL hattami · Chris J Pal |