Poster
|
Mon 1:00 |
Towards Robustness Against Natural Language Word Substitutions Xinshuai Dong · Anh Tuan Luu · Rongrong Ji · Hong Liu |
|
Poster
|
Mon 9:00 |
Rethinking Embedding Coupling in Pre-trained Language Models Hyung Won Chung · Thibault Fevry · Henry Tsai · Melvin Johnson · Sebastian Ruder |
|
Poster
|
Mon 9:00 |
Learning from others' mistakes: Avoiding dataset biases without modeling them Victor Sanh · Thomas Wolf · Yonatan Belinkov · Alexander M Rush |
|
Poster
|
Mon 9:00 |
Predicting Inductive Biases of Pre-Trained Models Charles Lovering · Rohan Jha · Tal Linzen · Ellie Pavlick |
|
Poster
|
Mon 17:00 |
Rethinking Positional Encoding in Language Pre-training Guolin Ke · Di He · Tie-Yan Liu |
|
Poster
|
Mon 17:00 |
MixKD: Towards Efficient Distillation of Large-scale Language Models Kevin Liang · Weituo Hao · Dinghan Shen · Yufan Zhou · Weizhu Chen · Changyou Chen · Lawrence Carin |
|
Poster
|
Mon 17:00 |
Deberta: Decoding-Enhanced Bert With Disentangled Attention Pengcheng He · Xiaodong Liu · Jianfeng Gao · Weizhu Chen |
|
Poster
|
Mon 17:00 |
Taking Notes on the Fly Helps Language Pre-Training Qiyu Wu · Chen Xing · Yatao Li · Guolin Ke · Di He · Tie-Yan Liu |
|
Poster
|
Tue 1:00 |
Monte-Carlo Planning and Learning with Language Action Value Estimates Youngsoo Jang · Seokin Seo · Jongmin Lee · Kee-Eung Kim |
|
Poster
|
Tue 9:00 |
Mapping the Timescale Organization of Neural Language Models Hsiang-Yun Sherry Chien · Jinhan Zhang · Christopher Honey |
|
Poster
|
Tue 17:00 |
Discovering Non-monotonic Autoregressive Orderings with Variational Inference Xuanlin Li · Brandon Trabucco · Dong Huk Park · Michael Luo · Sheng Shen · trevor darrell · Yang Gao |
|
Poster
|
Wed 17:00 |
Efficient Conformal Prediction via Cascaded Inference with Expanded Admission Adam Fisch · Tal Schuster · Tommi Jaakkola · Regina Barzilay |