firstbacksecondback
14 Results
Poster
|
Tue 9:00 |
On the Dynamics of Training Attention Models Haoye Lu · Yongyi Mao · Amiya Nayak |
|
Poster
|
Tue 17:00 |
Attentional Constellation Nets for Few-Shot Learning Weijian Xu · Yifan Xu · Huaijin Wang · Zhuowen Tu |
|
Poster
|
Mon 9:00 |
LambdaNetworks: Modeling long-range Interactions without Attention Irwan Bello |
|
Spotlight
|
Wed 12:48 |
LambdaNetworks: Modeling long-range Interactions without Attention Irwan Bello |
|
Poster
|
Tue 1:00 |
Hyperbolic Neural Networks++ Ryohei Shimizu · YUSUKE Mukuta · Tatsuya Harada |
|
Poster
|
Mon 17:00 |
Deberta: Decoding-Enhanced Bert With Disentangled Attention Pengcheng He · Xiaodong Liu · Jianfeng Gao · Weizhu Chen |
|
Poster
|
Wed 17:00 |
BERTology Meets Biology: Interpreting Attention in Protein Language Models Jesse Vig · Ali Madani · Lav R Varshney · Caiming Xiong · Richard Socher · Nazneen Rajani |
|
Spotlight
|
Thu 20:15 |
Random Feature Attention Hao Peng · Nikolaos Pappas · Dani Yogatama · Roy Schwartz · Noah Smith · Lingpeng Kong |
|
Poster
|
Mon 17:00 |
Random Feature Attention Hao Peng · Nikolaos Pappas · Dani Yogatama · Roy Schwartz · Noah Smith · Lingpeng Kong |
|
Poster
|
Thu 9:00 |
Initialization and Regularization of Factorized Neural Layers Mikhail Khodak · Neil Tenenholtz · Lester Mackey · Nicolo Fusi |
|
Poster
|
Wed 17:00 |
Is Attention Better Than Matrix Decomposition? Zhengyang Geng · Meng-Hao Guo · Hongxu None Chen · Xia Li · Ke Wei · Zhouchen Lin |
|
Poster
|
Mon 1:00 |
MELR: Meta-Learning via Modeling Episode-Level Relationships for Few-Shot Learning Nanyi Fei · Zhiwu Lu · Tao Xiang · Songfang Huang |