firstbacksecondback
6 Results
Oral
|
Tue 4:08 |
Rao-Blackwellizing the Straight-Through Gumbel-Softmax Gradient Estimator Max B Paulus · Chris Maddison · Andreas Krause |
|
Poster
|
Tue 1:00 |
Rao-Blackwellizing the Straight-Through Gumbel-Softmax Gradient Estimator Max B Paulus · Chris Maddison · Andreas Krause |
|
Oral
|
Wed 3:15 |
Rethinking Attention with Performers Krzysztof Choromanski · Valerii Likhosherstov · David Dohan · Xingyou Song · Georgiana-Andreea Gane · Tamas Sarlos · Peter Hawkins · Jared Q Davis · Afroz Mohiuddin · Lukasz Kaiser · David Belanger · Lucy J Colwell · Adrian Weller |
|
Poster
|
Tue 9:00 |
Rethinking Attention with Performers Krzysztof Choromanski · Valerii Likhosherstov · David Dohan · Xingyou Song · Georgiana-Andreea Gane · Tamas Sarlos · Peter Hawkins · Jared Q Davis · Afroz Mohiuddin · Lukasz Kaiser · David Belanger · Lucy J Colwell · Adrian Weller |
|
Poster
|
Wed 1:00 |
Knowledge distillation via softmax regression representation learning Jing Yang · Brais Martinez · Adrian Bulat · Georgios Tzimiropoulos |
|
Poster
|
Tue 9:00 |
Global optimality of softmax policy gradient with single hidden layer neural networks in the mean-field regime Andrea Agazzi · Jianfeng Lu |