firstbacksecondback
155 Results
Poster
|
Fri 7:30 |
Boosting Vanilla Lightweight Vision Transformers via Re-parameterization Zhentao Tan · Xiaodan Li · Yue Wu · Qi Chu · Le Lu · Nenghai Yu · Jieping Ye |
|
Poster
|
Wed 7:30 |
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems Zhiyuan Li · Hong Liu · Denny Zhou · Tengyu Ma |
|
Poster
|
Thu 7:30 |
Learning the greatest common divisor: explaining transformer predictions François Charton |
|
Poster
|
Wed 1:45 |
Diagnosing Transformers: Illuminating Feature Spaces for Clinical Decision-Making Aliyah Hsu · Yeshwanth Cherapanamjeri · Briton Park · Tristan Naumann · Anobel Odisho · Bin Yu |
|
Poster
|
Tue 7:30 |
Vision Transformers Need Registers Timothée Darcet · Maxime Oquab · Julien Mairal · Piotr Bojanowski |
|
Poster
|
Thu 1:45 |
Spike-driven Transformer V2: Meta Spiking Neural Network Architecture Inspiring the Design of Next-generation Neuromorphic Chips Man Yao · Jiakui Hu · Tianxiang Hu · Yifan Xu · Zhaokun Zhou · Yonghong Tian · Bo XU · Guoqi Li |
|
Poster
|
Thu 1:45 |
RingAttention with Blockwise Transformers for Near-Infinite Context Hao Liu · Matei Zaharia · Pieter Abbeel |
|
Poster
|
Fri 7:30 |
Can Transformers Capture Spatial Relations between Objects? Chuan Wen · Dinesh Jayaraman · Yang Gao |
|
Poster
|
Tue 1:45 |
Equivariant Scalar Fields for Molecular Docking with Fast Fourier Transforms Bowen Jing · Tommi Jaakkola · Bonnie Berger |
|
Oral
|
Tue 7:00 |
Understanding In-Context Learning in Transformers and LLMs by Learning to Learn Discrete Functions Satwik Bhattamishra · Arkil Patel · Phil Blunsom · Varun Kanade |
|
Oral
|
Fri 1:00 |
Small-scale proxies for large-scale Transformer training instabilities Mitchell Wortsman · Peter Liu · Lechao Xiao · Katie Everett · Alexander Alemi · Ben Adlam · John Co-Reyes · Izzeddin Gur · Abhishek Kumar · Roman Novak · Jeffrey Pennington · Jascha Sohl-Dickstein · Kelvin Xu · Jaehoon Lee · Justin Gilmer · Simon Kornblith |
|
Poster
|
Tue 1:45 |
Linear attention is (maybe) all you need (to understand Transformer optimization) Kwangjun Ahn · Xiang Cheng · Minhak Song · Chulhee Yun · Ali Jadbabaie · Suvrit Sra |