firstbacksecondback
36 Results
Oral
|
Wed 6:50 |
DIFFormer: Scalable (Graph) Transformers Induced by Energy Constrained Diffusion Qitian Wu · Chenxiao Yang · Wentao Zhao · Yixuan He · David Wipf · Junchi Yan |
|
Poster
|
Wed 7:30 |
DIFFormer: Scalable (Graph) Transformers Induced by Energy Constrained Diffusion Qitian Wu · Chenxiao Yang · Wentao Zhao · Yixuan He · David Wipf · Junchi Yan |
|
Poster
|
Making Better Decision by Directly Planning in Continuous Control Jinhua Zhu · Yue Wang · Lijun Wu · Tao Qin · Wengang Zhou · Tie-Yan Liu · Houqiang Li |
||
Oral
|
Wed 1:40 |
Model-based Causal Bayesian Optimization Scott Sussex · Anastasia Makarova · Andreas Krause |
|
Poster
|
Wed 2:30 |
Model-based Causal Bayesian Optimization Scott Sussex · Anastasia Makarova · Andreas Krause |
|
Workshop
|
Thu 4:00 |
Why Can GPT Learn In-Context? Language Models Implicitly Perform Gradient Descent as Meta-Optimizers Damai Dai · Yutao Sun · Li Dong · Yaru Hao · Shuming Ma · Zhifang Sui · Furu Wei |
|
Poster
|
Re-parameterizing Your Optimizers rather than Architectures Xiaohan Ding · Honghao Chen · Xiangyu Zhang · Kaiqi Huang · Jungong Han · Guiguang Ding |
||
Poster
|
Wed 2:30 |
An Adaptive Policy to Employ Sharpness-Aware Minimization Weisen JIANG · Hansi Yang · Yu Zhang · James Kwok |
|
Poster
|
Alternating Differentiation for Optimization Layers Haixiang Sun · Ye Shi · Jingya Wang · Hoang Tuan · H. Vincent Poor · Dacheng Tao |
||
Poster
|
Wed 2:30 |
SWIFT: Rapid Decentralized Federated Learning via Wait-Free Model Communication Marco Bornstein · Tahseen Rabbani · Evan Wang · Amrit Bedi · Furong Huang |
|
Workshop
|
Thu 4:00 |
Empirical Analysis of the Strengths and Weaknesses of PEFT Techniques for LLMs George Pu · Anirudh Jain · Jihan Yin · Russell Kaplan |
|
Poster
|
Mon 2:30 |
Understanding DDPM Latent Codes Through Optimal Transport Valentin Khrulkov · Gleb Ryzhakov · Andrei Chertkov · Ivan Oseledets |