Skip to yearly menu bar Skip to main content


Search All 2023 Events
 

36 Results

<<   <   Page 1 of 3   >   >>
Oral
Wed 6:50 DIFFormer: Scalable (Graph) Transformers Induced by Energy Constrained Diffusion
Qitian Wu · Chenxiao Yang · Wentao Zhao · Yixuan He · David Wipf · Junchi Yan
Poster
Wed 7:30 DIFFormer: Scalable (Graph) Transformers Induced by Energy Constrained Diffusion
Qitian Wu · Chenxiao Yang · Wentao Zhao · Yixuan He · David Wipf · Junchi Yan
Poster
Making Better Decision by Directly Planning in Continuous Control
Jinhua Zhu · Yue Wang · Lijun Wu · Tao Qin · Wengang Zhou · Tie-Yan Liu · Houqiang Li
Oral
Wed 1:40 Model-based Causal Bayesian Optimization
Scott Sussex · Anastasia Makarova · Andreas Krause
Poster
Wed 2:30 Model-based Causal Bayesian Optimization
Scott Sussex · Anastasia Makarova · Andreas Krause
Workshop
Thu 4:00 Why Can GPT Learn In-Context? Language Models Implicitly Perform Gradient Descent as Meta-Optimizers
Damai Dai · Yutao Sun · Li Dong · Yaru Hao · Shuming Ma · Zhifang Sui · Furu Wei
Poster
Re-parameterizing Your Optimizers rather than Architectures
Xiaohan Ding · Honghao Chen · Xiangyu Zhang · Kaiqi Huang · Jungong Han · Guiguang Ding
Poster
Wed 2:30 An Adaptive Policy to Employ Sharpness-Aware Minimization
Weisen JIANG · Hansi Yang · Yu Zhang · James Kwok
Poster
Alternating Differentiation for Optimization Layers
Haixiang Sun · Ye Shi · Jingya Wang · Hoang Tuan · H. Vincent Poor · Dacheng Tao
Poster
Wed 2:30 SWIFT: Rapid Decentralized Federated Learning via Wait-Free Model Communication
Marco Bornstein · Tahseen Rabbani · Evan Wang · Amrit Bedi · Furong Huang
Workshop
Thu 4:00 Empirical Analysis of the Strengths and Weaknesses of PEFT Techniques for LLMs
George Pu · Anirudh Jain · Jihan Yin · Russell Kaplan
Poster
Mon 2:30 Understanding DDPM Latent Codes Through Optimal Transport
Valentin Khrulkov · Gleb Ryzhakov · Andrei Chertkov · Ivan Oseledets