Poster
|
Fri 1:45
|
AutoChunk: Automated Activation Chunk for Memory-Efficient Deep Learning Inference
Xuanlei Zhao · Shenggan Cheng · Guangyang LU · Haotian Zhou · Bin Jia · Yang You
|
|
Workshop
|
|
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Jiawei Zhao · Zhenyu Zhang · Beidi Chen · Zhangyang Wang · anima anandkumar · Yuandong Tian
|
|
Poster
|
Tue 1:45
|
ALAM: Averaged Low-Precision Activation for Memory-Efficient Training of Transformer Models
Sunghyeon Woo · SunWoo Lee · Dongsuk Jeon
|
|
Workshop
|
|
Addax: Memory-Efficient Fine-Tuning of Language Models with a Combination of Forward-Backward and Forward-Only Passes
Zeman Li · Xinwei Zhang · Meisam Razaviyayn
|
|
Workshop
|
|
On the Representation Gap Between Modern RNNs and Transformers: The Curse of Memory Efficiency and the Fix of In-Context Retrieval
Kaiyue Wen · Xingyu Dang · Kaifeng Lyu
|
|
Poster
|
Tue 7:30
|
Provable Memory Efficient Self-Play Algorithm for Model-free Reinforcement Learning
Na Li · Yuchen Jiao · Hangguan Shan · Shefeng Yan
|
|
Workshop
|
|
DEFT: FLASH TREE-ATTENTION WITH IO-AWARENESS FOR EFFICIENT TREE-SEARCH-BASED LLM INFERENCE
Jinwei Yao · Kexun Zhang · Kaiqi Chen · Jiaxuan You · Zeke Wang · Binhang Yuan · Tao Lin
|
|
Affinity Workshop
|
Thu 7:30
|
Analog In-Memory Computing with Uncertainty Quantification for Efficient Edge-based Medical Imaging Segmentation
Imane Hamzaoui · Hadjer Benmeziane · Zayneb Cherif · Kaoutar El Maghraoui
|
|
Affinity Workshop
|
|
Analog In-Memory Computing with Uncertainty Quantification for Efficient Edge-based Medical Imaging Segmentation
Imane Hamzaoui · Hadjer Benmeziane · Zayneb Cherif · Kaoutar El Maghraoui
|
|
Workshop
|
|
On the Representation Gap Between Modern RNNs and Transformers: The Curse of Memory Efficiency and the Fix of In-Context Retrieval
Kaiyue Wen · Xingyu Dang · Kaifeng Lyu
|
|
Oral
|
Wed 6:45
|
Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning
Hyungho Na · Yunkyeong Seo · Il-chul Moon
|
|
Poster
|
Thu 7:30
|
On the Scalability and Memory Efficiency of Semidefinite Programs for Lipschitz Constant Estimation of Neural Networks
Zi Wang · Bin Hu · Aaron Havens · Alexandre Araujo · Yang Zheng · Yudong Chen · Somesh Jha
|
|