Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

14 Results

<<   <   Page 1 of 2   >   >>
Poster
Fri 1:45 AutoChunk: Automated Activation Chunk for Memory-Efficient Deep Learning Inference
Xuanlei Zhao · Shenggan Cheng · Guangyang LU · Haotian Zhou · Bin Jia · Yang You
Workshop
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Jiawei Zhao · Zhenyu Zhang · Beidi Chen · Zhangyang Wang · anima anandkumar · Yuandong Tian
Poster
Tue 1:45 ALAM: Averaged Low-Precision Activation for Memory-Efficient Training of Transformer Models
Sunghyeon Woo · SunWoo Lee · Dongsuk Jeon
Workshop
Addax: Memory-Efficient Fine-Tuning of Language Models with a Combination of Forward-Backward and Forward-Only Passes
Zeman Li · Xinwei Zhang · Meisam Razaviyayn
Workshop
On the Representation Gap Between Modern RNNs and Transformers: The Curse of Memory Efficiency and the Fix of In-Context Retrieval
Kaiyue Wen · Xingyu Dang · Kaifeng Lyu
Poster
Tue 7:30 Provable Memory Efficient Self-Play Algorithm for Model-free Reinforcement Learning
Na Li · Yuchen Jiao · Hangguan Shan · Shefeng Yan
Workshop
DEFT: FLASH TREE-ATTENTION WITH IO-AWARENESS FOR EFFICIENT TREE-SEARCH-BASED LLM INFERENCE
Jinwei Yao · Kexun Zhang · Kaiqi Chen · Jiaxuan You · Zeke Wang · Binhang Yuan · Tao Lin
Affinity Workshop
Thu 7:30 Analog In-Memory Computing with Uncertainty Quantification for Efficient Edge-based Medical Imaging Segmentation
Imane Hamzaoui · Hadjer Benmeziane · Zayneb Cherif · Kaoutar El Maghraoui
Affinity Workshop
Analog In-Memory Computing with Uncertainty Quantification for Efficient Edge-based Medical Imaging Segmentation
Imane Hamzaoui · Hadjer Benmeziane · Zayneb Cherif · Kaoutar El Maghraoui
Workshop
On the Representation Gap Between Modern RNNs and Transformers: The Curse of Memory Efficiency and the Fix of In-Context Retrieval
Kaiyue Wen · Xingyu Dang · Kaifeng Lyu
Oral
Wed 6:45 Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning
Hyungho Na · Yunkyeong Seo · Il-chul Moon
Poster
Thu 7:30 On the Scalability and Memory Efficiency of Semidefinite Programs for Lipschitz Constant Estimation of Neural Networks
Zi Wang · Bin Hu · Aaron Havens · Alexandre Araujo · Yang Zheng · Yudong Chen · Somesh Jha