Skip to yearly menu bar Skip to main content


(6 events)   Timezone:  
Show all
The 2026 schedule is still incomplete
Toggle Poster Visibility
Oral
Fri Apr 24 11:15 AM -- 11:25 AM (PDT) None
ThinKV: Thought-Adaptive KV Cache Compression for Efficient Reasoning Models
Akshat Ramachandran · Marina Neseem · Charbel Sakr · Rangharajan Venkatesan · Brucek Khailany · Tushar Krishna
[ OpenReview
Oral
Fri Apr 24 11:27 AM -- 11:37 AM (PDT) None
MrRoPE: Mixed-radix Rotary Position Embedding
Qingyuan Tian · Wenhong Zhu · Xiaoran Liu · Xiaofeng Wang · Rui Wang
[ OpenReview
Oral
Fri Apr 24 11:39 AM -- 11:49 AM (PDT) None
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss
Ang Lv · Jin Ma · Yiyuan Ma · Siyuan Qiao
[ Slides [ OpenReview
Oral
Fri Apr 24 11:51 AM -- 12:01 PM (PDT) None
FlashRNN: Unlocking Parallel Training of Nonlinear RNNs for Large Language Models
Federico Danieli · Pau Rodriguez · Miguel Sarabia · Xavier Suau · Luca Zappella
[ OpenReview
Oral
Fri Apr 24 12:03 PM -- 12:13 PM (PDT) None
Mamba-3: Improved Sequence Modeling using State Space Principles
Aakash Sunil Lahoti · Kevin Li · Berlin Chen · Caitlin Wang · Aviv Bick · Zico Kolter · Tri Dao · Albert Gu
[ OpenReview
Oral
Fri Apr 24 12:15 PM -- 12:25 PM (PDT) None
Energy-Based Transformers are Scalable Learners and Thinkers
Alexi Gladstone · Ganesh Nanduru · Md Mofijul Islam · Peixuan Han · Hyeonjeong Ha · Aman Chadha · Yilun Du · Heng Ji · Jundong Li · Tariq Iqbal
[ OpenReview