Skip to yearly menu bar Skip to main content


Poster Fri, Apr 24, 2026 • 11:15 AM – 1:45 PM PDT Pavilion 3 P3-#513

Universal Model Routing for Efficient LLM Inference

Wittawat Jitkrittum ⋅ Harikrishna Narasimhan ⋅ Ankit Singh Rawat ⋅ Jeevesh Juneja ⋅ Congchao Wang ⋅ Zifeng Wang ⋅ Alec Go ⋅ Chen-Yu Lee ⋅ Pradeep Shenoy ⋅ Rina Panigrahy ⋅ Aditya Krishna Menon ⋅ Sanjiv Kumar

Abstract

Log in and register to view live content