Skip to yearly menu bar Skip to main content


RLMedusa: Reinforcement Learning for Multiple Decoding Heads to Accelerate LLM Inference

Aadit Juneja · Parsa Idehpour

Abstract

Chat is not available.