Skip to yearly menu bar Skip to main content


Oral
in
Workshop: Workshop on Logical Reasoning of Large Language Models
Sun, Apr 26, 2026 • 8:10 AM – 8:30 AM PDT

Learning Reasoning Reward Models from Expert Demonstration via Inverse Reinforcement Learning

Claudio Fanconi ⋅ Nicolás Astorga ⋅ Mihaela van der Schaar

Abstract

Video

Chat is not available.