Skip to yearly menu bar Skip to main content


Poster Thu, Apr 23, 2026 • 11:15 AM – 1:45 PM PDT Pavilion 4 P4-#4804

Towards High Data Efficiency in Reinforcement Learning with Verifiable Reward

Xinyu Tang ⋅ Zhenduo Zhang ⋅ Yurou Liu ⋅ Xin Zhao ⋅ zujie wen ⋅ Zhiqiang Zhang ⋅ JUN ZHOU

Abstract

Log in and register to view live content