Skip to yearly menu bar Skip to main content


Poster Fri, Apr 24, 2026 • 11:15 AM – 1:45 PM PDT Pavilion 4 P4-#4915

Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning

Yimeng Zhang ⋅ Tian Wang ⋅ Jiri Gesi ⋅ Ziyi Wang ⋅ Yuxuan Lu ⋅ Jiacheng Lin ⋅ Simon Zhan ⋅ Vianne Gao ⋅ Ruochen Jiao ⋅ Junze Liu ⋅ Kun Qian ⋅ Yuxin Tang ⋅ Ran Xue ⋅ Houyu Zhang ⋅ Qingjun Cui ⋅ Yufan Guo ⋅ Dakuo Wang

Abstract

Log in and register to view live content