Skip to yearly menu bar Skip to main content


Poster Sat, Apr 25, 2026 • 6:30 AM – 9:00 AM PDT Pavilion 4 P4-#4706

Text2Grad: Reinforcement Learning from Natural Language Feedback

Hanyang Wang ⋅ Lu Wang ⋅ Chaoyun Zhang ⋅ Tianjun Mao ⋅ Si Qin ⋅ Qingwei Lin ⋅ Saravan Rajmohan ⋅ Dongmei Zhang

Abstract

Log in and register to view live content