firstbacksecondback
22 Results
Poster
|
Thu 7:30 |
Text2Reward: Reward Shaping with Language Models for Reinforcement Learning Tianbao Xie · Siheng Zhao · Chen Henry Wu · Yitao Liu · Qian Luo · Victor Zhong · Yanchao Yang · Tao Yu |
|
Poster
|
Thu 7:30 |
Confidence-aware Reward Optimization for Fine-tuning Text-to-Image Models Kyuyoung Kim · Jongheon Jeong · Minyong An · Mohammad Ghavamzadeh · Krishnamurthy Dvijotham · Jinwoo Shin · Kimin Lee |
|
Poster
|
Fri 1:45 |
Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community Arman Isajanyan · Artur Shatveryan · David Kocharian · Zhangyang Wang · Humphrey Shi |
|
Poster
|
Tue 7:30 |
Confronting Reward Model Overoptimization with Constrained RLHF Ted Moskovitz · Aaditya Singh · DJ Strouse · Tuomas Sandholm · Ruslan Salakhutdinov · Anca Dragan · Stephen McAleer |
|
Poster
|
Fri 7:30 |
Reward-Free Curricula for Training Robust World Models Marc Rigter · Minqi Jiang · Ingmar Posner |
|
Poster
|
Thu 1:45 |
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning Fan-Ming Luo · Tian Xu · Xingchen Cao · Yang Yu |
|
Workshop
|
West-of-N: Synthetic Preference Generation for Improved Reward Modeling Alizée Pace · Jonathan Mallinson · Eric Malmi · Sebastian Krause · Aliaksei Severyn |
||
Poster
|
Tue 7:30 |
Eureka: Human-Level Reward Design via Coding Large Language Models Yecheng Jason Ma · William Liang · Guanzhi Wang · De-An Huang · Osbert Bastani · Dinesh Jayaraman · Yuke Zhu · Jim Fan · anima anandkumar |
|
Poster
|
Fri 7:30 |
Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning Juan Rocamonde · Victoriano Montesinos · Elvis Nava · Ethan Perez · David Lindner |
|
Poster
|
Fri 1:45 |
Incentive-Aware Federated Learning with Training-Time Model Rewards Zhaoxuan Wu · Mohammad Mohammadi Amiri · Ramesh Raskar · Bryan Kian Hsiang Low |