firstbacksecondback
68 Results
Workshop
|
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL Yifei Zhou · Andrea Zanette · Jiayi Pan · Aviral Kumar · Sergey Levine |
||
Workshop
|
Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks Murtaza Dalal · Tarun Chiruvolu · Devendra Chaplot · Ruslan Salakhutdinov |
||
Poster
|
Tue 7:30 |
Eureka: Human-Level Reward Design via Coding Large Language Models Yecheng Jason Ma · William Liang · Guanzhi Wang · De-An Huang · Osbert Bastani · Dinesh Jayaraman · Yuke Zhu · Jim Fan · anima anandkumar |
|
Poster
|
Thu 7:30 |
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo Haque Ishfaq · Qingfeng Lan · Pan Xu · A. Rupam Mahmood · Doina Precup · anima anandkumar · Kamyar Azizzadenesheli |
|
Poster
|
Thu 1:45 |
In-context Exploration-Exploitation for Reinforcement Learning Zhenwen Dai · Federico Tomasi · Sina Ghiassian |
|
Poster
|
Wed 7:30 |
Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning Zihan Ding · Chi Jin |
|
Poster
|
Wed 7:30 |
TD-MPC2: Scalable, Robust World Models for Continuous Control Nicklas Hansen · Hao Su · Xiaolong Wang |
|
Poster
|
Fri 1:45 |
Proper Laplacian Representation Learning Diego Gomez · Michael Bowling · Marlos C. Machado |
|
Poster
|
Wed 1:45 |
Reward Design for Justifiable Sequential Decision-Making Aleksa Sukovic · Goran Radanovic |
|
Poster
|
Fri 7:30 |
Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes Ruiquan Huang · Yuan Cheng · Jing Yang · Vincent Tan · Yingbin Liang |
|
Poster
|
Thu 1:45 |
Skill or Luck? Return Decomposition via Advantage Functions Hsiao-Ru Pan · Bernhard Schoelkopf |
|
Workshop
|
LEAGUE++: EMPOWERING CONTINUAL ROBOT LEARNING THROUGH GUIDED SKILL ACQUISITION WITH LARGE LANGUAGE MODELS Zhaoyi Li · Kelin Yu · Shuo Cheng · Danfei Xu |