Skip to yearly menu bar Skip to main content


(6 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Thu Apr 23 06:30 AM -- 06:40 AM (PDT) @ Amphitheater None
Overthinking Reduction with Decoupled Rewards and Curriculum Data Scheduling
Shuyang Jiang ⋅ Yusheng Liao ⋅ Ya Zhang ⋅ Yanfeng Wang ⋅ Yu Wang
[ Slides [ OpenReview
Oral
Thu Apr 23 06:42 AM -- 06:52 AM (PDT) @ Amphitheater None
Reducing Belief Deviation in Reinforcement Learning for Active Reasoning of LLM Agents
Deyu Zou ⋅ Yongqiang Chen ⋅ Jianxiang Wang ⋅ Garry YANG ⋅ Mufei Li ⋅ Qing Da ⋅ James Cheng ⋅ Pan Li ⋅ Yu Gong
[ OpenReview
Oral
Thu Apr 23 06:54 AM -- 07:04 AM (PDT) @ Amphitheater None
MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent
Hongli Yu ⋅ Tinghong Chen ⋅ Jiangtao Feng ⋅ Jiangjie Chen ⋅ Weinan Dai ⋅ Qiying Yu ⋅ Ya-Qin Zhang ⋅ Wei-Ying Ma ⋅ Jingjing Liu ⋅ Mingxuan Wang ⋅ Hao Zhou
[ OpenReview
Oral
Thu Apr 23 07:06 AM -- 07:16 AM (PDT) @ Amphitheater None
Verifying Chain-of-Thought Reasoning via Its Computational Graph
Zheng Zhao ⋅ Yeskendir Koishekenov ⋅ Xianjun Yang ⋅ Naila Murray ⋅ Nicola Cancedda
[ OpenReview
Oral
Thu Apr 23 07:18 AM -- 07:28 AM (PDT) @ Amphitheater None
Revela: Dense Retriever Learning via Language Modeling
Fengyu Cai ⋅ Tong Chen ⋅ Xinran Zhao ⋅ Sihao Chen ⋅ Hongming Zhang ⋅ Sherry Wu ⋅ Iryna Gurevych ⋅ Heinz Koeppl
[ OpenReview
Oral
Thu Apr 23 07:30 AM -- 07:40 AM (PDT) @ Amphitheater None
RAIN-Merging: A Gradient-Free Method to Enhance Instruction Following in Large Reasoning Models with Preserved Thinking Format
Zhehao Huang ⋅ Yuhang Liu ⋅ Baijiong Lin ⋅ Yixin Lou ⋅ Zhengbao He ⋅ Hanling Tian ⋅ Tao Li ⋅ Xiaolin Huang
[ OpenReview