Skip to yearly menu bar Skip to main content


(6 events)   Timezone:  
Show all
The 2026 schedule is still incomplete
Toggle Poster Visibility
Oral
Thu Apr 23 06:30 AM -- 06:40 AM (PDT) None
Overthinking Reduction with Decoupled Rewards and Curriculum Data Scheduling
Shuyang Jiang · Yusheng Liao · Ya Zhang · Yanfeng Wang · Yu Wang
[ Slides [ OpenReview
Oral
Thu Apr 23 06:42 AM -- 06:52 AM (PDT) None
$\mathbf{T^3}$: Reducing Belief Deviation in Reinforcement Learning for Active Reasoning
Deyu Zou · Yongqiang Chen · Jianxiang Wang · Garry YANG · Mufei Li · James Cheng · Yu Gong · Pan Li · Qing Da
[ OpenReview
Oral
Thu Apr 23 06:54 AM -- 07:04 AM (PDT) None
MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent
Hongli Yu · Tinghong Chen · Jiangtao Feng · Jiangjie Chen · Weinan Dai · Qiying Yu · Ya-Qin Zhang · Wei-Ying Ma · Jingjing Liu · Mingxuan Wang · Hao Zhou
[ OpenReview
Oral
Thu Apr 23 07:06 AM -- 07:16 AM (PDT) None
Verifying Chain-of-Thought Reasoning via its Computational Graph
Zheng Zhao · Yeskendir Koishekenov · Xianjun Yang · Naila Murray · Nicola Cancedda
[ OpenReview
Oral
Thu Apr 23 07:18 AM -- 07:28 AM (PDT) None
Revela: Dense Retriever Learning via Language Modeling
Fengyu Cai · Tong Chen · Xinran Zhao · Sihao Chen · Hongming Zhang · Sherry Wu · Iryna Gurevych · Heinz Koeppl
[ OpenReview
Oral
Thu Apr 23 07:30 AM -- 07:40 AM (PDT) None
RAIN-Merging: A Gradient-Free Method to Enhance Instruction Following in Large Reasoning Models with Preserved Thinking Format
Zhehao Huang · Yuhang Liu · Baijiong Lin · Yixin Lou · Zhengbao He · Hanling Tian · Tao Li · Xiaolin Huang
[ OpenReview