Skip to yearly menu bar Skip to main content


Poster

AutoTool: Automatic Scaling of Tool-Use Capabilities in RL via Decoupled Entropy Constraints

Yirong Zeng · Xiao Ding · Yufei Liu · Yuxian Wang · Qunyao Du · Yutai Hou · Wu Ning · Haonan Song · Duyu Tang · Dandan Tu · Bing Qin · Ting Liu

Abstract

Log in and register to view live content