Skip to yearly menu bar Skip to main content


ProcessThinker: Enhancing Multi-modal Large Language Models Reasoning via Rollout-based Process Reward

Jingpei Wu ⋅ Xiao Han ⋅ Weixiang Shen ⋅ Boer Zhang ⋅ Zifeng Ding ⋅ Volker Tresp

Abstract

Chat is not available.