Skip to yearly menu bar Skip to main content


Poster

DIVA-GRPO: Enhancing Multimodal Reasoning through Difficulty-Adaptive Variant Advantage

Haowen Gao · zhenyu zhang · Liang Pang · Fangda Guo · hongjian dou · Guannan Lv · ShaoGuo Liu · Tingting Gao · Huawei Shen · Xueqi Cheng

Abstract

Log in and register to view live content