Skip to yearly menu bar Skip to main content


Poster

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Shengyuan Ding ⋅ Xinyu Fang ⋅ Ziyu Liu ⋅ Yuhang Zang ⋅ Yuhang Cao ⋅ Xiangyu Zhao ⋅ Haodong Duan ⋅ Xiaoyi Dong ⋅ Jianze Liang ⋅ Bin Wang ⋅ Conghui He ⋅ Dahua Lin ⋅ Jiaqi Wang

Abstract

Log in and register to view live content