Skip to yearly menu bar Skip to main content


Poster

Understanding the Role of Hallucination in Reinforcement Post-Training of Multimodal Reasoning Models

Gengwei Zhang ⋅ Jie Peng ⋅ Zhen Tan ⋅ Mufan Qiu ⋅ Hossein Nourkhiz Mahjoub ⋅ Vaishnav Tadiparthi ⋅ Kwonjoon Lee ⋅ Yanyong Zhang ⋅ Tianlong Chen

Abstract

Log in and register to view live content