Skip to yearly menu bar Skip to main content


Poster

Stable and Efficient Single-Rollout RL for Multimodal Reasoning

Rui Liu ⋅ Dian Yu ⋅ Lei Ke ⋅ Haolin Liu ⋅ Yujun Zhou ⋅ Zhenwen Liang ⋅ Haitao Mi ⋅ Pratap Tokekar ⋅ Dong Yu

Abstract

Log in and register to view live content