Skip to yearly menu bar Skip to main content


Poster

R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning

Qi Yang ⋅ Bolin Ni ⋅ Shiming Xiang ⋅ Houwen Peng

Abstract

Log in and register to view live content