Skip to yearly menu bar Skip to main content


Poster

Why Does RL Generalize Better Than SFT? A Data-Centric Perspective on VLM Post-Training

Aojun Lu ⋅ Tao Feng ⋅ Hangjie Yuan ⋅ Wei Li ⋅ Yanan Sun

Abstract

Log in and register to view live content