Skip to yearly menu bar Skip to main content


Poster

MoE-GRPO: Optimizing Mixture-of-Experts via Reinforcement Learning in Vision-Language Models

Dohwan Ko ⋅ Jinyoung Park ⋅ Seoung Choi ⋅ Sanghyeok Lee ⋅ Seohyun Lee ⋅ Hyunwoo J. Kim

Abstract

Log in and register to view live content