Skip to yearly menu bar Skip to main content


Poster

Taming Preference Mode Collapse via Directional Decoupling Alignment in Diffusion Reinforcement Learning

Chubin Chen ⋅ Sujie Hu ⋅ Jiashu Zhu ⋅ Meiqi Wu ⋅ Jintao Chen ⋅ Yanxun Li ⋅ Nisha Huang ⋅ Chengyu Fang ⋅ Jiahong Wu ⋅ Xiangxiang Chu ⋅ Xiu Li

Abstract

Log in and register to view live content