Skip to yearly menu bar Skip to main content


Poster

Where Does Vision Meet Language? Understanding and Refining Visual Fusion in MLLMs via Contrastive Attention

Shezheng Song ⋅ Shasha Li ⋅ Shan Zhao ⋅ Xiaopeng Li ⋅ Qian Wan ⋅ Chengyu Wang ⋅ Tianwei Yan ⋅ Ma Jun ⋅ Jie Yu

Abstract

Log in and register to view live content