Skip to yearly menu bar Skip to main content


Poster

Scaling the Long Video Understanding of Multimodal Large Language Models via Visual Memory Mechanism

Tao Chen ⋅ Kun Zhang ⋅ Qiong Wu ⋅ Xiao Chen ⋅ Chao Chang ⋅ Xiaoshuai Sun ⋅ Yiyi Zhou ⋅ Rongrong Ji

Abstract

Log in and register to view live content