Skip to yearly menu bar Skip to main content


Poster

TimeViper: A Hybrid Mamba-Transformer Vision-Language Model for Efficient Long Video Understanding

Boshen Xu ⋅ Zihan Xiao ⋅ Jiaze Li ⋅ Jianzhong Ju ⋅ Zhenbo Luo ⋅ Jian Luan ⋅ Qin Jin

Abstract

Log in and register to view live content