Skip to yearly menu bar Skip to main content


Poster

Cluster-Wise Spatio-Temporal Masking for Efficient Video-Language Pretraining

Weijun Zhuang ⋅ Yuqing Huang ⋅ Weikang Meng ⋅ Xin Li ⋅ Ming Liu ⋅ Xiaopeng Hong ⋅ Yaowei Wang ⋅ Wangmeng Zuo

Abstract

Log in and register to view live content