Skip to yearly menu bar Skip to main content


Poster

Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning

Antoine Yang ⋅ Arsha Nagrani ⋅ Paul Hongsuck Seo ⋅ Antoine Miech ⋅ Jordi Pont-Tuset ⋅ Ivan Laptev ⋅ Josef Sivic ⋅ Cordelia Schmid
2023 Poster

Abstract

Chat is not available.