Skip to yearly menu bar Skip to main content


Poster Sat, Jun 14, 2025 • 3:00 PM – 5:00 PM PDT

ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation

Ali Athar · Xueqing Deng · Liang-Chieh Chen

Abstract

Chat is not available.