Skip to yearly menu bar Skip to main content


Poster Sat, Jun 14, 2025 • 3:00 PM – 5:00 PM PDT

VideoGLaMM : A Large Multimodal Model for Pixel-Level Visual Grounding in Videos

Shehan Munasinghe · Hanan Gani · Wenqi Zhu · Jiale Cao · Eric P. Xing · Fahad Shahbaz Khan · Salman Khan

Abstract

Chat is not available.