Skip to yearly menu bar Skip to main content


Poster

SPARROW: Learning Spatial Precision and Temporal Referential Consistency in Pixel-Grounded Video MLLMs

Mohamad Alansari ⋅ Naufal Suryanto ⋅ Divya Velayudhan ⋅ Sajid Javed ⋅ Naoufel Werghi ⋅ Muzammal Naseer

Abstract

Log in and register to view live content