Skip to yearly menu bar Skip to main content


Poster

HandVQA: Diagnosing and Improving Fine-Grained Spatial Reasoning about Hands in Vision-Language Models

Khalequzzaman Chowdhury Sayem ⋅ Mubarrat Chowdhury ⋅ Yihalem Yimolal Tiruneh ⋅ Muneeb Ahmed Khan ⋅ Muhammad Salman Ali ⋅ Binod Bhattarai ⋅ Seungryul Baek

Abstract

Log in and register to view live content