Skip to yearly menu bar Skip to main content


Poster

SpaceMind: Camera-Guided Modality Fusion for Spatial Reasoning in Vision-Language Models

Ruosen Zhao ⋅ Zhikang Zhang ⋅ Jialei Xu ⋅ Jiahao Chang ⋅ Dong Chen ⋅ Lingyun Li ⋅ Weijian Sun ⋅ Zizhuang Wei

Abstract

Log in and register to view live content