Thinking with Blueprints: Assisting Vision–Language Models in Spatial Reasoning via Structured Object Representation
Weijian Ma, Shizhao Sun, Tianyu Yu, Ruiyu Wang, Tat-Seng Chua, Jiang Bian
Keywords:
Vision, Language, and Reasoning
Successful Page Load