Scaling Spatial Reasoning in MLLMs through Programmatic Data Synthesis
Helu Zhi, Jingjing Huang, Wang Xu, Yangbin Xu, Yibin Huang, Wanyue Zhang, Baoyang Jiang, Shirui Deng, Liang Zhu, FangFang Li, Tiejun Zhao, Yankai Lin, Yuan Yao
Keywords:
Vision, Language, and Reasoning
Successful Page Load