Skip to yearly menu bar Skip to main content


Poster

Thinking in Dynamics: How Multimodal Large Language Models Perceive, Track, and Reason Dynamics in Physical 4D World

Yuzhi Huang ⋅ Kairun Wen ⋅ Rongxin Gao ⋅ Dongxuan Liu ⋅ Yibin Lou ⋅ Jie Wu ⋅ Jing Xu ⋅ Jian Zhang ⋅ Zheng Yang ⋅ yunlong lin ⋅ Chenxin Li ⋅ Panwang Pan ⋅ Junbin Lu ⋅ Jingyan Jiang ⋅ Xinghao Ding ⋅ Yue Huang ⋅ Zhi Wang

Abstract

Log in and register to view live content