The Road to Convergence: Evolution of Unified Multimodal Models
Jindong Wang · Hao Chen · Jiakui Hu · Zhaolong Su · Sharon Li
Abstract
Tracing the evolution of multimodal AI from isolated expertise to Unified Multimodal Models. We introduce the core motivations driving unification — particularly the mutual reinforcement between understanding and generation — and provide a rigorous definition of UMMs.
Successful Page Load