Tutorial
Recent Advances in Vision Foundation Models
Zhengyuan Yang
Abstract:
This tutorial covers cutting-edge developments in vision foundation models. Topics include multimodal understanding and generation, scaling test-time compute, and applications for physical and virtual agents. The session will provide insights into the design and future directions of vision-based foundation models.
Chat is not available.
Successful Page Load