Tutorial
Recent Advances in Vision Foundation Models
Zhengyuan Yang
401 AB
Abstract:
This tutorial covers cutting-edge developments in vision foundation models. Topics include multimodal understanding and generation, scaling test-time compute, and applications for physical and virtual agents. The session will provide insights into the design and future directions of vision-based foundation models.
Live content is unavailable. Log in and register to view live content