Tutorial

Recent Advances in Vision Foundation Models

Zhengyuan Yang

2025 Tutorial

Project Page

Abstract

This tutorial covers cutting-edge developments in vision foundation models. Topics include multimodal understanding and generation, scaling test-time compute, and applications for physical and virtual agents. The session will provide insights into the design and future directions of vision-based foundation models.

Chat is not available.