Skip to yearly menu bar Skip to main content


Tutorial

Recent Advances in Vision Foundation Models

Zhengyuan Yang

401 AB
[ ] [ Project Page ]
Thu 12 Jun 1 p.m. CDT — 5 p.m. CDT

Abstract:

This tutorial covers cutting-edge developments in vision foundation models. Topics include multimodal understanding and generation, scaling test-time compute, and applications for physical and virtual agents. The session will provide insights into the design and future directions of vision-based foundation models.

Chat is not available.