Tutorial Schedule CVPR 2025

The 2nd Point Cloud Tutorial: All You Need To Know About 3D Point Cloud	Full day 6/11
From Video Generation to World Model	Full day 6/11
Scalable Generative Models in Computer Vision	Full day 6/11
Volumetric Video in the Real World	Full day 6/11
Cognitive AI for the Future: Agentic Multimodal Models and RAG for Vision Language Applications, from Training to Deployment	AM 6/11
Foundations of Interpretable AI	AM 6/11
Tackling 3D Deep Learning, Gaussian Splats and Physics Simulation with NVIDIA Kaolin Library, a Hands-On Lab	AM 6/11
Evaluations and Benchmarks in Context of Multimodal LLM	PM 6/11
Multimodal Mathematical Reasoning: Frontiers in Integrating Vision, Language, and Symbolic Representations	PM 6/11
Evaluating Large Multi-modal Models: Challenges and Methods	PM 6/11
Robotics 101: An Odyssey from A Vision Perspective	Full day 6/12
Geospatial Computer Vision and Artificial Intelligence for Large-Scale Earth Observation Data	Full Day 6/12
Sense, Perceive, Interact & Render on Android XR	Full Day 6/12
3D Shape Analysis: From Classical Optimization to Learning-based Matching	Full Day 6/12
Efficient Text-to-Image/Video modeling	AM 6/12
Continuous Data Cycle via Foundation Models	AM 6/12
Edge AI in Action: Technologies and Applications	AM 6/12
Animal re-identification	AM 6/12
Multi-Modal Computer Vision and Foundation Models In Agriculture in conjunction with IEEE CVPR 2025	AM 6/12
Computer Vision over Homomorphically Encrypted Data	AM 6/12
Intelligent Healthcare based on Cameras and Wireless Sensors	PM 6/12
Recent Advances in Vision Foundation Models	PM 6/12
Identifying Structure in Data: All you need to know about Dimensionality Reduction, Clustering and more	PM 6/12
Power-efficient neural networks using low-precision data types and quantization	PM 6/12
Full-Stack, GPU-based Acceleration of Deep Learning and Foundation Models	PM 6/12