Skip to yearly menu bar Skip to main content


Tutorial Schedule CVPR 2025

 

The 2nd Point Cloud Tutorial: All You Need To Know About 3D Point Cloud Full day 6/11
From Video Generation to World Model Full day 6/11
Scalable Generative Models in Computer Vision Full day 6/11
Volumetric Video in the Real World Full day 6/11
Cognitive AI for the Future: Agentic Multimodal Models and RAG for Vision Language Applications, from Training to Deployment AM 6/11
Foundations of Interpretable AI AM 6/11
Tackling 3D Deep Learning, Gaussian Splats and Physics Simulation with NVIDIA Kaolin Library, a Hands-On Lab AM 6/11
Evaluations and Benchmarks in Context of Multimodal LLM PM 6/11
Multimodal Mathematical Reasoning: Frontiers in Integrating Vision, Language, and Symbolic Representations PM 6/11
Evaluating Large Multi-modal Models: Challenges and Methods PM 6/11
Robotics 101: An Odyssey from A Vision Perspective Full day 6/12
Geospatial Computer Vision and Artificial Intelligence for Large-Scale Earth Observation Data Full Day 6/12
Sense, Perceive, Interact & Render on Android XR Full Day 6/12
3D Shape Analysis: From Classical Optimization to Learning-based Matching Full Day 6/12
Efficient Text-to-Image/Video modeling AM 6/12
Continuous Data Cycle via Foundation Models AM 6/12
Edge AI in Action: Technologies and Applications AM 6/12
Animal re-identification AM 6/12
Multi-Modal Computer Vision and Foundation Models In Agriculture in conjunction with IEEE CVPR 2025 AM 6/12
Computer Vision over Homomorphically Encrypted Data AM 6/12
Intelligent Healthcare based on Cameras and Wireless Sensors PM 6/12
Recent Advances in Vision Foundation Models PM 6/12
Identifying Structure in Data: All you need to know about Dimensionality Reduction, Clustering and more PM 6/12
Power-efficient neural networks using low-precision data types and quantization PM 6/12
Full-Stack, GPU-based Acceleration of Deep Learning and Foundation Models PM 6/12