| The 2nd Point Cloud Tutorial: All You Need To Know About 3D Point Cloud |
Full day 6/11 |
| From Video Generation to World Model |
Full day 6/11 |
| Scalable Generative Models in Computer Vision |
Full day 6/11 |
| Volumetric Video in the Real World |
Full day 6/11 |
| Cognitive AI for the Future: Agentic Multimodal Models and RAG for Vision Language Applications, from Training to Deployment |
AM 6/11 |
| Foundations of Interpretable AI |
AM 6/11 |
| Tackling 3D Deep Learning, Gaussian Splats and Physics Simulation with NVIDIA Kaolin Library, a Hands-On Lab |
AM 6/11 |
| Evaluations and Benchmarks in Context of Multimodal LLM |
PM 6/11 |
| Multimodal Mathematical Reasoning: Frontiers in Integrating Vision, Language, and Symbolic Representations |
PM 6/11 |
| Evaluating Large Multi-modal Models: Challenges and Methods |
PM 6/11 |
| Robotics 101: An Odyssey from A Vision Perspective |
Full day 6/12 |
| Geospatial Computer Vision and Artificial Intelligence for Large-Scale Earth Observation Data |
Full Day 6/12 |
| Sense, Perceive, Interact & Render on Android XR |
Full Day 6/12 |
| 3D Shape Analysis: From Classical Optimization to Learning-based Matching |
Full Day 6/12 |
| Efficient Text-to-Image/Video modeling |
AM 6/12 |
| Continuous Data Cycle via Foundation Models |
AM 6/12 |
| Edge AI in Action: Technologies and Applications |
AM 6/12 |
| Animal re-identification |
AM 6/12 |
| Multi-Modal Computer Vision and Foundation Models In Agriculture in conjunction with IEEE CVPR 2025 |
AM 6/12 |
| Computer Vision over Homomorphically Encrypted Data |
AM 6/12 |
| Intelligent Healthcare based on Cameras and Wireless Sensors |
PM 6/12 |
| Recent Advances in Vision Foundation Models |
PM 6/12 |
| Identifying Structure in Data: All you need to know about Dimensionality Reduction, Clustering and more |
PM 6/12 |
| Power-efficient neural networks using low-precision data types and quantization |
PM 6/12 |
| Full-Stack, GPU-based Acceleration of Deep Learning and Foundation Models |
PM 6/12 |