The 2nd Point Cloud Tutorial: All You Need To Know About 3D Point Cloud |
Full day 6/11 |
From Video Generation to World Model |
Full day 6/11 |
Scalable Generative Models in Computer Vision |
Full day 6/11 |
Volumetric Video in the Real World |
Full day 6/11 |
Cognitive AI for the Future: Agentic Multimodal Models and RAG for Vision Language Applications, from Training to Deployment |
AM 6/11 |
Foundations of Interpretable AI |
AM 6/11 |
Tackling 3D Deep Learning, Gaussian Splats and Physics Simulation with NVIDIA Kaolin Library, a Hands-On Lab |
AM 6/11 |
Evaluations and Benchmarks in Context of Multimodal LLM |
PM 6/11 |
Multimodal Mathematical Reasoning: Frontiers in Integrating Vision, Language, and Symbolic Representations |
PM 6/11 |
Evaluating Large Multi-modal Models: Challenges and Methods |
PM 6/11 |
Robotics 101: An Odyssey from A Vision Perspective |
Full day 6/12 |
Geospatial Computer Vision and Artificial Intelligence for Large-Scale Earth Observation Data |
Full Day 6/12 |
Sense, Perceive, Interact & Render on Android XR |
Full Day 6/12 |
3D Shape Analysis: From Classical Optimization to Learning-based Matching |
Full Day 6/12 |
Efficient Text-to-Image/Video modeling |
AM 6/12 |
Continuous Data Cycle via Foundation Models |
AM 6/12 |
Edge AI in Action: Technologies and Applications |
AM 6/12 |
Animal re-identification |
AM 6/12 |
Multi-Modal Computer Vision and Foundation Models In Agriculture in conjunction with IEEE CVPR 2025 |
AM 6/12 |
Computer Vision over Homomorphically Encrypted Data |
AM 6/12 |
Intelligent Healthcare based on Cameras and Wireless Sensors |
PM 6/12 |
Recent Advances in Vision Foundation Models |
PM 6/12 |
Identifying Structure in Data: All you need to know about Dimensionality Reduction, Clustering and more |
PM 6/12 |
Power-efficient neural networks using low-precision data types and quantization |
PM 6/12 |
Full-Stack, GPU-based Acceleration of Deep Learning and Foundation Models |
PM 6/12 |