Workshops
The 1st Workshop on Deployment of Foundation Models for Embodied AI
Burhan Yaman ⋅ Xin Ye
View full details
Third Joint Egocentric Vision (EgoVis) Workshop
Siddhant Bansal ⋅ Tushar Nagarajan
View full details
The 3rd AI for Visual Arts Workshop and Challenges
Deblina Bhattacharjee ⋅ Bahar Aydemir
View full details
Generative AI for XR and Identity-based Applications
Brendan David-John ⋅ Chris Thomas
View full details
AI for Creative Visual Content Generation, Editing and Understanding
Ozgur Kara ⋅ Junho Kim
View full details
Bridging Vision, Language, and Action: What’s Missing in Actionable Visual Perception for Robotics
Jiawei Ma ⋅ Chengzhi Mao
View full details
The 2nd Workshop on Multimodal Spatial Intelligence
Juil Koo ⋅ Phillip Y. Lee
View full details
4th Workshop on Vision Based Industrial Inspection
Shancong Mou ⋅ Hao Yan
View full details
The 2nd International Workshop & Challenge on Subtle Visual Computing @CVPR 2026
Zitong Yu ⋅ Xun Lin
View full details
LatinX in Computer Vision Research Workshop
Ana Maria Quintero ⋅ William de Lima
View full details
Autonomous Understanding Through Open-world Perception and Integrated Language models for On-road Tasks
Ali AlShami ⋅ Ryan Rabinowitz
View full details
The 5th Workshop on Federated Learning for Computer Vision
Chen Chen ⋅ Guangyu Sun
View full details
From Lab Demos to Daily Tasks: Embodied Intelligence in the Wild
Huijie Wang ⋅ Hongyang Li
View full details
22nd Workshop on Perception Beyond the Visible Spectrum
Riad I. Hammoud ⋅ Yi Ding
View full details
Sense of Space: Multi-Sensory Modeling for Embodied Intelligence
Rao Fu ⋅ Li Guan
View full details
Workshop on World Models Meet Active Sensing and Closed-Loop Planning
Jieneng Chen ⋅ Alan Yuille
View full details
AERO-HPR: Human Perception and Recognition in Aerial Surveillance
Kien Nguyen Thanh ⋅ Arun Ross
View full details
2nd Workshop on Photorealistic 3D Head Avatars
Tobias Kirschstein ⋅ Simon Giebenhain
View full details
PHAROS AI Factory for Medical Imaging & Healthcare
Stefanos Kollias ⋅ Xujiong Ye
View full details
Workshop on Vision-based Assistants in the Real-World
Apratim Bhattacharyya ⋅ Fadime Sener
View full details
Proposal for 12th Workshop on Medical Computer Vision, CVPR 2026
Zongwei Zhou ⋅ Yucheng Tang
View full details
The 3rd Workshop on Foundation Models for Medical Vision
Jun Ma ⋅ Yuyin Zhou
View full details
The 3rd Workshop on AI for Content Generation, Quality Enhancement and Streaming
Marcos V. Conde ⋅ Radu Timofte
View full details
Multimodal Foundation Models for Biomedicine: Challenges and Opportunities
Yuhui Zhang ⋅ Xiaohan Wang
View full details
1st Workshop on Video World Models: Interaction, Memory, and Efficiency
Jiwen Yu ⋅ Xihui Liu
View full details
The 5th Explainable AI for Computer Vision (XAI4CV) Workshop
Miguel-Ángel Fernández-Torres
Computer vision for high-stakes, real-world applications necessitates robust explanation and transparency to ensure trust, accountability, and ethical deployment. Celebrating its 5th Anniversary, the Explainable AI for Computer Vision (XAI4CV) workshop provides a premier forum for the entire spectrum of XAI research, from interpretable-by-design models to challenges in multimodal foundational models. The program includes invited talks, spotlight papers, a poster session, and a tutorial. XAI4CV accepts paper and demo submissions to define the future of trustworthy visual AI.
Show more
Multimodal Alignment for a Pluralistic Society
Perampalli Shravan Nayak ⋅ Aishwarya Agrawal
View full details
AI4RWC: The 2nd International Workshop on Vision Intelligence for Real-world Challenges
Daqian Shi ⋅ Xiaolei Diao
View full details
GRAIL-V: Grounded Retrieval & Agentic Intelligence for Vision-Language
Amit Agarwal ⋅ Vivek. Gupta
View full details
Urban Scene Modeling: Structured, Semantic, and Synthetic 3D Habitats
Jack Langerman ⋅ Ruisheng Wang
View full details
The 3rd Workshop on Human Motion Generation - New Perspective on Simulation, Animation, and VR applications
Chuan Guo ⋅ Yuxuan Mu
View full details
The Second CVPR Workshop on Foundation and Large Vision Models in Remote Sensing (MORSE)
Saurabh Prasad ⋅ Jocelyn Chanussot
View full details
13th Workshop on Fine-grained Visual Categorization
Nico Lang ⋅ Lukas Picek
View full details
Workshop Proposal: AI-assisted Long Video Creation
Yudong Jiang ⋅ Lisai Zhang
View full details
3rd Workshop on Efficient and On-Device Generation (EDGE), CVPR 2026
Felix Juefei-Xu ⋅ Tingbo Hou
View full details
Auto-Annotation with Expert-Crafted Guidelines
Shu Kong ⋅ Sara Beery
Machine-learned visual systems are transforming numerous fields such as autonomous driving, biodiversity assessment, and ecological monitoring, but they hunger for vast, high-quality annotated data. Asking domain experts to manually annotate large-scale data is unrealistic; the current paradigm to scale up data annotation is to have domain experts craft annotation guidelines using visual examples and descriptions for non-expert annotators to apply. This paradigm is commonly adopted by companies which provide data labeling services. Lacking domain knowledge, ordinary annotators often produce annotations that are erroneous, subjective, biased, and inconsistent. Further, this process is labor-intensive, tedious, and costly. This workshop aims to pioneer auto-annotation, developing AI agents that can interpret expert-crafted annotation guidelines and generate labels automatically. In essence, we seek to replace ordinary human annotators with AI.
Show more
The 5th Workshop on “What is Next in Multimodal Foundation Models?”
Edson Araujo ⋅ Roei Herzig
View full details
The 2nd 3D-LLM/VLA Workshop: Bridging Language, Vision and Action in 3D Environments
Yining Hong ⋅ Wenbo hu
View full details
Workshop on Multimodal Human Motion Analysis
Olivia Nocentini ⋅ Rishabh Dabral
View full details
The 2nd Workshop on Test-time Scaling for Computer Vision
Yinpeng Dong ⋅ Yichi Zhang
View full details
GigaBrain Challenge 2026: Workshop on World Models Empowering Vision Language Action Model
Zheng Zhu ⋅ Xiaofeng Wang
View full details
DataMFM: Emerging Directions in Data for Multimodal Foundation Models
Pengyuan Li ⋅ Zihan Wang
View full details
2nd Workshop on Multimodal Sign Language Recognition
Raffaele Mineo ⋅ Hamzah Luqman
MSLR 2026 is the second edition of a rapidly growing venue on multimodal sign language recognition and translation. The program combines invited talks, a peer-reviewed track published in CVPR Workshops, and the SignEval Challenge featuring updated datasets for isolated LIS and continuous SLR. We emphasize privacy-preserving sensing (e.g., radar), healthcare accessibility, and inclusive practices with sign interpreters. Building on the success at ICCV 2025, MSLR 2026 will consolidate a global, interdisciplinary community spanning computer vision, linguistics, healthcare, and Deaf studies.
Show more
10th Affective & Behavior Analysis in-the-wild
Dimitrios Kollias ⋅ Panagiotis Tzirakis
View full details
The 5th Workshop on Transformers for Vision and Multimodal AI
Gedas Bertasius ⋅ Zhiding Yu
View full details
Synthetic & Adversarial ForEnsics
Josué Martínez-Martínez ⋅ Pooya Khorrami
View full details
Cognitive Foundations for Multimodal Models
Aditya Chinchure ⋅ Sahithya Ravi
View full details
Authenticity & Provenance in the age of Generative AI
Shruti Agarwal ⋅ Sarah Barrington
View full details
Computer Vision with Small Data: Beyond Scale -- Toward Data-Efficient Dynamically-Aware Video Intelligence
Sarah Ostadabbas ⋅ Shayda Moezzi
View full details
The 7th International Workshop and CVML Challenge on Agriculture-Vision: Challenges & Opportunities for Computer Vision in Agriculture
Chris Padwick ⋅ Ripudaman Arora
View full details
1st Workshop on Multi-Agent Robotic Systems: Scaling with Compositional Intelligence
Yiran Qin ⋅ Zhenfei Yin
View full details
Sixth Workshop on Neural Architecture Search
Stephen McGough ⋅ Amir Atapour-Abarghouei
View full details
The 1st Workshop on Vision for Intelligent Task Assistants
Ehsan Elhamifar ⋅ Jason J. Corso
View full details
3rd Workshop on ScanNet++ Novel View Synthesis and 3D Semantic Understanding Challenge
Angela Dai ⋅ Matthias Nießner
View full details
The 1st Workshop on Monitoring the World through an Imperfect Lens
Miriam Cha ⋅ Greg Angelides
View full details
Second Workshop on Foundation and Generative Models in Biometrics
Hatef Otroshi Shahreza ⋅ Vitomir Struc
View full details
Rediscovering Intelligence: Can AI Still Learn from Humans?
Xi Wang ⋅ Yen-Ling Kuo
View full details
Spatial Intelligence for Cultural Heritage
Marina Paolanti ⋅ Roberto Pierdicca
View full details
OpenSUN3D: 6th Workshop on Open-World 3D Scene Understanding with Foundation Models
Francis Engelmann ⋅ Anna-Maria Halacheva
View full details
The Eighth Workshop on Precognition: Seeing through the Future
Khoa Luu ⋅ Nemanja Djuric
View full details
Unified Robotic Vision with Cross-Modal Sensing and Alignment
Zongwei Wu ⋅ Christos Sakaridis
View full details
From Perception to Persuasion: Challenges and Advances in Misinformation Detection in Society
PRIYANKA SINGH ⋅ Xue Li
View full details
The 8th UG2+ Workshop and Challenge: Bridging the Gap between Computational Photography and Visual Perception
Alex Wong ⋅ Dong Lao
View full details
Video Generative Models: Benchmarks and Evaluation
Shuo Xing ⋅ Mingyang Wu
View full details
6th Workshop on 3D Scene Understanding for Vision, Graphics, and Robotics
Yixin Chen ⋅ Shaofei Wang
View full details
2nd Workshop on Human-Interactive Generation and Editing
Jinbo Xing ⋅ Xi Chen
View full details
SPAR-3D: Security, Privacy, and Adversarial Robustness in 3D Generative Vision Models
Nicole Meng ⋅ Yingjie Lao
View full details
Open-World Vision
Shu Kong ⋅ Neehar Peri
Open-World Vision (OWV) emphasizes realistic opportunities and challenges in developing and deploying computer vision systems in the dynamic, vast, and unpredictable real open world, which offers abundant data that can benefit training and challenge testing. It contrasts the traditional "closed-world" paradigm of visual learning and inference, which assumes fixed, known data distributions and categorical labels. Models developed under such closed-world assumptions tend to be brittle when encountering ever-changing and novel scenarios in the real open world. Modern visual learning has shifted towards an open-world paradigm, such as pretraining foundation models on massive data sourced from the open world (e.g., web-sourced data). While these models show unprecedented performance and strong adaptability to downstream tasks, they inherit biases from their open-world pretraining data and can still fail in truly novel or underrepresented scenarios during deployment. This workshop aims not only to uncover current limitations, potential risks, emerging opportunities, and unresolved challenges of open-world vision, but also to solicit solutions that advance the field toward more robust, fair, and adaptable visual systems.
Show more
The Second Workshop on the Evaluation of the Generative Foundation Models
Wisdom Ikezogwo ⋅ Maria Zontak
View full details
PhysHuman: Physically Grounded Human Perception and Modeling
Feng Liu ⋅ Youngjoong Kwon ⋅ Cheng Zhang
View full details
The Seventh Annual Embodied Artificial Intelligence Workshop
Anthony Francis ⋅ David Hall
View full details
2nd Workshop on Agents in Interaction, from Humans to Robots
Yufei Ye ⋅ Homanga Bharadhwaj
View full details
6th Workshop on CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling
Tuan-Anh Vu ⋅ Isla Duporge
View full details
2nd Workshop on Knowledge-Intensive Multimodal Reasoning
Arman Cohan ⋅ Yilun Zhao
View full details
4th Workshop on Generative Models for Computer Vision
Adam Kortylewski ⋅ Fangneng Zhan
View full details
11th New Trends in Image Restoration and Enhancement Workshop and Challenges
Radu Timofte ⋅ Zongwei Wu
View full details
Third Workshop for Learning 3D with Multi-View Supervision
Abdullah J Hamdi ⋅ Silvio Giancola
View full details
Safe Artificial Intelligence for All Domains
Oliver Wasenmüller ⋅ Markus Enzweiler
View full details
EarthVision: Large Scale Computer Vision for Remote Sensing Imagery
Ronny Haensch ⋅ Devis Tuia
View full details
9th International Workshop on Visual Odometry and Computer Vision Applications Based on Location Clues
Guoyu Lu ⋅ Friedrich Fraundorfer
View full details
Mobile AI workshop and associated challenges, 6th edition
Andrey Ignatov ⋅ Radu Timofte
View full details
The 5th Workshop on Computer Vision in the Wild: Towards Unified Multimodal Agents For Reasoning in the Wild
Reuben Tan ⋅ Zhengyuan Yang
View full details
The 1st Workshop on Low‑Level Vision Frontiers with Generative AI, Preference Optimization, and Agentic Systems
Xin Li ⋅ Yeying Jin
View full details
12th IEEE International Workshop on Computer Vision in Sports
Rikke Gade ⋅ Silvio Giancola
View full details
The 6th Workshop of Adversarial Machine Learning on Computer Vision: Safety of Vision-Language Agents
Aishan Liu ⋅ Jiakai Wang
View full details
11th Workshop on Computer Vision and Multimodal Microscopy Image Analysis
Steve Finkbeiner ⋅ Mei Chen
View full details
Trustworthy, Robust, Uncertainty-Aware, and Explainable Visual Intelligence and Beyond
Tsui-Wei Weng ⋅ Nghia Hoang
View full details
VizWiz Grand Challenge: Interpreting Images and Videos Taken by Blind People
Danna Gurari ⋅ Neelima Prasad
View full details
Geometry-Free Novel View Synthesis and Controllable Video Models
Andrea Tagliasacchi
View full details
Multi-Agent Embodied Intelligent Systems Meet Agentic-AI era: Opportunities, Challenges and Futures
Xiangbo Gao ⋅ Yuheng Wu
View full details
9th Multimodal Learning and Applications Workshop
Paolo Rota ⋅ Michael Ying Yang
View full details
6th Omnidirectional Computer Vision Workshop
Pierre Moulon ⋅ Guillaume Caron
View full details
3D Geometry Generation for Scientific Computing (2nd Edition)
Wuyang Chen ⋅ Michael Mahoney
View full details
The 3rd Workshop on New Trends in AI-Generated Media and Security
Shu Hu ⋅ Xin Wang
View full details
Embodied Reasoning in Action: Workshop and Challenge on Embodied Reasoning for Robotic Manipulation
Jiafei Duan ⋅ Jason Ren
View full details
1st Workshop on Generative 3D Reconstruction
Daniel Barath ⋅ Fabian Manhardt
View full details
The Third Workshop on Anomaly Detection with Foundation Models
Kuan-Chuan Peng ⋅ Ying Zhao
View full details
4D Digital Twins: Real-to-Sim-to-Real for Physical AI
Amrita Mazumdar ⋅ Tianye Li
View full details
Visual Anomaly and Novelty Detection - 4th Edition
Philipp Seeböck ⋅ Latha Pemula
View full details
1st Workshop on Journey to the Awards: Generative AI for Movie-Grade Video Production (J2A)
Felix Juefei-Xu ⋅ Stephane Grabl
View full details
4D World Models: Bridging Generation and Reconstruction
Aayush Prakash ⋅ Aashish Rai
View full details
The 2nd Workshop on Multi-Modal Reasoning for AI Agents
Yijiang Li ⋅ Zhenfei Yin
View full details
ScaleBot: The First Workshop on Scalable Robot Learning Systems
Sijin Chen ⋅ Yuxiang Lu
View full details
The 3rd Workshop on Synthetic Data for Computer Vision
Jieyu Zhang ⋅ Zixian Ma
View full details
Computer Vision × Education: Building a Cross‑Community Agenda for Multimodal Vision in Classrooms
Ekta Sood ⋅ Joyces H Fonteles
View full details
Bridging AI and Medical Reality: Computer Vision for Real-world Clinical Translation
Yicheng Wu ⋅ Yutong Xie
View full details
Pixel-level Video Understanding in the Wild Challenge
Henghui Ding ⋅ Nikhila Ravi
View full details
The 7th International Workshop on Eye and Gaze in Computer Vision
Yihua Cheng ⋅ Seonwook Park ⋅ Hyung Jin Chang
View full details
Eighth Workshop on Image Matching: Local Features and Beyond
Dmytro Mishkin ⋅ Eduard Trulls
View full details
8th International Workshop on Large Scale Holistic Video Understanding
Ali Diba ⋅ Mohsen Fayyaz
View full details
Third Workshop on Simulation for Autonomous Driving
Yiyi Liao ⋅ Maximilian Igl
View full details
Second Workshop on Skilled Activity Understanding, Assessment & Feedback Generation
Paritosh Parmar ⋅ Brendan Morris
Imagine a world where computer vision-based systems can analyze a video of an athlete, a surgeon, a patient, or a factory worker and instantly provide expert-level actionable feedback---correcting techniques, identifying inefficiencies, and helping people refine their skills in real time. Thanks to rapid progress in video understanding, this vision is becoming reality. AI-powered systems can now analyze complex human activities, assess performance, and generate intelligent feedback, unlocking new possibilities in sports, healthcare, manufacturing, education, rehabilitation, and beyond. Through Expert Keynotes and Invited Contributions, this CVPR 2026 workshop will explore the cutting edge of skilled activity understanding, assessment, and feedback generation, bridging research and real-world applications.
As AI systems become more capable of understanding human expertise, the implications are profound---empowering individuals with personalized coaching, democratized skill development, and scalable training solutions. We invite researchers, industry leaders, and practitioners to join us in shaping the future of AI-powered skill understanding. Whether working on foundational research, applied solutions, or real-world deployment, this workshop is an opportunity and forum to learn about and push the boundaries of how AI perceives, evaluates, and enhances human ability.
Show more
As AI systems become more capable of understanding human expertise, the implications are profound---empowering individuals with personalized coaching, democratized skill development, and scalable training solutions. We invite researchers, industry leaders, and practitioners to join us in shaping the future of AI-powered skill understanding. Whether working on foundational research, applied solutions, or real-world deployment, this workshop is an opportunity and forum to learn about and push the boundaries of how AI perceives, evaluates, and enhances human ability.
The 2nd CVPR Workshop Proposal on Foundation Models Meet Embodied Agents
Manling Li ⋅ Qineng Wang
View full details
6th International Workshop on Long-form Video Understanding, Generation and Action
Mike Zheng Shou ⋅ Gedas Bertasius
View full details
Domain Generalization: Evolution, Breakthroughs, and Future Horizons (2nd Edition)
Muhammad Haris Khan ⋅ Rishabh Lalla
View full details
Medical Reasoning with Vision Language Foundation Models
Anas Zafar ⋅ Muhammad Waqas
View full details
2nd Workshop on 4D Vision: Modeling the Dynamic World
Jiahui Lei ⋅ Shangzhe Wu
View full details
See the World in a Different Light: Physical Appearance Modeling and Relighting in the Age of Generative AI
Xilong Zhou ⋅ Marc Habermann
View full details
Successful Page Load