CVPR 2026 Workshops
| Short Name | Contact | Day | Time | Room | |
|---|---|---|---|---|---|
3D from multi-view and sensors | |||||
| 2nd Workshop on 4D Vision: Modeling the Dynamic World | 4DVision | Jiahui Lei | |||
| 3rd Workshop on ScanNet++ Novel View Synthesis and 3D Semantic Understanding Challenge | ScanNet++ | Angela Dai | |||
| Eighth Workshop on Image Matching: Local Features and Beyond | IMW | Dmytro Mishkin | |||
| SPAR-3D: Security, Privacy, and Adversarial Robustness in 3D Generative Vision Models | SPAR-3D | Nicole Meng | |||
| Third Workshop for Learning 3D with Multi-View Supervision | 3DMV | Abdullah J Hamdi | |||
| Urban Scene Modeling: Structured, Semantic, and Synthetic 3D Habitats | USM3D | Jack Langerman | |||
3D from multi-view and sensors, Deep learning architectures and techniques, Foundation models (LLM, VLM, VLA, etc.), Generative models, Multimodal learning, Vision for XR, AR, VR | |||||
| Spatial Intelligence for Cultural Heritage | SINT4CH | Marina Paolanti | |||
Adversarial attack and defense | |||||
| Synthetic & Adversarial ForEnsics | SAFE | Josué Martínez-Martínez | |||
Adversarial attack and defense, Embodied vision: Active agents, simulation, Transparency, safety, fairness, accountability, and ethics in vision | |||||
| The 6th Workshop of Adversarial Machine Learning on Computer Vision: Safety of Vision-Language Agents | 6thAdvML@CV | Aishan Liu | |||
Affinity groups | |||||
| LatinX in Computer Vision Research Workshop | LXCV | Ana Maria Quintero | |||
Autonomous driving | |||||
| Autonomous Understanding Through Open-world Perception and Integrated Language models for On-road Tasks | AUTOPILOT | Ali AlShami | |||
| Foundation Models for V2X-Based Cooperative Autonomous Driving | DriveX | Walter Zimmer | |||
| The 1st Workshop on Deployment of Foundation Models for Embodied AI | WDFM-EA | Burhan Yaman | |||
| Third Workshop on Simulation for Autonomous Driving | SAD | Yiyi Liao | |||
| Workshop on Autonomous Driving | WAD | Vincent Casser | |||
Autonomous driving, Embodied vision: Active agents, simulation, Foundation models (LLM, VLM, VLA, etc.) | |||||
| Multi-Agent Embodied Intelligent Systems Meet Agentic-AI era: Opportunities, Challenges and Futures | MEIS | Xiangbo Gao | |||
Autonomous driving, Generative models, Multimodal learning, Reinforcement learning and reasoning, Robot perception, Scene analysis and understanding, Video: Action and event understanding, Vision for | |||||
| The Eighth Workshop on Precognition: Seeing through the Future | Precognition | Khoa Luu | |||
Autonomous driving, Robot perception, Vision for XR, AR, VR | |||||
| 9th International Workshop on Visual Odometry and Computer Vision Applications Based on Location Clues | VOCVALC | Guoyu Lu | |||
Biometrics | |||||
| AERO-HPR: Human Perception and Recognition in Aerial Surveillance | AERO-HPR | Kien Nguyen Thanh | |||
| CVPR 2026 Biometrics Workshop | BIOM2026 | Bir Bhanu | |||
| Second Workshop on Foundation and Generative Models in Biometrics | FoundGen-BIO | Hatef Otroshi Shahreza | |||
Biometrics, Computer vision theory, Vision for healthcare, Vision for societal good, Vision applications and systems | |||||
| The 2nd International Workshop & Challenge on Subtle Visual Computing @CVPR 2026 | SVC | Zitong Yu | |||
Computational imaging | |||||
| 11th New Trends in Image Restoration and Enhancement Workshop and Challenges | NTIRE | Radu Timofte | |||
| Computational Cameras and Displays | CCD | Vishwanath Saragadam | |||
| DEgraded X-ray image Tomography, Enhancement, and Reconstruction | DEXTER | Scott McCloskey | |||
| The 8th UG2+ Workshop and Challenge: Bridging the Gap between Computational Photography and Visual Perception | UG2+ | Alex Wong | |||
Deep learning architectures and techniques | |||||
| Sixth Workshop on Neural Architecture Search | CVPR-NAS26 | Stephen McGough | |||
| The 5th Workshop on Federated Learning for Computer Vision | FedVision-2026 | Chen Chen | |||
| The 5th Workshop on Transformers for Vision and Multimodal AI | T4V | Gedas Bertasius | |||
| Women in Computer Vision | WiCV | Karen Sanchez | |||
Efficient, edge, and scalable vision | |||||
| 3rd Workshop on Efficient and On-Device Generation (EDGE), CVPR 2026 | EDGE | Felix Juefei-Xu | |||
| Efficient Deep Learning for Computer Vision | ECV | Shuai Zhang | |||
| Mobile AI workshop and associated challenges, 6th edition | MAI 2026 | Andrey Ignatov | |||
| On Sensor Vision Workshop | OSV | Andrew J. Davison | |||
| The 22th Embedded Vision Workshop | EVW | Matteo Poggi | |||
Egocentric vision | |||||
| Third Joint Egocentric Vision (EgoVis) Workshop | EgoVis@CVPR2026 | Siddhant Bansal | |||
Embodied vision: Active agents, simulation | |||||
| 1st Workshop on Multi-Agent Robotic Systems: Scaling with Compositional Intelligence | MARS | Yiran Qin | |||
| 4D Digital Twins: Real-to-Sim-to-Real for Physical AI | 4DDT | Amrita Mazumdar | |||
| Bridging Vision, Language, and Action: What’s Missing in Actionable Visual Perception for Robotics | ActiVis | Jiawei Ma | |||
| IPA: Interactive Physical AI Workshop | IPA | Seonwook Park | |||
Embodied vision: Active agents, simulation, Foundation models (LLM, VLM, VLA, etc.), Reinforcement learning and reasoning, Vision, language and reasoning | |||||
| Embodied Reasoning in Action: Workshop and Challenge on Embodied Reasoning for Robotic Manipulation | ERA | Jiafei Duan | |||
Embodied vision: Active agents, simulation, Human modeling & understanding: Face, body, pose, gesture, movement, Robot perception | |||||
| 2nd Workshop on Agents in Interaction, from Humans to Robots | H2R | Yufei Ye | |||
Emerging topics - other | |||||
| 2nd Workshop on Human-Interactive Generation and Editing | HiGen | Jinbo Xing | |||
| Auto-Annotation with Expert-Crafted Guidelines | autoExpert | Shu Kong | |||
| Exploring the Next Generation of Data | NeXD | Nadine Chang | |||
| From Lab Demos to Daily Tasks: Embodied Intelligence in the Wild | EmbodiedAIinLife | Huijie Wang | |||
| Humans of Generative AI | HuG | Jaron Mink | |||
| Multimodal Alignment for a Pluralistic Society | MAPS | Perampalli Shravan Nayak | |||
| Rediscovering Intelligence: Can AI Still Learn from Humans? | ReLearn | Xi Wang | |||
| Second Workshop on Skilled Activity Understanding, Assessment & Feedback Generation | SAUAFG | Paritosh Parmar | |||
| Sense of Space: Multi-Sensory Modeling for Embodied Intelligence | Sense of Space | Rao Fu | |||
| Workshop on "Bitter Lessons" | BitterLessonsCV | Anand Bhattad | |||
Explainable computer vision | |||||
| Safe Artificial Intelligence for All Domains | SAIAD | Oliver Wasenmüller | |||
| The 5th Explainable AI for Computer Vision (XAI4CV) Workshop | XAI4CV | Miguel-Ángel Fernández-Torres | |||
Explainable computer vision, Emerging topics - other | |||||
| How Do Vision Models Work? | HOW | Tamar Rott Shaham | |||
Foundation models (LLM, VLM, VLA, etc.) | |||||
| 2nd Workshop on Video Large Language Models | VidLLMs | Rohit Gupta | |||
| DataMFM: Emerging Directions in Data for Multimodal Foundation Models | DataMFM | Pengyuan Li | |||
| GigaBiran Challenge 2026: Workshop on World Models Empowering Vision Language Action Model | GigaBiran Challenge | Zheng Zhu | |||
| ScaleBot: The First Workshop on Scalable Robot Learning Systems | ScaleBot | Sijin Chen | |||
| The 2nd 3D-LLM/VLA Workshop: Bridging Language, Vision and Action in 3D Environments | 3D-LLM/VLA | Yining Hong | |||
| The 2nd CVPR Workshop Proposal on Foundation Models Meet Embodied Agents | FMEA | Manling Li | |||
| The 2nd Workshop on Multimodal Spatial Intelligence | MUSI | Juil Koo | |||
| The 5th Workshop on Computer Vision in the Wild: Towards Unified Multimodal Agents For Reasoning in the Wild | CVinW | Reuben Tan | |||
| Visual General Intelligence | VGI | Hirokatsu Kataoka | |||
Foundation models (LLM, VLM, VLA, etc.), Generative models, Emerging topics - other | |||||
| The Second Workshop on the Evaluation of the Generative Foundation Models | EvaGenFM | Wisdom Ikezogwo | |||
Foundation models (LLM, VLM, VLA, etc.), Generative models, Image and video synthesis and generation, Emerging topics - other | |||||
| Workshop on Agentic AI for Visual Media | A4VM | Jinjin Gu | |||
Foundation models (LLM, VLM, VLA, etc.), Medical and biological vision, cell microscopy | |||||
| The 3rd Workshop on Foundation Models for Medical Vision | FMV | Jun Ma | |||
Foundation models (LLM, VLM, VLA, etc.), Multimodal learning | |||||
| Big Model Adaptation In Computer Vision | BigMAC | Yuki Asano | |||
Foundation models (LLM, VLM, VLA, etc.), Multimodal learning, Vision for healthcare, Vision, language and reasoning | |||||
| Medical Reasoning with Vision Language Foundation Models | Med-Reasoner | Anas Zafar | |||
Generative models | |||||
| 1st Workshop on Generative 3D Reconstruction | GenRec3D | Daniel Barath | |||
| 2nd Workshop on GenAI for Storytelling | AISTORY | Andrew Shin | |||
| 4th Workshop on Generative Models for Computer Vision | GCV | Adam Kortylewski | |||
| Personalization in Generative AI Workshop | P13N | Pinar Yanardag | |||
| Video Generative Models: Benchmarks and Evaluation | VGBE | Shuo Xing | |||
Human modeling & understanding: Face, body, pose, gesture, movement | |||||
| 10th Affective & Behavior Analysis in-the-wild | ABAW | Dimitrios Kollias | |||
| 2nd Workshop on Photorealistic 3D Head Avatars | P3HA | Tobias Kirschstein | |||
| Computer Vision for Biomechanics Workshop | CVBW | Ethan Goan | |||
| PhysHuman: Physically Grounded Human Perception and Modeling | PhysHuman | Cheng Zhang, Feng Liu, Youngjoong Kwon | |||
| The 3rd Workshop on Human Motion Generation - New Perspective on Simulation, Animation, and VR applications | HuMoGen | Chuan Guo | |||
| The 7th International Workshop on Eye and Gaze in Computer Vision | GAZE 2026 | Seonwook Park, Yihua Cheng | |||
| Workshop on Multimodal Human Motion Analysis | MOMA | Olivia Nocentini | |||
Human modeling & understanding: Face, body, pose, gesture, movement, Vision for healthcare, Vision for societal good | |||||
| 2nd Workshop on Computer Vision for Children | CV4CHL | Yifan Shen | |||
Image and video synthesis and generation | |||||
| 1st Workshop on Journey to the Oscars: Generative AI for Movie-Grade Video Production (J2O), CVPR 2026 | J2O | Felix Juefei-Xu | |||
| 1st Workshop on Video World Models: Interaction, Memory, and Efficiency | VideoWorldModel | Jiwen Yu | |||
| AI for Creative Visual Content Generation, Editing and Understanding | CVEU | Ozgur Kara | |||
| Workshop Proposal: AI-assisted Long Video Creation | AILV | Yudong Jiang | |||
Low-level vision | |||||
| The 1st Workshop on Low‑Level Vision Frontiers with Generative AI, Preference Optimization, and Agentic Systems | LoViF | Xin Li | |||
Medical and biological vision, cell microscopy | |||||
| 11th Workshop on Computer Vision and Multimodal Microscopy Image Analysis | CVMI | Steve Finkbeiner | |||
| Bridging AI and Medical Reality: Computer Vision for Real-world Clinical Translation | CV4Clinic 2026 | Yicheng Wu | |||
| Multimodal Foundation Models for Biomedicine: Challenges and Opportunities | MMFM-BIOMED | Yuhui Zhang | |||
| Proposal for 12th Workshop on Medical Computer Vision, CVPR 2026 | MCV | Zongwei Zhou | |||
Multimodal detection, recognition, segmentation | |||||
| 22nd Workshop on Perception Beyond the Visible Spectrum | PBVS | Riad I. Hammoud | |||
Multimodal detection, recognition, segmentation, Vision applications and systems | |||||
| 4th Workshop on Vision Based Industrial Inspection | VISION'26 | Shancong Mou | |||
Multimodal learning | |||||
| 9th Multimodal Learning and Applications Workshop | MULA 2026 | Paolo Rota | |||
| Sight and Sound | WSS2026 | Andrew Owens | |||
| The 5th Workshop on “What is Next in Multimodal Foundation Models?” | MMFM | Edson Araujo | |||
| Workshop on Any-to-any Multimodal Learning | A2A-MML | Shengqiong Wu | |||
Open world learning | |||||
| Open-World Vision | OWV | Shu Kong | |||
| Visual Anomaly and Novelty Detection - 4th Edition | VAND | Philipp Seeböck | |||
Photogrammetry and remote sensing | |||||
| EarthVision: Large Scale Computer Vision for Remote Sensing Imagery | EarthVision | Ronny Haensch | |||
| The 1st Workshop on Monitoring the World through an Imperfect Lens | MWIL | Miriam Cha | |||
| The Second CVPR Workshop on Foundation and Large Vision Models in Remote Sensing (MORSE) | MORSE | Saurabh Prasad | |||
Recognition: categorization, detection, retrieval | |||||
| 13th Workshop on Fine-grained Visual Categorization | FGVC13 | Nico Lang | |||
| The Third Workshop on Anomaly Detection with Foundation Models | ADFM | Kuan-Chuan Peng | |||
Robot perception | |||||
| 4th Workshop on Maritime Computer Vision | MaCVi | Benjamin Kiefer | |||
| Unified Robotic Vision with Cross-Modal Sensing and Alignment | URVIS | Zongwei Wu | |||
Robot perception, World models | |||||
| Workshop on World Models Meet Active Sensing and Closed-Loop Planning | WMAS | Jieneng Chen | |||
Scene analysis and understanding | |||||
| 6th Workshop on 3D Scene Understanding for Vision, Graphics, and Robotics | 3DSUN | Yixin Chen | |||
| OpenSUN3D: 6th Workshop on Open-World 3D Scene Understanding with Foundation Models | OpenSUN3D | Francis Engelmann | |||
| Pixel-level Video Understanding in the Wild Challenge | PVUW | Henghui Ding | |||
Synthetic data for vision | |||||
| The 3rd Workshop on Synthetic Data for Computer Vision | SynData4CV | Jieyu Zhang | |||
| The 5th DataCV Workshop and Challenge | DataCV | Liang Zheng | |||
Transfer, low-shot, continual, long-tail, learning | |||||
| Domain Generalization: Evolution, Breakthroughs, and Future Horizons (2nd Edition) | DG-EBF | Muhammad Haris Khan | |||
Transparency, safety, fairness, accountability, and ethics in vision | |||||
| Machine Unlearning for Vision | MUV | Alessio Sampieri | |||
| The 3rd Workshop on New Trends in AI-Generated Media and Security | AIMS | Shu Hu | |||
| Trustworthy, Robust, Uncertainty-Aware, and Explainable Visual Intelligence and Beyond | TRUE-V | Tsui-Wei Weng | |||
Video: Action and event understanding | |||||
| 6th International Workshop on Long-form Video Understanding, Generation and Action | LOVEU | Mike Zheng Shou | |||
| 8th International Workshop on Large Scale Holistic Video Understanding | HVU 2026 | Ali Diba | |||
| The 1st Workshop on Vision for Intelligent Task Assistants | VITA 2026 | Ehsan Elhamifar | |||
Video: Action and event understanding, Video: Low-level analysis, motion, and tracking, Emerging topics - other | |||||
| Computer Vision with Small Data: Beyond Scale -- Toward Data-Efficient Dynamically-Aware Video Intelligence | CV4Smalls | Sarah Ostadabbas | |||
Vision and graphics | |||||
| AI for Content Creation | AI4CC | James Tompkin | |||
| Appearance Understanding and Generation | APPX | Elena Garces | |||
| See the World in a Different Light: Physical Appearance Modeling and Relighting in the Age of Generative AI | Lumina | Xilong Zhou | |||
| The 3rd AI for Visual Arts Workshop and Challenges | AI4VA | Deblina Bhattacharjee | |||
| The 3rd Workshop on AI for Content Generation, Quality Enhancement and Streaming | AIGENS | Marcos V. Conde | |||
Vision applications and systems | |||||
| 12th IEEE International Workshop on Computer Vision in Sports | CVsports | Rikke Gade | |||
| 6th Omnidirectional Computer Vision Workshop | OmniCV6 | Pierre Moulon | |||
| AI4RWC: The 2nd International Workshop on Vision Intelligence for Real-world Challenges | AI4RWC | Daqian Shi | |||
| Artificial Intelligence for Space | AI4Space | Daniele Gammelli | |||
| Computer Vision for the Built World | CV4AEC | Iro Armeni | |||
| CV4Science: Using Computer Vision for the Sciences | CV4Science | Utkarsh Mall | |||
| The 3rd MetaFood Workshop (MTF) | MTF | Yuhao Chen | |||
| The 7th International Workshop and CVML Challenge on Agriculture-Vision: Challenges & Opportunities for Computer Vision in Agriculture | V4A | Chris Padwick | |||
Vision for accessibility | |||||
| 2nd Workshop on Multimodal Sign Language Recognition | MSLR | Raffaele Mineo | |||
| Generative AI for Sign Language | GenSign | Hezhen Hu | |||
| VizWiz Grand Challenge: Interpreting Images and Videos Taken by Blind People | VizWiz | Danna Gurari | |||
Vision for healthcare | |||||
| PHAROS AI Factory for Medical Imaging & Healthcare | PHAROS-AIF-MIH | Stefanos Kollias | |||
Vision for privacy and security | |||||
| Intelligent Screening and Imaging For Detection and Exploitation via X-ray | INSIDE-X | Naoufel Werghi | |||
Vision for scientific discovery | |||||
| 3D Geometry Generation for Scientific Computing (2nd Edition) | 3D4S | Wuyang Chen | |||
Vision for societal good | |||||
| 6th Workshop on CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling | CV4Animals | Tuan-Anh Vu | |||
| Authenticity & Provenance in the age of Generative AI | APAI | Shruti Agarwal | |||
| Computer Vision × Education: Building a Cross‑Community Agenda for Multimodal Vision in Classrooms | CV4Edu | Ekta Sood | |||
| From Perception to Persuasion: Challenges and Advances in Misinformation Detection in Society | PP-MisDet | PRIYANKA SINGH | |||
Vision for XR, AR, VR | |||||
| Generative AI for XR and Identity-based Applications | GenXR-ID | Brendan David-John | |||
Vision, language and reasoning | |||||
| 2nd Workshop on Knowledge-Intensive Multimodal Reasoning | KnowledgeMR | Arman Cohan | |||
| Cognitive Foundations for Multimodal Models | COGVL | Aditya Chinchure | |||
| GRAIL-V: Grounded Retrieval & Agentic Intelligence for Vision-Language | GRAIL-V | Amit Agarwal | |||
| Multimodal Algorithmic Reasoning Workshop | MAR | Anoop Cherian | |||
| The 2nd Workshop on Multi-Modal Reasoning for AI Agents | MMR | Yijiang Li | |||
| The 2nd Workshop on Test-time Scaling for Computer Vision | ViSCALE | Yinpeng Dong | |||
| Workshop on Vision-based Assistants in the Real-World | VAR | Apratim Bhattacharyya | |||
| Workshop on Visual Concepts | VisCon | Joy Hsu | |||
World models | |||||
| 4D World Models: Bridging Generation and Reconstruction | ReGen4D | Aayush Prakash | |||
| End-to-End 3D Learning | E2E3D | Zhiwen Fan | |||
| Geometry-Free Novel View Synthesis and Controllable Video Models | GeoFreeNVS | Andrea Tagliasacchi | |||
| The Seventh Annual Embodied Artificial Intelligence Workshop | EAI | Anthony Francis | |||
Successful Page Load