CVPR 2026 Career Opportunities
Here we highlight career opportunities submitted by our Exhibitors, and other top industry, academic, and non-profit leaders. We would like to thank each of our exhibitors for supporting CVPR 2026.
Search Opportunities
Location: Beijing, Shanghai
Responsibilities 1. Develop and iterate multimodal foundation models, covering joint understanding and generation of image, video, speech, 3D, and other modalities. 2. Research cutting-edge technologies including cross-modal alignment, contrastive learning, diffusion models, video generation, image editing, 3D generation, and style transfer. 3. Build multimodal data pipelines and evaluation systems, drive business adoption, and apply models to core scenarios such as search, recommendation, AIGC, healthcare, autonomous driving, cloud storage/document editing, video understanding, and problem‑solving. 4. Explore areas including visual perception, multimodal understanding models, image/video generation, model compression/lightweighting, and document multimodal understanding. 5. Construct data pipelines for multimodal learning, optimize training and inference efficiency, and handle end‑to‑end model training, tuning, and deployment.
Requirements 1. PhD preferred; Master’s degree or above in Computer Science, Pattern Recognition, Artificial Intelligence, Electronic Engineering, Mathematics, or related fields. 2. Solid foundation in computer vision, image processing, and deep learning, with in-depth research experience in areas such as multimodal model training, document multimodal understanding, open‑vocabulary object detection, and model compression/lightweighting. 3. Familiar with diffusion models and multimodal foundation models (e.g., CLIP, Flamingo, Qwen‑VL), with strong interest and project experience in image generation, video generation, 3D generation, digital humans, and related fields. 4. Proficient in Python and deep learning frameworks such as PyTorch, PaddlePaddle, or TensorFlow; strong ability to reproduce research results from papers. 5. Experience in multimodal pre‑training, distillation, video generation, or model lightweighting is a strong plus. 6. Publications in top‑tier conferences (e.g., CVPR, ICCV, ECCV, NeurIPS, ICML, AAAI) or journals, or contributions to open‑source projects, are highly valued. 7. Strong teamwork, communication, and problem‑solving skills, with a passion for technology and a drive for innovation.
USA, California, Santa Clara
Intelligent machines powered by artificial intelligence—computers that can learn, reason, and interact with people—are transforming every industry. GPU-accelerated deep learning provides the foundation for machines to perceive, reason, and solve complex problems. NVIDIA GPUs run deep learning algorithms that simulate aspects of human intelligence, acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world.
We are seeking an exceptional Senior Perception Engineer to help design and productize NVIDIA’s next-generation autonomous driving perception stack. You will work on the core 3D obstacle perception pipeline, contribute to architecture and algorithm design, and remain deeply hands-on with implementation, including modern transformer-based, multi-modal, and vision-language techniques where they add real value.
What you’ll be doing: - Develop and improve the technical design, architecture, and roadmap for 3D obstacle perception to support end-to-end autonomous driving functionalities, leveraging state-of-the-art CNN and transformer-based architectures where appropriate. - Design and implement advanced 3D perception models using multi-camera inputs and/or multi-sensor fusion (camera, radar, lidar) for obstacle detection and tracking, including opportunities to explore BEV and transformer-based 3D perception. - Build efficient, production-grade deep learning models: define objectives with the team, select and prototype architectures, run experiments, and follow best practices for training and evaluation, using techniques such as large-scale pretraining, distillation, and parameter-efficient fine-tuning (e.g., LoRA). - Help define and maintain KPI frameworks to quantify perception performance; analyze large-scale real and synthetic datasets to identify failure modes and systematically improve accuracy, robustness, and efficiency, incorporating approaches like self-supervised and representation learning when beneficial. - Contribute to the data strategy for perception: specify data and labeling requirements, help prioritize data collection and annotation, and collaborate with data and ground-truth teams, including model-assisted workflows (e.g., active learning, auto-labeling, vision-language models (VLMs)) and model-in-the-loop tooling. - Collaborate with safety, systems, and software teams to ensure perception solutions meet product requirements for safety, latency, resource usage, and software robustness, and are ready for deployment at scale.
What we need to see: - PhD with 4+ years, MS with 6+ years, or BS (or equivalent experience) with 8+ years of relevant experience in Computer Science, Computer Engineering, or a related technical field. - Hands-on experience developing deep learning–based perception or closely related systems for complex real-world problems, with strong proficiency in frameworks such as PyTorch and a track record of taking models from prototype to production. - Proven experience in data-driven development, including close collaboration with data, labeling, and ground-truth teams on data strategy, labeling quality, and iterative model improvement. - Strong programming skills in Python and/or C++, with experience building reliable, high-performance, production-quality software. - Excellent communication and collaboration skills, with the ability to work effectively across multidisciplinary teams.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.
You will also be eligible for equity and benefits.
This posting is for an existing vacancy.
Location: Mountain View, CA
The Localization team is responsible for the software that safely and accurately estimates positioning for the vehicles in the Aurora fleet. This involves processing high-rate data from a diverse suite of sensors in order to provide real-time, high-precision measurements of the location and motion of vehicles traveling on highways, surface streets, and endpoint facilities. We are searching for a Senior Software Engineer to join us in solving these technical challenges.
In this role, you will
Design, implement, and maintain solutions for estimating the pose of vehicles and other assets, such as trailers Rigorously evaluate and test solutions to verify the safety and efficiency of the localization system Collaborate with engineers and stakeholders on partner teams to solve problems of high importance to Aurora products Required Qualifications
-5+ years of industry experience building software in a production environment -Proficient in C++ and Python using modern best practices -Graduate level degree in CS/Robotics/EE -Deep knowledge of state estimation theory and practice (one or more of sensor fusion, Kalman Filtering, non-linear estimation, SLAM) -Experience in estimation and modeling with sensor data (lidar, IMU, radar, camera) -Deep understanding in relevant areas of mathematics, including spatial transforms, linear algebra, probabilistic estimation
Desirable Qualifications
-Prior experience in robotics or autonomous vehicle applications -Prior experience with ML solutions and/or GPU development
Foster City, CA
The Prediction & Behavior ML team is responsible for developing machine learning (ML) algorithms that learn and predict behaviors from data, applying them both on-vehicle to influence driving behavior and off-vehicle to provide ML capabilities to simulation and validation. Given the tight integration of behavior forecasting and motion planning, our team collaborates closely with the Planner team to advance overall vehicle behavior. We also work closely with our Perception, Simulation, and Systems Engineering teams to accelerate our ability to validate our driving performance.
As a Learned Trajectory Machine Learning Engineer you will be responsible for developing deep learned models that produce trajectories for our vehicles to drive. Given the tight integration of behavior prediction and motion planning, you will closely collaborate with the Planner and Perception teams in the advancement of our overall vehicle behavior. In this role, you will: You will develop new deep learning models that use imitation learning and reinforcement learning to generate driving plans for our autonomous vehicle. You will also work on techniques to estimate the quality of those driving plans along the dimensions of safety, progress, comfort etc. You will leverage our large-scale machine learning infrastructure to discover new solutions and push the boundaries of the field You will develop metrics and tools to analyze errors and understand improvements of our systems You will collaborate with engineers on Perception, Planning, and Simulation to solve the overall Autonomous Driving problem in complex urban environments Qualifications BS, MS, or PhD degree in computer science or related field Experience with training and deploying transformer-based model architectures and reinforcement learning Experience with production Machine Learning pipelines: dataset creation, training frameworks, metrics pipelines Fluency in Python with a basic understanding of C++ Extensive experience with programming, algorithm design, and strong mathematics skills Bonus Qualifications Conference or Journal publications in Machine Learning or Robotics related venues Prior experience with Prediction and/or autonomous vehicles or robotics in general $277,000 - $407,000 a year Base Salary Range
There are three major components to compensation for this position: salary, Amazon Restricted Stock Units (RSUs), and Zoox Stock Appreciation Rights. A sign-on bonus may be offered as part of the compensation package. The listed range applies only to the base salary. Compensation will vary based on geographic location and level. Leveling, as well as positioning within a level, is determined by a range of factors, including, but not limited to, a candidate's relevant years of experience, domain knowledge, and interview performance. The salary range listed in this posting is representative of the range of levels Zoox is considering for this position.
USA, California, Santa Clara
We are seeking a highly technical and strategic Senior Technical Lead to join our team, with a focus on engaging developer ecosystems across emerging technology domains such as Generative AI, Autonomous Vehicles, and Simulation Platforms. In this pivotal role, you will work directly with software solution providers, developers, and industry professionals to foster the adoption of NVIDIA’s advanced AI and computing platforms—including Omniverse, Cosmos, and GenAI frameworks. The ideal candidate brings a blend of deep technical expertise and commercial go-to-market experience, combined with a passion for developer advocacy and a talent for communicating how NVIDIA technology can solve complex, real-world challenges.
What You'll Be Doing: -Serve as a technical advisor and problem solver with partner engineering teams, collaborating on architecture, code, and integration for Omniverse and AI enabled-solutions. - Develop and maintain deep technical expertise in NVIDIA Cosmos and Omniverse Platforms and related technologies (APIs, USD, NIMs, Blueprints) through prototyping, technical integration and creation of reference architectures. - Advise on technical enablement resources such as sample code, guides, demonstration pipelines, and tools to highlight the application of technologies in solving real-world problems. - Engage with partner software organizations, from engineering teams to technical leaders, and decision-makers to understand their goals, solve technical challenges, and promote best practices for successful integrations. - Represent and advocate for the partner technical needs and feedback to NVIDIA’s internal product and engineering teams, supplying actionable insights from field deployments to influence product roadmaps. - Support product launches, technical go-to-market activities by providing technical validation, demonstrating integrated solutions, and ensuring excellence in customer- and partner-facing materials. - Guide partners and startups through onboarding and integration with NVIDIA’s programs, fostering co-innovation and the development of next-generation solutions.
What We Need to See: - Master’s or Ph.D. in Computer Science, Artificial Intelligence, or equivalent experience. - 12+ years of experience of hands-on experience in a technical AI role, with a strong emphasis on AV End-to-End models and GenAI model development. - Experience writing production code in Python or C++, and proficiency with Linux. - Hands-on experience with DevOps tools such as GitLab, Docker, and Kubernetes. - Strong understanding of AV systems (Sensors, dynamics, perception, prediction, planning, control). - Experience with DL and RL algorithms and frameworks such as PyTorch. - Skilled at collaborating across engineering, product, sales, and marketing teams, with strong interpersonal abilities to simplify complex technical concepts for diverse audiences. - Experience leading technical collaborations with engineering and product teams—including architectural design, code reviews, technical mentorship, and delivery of technical talks or workshops. - Self-starter with a vision for growth, real passion for continuous learning and sharing findings across the team.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Role: Robotics Software - Perception ML Engineer Location: SF Bay Area (in-person)
About the role
We’re looking for an experienced Perception ML Engineer to join our team. As an early member, you will play a pivotal role in building and shaping the AI capabilities of our robots. You will work on challenging problems around manipulation and navigation in dynamic outdoor environments. You'll need to thrive in a fast-paced startup environment where you'll wear multiple hats and have a direct impact on our product's evolution. Ideally, you have a proven track record of developing and deploying perception ML systems in production, and you're passionate about pushing the boundaries of what's possible in robotics with AI.
What you’ll get to work on
- Develop and deploy state-of-the-art perception models for challenging manipulation and navigation problems.
- Help drive our technical approach, with particular focus on real-world deployments.
- Research and develop techniques around 3D scene understanding, detection, segmentation, pose and world/video models.
- Develop robust metrics to establish performance of large vision models.
- Work with multimodal data (e.g. cameras, LIDAR, tactile).
- Develop and improve real/synthetic data pipelines for challenging perception problems.
- Build scalable training and validation infrastructure.
- Collaborate with other teams in the company.
What we look for
- M.S/Ph.D degree in robotics, vision, computer science, mechanical engineering, electrical engineering or other engineering disciplines (or equivalent experience).
- 4+ years of hands-on experience developing perception solutions for robotics applications like manipulation and navigation.
- Experience working on one or more of detection, segmentation, pose estimation, tracking, 3D reconstruction, world models and VLMs.
- Experience working with synthetic data pipelines and robotics simulation infrastructure (e.g. Isaac Sim, Gazebo, Blender, Unity).
- Extensive experience with Python and common ML frameworks like PyTorch.
- Should be comfortable taking ownership of tasks with light supervision.
- Must have excellent problem-solving skills.
- Legally authorized to work in the United States.
Sunnyvale, CA
About the Role CoStar Matterport is seeking a Manager of Software Development to support the demand for Matterport’s revolutionary platform. Matterport is the leading spatial data product focused on digitizing and indexing the built world. Our all-in-one 3D data platform enables anyone to turn a space into an accurate and immersive digital twin which can be used to design, build, operate, promote, and understand any space. As the Manager, Software Development, you will join our Platform team as manager. You’ll be responsible for managing a team that crafts and scales services which connect the Matterport Platform with Matterport’s internal systems. This team is responsible for connecting Platform to our ERP and CRM systems, amongst other integrations with the broader enterprise. The ideal candidate is an individual who thrives on new challenges, possesses a strong platform development background, and has a drive to invent with others.
This role is located in our Sunnyvale, CA office and has a schedule of 4 days on-site and 1 day work from home.
Responsibilities: - Lead a team that delivers on multiple projects with other teams. - Work closely with our product and design teams, as well as business stakeholders, to drive and execute on the next set of product innovations required to maintain and accelerate our position as an industry leader. - Build products with a focus on scalability. - Foster an inclusive team culture focusing on learning, best-practices and efficiency - Enable teams to be successful by removing obstacles and developing strategic relationships throughout the business. - Help set the direction and goals for the team, in terms of project impact, product quality, and engineering efficiency. - Understand and stay current on competitive landscape, market trends and emerging technology. - Understand how billing systems and ERPs work - Dive into coding and technical tasks as necessary and as time permits with your other management responsibilities. - Work with Product to organize and define requirements for the team - Run our SDLC processes for the team. - Hire, coach, and develop your team.
Basic Qualifications: - Bachelor's degree from an accredited, not-for-profit University or College - A track record of commitment to prior employers - Experience leading multiple software engineering teams in a high growth and fast-paced environment. - 8+ years of experience developing cloud platforms at scale in languages such as: Kotlin, Java, Go, or Python.
DoorDash Labs, established in 2018, serves as the innovation hub for DoorDash, focusing on developing automation and robotics solutions to enhance last-mile logistics. The team's mission is to create technologies that support and augment human networks, aiming to improve efficiency for Dashers, merchants, and consumers alike. We’re ruthlessly focused on business impact. We are a highly senior and nimble team composed of a core group of veterans from a variety of different robotics industries.
We are seeking a seasoned Senior Planner Engineer with a proven track record of shipping production-level autonomy systems. The engineers will design, implement, and deploy behavior and motion planning algorithms that enable safe, reliable, and fully autonomous delivery. The ideal candidate has lived through the full development lifecycle of an autonomous product and intimately understands the practical challenges and trade-offs required to build planning systems that are not just novel, but robust, reliable, and performant. You will be a key technical voice, expected to solve our most critical challenges by drawing upon your extensive prior experience in the field.
Robotics Research Engineer
About Us
We are reimagining manufacturing through advanced robotics. Our mission is to rebuild the American manufacturing industry as an AI-first, assembly-focused, dual-use contract manufacturer. We aim to empower manufacturers with intelligent, efficient, and adaptable robotic systems that redefine productivity and quality.
As a founding member of our engineering team, you will have a direct and significant impact on our product, culture, and ultimate success.
This role is 100% in-person at our office in the Mission, SF.
The Role
We are hiring a Robotics Research Engineer to design, implement, and deploy intelligent robotic systems for manufacturing.
This is not a purely academic role — and not a generic software role.
You will work across:
- ROS2-based system architecture
- Manipulation and navigation
- Perception and VLA-based control
- Traditional controls fused with learned methods
- On-site deployment in production manufacturing workcells
You should be equally comfortable:
- Reading robotics papers
- Tuning controllers
- Debugging sensor calibration
- Pushing production code to robots running on a factory floor
Key Responsibilities
Robotics Systems & Architecture
- Design and implement distributed robotic systems in ROS2
- Build modular autonomy stacks for manipulation and mobile platforms
- Own system-level performance, latency, and reliability
Robotics Foundation Models (VLA)
- Develop and integrate Vision–Language–Action models for robotic control
- Design data collection pipelines and fine-tune foundation models for assembly tasks
- Benchmark and evaluate real-world performance
Controls + Learning
- Fuse traditional control with learned policies
- Implement hybrid perception–planning–control architectures
- Improve robustness, repeatability, and safety in physical systems
Manipulation, Navigation & Deployment
- Develop algorithms for dexterous manipulation and industrial navigation
- Ship systems from prototype to production
-
Debug real-world edge cases:
-
Lighting
- Calibration drift
- Wear
- Latency
- Maintain deployed robotic cells
What We're Looking For
Deployment Mentality
- Experience shipping systems into real environments (not just lab demos)
- Comfort debugging hardware–software integration issues
- Ability to own systems end-to-end
Ownership
- Thrives in high-velocity, ambiguous environments
- Takes full technical ownership of complex systems
- Willing to get into the weeds of mechanical, electrical, and software challenges
Nice to Have
- PhD in Robotics or related field
- Experience working on dual-use or DoD-related systems
- Ability to obtain a government security clearance
- Experience with safety-critical robotic systems
Growth Opportunities
- Define the technical direction of robotics foundation model deployment in manufacturing
- Build and lead the autonomy team
- Shape the next generation of AI-driven assembly systems
- Become a core technical interface with defense and industrial partners
Why Join Us?
This is one of the only places where:
- World-class manufacturing operators
- Mechanical engineers
- Robotics researchers
- Software engineers
sit in the same room — building production systems together.
We are committed to being deeply embedded in the U.S. industrial base.
Our focus is simple:
> Build adaptive robotic assembly systems that make American manufacturing scalable, resilient, and competitive again.
If you want to publish papers, this may not be the role.
If you want to build and deploy the systems those papers were meant to enable — this is it.
Compensation
The base salary range for this full-time position in San Francisco is:
$150,000 — $250,000 USD
Compensation packages at Foundry Robotics for eligible roles include:
- Base salary
- Equity
- Benefits
Foster City, CA
Be the visionary Perception Architect designing the sensory "nervous system" for our next generation of autonomous products. Evolve our end-to-end perception architecture, serving as the essential liaison between Sensing teams and the core ML system. Drive architectural evolution toward next-generation sensing and compute platforms by influencing Hardware-Software Co-Design. In this role, you will: System Synthesis: Define the end-to-end perception architecture—assess new sensing (LiDAR, Radar, Camera) to the final fused world model. Hardware-Software Co-Design: Partner with Compute and Sensing teams to influence the design, ensuring our algorithms have the "computational oxygen" they need. Algorithmic Strategy: Lead the transition from current pipelines to AI innovation pipelines, while maintaining the safety and interpretability required for edge deployment. Future-Proofing: Evaluate emerging technologies and determine their viability for the 3-5 year roadmap. Performance Budgeting: Collaborate on rigorous metrics for latency and accuracy across diverse edge-computing environments. Lead cross functional teams: to create a viable plan to move to new platforms and unlock further technical capabilities for Perception systems. Coordinate projects across partner teams to make sure work is aligned with the overall org direction. Qualifications Education: Ph.D. or MS in Computer Science, Robotics, Electrical Engineering, or a related field with a focus on Computer Vision. Experience: 8+ years in autonomous systems with at least 2 years in an architectural role, shipping and evolving perception systems Deep Domain Expertise: Mastery of systems focused on Computer Vision, Deep learning, Foundational models, Sensor Fusion, and State Estimation (SLAM, EKF/UKF). Deep understanding of sensor modalities and their impact to perception/autonomy systems, including: Camera, lidar, radar. Platform Proficiency: Deep understanding of hardware acceleration, component budgeting and transition to new platforms. Mathematical Rigor: Strong foundation in 3D geometry, linear algebra, and probabilistic robotics. Hands on Skills: Proficiency in C++ (production-grade) and Python (rapid prototyping). Ability to run experiments across system components and run metrics platforms. Soft Skills: Curious, Data driven, with ability to translate complex technical trade-offs to non-technical stakeholders and executive leadership. $363,000 - $470,000 a year Base Salary Range