Skip to yearly menu bar Skip to main content


CVPR 2026 Career Opportunities

Here we highlight career opportunities submitted by our Exhibitors, and other top industry, academic, and non-profit leaders. We would like to thank each of our exhibitors for supporting CVPR 2026.

Search Opportunities

World Models for Manipulation Engineer

We build world models that simulate manipulation scenes faithfully enough to validate, and one day, train policies without touching a robot. You'll develop generative models that make this work, with the controllability and physical fidelity to match real-robot behavior.

What You'll Do

  • Train video and dynamics models: Develop world models with action conditioning for manipulation policies.
  • Push long-horizon coherence: Develop architectures and training methods that extend rollout quality on hard physical tasks.
  • Own training infrastructure: Run multi-GPU clusters, write custom CUDA, debug at scale.
  • Build the world-model data engine: Design, implement, and improve a data engine that allows the world model to compound learning across customers and manipulation tasks.

Requirements

  • Very strong coding in Python and PyTorch, or similar.
  • Video generation experience: deep experience training image or video generation models end-to-end.
  • Large-scale training: track record operating training runs at cluster scale.
  • 3D vision: working knowledge of multi-view geometry, scene reconstruction, and physical priors.

Role Details

Job type: Full-time
Experience: Any, new grads ok
Location: San Francisco, CA, US
Remote: No
US visas: Will sponsor
Equity: 0.50% - 2.00%
Salary: $150K - $275K
Hiring manager: Hemanth Sarabu

About One Robot

One Robot builds task-specific world models and an evaluation platform for robot manipulation policies.

Training end-to-end policies for robots is vibes-based today. Teams collect data, train, deploy on a real robot, find out what fails, collect more, retry. We replace the trial-and-error with rigorous validation that tells you where your policy will fail and what data to collect to fix it.

Robotics can't industrialize without an evaluation layer. We're building it.

We're solving challenging technical problems around long-horizon autoregressive generation, world model controllability, and closing the sim-to-real gap. We work with real customer data, real failures, and real deployment pressure.

We're based in San Francisco, backed by Accel, YC, several exited founders, and engineering leaders at leading AI companies.

We're small and deliberately so. Everyone is an IC with deep ownership of a wide surface area. The culture is fast iteration and direct responsibility.

Hemanth Sarabu and Elton Shon co-founded One Robot after leading robot learning together at Industrial Next (YC W22), bringing experience from Google, NASA JPL, and Tesla.

Please click on the link for full job description

Job Responsibilities

  • Conduct innovative research and development in training large-scale generative AI models for image and video synthesis.
  • Develop, carry out, and carefully assess sophisticated techniques for conditional generation and editing. Emphasize improving instruction compliance, controllability, and visual clarity in pre-trained image and video models.
  • Collaborate cross-functionally with researchers, engineers, and product teams to translate multimodal innovations into scalable downstream applications coordinated within Adobe products.
  • Develop and maintain robust evaluation pipelines to assess generative models across quality, efficiency, robustness, and safety metrics.
  • Translate research concepts and published work into production-ready implementations using Python and modern machine learning frameworks.

What You'll Need to Succeed

  • Possession of a Master's or Ph.D. degree in Computer Science, Machine Learning, or a related field.
  • Extensive practical experience in large-scale generative AI training concentrating on image and video generation and editing.
  • Familiarity with diffusion models, transformers, or other brand new generative architectures.
  • Excellent communication skills and ability to collaborate across cross-functional teams.
  • Strong coding and prototyping ability in Python, PyTorch, and ML infrastructure tools.
  • Working with product teams on technology transfers.
  • Strong history of publishing in Computer Science, AI/ML, or related areas.

Founding Robot Learning

We're expanding the platform into policy training: building the components that let policies validate and improve through our world model. You'll train manipulation policies, including VLAs, end-to-end imitation, and RL, and push the world model and eval forward.

What You'll Do

  • Train manipulation policies: Build VLAs, diffusion policies, or end-to-end imitation models and run them on real robots.
  • Validate the world model end-to-end: Train policies in simulation, deploy on real hardware, and find what doesn't transfer.
  • Push policy capabilities forward: Build the infrastructure that lets policies train and improve on our platform.

Requirements

  • Very strong coding in Python and PyTorch.
  • Real-robot policy training: Track record training manipulation policies that ran on physical hardware.
  • Demonstration data: Hands-on experience curating real-robot demonstration datasets.

Role Details

Job type: Full-time
Experience: Any, new grads ok
Location: San Francisco, CA, US
Remote: No
US visas: Will sponsor
Equity: 0.50% - 2.00%
Salary: $150K - $275K
Hiring manager: Hemanth Sarabu

About One Robot

One Robot builds task-specific world models and an evaluation platform for robot manipulation policies.

Training end-to-end policies for robots is vibes-based today. Teams collect data, train, deploy on a real robot, find out what fails, collect more, retry. We replace the trial-and-error with rigorous validation that tells you where your policy will fail and what data to collect to fix it.

Robotics can't industrialize without an evaluation layer. We're building it.

We're solving challenging technical problems around long-horizon autoregressive generation, world model controllability, and closing the sim-to-real gap. We work with real customer data, real failures, and real deployment pressure.

We're based in San Francisco, backed by Accel, YC, several exited founders, and engineering leaders at leading AI companies.

We're small and deliberately so. Everyone is an IC with deep ownership of a wide surface area. The culture is fast iteration and direct responsibility.

Hemanth Sarabu and Elton Shon co-founded One Robot after leading robot learning together at Industrial Next (YC W22), bringing experience from Google, NASA JPL, and Tesla.

Locations: Toronto, ON

You will…

  • Be part of a team of multidisciplinary Engineers and Researchers using an AI-first approach to enable safe self-driving at scale.
  • Build reliable and scalable tools and frameworks to support Autonomous Vehicle (AV) development.
  • Lead technical and architecture discussions, collaborating with Researchers and Engineers.
  • Mentor other software engineers via code reviews, technical design reviews, and sharing general software development best practices.
  • Assist in task planning and estimation.

Qualifications:

  • MS/PhD or Bachelors degree with a Computer Science, Robotics and/or similar technical field(s) of study.
  • 5+ years of industry experience reading and developing production quality software..
  • Experience using languages such as Python, Go, C++, or Rust.
  • Experience working in a team environment on a common codebase.
  • Ability to learn new technologies quickly.
  • Open-minded and collaborative team player with willingness to help others.
  • Passionate about self-driving technologies, solving hard problems, and creating innovative solutions.

Bonus/nice to have:

  • Experience programming in C++ for a real world robotic system.
  • Comfortable with Linux/other unix environments.
  • Comfortable with Docker.
  • Comfortable with git workflows.
  • Experience in robotics or machine learning.
  • Experience with automated testing.
  • Experience working in an Agile/Scrum environment.
  • Experience working with internal cross-functional partners/stakeholders when building software frameworks. -Experience in one or more of the following areas: application development, distributed systems, data storage and processing, parallel computing environments, emulation at scale, software performance, optimization, and profiling, concurrency and determinism, test-driven and API-driven development methodologies, system design/architecture, algorithms, data structure design, and low level threading. Front-end development and tools.

Bengaluru, Karnataka, India

Minimum qualifications: - Currently enrolled in Bachelor's, Master's, or PhD degree program in Computer Science, Linguistics, Statistics, Biostatistics, Applied - Mathematics, Operations Research, Economics, or Natural Sciences. - Experience in one area of computer science (e.g., Natural Language Understanding, Human Computer Interactions, Computer Vision, Machine Learning, Deep Learning, Algorithmic Foundations of Optimization, Quantum Information Science, Data Science, Software Engineering, or similar areas).

Preferred qualifications: - Currently enrolled in a full-time degree program and returning to the program after completion of the internship. - Experience contributing research communities or efforts, including publishing papers in major conferences or journals. - Experience with one or more general purpose programming languages (e.g., Python, Java, JavaScript, C/C++, etc.).

About the job Researchers across Google are working to advance in computing and build the next generation of intelligent systems for all Google products. To achieve this, we invest in foundational research and work on projects that utilize the latest computer science techniques developed by skilled software developers and research scientists. Whether we're shaping the future of sustainability, optimizing algorithms, or pioneering AI systems, our teams strive to continuously progress science, advance society, and improve the lives of billions of people.

Student Researcher projects are exploratory and direct experiences that drive scientific advancement across a multitude of research areas. Students will work collaboratively on projects that explore innovative research challenges and support the creation of breakthrough technologies.

The Student Researcher Program fosters academic collaborations by hiring students onto research projects aligned to company priorities in scientific advancement. The program offers placements on teams across Google, for research, engineering, and science roles. As a Student Researcher, you will have the opportunity to participate in research projects focused on developing solutions for real-world, large-scale problems.

The program is open to students enrolled in a Bachelor's, Master's, or PhD program. Projects vary in duration and location based on team and student requirements. It is required that you are located in one of the specific country locations identified for this role for the full duration of the engagement. When you apply, you will be considered for Student Researcher positions across all of Google's research teams - including Google DeepMind, Google Research, Google Cloud and more. This allows us to find the right project match for your skills and interests.

Researchers across Google are working to advance the state of the art in computing and build the next generation of intelligent systems for all Google products. To achieve this, we invest in foundational research and work on projects that utilize the latest computer science techniques developed by skilled software developers and research scientists. Whether we're shaping the future of sustainability, optimizing algorithms, or pioneering AI systems, our teams strive to continuously progress science, advance society, and improve the lives of billions of people.

Responsibilities Participate in research to develop solutions for real-world, large-scale problems.

USA, California, Santa Clara

Are you passionate about pushing the boundaries of AI at the intersection of the digital and physical worlds? Join our groundbreaking research team as we revolutionize the future of physical AI through groundbreaking generative models. We are now hiring Research Scientists to join our Cosmos team! As a Research Scientist specializing in Generative AI for Physical AI, you'll be at the forefront of developing next-generation algorithms that bridge the gap between virtual and physical realms. You'll work with state-of-the-art technology and have access to massive computational resources to bring your ideas to life.

What you'll be doing: - Pioneer revolutionary generative AI algorithms for physical AI applications, with a focus on advanced video generative models and video-language models - Architect and implement sophisticated data processing pipelines that produce premium-quality training data for Generative AI and Physical AI systems - Design and develop cutting-edge physics simulation algorithms that enhance Physical AI training - Scale and optimize large-scale training systems to efficiently harness the power of 20,000+ GPUs for training foundation models - Author influential research papers to share your groundbreaking discoveries with the global AI community - Drive innovation through close collaboration with research teams, diverse internal product groups, and external researchers - Build lasting impact by facilitating technology transfer and contributing to open-source initiatives

What we need to see: - PhD in Computer Science, Computer Engineering, Electrical Engineering, or related field (or equivalent experience). - Deep expertise in PyTorch and related libraries for Generative AI and Physical AI development - Strong foundation in diffusion, vision language and reasoning models and their applications - Proven experience with reinforcement learning algorithms and implementations - Robust knowledge of physics simulation and its integration with AI systems - Demonstrated proficiency in 3D generative models and their applications

Ways to stand out from the crowd: - Publications or contributions to major AI conferences (ICLR, NeurIPS, ICML, CVPR, ECCV, SIGGRAPH, ICCV, etc.) - Experience with large-scale distributed training systems - Background in robotics or physical systems - Open-source contributions to prominent AI projects - History of successful research-to-product transitions

You'll be part of a team that's defining the future of Physical AI, with access to world-class computing resources and the opportunity to work on problems that matter. Your research won't just live in papers – it will be implemented in real-world systems that push the boundaries of what's possible in AI. Join us in shaping the future of AI where digital intelligence meets physical reality. NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. Are you a creative and autonomous research scientist with a genuine passion for advancing the state of AI? If so, we want to hear from you!


Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 168,000 USD - 264,500 USD.

You will also be eligible for equity and benefits.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

USA, California, Santa Clara

Intelligent machines powered by artificial intelligence—computers that can learn, reason, and interact with people—are transforming every industry. GPU-accelerated deep learning provides the foundation for machines to perceive, reason, and solve complex problems. NVIDIA GPUs run deep learning algorithms that simulate aspects of human intelligence, acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world.

We are seeking an exceptional Senior Perception Engineer to help design and productize NVIDIA’s next-generation autonomous driving perception stack. You will work on the core 3D obstacle perception pipeline, contribute to architecture and algorithm design, and remain deeply hands-on with implementation, including modern transformer-based, multi-modal, and vision-language techniques where they add real value.

What you’ll be doing: - Develop and improve the technical design, architecture, and roadmap for 3D obstacle perception to support end-to-end autonomous driving functionalities, leveraging state-of-the-art CNN and transformer-based architectures where appropriate. - Design and implement advanced 3D perception models using multi-camera inputs and/or multi-sensor fusion (camera, radar, lidar) for obstacle detection and tracking, including opportunities to explore BEV and transformer-based 3D perception. - Build efficient, production-grade deep learning models: define objectives with the team, select and prototype architectures, run experiments, and follow best practices for training and evaluation, using techniques such as large-scale pretraining, distillation, and parameter-efficient fine-tuning (e.g., LoRA). - Help define and maintain KPI frameworks to quantify perception performance; analyze large-scale real and synthetic datasets to identify failure modes and systematically improve accuracy, robustness, and efficiency, incorporating approaches like self-supervised and representation learning when beneficial. - Contribute to the data strategy for perception: specify data and labeling requirements, help prioritize data collection and annotation, and collaborate with data and ground-truth teams, including model-assisted workflows (e.g., active learning, auto-labeling, vision-language models (VLMs)) and model-in-the-loop tooling. - Collaborate with safety, systems, and software teams to ensure perception solutions meet product requirements for safety, latency, resource usage, and software robustness, and are ready for deployment at scale.

What we need to see: - PhD with 4+ years, MS with 6+ years, or BS (or equivalent experience) with 8+ years of relevant experience in Computer Science, Computer Engineering, or a related technical field. - Hands-on experience developing deep learning–based perception or closely related systems for complex real-world problems, with strong proficiency in frameworks such as PyTorch and a track record of taking models from prototype to production. - Proven experience in data-driven development, including close collaboration with data, labeling, and ground-truth teams on data strategy, labeling quality, and iterative model improvement. - Strong programming skills in Python and/or C++, with experience building reliable, high-performance, production-quality software. - Excellent communication and collaboration skills, with the ability to work effectively across multidisciplinary teams.


Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits.

This posting is for an existing vacancy.

CV/ML Platform Engineer, Austin, TX

Company Overview

Allen Control Systems (ACS) is a cutting-edge defense startup founded by two former Navy electrical engineers with a proven track record in robotics and software. We are developing an autonomous gun turret using advanced computer vision and control systems to precisely detect, track, and neutralize enemy drones.

With an engineering-first culture, ACS values technical excellence and innovation. Backed by our founders’ successful exits from two previous ventures acquired for a combined $180M in 2022, we are committed to ensuring that the groundbreaking technologies we develop have a real-world impact.

Position Overview

We are seeking an experienced CV/ML Platform Engineer with specialization in Computer Vision and Machine Learning (CV/ML) to design, build, and own the data, model, and compute infrastructure powering ACS CV/ML team. You will help manage a 130+ GPU bare-metal Kubernetes cluster, own CV/ML CI/CD pipelines, and ensure ML model training proceeds at high volume with low friction.

What You'll Do:

Deploy and operate Kubernetes clusters on bare-metal infrastructure hosting 130+ NVIDIA GPUs, with hybrid burst capability to AWS for scalable compute and storage workloads.

Manage NVIDIA GPU clusters for ML training.

Own the ACS CV/ML CI/CD pipeline.

Improve and maintain core ML infrastructure, such as model registration and versioning, experiment tracking, and model and data provenance tracking.

Improve and maintain ML model testing, performance analysis, and reporting tools.

Automate repetitive model training and testing tasks to increase developer velocity.

Work with Software Team Platform Engineers to ensure efficient coordination and minimal duplication between CV/ML infrastructure and wider Software infrastructure.

Collaborate with the Software Team to automate the optimization of models (TensorRT/quantization) for deployment on NVIDIA Jetson and other edge hardware.

Required Technical Skills:

2+ years of experience in Platform Engineering or DevOps/MLOps. 

Strong programming skills are required for automating ML lifecycles and building custom CLI tools for CV engineers. 

Hands-on experience with NVIDIA GPU infrastructure, including managing CUDA libraries and development environments, GPU Operator, device plugins, and scheduling (MIG, Volcano, or fractional GPU sharing). 

Experience implementing and maintaining MLOps platforms such as Kubeflow, MLflow, Weights & Biases (W&B), or DVC for experiment tracking and model versioning. 

Familiarity with high-performance storage solutions (e.g., MinIO, WEKA, or Ceph) and data orchestration tools capable of handling terabytes of video/image data. 

Proven track record building CI/CD pipelines that include automated model validation, performance benchmarking, and artifact management for both cloud and edge targets. 

Experience with model optimization toolchains, including TensorRT, ONNX, and quantization techniques, specifically for cross-compilation to ARM targets like NVIDIA Jetson. 

Proficiency with observability stacks (ELK, Prometheus/Grafana) adapted for ML, including monitoring GPU health, training throughput, and model inference metrics. 

Strong Linux systems knowledge (Debian/Ubuntu), including networking for high-throughput data, storage, and security hardening for defense-grade production environments. 

What We Offer

Competitive salary

Health, Dental, Vision Insurance

Paid Time Off

Allen Control Systems is an Equal Opportunity Employer, providing equal employment opportunities to all employees and applicants for employment. Allen Control Systems prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.

Please note that this is only an interest form for our Summer 2027 technical internships. Our job applications will be released in September 2026.

Through your registration, you will be one of the first to know about our opportunities! Technical intern positions include: - Software Engineers - Machine Learning Engineers - Data Scientists - Research Scientists - Research Engineers - Applied Scientists

The RBKS AI team is responsible for innovating AI features for Ring and Blink cameras, with a mission to make our neighborhoods safer. We are working at the intersection of computer vision, generative AI (GenAI), and ambient intelligence. The team is seeking Applied Science Manager to lead initiatives that combine advanced computer vision and multimodal GenAI capabilities. This role offers a unique opportunity to lead a world-class team while shaping next-generation home security technology and advancing the field of AI algorithms and systems.

The team is focused on productizing research in computer vision and GenAI into products that benefit millions of customers worldwide, such as real-time object detection, video understanding, and multimodal LLMs. We are at the forefront of developing AI solutions that seamlessly blend into our products while respecting privacy, delivering unprecedented levels of intelligent security experience.

Key job responsibilities - Lead and guide a team of applied scientists in designing and developing advanced computer vision and GenAI models and algorithms for comprehensive video understanding, including but not limited to object detection, recognition and spatial understanding - Drive technical strategy and roadmap for privacy-preserving CV and GenAI models and systems, ensuring the team delivers efficient fine-tuning and on-device and in-cloud inference solutions - Partner with product and engineering leadership to translate business objectives into technical roadmaps, and ensure delivery of high-quality science artifacts that ship to products - Build and maintain strategic partnerships with science, engineering, product, and program management teams across the organization - Recruit, mentor, and develop top-tier applied science talent; provide technical and career guidance to team members while fostering a culture of innovation and excellence - Set technical direction and establish best practices for AI products/features across multiple projects and initiatives