AI DevOps Engineer Jobs

Discover the latest remote and onsite AI DevOps Engineer roles across top active AI companies. Updated hourly.

Check out 1001 new AI DevOps Engineer opportunities posted on The Homebase

Data Center Substation/Utility Engineering - Electrical

New
Top rated
Lambda AI
Full-time
Full-time
Posted

This role provides strategic, technical, and executive leadership for Lambda’s high-voltage substation, transmission, and power-generation programs supporting AI data centers. The responsibilities include oversight of HV substation design and interconnections, utility negotiations, program planning, construction oversight, and ensuring regulatory compliance and system reliability.

Undisclosed
YEAR

(USD)

Maybe global
Hybrid

SOC Engineer

New
Top rated
Replit
Full-time
Full-time
Posted

The SOC Engineer will monitor and assess emerging security threats relevant to Replit's cloud infrastructure, conduct targeted security investigations, and analyze signals across the environment. They will collaborate across Security, SRE, and Engineering teams to drive effective mitigation strategies and ensure proper containment and documentation of incidents.

Undisclosed
YEAR

(USD)

Maybe global
Hybrid

Security Operations Lead

New
Top rated
Replit
Full-time
Full-time
Posted

Lead and mature the global Security Operations Center (SOC) for 24/7 monitoring, detection engineering, SIEM ownership, triage, and response in a high-scale, cloud-native, and AI-driven environment. Oversee the integration of AI-based SOC tools, manage cross-functional collaborations, and drive detection coverage across multi-cloud, SaaS, endpoints, and AI/ML systems.

Undisclosed
YEAR

(USD)

Maybe global
Hybrid

Cloud Security Lead

New
Top rated
Replit
Full-time
Full-time
Posted

The Cloud Security Lead will design and oversee the security of Replit's cloud-native, multi-cloud, and AI infrastructure, focusing on GCP with additional responsibilities across AWS and Azure. The role involves hands-on leadership in cloud security engineering, infrastructure vulnerability management, secure development practices, and cross-team collaboration to ensure robust security for millions of users.

Undisclosed
YEAR

(USD)

Maybe global
Hybrid

Talent Sourcer – AI & ML Research

New
Top rated
Symbolica AI
Full-time
Full-time
Posted

Lead the design, build, and optimization of robust infrastructure and tools to support advanced AI and machine learning research and product deployments. Oversee the transition of research prototypes into production-ready, secure, and high-performance software environments, managing clusters, observability, and deployment systems.

Undisclosed

()

Maybe global
On-site

Legal Intern [Summer 2026]

New
Top rated
Figure AI
Full-time
Full-time
Posted

The Training Infrastructure Engineer will design, deploy, and maintain large-scale distributed training infrastructure and frameworks for deep learning on robot datasets. They will collaborate with AI researchers to implement new model architectures and optimize systems for rapid experimentation and development.

Undisclosed
YEAR

(USD)

Maybe global
On-site

DevSecOps Engineer

New
Top rated
webAI
Full-time
Full-time
Posted

Design, secure, and maintain compliant cloud and edge infrastructure for deploying AI models in government and regulated environments. Ensure automation, compliance (FedRAMP, NIST), and support for secure MLOps workflows and continuous integration pipelines.

Undisclosed

()

Maybe global
Hybrid

Staff Engineer, Systems Test (R4151)

New
Top rated
Shield AI
Full-time
Full-time
Posted

Lead system-level integration, test planning, and validation for advanced autonomous aircraft systems, driving implementation across simulation, HIL, VIL, and flight environments. Architect and evolve test infrastructure, collaborate with multidisciplinary teams, and provide mentorship while ensuring robust performance and reliability for AI autonomy platforms.

Undisclosed
YEAR

(USD)

Maybe global
On-site

Staff Software Engineer, GPU Infrastructure (HPC)

New
Top rated
Cohere
Full-time
Full-time
Posted

Build and operate large-scale, ML-optimized GPU/TPU superclusters across multiple clouds, ensuring efficient performance and scalability for AI workloads. Collaborate with AI researchers to provide infrastructure solutions, troubleshoot complex system issues, and drive innovation in ML/HPC environments.

Undisclosed

()

Maybe global
Remote OK

Manager - Security Architecture

New
Top rated
Lambda AI
Full-time
Full-time
Posted

Lead and build a high-performing security architecture team responsible for securing Lambda's AI infrastructure, model data, and sensitive digital assets. Partner with engineering and IT teams to set security policies, guide design reviews, oversee compliance efforts, and develop customer-facing security documentation.

Undisclosed
YEAR

(USD)

Maybe global
Hybrid

Want to see more AI DevOps Engineer jobs?

View all jobs

Access all 4,256 remote & onsite AI jobs.

Join our private AI community to unlock full job access, and connect with founders, hiring managers, and top AI professionals.
(Yes, it’s still free—your best contributions are the price of admission.)

Frequently Asked Questions

Have questions about roles, locations, or requirements for AI DevOps Engineer jobs?

Question text goes here

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique. Duis cursus, mi quis viverra ornare, eros dolor interdum nulla, ut commodo diam libero vitae erat. Aenean faucibus nibh et justo cursus id rutrum lorem imperdiet. Nunc ut sem vitae risus tristique posuere.

[{"question":"What does an AI DevOps Engineer do?","answer":"AI DevOps Engineers build and maintain ML pipelines in cloud environments, implementing CI/CD workflows specifically for AI applications. They create monitoring solutions that track not just system health but also data quality and model performance. Their daily work includes developing cloud infrastructure code using tools like Terraform and Ansible, ensuring AI applications scale effectively. They collaborate with data scientists to deploy models, troubleshoot production issues, and implement security protocols. Unlike traditional developers, they bridge the gap between data science and operations, ensuring ML models transition smoothly from development to production environments."},{"question":"What skills are required for AI DevOps Engineer jobs?","answer":"AI DevOps Engineers need strong cloud platform expertise, particularly in AWS, Azure, or GCP. Proficiency with infrastructure-as-code tools like Terraform and Ansible is essential. Container orchestration skills using Docker and Kubernetes help manage AI workloads. Experience with CI/CD pipelines through Jenkins or GitLab CI enables automated model deployment. Python scripting ability supports both automation and ML pipeline integration. Monitoring skills using Prometheus and Grafana help track model performance. Beyond technical abilities, these roles require collaboration skills to work effectively with data scientists and developers, plus problem-solving aptitude to troubleshoot complex AI system issues."},{"question":"What qualifications are needed for AI DevOps Engineer jobs?","answer":"Most AI DevOps Engineer positions require a minimum of 3 years of software development experience and 2+ years of cloud deployment experience, with Azure often preferred. A computer science or related degree is typically expected, though equivalent experience may substitute. Employers look for candidates with hands-on experience using development and deployment tools like GitLab and Atlassian suite products. While not always mandatory, certifications in cloud platforms (AWS Solutions Architect, Azure DevOps Engineer) and container orchestration (CKA) strengthen applications. Experience building CI/CD pipelines specifically for ML workflows gives candidates a significant advantage in the hiring process."},{"question":"What is the salary range for AI DevOps Engineer jobs?","answer":"AI DevOps Engineer salaries vary based on several key factors. Geographic location significantly impacts compensation, with tech hubs like San Francisco and New York offering higher wages. Experience level creates substantial differences, with senior engineers earning considerably more. Specialized expertise in high-demand tools like Kubernetes or specific cloud platforms (AWS/Azure/GCP) can boost earnings. Industry sector also matters—financial services and healthcare organizations often pay premium rates for AI infrastructure expertise. Company size influences packages too, with large enterprises typically offering better benefits but startups potentially providing equity. Security clearances for sensitive projects may command additional compensation."},{"question":"How long does it take to get hired as an AI DevOps Engineer?","answer":"The hiring timeline for AI DevOps Engineers typically ranges from 4-8 weeks. The process usually begins with a screening call, followed by technical assessments testing cloud infrastructure skills and coding abilities. Candidates often face 2-3 rounds of interviews, including sessions with engineering managers and team members. Many employers include practical challenges related to containerization, CI/CD pipeline setup, or infrastructure-as-code implementations. Companies hiring for specialized AI infrastructure may extend the process with additional technical evaluations. Candidates with demonstrated experience in both DevOps and machine learning environments generally move through the pipeline faster than those from only traditional DevOps backgrounds."},{"question":"Are AI DevOps Engineer jobs in demand?","answer":"AI DevOps Engineer roles show strong demand as organizations integrate machine learning into their product offerings. Major companies like Boeing actively recruit for these positions to support AI applications in secure cloud environments. The specialized skillset—combining traditional DevOps practices with ML pipeline expertise—creates a smaller talent pool than for general DevOps roles. Organizations increasingly recognize that successful AI deployment requires specialized infrastructure and monitoring beyond conventional applications. This demand spans industries from technology and finance to manufacturing and healthcare, as each sector adopts AI capabilities requiring robust deployment pipelines, monitoring solutions, and infrastructure that traditional DevOps approaches don't fully address."},{"question":"What is the difference between AI DevOps Engineer and Traditional DevOps Engineer?","answer":"Traditional DevOps Engineers focus on application delivery pipelines, infrastructure automation, and system monitoring for conventional software. AI DevOps Engineers extend these skills to handle machine learning workflows, requiring specialized knowledge of model deployment, training pipelines, and experiment tracking. While both roles use similar tools (Docker, Kubernetes, CI/CD platforms), AI DevOps Engineers must understand data quality monitoring and model performance metrics that don't exist in traditional applications. They work more closely with data scientists and ML engineers, bridging the gap between data science and operations. AI DevOps requires additional considerations around computational resources, GPU scheduling, and optimizing infrastructure for machine learning workloads."}]