AI DevOps Engineer Jobs

Discover the latest remote and onsite AI DevOps Engineer roles across top active AI companies. Updated hourly.

Join our AI community Interested in Hiring?

Hiring by

Check out 1001 new AI DevOps Engineer opportunities posted on The Homebase

View detail

Infrastructure Engineer

New

Top rated

Faculty

–

Full-time

–

Posted

Feb 20, 2026 22:58

The Infrastructure Engineer is responsible for designing, building, and deploying robust, secure, and scalable cloud infrastructure for AI and machine learning workflows. They will work in a cross-functional team and partner with technical and non-technical stakeholders from the initial idea generation through to implementation and shipping. The role involves enabling Machine Learning Engineers and Data Scientists by contributing to internal best practices, standards, and reusable code repositories. The engineer will proactively identify and recommend new ways customers can leverage cloud infrastructure to address their key challenges, create and maintain reusable company-wide libraries and infrastructure-as-code, and research and integrate the best open-source technologies to enhance Faculty's infrastructure capabilities.

Undisclosed

()

London, United Kingdom

Maybe global

Hybrid

View detail

Staff DevOps Engineer

New

Top rated

webAI

–

Full-time

–

Posted

Feb 19, 2026 0:39

As a Staff DevOps Engineer at webAI, you will design and architect secure, scalable cloud and edge infrastructure for deploying AI workloads across multi-cloud and hybrid environments, build and maintain production-grade Infrastructure as Code managing over 100 resources with GitOps workflows and automated validation, design and operate production Kubernetes clusters optimized for AI/ML workloads with GPU support, implement secure CI/CD pipelines with integrated security controls and automated deployment workflows, lead MLOps infrastructure initiatives including model deployment pipelines and monitoring, design observability and monitoring systems with tools like Prometheus and Grafana aligned to performance indicators, implement security best practices including least-privilege access and automated compliance validation, lead incident response and reliability initiatives including on-call rotations and post-mortems, architect disaster recovery and business continuity strategies with automated backup and failover processes, develop reusable infrastructure modules to standardize deployment patterns, mentor engineers on cloud architecture and DevOps best practices, and drive technical documentation and knowledge sharing including runbooks and infrastructure standards.

Undisclosed

()

Austin, United States

Maybe global

Onsite

View detail

Site Reliability Engineer, Managed AI

New

Top rated

Crusoe

–

Full-time

–

Posted

Jan 24, 2026 8:16

The Site Reliability Engineer is responsible for designing and operating reliable managed AI services focused on serving and scaling large language model workloads. They build automation and reliability tooling to support distributed AI pipelines and inference services, define, measure, and improve SLIs/SLOs across AI workloads to ensure performance and reliability, and collaborate with AI, platform, and infrastructure teams to optimize large-scale training and inference clusters. Additionally, they automate observability by building telemetry and performance tuning strategies for latency-sensitive AI services, investigate and resolve reliability issues in distributed AI systems using telemetry, logs, and profiling, and contribute to the architecture of next-generation distributed systems designed specifically for AI-first environments.

$204,000 – $247,000

Undisclosed

YEAR

(USD)

San Francisco, United States

Maybe global

Onsite

View detail

Site Reliability Engineer, Inference Infrastructure

New

Top rated

Cohere

–

Full-time

–

Posted

Jan 13, 2026 6:00

As a Site Reliability Engineer on the Model Serving team, you will build self-service systems that automate managing, deploying, and operating services, including custom Kubernetes operators supporting language model deployments. You will automate environment observability and resilience, enabling all developers to troubleshoot and resolve problems, and take steps to ensure defined SLOs are met, including participating in an on-call rotation. Additionally, you will build strong relationships with internal developers and influence the Infrastructure team’s roadmap based on their feedback, as well as develop the team through knowledge sharing and an active review process.

Undisclosed

()

Toronto, Canada

Maybe global

Remote

View detail

DevOps Engineer

New

Top rated

Obviant

–

Full-time

–

Posted

Dec 9, 2025 10:33

The DevSecOps / Platform Engineer will design, implement, and operate secure, cloud-native infrastructure powering core data and application platforms for a defense-focused company. They will develop CI/CD pipelines, automate deployments, uphold security practices, and collaborate across teams to ensure reliability, scalability, and compliance for government users.

Undisclosed

()

Maybe global

Hybrid

View detail

Staff Software Engineer, Infrastructure

New

Top rated

Decagon

–

Full-time

–

Posted

Dec 9, 2025 4:35

You will design, build, and operate production infrastructure for high-scale, low-latency systems, owning critical services end-to-end to improve reliability and performance. The role also involves partnering with research and product teams, optimizing service latencies, evolving CI/CD and self-service tooling, and leading infrastructure-as-code and GitOps practices.

Undisclosed

YEAR

(USD)

Maybe global

On-site

View detail

Staff Infrastructure Security Engineer

New

Top rated

Crusoe

–

Full-time

–

Posted

Dec 9, 2025 4:16

The engineer will architect, deploy, and operationalize foundational security services to support Crusoe's move toward Zero Trust, serving as a technical leader for secrets management and identity architecture. Responsibilities span from driving enterprise-wide platforms like HashiCorp Vault to defining trust patterns and secure onboarding in a hybrid, multi-cloud environment.

Undisclosed

()

Maybe global

On-site

View detail

Enterprise Security Engineer

New

Top rated

PhysicsX

–

Full-time

–

Posted

Dec 4, 2025 23:47

You will be responsible for building and operationalizing the company's compliance program, implementing controls, and supporting audits in a fast-paced SaaS environment. Key tasks include managing GRC tools, automating workflows for compliance standards such as SOC 2 and ISO 27001, and supporting responses to customer security assessments.

Undisclosed

YEAR

(USD)

Maybe global

View detail

Freelance AI Red Team Engineer

New

Top rated

Mindrift

–

Part-time

Full-time

–

Posted

Dec 3, 2025 6:00

As a Freelance AI Red Team Engineer, you will evaluate and red team AI models, agents, and machine learning systems for safety risks and vulnerabilities. You will also develop automation tools, create rigorous test scenarios, and contribute to security research initiatives in the AI domain.

Undisclosed

HOUR

(USD)

Maybe global

Remote Solely

View detail

Freelance AI Red Team Engineer

New

Top rated

Mindrift

–

Part-time

Full-time

–

Posted

Dec 3, 2025 6:00

Evaluate and red team AI models and agents for vulnerabilities and safety risks, and develop automation tools and test harnesses for AI systems. Contribute to security research initiatives, including designing and implementing challenging attack scenarios for AI models.

Undisclosed

HOUR

(USD)

Maybe global

Remote Solely

Want to see more AI DevOps Engineer jobs?

View all jobs

Access all 4,256 remote & onsite AI jobs.

Join our private AI community to unlock full job access, and connect with founders, hiring managers, and top AI professionals.

Join our community

(Yes, it’s still free—your best contributions are the price of admission.)

Frequently Asked Questions

Have questions about roles, locations, or requirements for AI DevOps Engineer jobs?

Question text goes here

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique. Duis cursus, mi quis viverra ornare, eros dolor interdum nulla, ut commodo diam libero vitae erat. Aenean faucibus nibh et justo cursus id rutrum lorem imperdiet. Nunc ut sem vitae risus tristique posuere.

[{"question":"What does an AI DevOps Engineer do?","answer":"AI DevOps Engineers build and maintain ML pipelines in cloud environments, implementing CI/CD workflows specifically for AI applications. They create monitoring solutions that track not just system health but also data quality and model performance. Their daily work includes developing cloud infrastructure code using tools like Terraform and Ansible, ensuring AI applications scale effectively. They collaborate with data scientists to deploy models, troubleshoot production issues, and implement security protocols. Unlike traditional developers, they bridge the gap between data science and operations, ensuring ML models transition smoothly from development to production environments."},{"question":"What skills are required for AI DevOps Engineer jobs?","answer":"AI DevOps Engineers need strong cloud platform expertise, particularly in AWS, Azure, or GCP. Proficiency with infrastructure-as-code tools like Terraform and Ansible is essential. Container orchestration skills using Docker and Kubernetes help manage AI workloads. Experience with CI/CD pipelines through Jenkins or GitLab CI enables automated model deployment. Python scripting ability supports both automation and ML pipeline integration. Monitoring skills using Prometheus and Grafana help track model performance. Beyond technical abilities, these roles require collaboration skills to work effectively with data scientists and developers, plus problem-solving aptitude to troubleshoot complex AI system issues."},{"question":"What qualifications are needed for AI DevOps Engineer jobs?","answer":"Most AI DevOps Engineer positions require a minimum of 3 years of software development experience and 2+ years of cloud deployment experience, with Azure often preferred. A computer science or related degree is typically expected, though equivalent experience may substitute. Employers look for candidates with hands-on experience using development and deployment tools like GitLab and Atlassian suite products. While not always mandatory, certifications in cloud platforms (AWS Solutions Architect, Azure DevOps Engineer) and container orchestration (CKA) strengthen applications. Experience building CI/CD pipelines specifically for ML workflows gives candidates a significant advantage in the hiring process."},{"question":"What is the salary range for AI DevOps Engineer jobs?","answer":"AI DevOps Engineer salaries vary based on several key factors. Geographic location significantly impacts compensation, with tech hubs like San Francisco and New York offering higher wages. Experience level creates substantial differences, with senior engineers earning considerably more. Specialized expertise in high-demand tools like Kubernetes or specific cloud platforms (AWS/Azure/GCP) can boost earnings. Industry sector also matters—financial services and healthcare organizations often pay premium rates for AI infrastructure expertise. Company size influences packages too, with large enterprises typically offering better benefits but startups potentially providing equity. Security clearances for sensitive projects may command additional compensation."},{"question":"How long does it take to get hired as an AI DevOps Engineer?","answer":"The hiring timeline for AI DevOps Engineers typically ranges from 4-8 weeks. The process usually begins with a screening call, followed by technical assessments testing cloud infrastructure skills and coding abilities. Candidates often face 2-3 rounds of interviews, including sessions with engineering managers and team members. Many employers include practical challenges related to containerization, CI/CD pipeline setup, or infrastructure-as-code implementations. Companies hiring for specialized AI infrastructure may extend the process with additional technical evaluations. Candidates with demonstrated experience in both DevOps and machine learning environments generally move through the pipeline faster than those from only traditional DevOps backgrounds."},{"question":"Are AI DevOps Engineer jobs in demand?","answer":"AI DevOps Engineer roles show strong demand as organizations integrate machine learning into their product offerings. Major companies like Boeing actively recruit for these positions to support AI applications in secure cloud environments. The specialized skillset—combining traditional DevOps practices with ML pipeline expertise—creates a smaller talent pool than for general DevOps roles. Organizations increasingly recognize that successful AI deployment requires specialized infrastructure and monitoring beyond conventional applications. This demand spans industries from technology and finance to manufacturing and healthcare, as each sector adopts AI capabilities requiring robust deployment pipelines, monitoring solutions, and infrastructure that traditional DevOps approaches don't fully address."},{"question":"What is the difference between AI DevOps Engineer and Traditional DevOps Engineer?","answer":"Traditional DevOps Engineers focus on application delivery pipelines, infrastructure automation, and system monitoring for conventional software. AI DevOps Engineers extend these skills to handle machine learning workflows, requiring specialized knowledge of model deployment, training pipelines, and experiment tracking. While both roles use similar tools (Docker, Kubernetes, CI/CD platforms), AI DevOps Engineers must understand data quality monitoring and model performance metrics that don't exist in traditional applications. They work more closely with data scientists and ML engineers, bridging the gap between data science and operations. AI DevOps requires additional considerations around computational resources, GPU scheduling, and optimizing infrastructure for machine learning workloads."}]