AI MLOps Engineer Jobs

Discover the latest remote and onsite AI MLOps Engineer roles across top active AI companies. Updated hourly.

Join our AI community Interested in Hiring?

Hiring by

Check out 12 new AI MLOps Engineer opportunities posted on The Homebase

View detail

Member of Technical Staff, GPU Optimization

New

Top rated

Mirage

–

Full-time

–

Posted

Nov 4, 2025 21:15

Optimize model training and inference pipelines, including data loading, preprocessing, checkpointing, and deployment, to improve throughput, latency, and memory efficiency on NVIDIA GPUs; design, implement, and benchmark custom CUDA and Triton kernels for performance-critical operations; integrate low-level optimizations into PyTorch-based codebases, including custom operators, low-precision formats, and TorchInductor passes; profile and debug the entire stack from kernel launches to multi-GPU I/O paths using various profiling tools such as Nsight, nvprof, PyTorch Profiler, and custom tools; collaborate with colleagues to co-design model architectures and data pipelines that are hardware-friendly while maintaining state-of-the-art quality; stay updated on the latest GPU and compiler technologies and assess their impact; work closely with infrastructure and backend teams to improve cluster orchestration, scaling strategies, and observability for large experiments; provide clear, data-driven insights regarding performance, quality, and cost trade-offs; contribute to a culture emphasizing fast iteration, thoughtful profiling, and performance-centric design.

$200,000 – $350,000

Undisclosed

YEAR

(USD)

New York, United States

Maybe global

Onsite

View detail

Member of Technical Staff, ML Engineer

New

Top rated

Mirage

–

Full-time

–

Posted

Nov 4, 2025 21:15

The Machine Learning Engineer will partner closely with Researchers to bring large-scale multimodal video diffusion models into production. Responsibilities include developing high-performance GPU-based inference pipelines for large multimodal diffusion models, building, optimizing, and maintaining serving infrastructure to deliver low-latency predictions at large scale, and collaborating with DevOps teams to containerize models, manage autoscaling, and ensure uptime SLAs. The role also involves leveraging techniques like quantization, pruning, and distillation to reduce latency and memory footprint without compromising quality, implementing continuous fine-tuning workflows to adapt models based on real-world data and feedback, designing and maintaining automated CI/CD pipelines for model deployment, versioning, and rollback, implementing robust monitoring (latency, throughput, concept drift) and alerting for critical production systems, and exploring cutting-edge GPU acceleration frameworks (e.g., TensorRT, Triton, TorchServe) to continuously improve throughput and reduce costs.

$200,000 – $300,000

Undisclosed

YEAR

(USD)

New York, United States

Maybe global

Onsite

View detail

Machine Learning Engineer

New

Top rated

HappyRobot

–

Full-time

–

Posted

Oct 23, 2025 10:56

Design, build, and maintain scalable machine learning systems encompassing data ingestion, preprocessing, training, testing, and deployment. Develop and optimize end-to-end ML pipelines covering data collection, labeling, training, validation, and monitoring to ensure system reliability and reproducibility. Implement robust MLOps practices such as model versioning, experiment tracking, continuous integration and deployment (CI/CD) for ML, and continuous monitoring in production. Collaborate with product and engineering teams to integrate and deploy models into real-time products with an emphasis on efficiency and scalability. Ensure data quality, observability, and performance across all AI systems. Stay current with the latest advancements in AI infrastructure, tooling, and research to maintain leadership in AI innovation.

$120,000 – $220,000

Undisclosed

YEAR

(USD)

Barcelona, Spain

Maybe global

Remote

View detail

Founding Senior Machine Learning Engineer

New

Top rated

Retell AI

–

Full-time

–

Posted

Oct 16, 2025 10:11

Train and fine-tune large language models (LLMs) and audio models to maximize speed, accuracy, and production-readiness for real-time AI voice experiences. Build datasets, define rigorous metrics, and measure model performance across high-impact voice AI tasks to guide development. Work closely with engineering to deploy models into production, monitor their performance, and ensure they remain fast, reliable, and accurate at scale. Build scalable pipelines to collect structured human feedback, benchmark subjective quality, and inform model iterations. Design and maintain machine learning infrastructure needed for fast experimentation, robust training, and continuous deployment.

$200,000 – $280,000

Undisclosed

YEAR

(USD)

Redwood City, United States

Maybe global

Hybrid

Want to see more AI MLOps Engineer jobs?

View all jobs

Access all 4,256 remote & onsite AI jobs.

Join our private AI community to unlock full job access, and connect with founders, hiring managers, and top AI professionals.

Join our community

(Yes, it’s still free—your best contributions are the price of admission.)

Frequently Asked Questions

Have questions about roles, locations, or requirements for AI MLOps Engineer jobs?

Question text goes here

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique. Duis cursus, mi quis viverra ornare, eros dolor interdum nulla, ut commodo diam libero vitae erat. Aenean faucibus nibh et justo cursus id rutrum lorem imperdiet. Nunc ut sem vitae risus tristique posuere.

[{"question":"What does a AI MLOps Engineer do?","answer":"AI MLOps Engineers design and implement CI/CD pipelines for machine learning models, focusing on deployment, monitoring, and maintenance. They containerize models using Docker and Kubernetes, implement automated testing frameworks, and build scalable infrastructure for ML workflows. These engineers monitor models for performance degradation and data drift while ensuring security compliance throughout the pipeline. They bridge the gap between data science and production environments, automating model versioning, retraining, and optimization."},{"question":"What skills are required for AI MLOps Engineer?","answer":"AI MLOps Engineers need strong programming skills in Python and experience with containerization tools like Docker and Kubernetes. Proficiency with cloud platforms (AWS, GCP, Azure) is essential, alongside expertise in CI/CD pipelines, version control, and infrastructure as code. They should understand ML algorithms, model serving patterns, and monitoring systems to track performance metrics. Experience with vector databases, RAG systems, and fine-tuning pipelines for LLMs is increasingly valuable in today's market."},{"question":"What qualifications are needed for AI MLOps Engineer role?","answer":"Most AI MLOps Engineer positions require a bachelor's degree in Computer Science, Data Science, Engineering or related field. Employers typically seek candidates with 4+ years of technical engineering experience, particularly in DevOps, software engineering, or data engineering. Demonstrable expertise with ML deployment, containerization, and cloud platforms is crucial. Strong coding skills in Python and other languages, combined with practical experience implementing and maintaining ML systems in production environments, are highly valued."},{"question":"What is the salary range for AI MLOps Engineer job?","answer":"The research provided does not contain specific salary information for AI MLOps Engineers. Compensation typically varies based on location, experience level, company size, and industry. As this role requires specialized expertise in both ML and DevOps, salaries generally align with other senior technical positions in the AI field. For accurate salary information, it's recommended to consult current compensation surveys or job listings for AI MLOps Engineer positions in your target location."},{"question":"How long does it take to get hired as a AI MLOps Engineer?","answer":"The research doesn't provide specific hiring timelines for AI MLOps Engineer positions. The process typically involves technical interviews assessing both ML knowledge and operational skills. With employers commonly requiring 4+ years of technical experience and specific expertise in ML algorithms, DevOps, and workflow automation, candidates meeting these qualifications may move through the process more quickly. The hiring timeline can vary significantly depending on the company's urgency, the candidate pool, and the specific technical requirements of the position."},{"question":"Are AI MLOps Engineer job in demand?","answer":"The research indicates growing demand for AI MLOps Engineers, evidenced by recruitment at major companies like Microsoft. As organizations increasingly deploy ML models to production, the need for specialists who can bridge data science and operations has expanded. This role is crucial for companies looking to scale AI initiatives reliably and efficiently. The specialized skill set combining ML knowledge with DevOps expertise makes qualified candidates particularly valuable as more businesses implement machine learning in production environments."}]