Machine Learning Engineer Jobs

Discover the latest remote and onsite Machine Learning Engineer roles across top active AI companies. Updated hourly.

Check out 1871 new Machine Learning Engineer opportunities posted on The Homebase

Staff Strategic Sourcing Manager (Hardware)

New
Top rated
Together AI
Full-time
Full-time
Posted

Advance inference efficiency end-to-end by designing and prototyping algorithms, architectures, and scheduling strategies for low-latency, high-throughput inference. Implement and maintain changes in high-performance inference engines including kernel backends, speculative decoding, and quantization. Profile and optimize performance across GPU, networking, and memory layers to improve latency, throughput, and cost. Design and operate reinforcement learning (RL) and post-training pipelines to jointly optimize algorithms and systems where most of the cost is inference. Make RL and post-training workloads more efficient with inference-aware training loops such as asynchronous RL rollouts and speculative decoding. Use these pipelines to train, evaluate, and iterate on frontier models on top of the inference stack. Co-design algorithms and infrastructure to tightly couple objectives, rollout collection, and evaluation with efficient inference, identifying bottlenecks across the training engine, inference engine, data pipeline, and user-facing layers. Run ablations and scale-up experiments to understand trade-offs between model quality, latency, throughput, and cost, and feed insights back into model, RL, and system design. Own critical systems at production scale by profiling, debugging, and optimizing inference and post-training services under real production workloads. Drive roadmap items requiring engine modification including changing kernels, memory layouts, scheduling logic, and APIs. Establish metrics, benchmarks, and experimentation frameworks for rigorous validation of improvements. Provide technical leadership by setting technical direction for cross-team efforts, and mentor engineers and researchers on full-stack ML systems work and performance engineering.

$200,000 – $280,000
Undisclosed
YEAR

(USD)

San Francisco
Maybe global
Onsite

Machine Learning Engineer, TTS Systems

New
Top rated
Bland
Full-time
Full-time
Posted

As an ML Engineer focused on Text To Speech (TTS), you will own the deployment, optimization, and maintenance of production TTS systems. Responsibilities include deploying and optimizing large-scale TTS models into production environments for reliable, low-latency inference; implementing and refining post-training and modern inference techniques to maximize throughput and audio quality; collaborating with cross-functional teams to ensure seamless rollout, A/B testing, and iterative improvement of production models; maintaining high availability and scalable infrastructure for multi-speaker, expressive, and controllable TTS use cases; and designing and documenting best practices for efficient TTS inference and system reliability.

$160,000 – $250,000
Undisclosed
YEAR

(USD)

San Francisco, United States
Maybe global
Hybrid

Research Engineer, Core ML

New
Top rated
Together AI
Full-time
Full-time
Posted

Advance inference efficiency end-to-end by designing and prototyping algorithms, architectures, and scheduling strategies for low-latency, high-throughput inference. Implement and maintain changes in high-performance inference engines including kernel backends, speculative decoding, quantization, and profiling and optimizing performance across GPU, networking, and memory layers to improve latency, throughput, and cost. Unify inference with RL and post-training by designing and operating RL and post-training pipelines, optimizing algorithms and systems jointly for inference-heavy workloads, and making RL workloads more efficient with inference-aware training loops. Use RL pipelines to train, evaluate, and iterate on models, co-design algorithms and infrastructure to tightly couple objectives, rollout collection, and evaluation with efficient inference, and quickly identify bottlenecks across all layers. Run experiments to understand trade-offs between model quality, latency, throughput, and cost, and feed insights back into design. Own critical systems at production scale by profiling, debugging, and optimizing inference and post-training services under real production workloads, driving roadmap items involving engine modifications, establishing metrics and experimentation frameworks to validate improvements. Provide technical leadership by setting technical direction for cross-team efforts intersecting inference, RL, and post-training, and mentoring engineers and researchers on full-stack ML systems and performance engineering.

$200,000 – $280,000
Undisclosed
YEAR

(USD)

San Francisco
Maybe global
Onsite

Research Engineer / Machine Learning Engineer - B2B Applications

New
Top rated
OpenAI
Full-time
Full-time
Posted

As a Research Engineer in OpenAI's Applied Voice Team, you will design and build advanced machine learning models including state-of-the-art speech models such as speech-to-speech, transcribing, and text to speech, transforming research breakthroughs into tangible B2B applications like API and ChatGPT AVM. You will collaborate closely with software engineers, product managers, and forward deployed engineers to understand business challenges, address customer concerns, and deliver AI-powered solutions. You will implement scalable data pipelines, optimize models for performance and accuracy, ensure production readiness, and contribute to projects requiring cutting-edge technology and innovative approaches. Additionally, you will engage with the latest developments in machine learning and AI, participate in code reviews, share knowledge, and lead by example to maintain high-quality engineering practices. You will also monitor and maintain deployed models to ensure they continue delivering value, thereby influencing how AI benefits individuals, businesses, and society.

$295,000 – $445,000
Undisclosed
YEAR

(USD)

San Francisco, United States
Maybe global
Onsite

AI/ML Engineer

New
Top rated
Air Apps
Full-time
Full-time
Posted

Develop, train, and optimize machine learning models for various mobile app features. Research and implement state-of-the-art AI techniques to improve user engagement and app performance. Collaborate with cross-functional teams to integrate AI-driven solutions into applications. Design and maintain scalable ML pipelines, ensuring efficient model deployment and monitoring. Analyze large datasets to derive insights and drive data-driven decision-making. Stay updated with the latest AI trends and best practices, incorporating them into development processes. Optimize AI models for mobile environments to ensure high performance and low latency.

€60,000 – €76,000
Undisclosed
YEAR

(EUR)

Lisbon or Lisboa, Portugal
Maybe global
Onsite

Tech Lead, Android Core Product - Casablanca, Morocco

New
Top rated
Speechify
Full-time
Full-time
Posted

Work alongside machine learning researchers, engineers, and product managers to bring AI Voices to customers for diverse use cases. Deploy and operate the core ML inference workloads for the AI Voices serving pipeline. Introduce new techniques, tools, and architecture to improve the performance, latency, throughput, and efficiency of deployed models. Build tools to provide visibility into bottlenecks and sources of instability and design and implement solutions to address the highest priority issues.

$140,000 – $200,000
Undisclosed
YEAR

(USD)

Casablanca, Morocco
Maybe global
Remote

Tech Lead, Android Core Product - Guadalajara, Mexico

New
Top rated
Speechify
Full-time
Full-time
Posted

Work alongside machine learning researchers, engineers, and product managers to bring AI Voices to their customers for a diverse range of use cases; deploy and operate the core ML inference workloads for AI Voices serving pipeline; introduce new techniques, tools, and architecture to improve performance, latency, throughput, and efficiency of deployed models; build tools to gain visibility into bottlenecks and sources of instability and design and implement solutions to address the highest priority issues.

$140,000 – $200,000
Undisclosed
YEAR

(USD)

Guadalajara, Mexico
Maybe global
Remote

Tech Lead, Android Core Product - Cebu, Philippines

New
Top rated
Speechify
Full-time
Full-time
Posted

Work alongside machine learning researchers, engineers, and product managers to bring AI Voices to customers for a diverse range of use cases. Deploy and operate the core machine learning inference workloads for the AI Voices serving pipeline. Introduce new techniques, tools, and architecture to improve performance, latency, throughput, and efficiency of deployed models. Build tools to identify bottlenecks and sources of instability and design and implement solutions to address the highest priority issues.

$140,000 – $200,000
Undisclosed
YEAR

(USD)

Cebu, Philippines
Maybe global
Remote

Tech Lead, Android Core Product - Alexandria, Egypt

New
Top rated
Speechify
Full-time
Full-time
Posted

Work alongside machine learning researchers, engineers, and product managers to bring our AI Voices to their customers for a diverse range of use cases. Deploy and operate the core ML inference workloads for our AI Voices serving pipeline. Introduce new techniques, tools, and architecture that improve the performance, latency, throughput, and efficiency of our deployed models. Build tools to give us visibility into our bottlenecks and sources of instability and then design and implement solutions to address the highest priority issues.

$140,000 – $200,000
Undisclosed
YEAR

(USD)

Alexandria, Egypt
Maybe global
Remote

Tech Lead, Android Core Product - Nairobi, Kenya

New
Top rated
Speechify
Full-time
Full-time
Posted

Work alongside machine learning researchers, engineers, and product managers to bring AI Voices to customers for various use cases; deploy and operate core ML inference workloads for the AI Voices serving pipeline; introduce new techniques, tools, and architecture to improve performance, latency, throughput, and efficiency of deployed models; build tools to identify bottlenecks and sources of instability and design and implement solutions to address the highest priority issues.

$140,000 – $200,000
Undisclosed
YEAR

(USD)

Nairobi, Kenya
Maybe global
Remote

Want to see more Machine Learning Engineer jobs?

View all jobs

Access all 4,256 remote & onsite AI jobs.

Join our private AI community to unlock full job access, and connect with founders, hiring managers, and top AI professionals.
(Yes, it’s still free—your best contributions are the price of admission.)

Frequently Asked Questions

Have questions about roles, locations, or requirements for Machine Learning Engineer jobs?

Question text goes here

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique. Duis cursus, mi quis viverra ornare, eros dolor interdum nulla, ut commodo diam libero vitae erat. Aenean faucibus nibh et justo cursus id rutrum lorem imperdiet. Nunc ut sem vitae risus tristique posuere.

[{"question":"What does a Machine Learning Engineer do?","answer":"Machine Learning Engineers design, build, and deploy AI systems that solve real-world problems. They transform research prototypes into production-ready solutions by creating scalable ML pipelines, optimizing model performance, and handling data preprocessing workflows. They integrate models with applications via APIs, implement monitoring systems, and ensure models perform reliably in production environments. Daily tasks include collaborating with data scientists, fine-tuning algorithms, building deployment infrastructure, and maintaining data privacy. They work across diverse applications like recommendation engines, fraud detection systems, and computer vision tools while ensuring models remain accurate and efficient."},{"question":"What skills are required for Machine Learning Engineer jobs?","answer":"Strong programming skills in Python are fundamental, alongside proficiency with ML frameworks like TensorFlow and PyTorch. Machine Learning Engineers need solid mathematics and statistics knowledge, particularly in linear algebra, calculus, and probability theory. Experience with cloud platforms (AWS, GCP, Azure) is essential for deploying models at scale. Skills in data preprocessing, feature engineering, and model evaluation are critical for building effective systems. Engineers should understand MLOps practices, RESTful APIs, containerization tools like Docker, and version control systems. Practical experience with deep learning architectures and natural language processing is valuable for specialized roles."},{"question":"What qualifications are needed for Machine Learning Engineer jobs?","answer":"Most Machine Learning Engineer positions require a bachelor's degree in computer science, mathematics, or related field, with many employers preferring advanced degrees for senior roles. Beyond formal education, employers value demonstrated experience building and deploying machine learning models. A strong portfolio showcasing completed projects is often more important than academic credentials alone. Relevant certifications from cloud providers or in specific ML frameworks can strengthen applications. Employers look for candidates with verifiable experience in model deployment, optimization, and maintenance. Knowledge of software engineering best practices like testing, version control, and documentation is increasingly essential in this hybrid role."},{"question":"What is the salary range for Machine Learning Engineer jobs?","answer":"Machine Learning Engineer salaries vary based on several key factors. Geographic location significantly impacts compensation, with tech hubs like San Francisco, Seattle, and New York typically offering higher wages. Experience level creates substantial differences, with senior engineers earning considerably more than entry-level positions. Specialized expertise in areas like computer vision, reinforcement learning, or NLP can command premium compensation. Company size and industry also influence pay scales, with large tech companies and finance firms often offering higher salaries than startups or non-profits. Educational background, portfolio quality, and demonstrated impact on previous business outcomes further affect earning potential."},{"question":"How long does it take to get hired as a Machine Learning Engineer?","answer":"The hiring timeline for Machine Learning Engineer positions typically ranges from 4-12 weeks, depending on the company's hiring process and your qualifications. The interview process often includes technical screenings, coding challenges, system design discussions, and model implementation exercises. Candidates with strong portfolios demonstrating deployed ML projects may progress more quickly through initial screens. Specialized roles requiring expertise in deep learning or specific domain knowledge might have longer evaluation periods. Companies often test both theoretical understanding and practical implementation skills through multi-stage interviews. Building relationships with hiring managers through professional networks can sometimes accelerate the process."},{"question":"Are Machine Learning Engineer jobs in demand?","answer":"Machine Learning Engineer jobs remain in high demand across industries as organizations implement AI solutions to solve complex problems. Companies actively recruit ML Engineers for applications in recommendation systems, fraud detection, computer vision, natural language processing, and autonomous technologies. The role's hybrid nature—combining software engineering and data science expertise—makes qualified candidates particularly valuable. Organizations need specialists who can both develop models and deploy them in production environments. While the field is competitive, professionals with demonstrated experience building and maintaining ML systems at scale continue to find strong opportunities, especially those with specialized knowledge in emerging areas like reinforcement learning."},{"question":"What is the difference between Machine Learning Engineer and Data Scientist?","answer":"Machine Learning Engineers focus on implementing and deploying models in production environments, while Data Scientists concentrate on research, analysis, and prototype development. ML Engineers build scalable pipelines, optimize model performance, and create deployment infrastructure using software engineering practices. Data Scientists explore data, develop statistical insights, and experiment with algorithms to solve business problems. ML Engineers work extensively with frameworks like TensorFlow and deployment tools, whereas Data Scientists may spend more time with analytical tools and statistical methods. While Data Scientists uncover patterns and build proofs of concept, ML Engineers transform these prototypes into robust, production-ready systems that can operate at scale."}]