Location
United States
United States
United States
United States
Salary
(Yearly)
(Yearly)
(Yearly)
(Yearly)
(Hourly)
Undisclosed
0
0
-
0
Date posted
September 23, 2025
Job type
Full-time
Experience level
Mid level

Job Description

< Remote - United States >

Job Description:
Stability AI’s Engineering Operations team is looking for a Senior Site Reliability Engineer (SRE) to join our growing team and play a pivotal role in improving and shaping our cloud infrastructure. The person will closely work with engineering, IT, security, and product teams to drive innovation and reliability in an evolving environment. Candidates should have the initiative to build and improve a maturing cloud landscape.

Responsibilities:

  • Developing and enforcing SRE best practices and standards across the organization.
  • Architecting and managing scalable systems in AWS and other cloud environments, focusing on high availability and resilience.
  • Implementing and maintaining infrastructure as code using Terraform.
  • Setting up and refining monitoring, logging, and alerting systems.
  • Driving incident management and root cause analysis to improve system reliability.
  • Championing SRE principles and mentoring junior team members.

Qualifications:

  • Collaborating with development teams to enhance CI/CD pipelines.
  • Experience scaling resource intensive systems, be it storage, networking, or compute.
  • Knowledge and experience with Kubernetes or other container scaling solutions
  • Background in software development or automation scripting.
  • Knowledge and experience with Grafana, ELK stack, or similar tools.
  • Cloud security experience.

Equal Employment Opportunity:

We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or other legally protected statuses.

 

Apply now
Stability AI is hiring a Senior Site Reliability Engineer . Apply through Homebase and and make the next move in your career!
Apply now
Companies size
101-200
employees
Founded in
Headquaters
London, United Kingdom
Country
United Kingdom
Industry
Research Services
Social media
Visit website

Similar AI jobs

Here are other jobs you might want to apply for.

US.svg
United States

Platform Engineer

Full-time
MLOps / DevOps Engineer
IN.svg
India

Application Security Engineer

Full-time
MLOps / DevOps Engineer
CA.svg
Canada
GB.svg
United Kingdom
US.svg
United States

Member of Technical Staff, Infrastructure & Data

Full-time
MLOps / DevOps Engineer
US.svg
United States

Senior Infrastructure Engineer

Full-time
MLOps / DevOps Engineer
GB.svg
United Kingdom
CA.svg
Canada
US.svg
United States

Member of Technical Staff, Applied AI Engineer

Full-time
MLOps / DevOps Engineer
US.svg
United States

Senior Platform Engineer

Full-time
MLOps / DevOps Engineer
Open Modal