Location
United States
United States
United States
United States
Salary
(Yearly)
(Yearly)
(Yearly)
(Yearly)
(Hourly)
Undisclosed
0
0
-
0
Date posted
September 23, 2025
Job type
Full-time
Experience level
Mid level

Job Description

< Remote - United States >

Job Description:
Stability AI’s Engineering Operations team is looking for a Senior Site Reliability Engineer (SRE) to join our growing team and play a pivotal role in improving and shaping our cloud infrastructure. The person will closely work with engineering, IT, security, and product teams to drive innovation and reliability in an evolving environment. Candidates should have the initiative to build and improve a maturing cloud landscape.

Responsibilities:

  • Developing and enforcing SRE best practices and standards across the organization.
  • Architecting and managing scalable systems in AWS and other cloud environments, focusing on high availability and resilience.
  • Implementing and maintaining infrastructure as code using Terraform.
  • Setting up and refining monitoring, logging, and alerting systems.
  • Driving incident management and root cause analysis to improve system reliability.
  • Championing SRE principles and mentoring junior team members.

Qualifications:

  • Collaborating with development teams to enhance CI/CD pipelines.
  • Experience scaling resource intensive systems, be it storage, networking, or compute.
  • Knowledge and experience with Kubernetes or other container scaling solutions
  • Background in software development or automation scripting.
  • Knowledge and experience with Grafana, ELK stack, or similar tools.
  • Cloud security experience.

Equal Employment Opportunity:

We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or other legally protected statuses.

 

Apply now
Stability AI is hiring a Senior Site Reliability Engineer . Apply through Homebase and and make the next move in your career!
Apply now
Companies size
101-200
employees
Founded in
Headquaters
London, United Kingdom
Country
United Kingdom
Industry
Research
Social media
Visit website

Similar AI jobs

Here are other jobs you might want to apply for.

FR.svg
France
GB.svg
United Kingdom

Member of technical staff (Infrastructure)

Full-time
MLOps / DevOps Engineer
FR.svg
France
GB.svg
United Kingdom

Senior Member of technical staff (Infrastructure)

Full-time
MLOps / DevOps Engineer
US.svg
United States

Senior Data Center Operations Engineer - Quincy WA

Full-time
MLOps / DevOps Engineer
US.svg
United States

Software Engineer, Infrastructure

Full-time
MLOps / DevOps Engineer
US.svg
United States

AI Security Engineer - Red Team

Full-time
MLOps / DevOps Engineer
US.svg
United States

IT Solutions Engineer

Full-time
MLOps / DevOps Engineer
Open Modal