Location
San Francisco United States
San Francisco United States
Salary
(Yearly)
(Yearly)
(Yearly)
(Yearly)
(Hourly)
Undisclosed
USD
145000
-
195000
Date posted
August 20, 2025
Job type
Full-time
Experience level
Mid level

Job Description

About LangChain

At LangChain, our mission is to make intelligent agents ubiquitous. We help developers build mission-critical AI applications across the entire agent development lifecycle. Our open source frameworks — LangChain and LangGraph — see over 70+ million downloads per month. Developers rely on LangChain for composable integrations and LangGraph for controllable agent orchestration. Our commercial agent platform, consisting of LangSmith and LangGraph Platform, enables teams to build, test, run, and manage agents at scale across their organization.

Founded in 2023, LangChain powers top engineering teams at companies like Replit, Lovable, Clay, Klarna, LinkedIn, and more.

About the role

We are seeking Platform and Infrastructure Engineers with deep expertise in Kubernetes, cloud platforms, and modern deployment technologies to build and maintain the infrastructure that powers AI applications in our cloud and customer environments. You'll architect and operate the critical systems that power our customers' AI observability and deployments, working directly with cutting-edge technologies at the intersection of AI and distributed systems

  • Design and Scale Infrastructure: Build and maintain scalable, high-throughput infrastructure solutions using Kubernetes, Helm, Docker, and multi-cloud environments (AWS, Azure, GCP) to support flagship SaaS products like LangSmith and LangGraph Platform.

  • Drive Reliability and Performance: Ensure platform reliability, security, and performance through robust monitoring, alerting, automated recovery systems, and proactive maintenance, including performance tuning and database optimization.

  • Contribute to Platform Strategy: Influence infrastructure strategy, tooling, and operational practices as the organization scales from startup to enterprise.

  • Enable Secure, Efficient Operations: Implement security best practices, compliance requirements, and infrastructure cost optimization strategies while architecting for high availability, disaster recovery, and resource efficiency.

  • Develop Automation and CI/CD Pipelines: Build and optimize CI/CD pipelines, infrastructure as code, and deployment automation strategies to streamline application delivery.

  • Support Customer Deployments: Create and maintain deployment solutions and monitoring tools for customer-hosted environments, and collaborate with engineering teams on application rollout and support.

  • Participate in Incident Response: Take part in the on-call rotation with a focus on learning, automation, and continuous improvement of incident response processes.

  • Document and Evolve Best Practices: Maintain comprehensive infrastructure documentation and stay up to date with emerging technologies and best practices in cloud-native systems.

How to be successful in this role

  • Experience: 3+ years building and operating production systems at scale

  • Programming proficiency: Strong hands-on software engineering skills (Python, Go, Rust)

  • Infrastructure expertise: Deep knowledge of Kubernetes, containerized infrastructure, cloud platforms (AWS, Azure, GCP)

  • Observability mastery: Hands-on experience with observability stacks (Datadog, Prometheus/Grafana, OpenTelemetry or similar)

  • Proficiency in infrastructure as code tools (Terraform, CloudFormation, etc.)

  • Database expertise: Production experience with OSS datastores (PostgreSQL, Redis, Kafka)

  • Experience with CI/CD pipelines and automation tools

  • Strong communication skills for cross-functional collaboration with other engineers and customers

Nice to Have

  • Proficiency with analytical databases (e.g. ClickHouse)

  • Background in high-growth startups

  • Previous experience in AI/ML infrastructure

Compensation & Benefits

  • Competitive salary and equity stake for role and stage of company. Commensurate with experience.

  • Annual salary range: $145,000-$195,000 USD for Senior Engineers

Apply now
LangChain is hiring a Platform/Infrastructure Engineer. Apply through Homebase and and make the next move in your career!
Apply now
Companies size
101-200
employees
Founded in
Headquaters
Country
Industry
{object}
Social media
Visit website

Similar AI jobs

Here are other jobs you might want to apply for.

No items found.

Cybersecurity - Site Reliablity Engineer, X Money

MLOps / DevOps Engineer
IN.svg
India

DevOps Engineer I

Full-time
MLOps / DevOps Engineer
US.svg
United States

Director of production support

Full-time
MLOps / DevOps Engineer
US.svg
United States

Director of production support

Full-time
MLOps / DevOps Engineer
US.svg
United States

Engineering Manager, Secure Frameworks

Full-time
MLOps / DevOps Engineer
US.svg
United States

Engineering Manager - CI Infrastructure

Full-time
MLOps / DevOps Engineer
Open Modal