Location
San Francisco United States
San Francisco United States
Salary
(Yearly)
(Yearly)
(Yearly)
(Yearly)
(Hourly)
Undisclosed
-
Date posted
October 7, 2025
Job type
Full-time
Experience level
Mid level

Job Description

Our Mission

Reflection’s mission is to build open superintelligence and make it accessible to all.

We’re developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI researchers and company builders come from DeepMind, OpenAI, Google Brain, Meta, Character.AI, Anthropic and beyond.

About the Role

  • Design, build, and operate Reflection’s large-scale GPU infrastructure powering pre-training, post-training, and inference.

  • Develop reliable, high-performance systems for scheduling, orchestration, and observability across thousands of GPUs.

  • Optimize cluster utilization, throughput, and cost efficiency while maintaining reliability at scale.

  • Build tools and automation for distributed training, inference, monitoring, and experiment management.

  • Collaborate closely with research, training, and platform teams to accelerate development and enable large-scale training and inference.

  • Push the limits of hardware, networking, and software to accelerate the path from idea to model.

About You

  • Deep systems or infrastructure engineering experience in high-performance or distributed computing environments.

  • Strong understanding of GPUs, CUDA, NCCL, and large-scale training and inference frameworks and libraries (PyTorch, DeepSpeed, JAX, Megatron-LM, SGLang, vLLM, etc.).

  • Hands-on experience with containerization, orchestration, and cluster management (Kubernetes, Slurm, or similar).

  • Familiar with modern observability stacks and performance profiling tools.

  • High agency and the ability to thrive in a fast-paced, high-ownership startup environment.

  • Excited to build from zero to one defining how frontier-scale training/RL infrastructure is architected and operated.

  • Motivated by enabling researchers and engineers to build the world’s most capable open-weight AI systems.

What We Offer:

We believe that to build superintelligence that is truly open, you need to start at the foundation. Joining Reflection means building from the ground up as part of a small talent-dense team. You will help define our future as a company, and help define the frontier of open foundational models.

We want you to do the most impactful work of your career with the confidence that you and the people you care about most are supported.

  • Top-tier compensation: Salary and equity structured to recognize and retain the best talent globally.

  • Health & wellness: Comprehensive medical, dental, vision, life, and disability insurance.

  • Life & family: Fully paid parental leave for all new parents, including adoptive and surrogate journeys. Financial support for family planning.

  • Benefits & balance: paid time off when you need it, relocation support, and more perks that optimize your time.

  • Opportunities to connect with teammates: lunch and dinner are provided daily. We have regular off-sites and team celebrations.

Apply now
Reflection is hiring a Member of Technical Staff - GPU Infrastructure. Apply through Homebase and and make the next move in your career!
Apply now
Companies size
1-10
employees
Founded in
2024
Headquaters
Paris, France
Country
France
Industry
Software Development
Social media
Visit website

Similar AI jobs

Here are other jobs you might want to apply for.

US.svg
United States

Member of Technical Staff - GPU Infrastructure

Full-time
MLOps / DevOps Engineer
US.svg
United States

AI Platform Security Engineer

Full-time
MLOps / DevOps Engineer
US.svg
United States

Engineering Manager, Core Services

Full-time
MLOps / DevOps Engineer
US.svg
United States

Detection & Response Engineer

Full-time
MLOps / DevOps Engineer
US.svg
United States

Director, Platform Engineering

Full-time
MLOps / DevOps Engineer
US.svg
United States

Senior+ Software Engineer - Cloud Availability Platform Engineering (Observability)

Full-time
MLOps / DevOps Engineer
Open Modal