About Us
Hippocratic AI has developed a safety-focused Large Language Model (LLM) for healthcare. The company believes that a safe LLM can dramatically improve healthcare accessibility and health outcomes in the world by bringing deep healthcare expertise to every human. No other technology has the potential to have this level of global impact on health.
Why Join Our Team
Innovative Mission: We are developing a safe, healthcare-focused large language model (LLM) designed to revolutionize health outcomes on a global scale.
Visionary Leadership: Hippocratic AI was co-founded by CEO Munjal Shah, alongside a group of physicians, hospital administrators, healthcare professionals, and artificial intelligence researchers from leading institutions, including El Camino Health, Johns Hopkins, Stanford, Microsoft, Google, and NVIDIA.
Strategic Investors: We have raised a total of $278 million in funding, backed by top investors such as Andreessen Horowitz, General Catalyst, Kleiner Perkins, NVIDIA’s NVentures, Premji Invest, SV Angel, and six health systems.
World-Class Team: Our team is composed of leading experts in healthcare and artificial intelligence, ensuring our technology is safe, effective, and capable of delivering meaningful improvements to healthcare delivery and outcomes.
For more information, visit www.HippocraticAI.com.
We value in-person teamwork and believe the best ideas happen together. Our team is expected to be in the office five days a week in Palo Alto, CA unless explicitly noted otherwise in the job description.
Overview
Hippocratic AI is seeking a PM to lead the development of our model evaluation and data generation platform. In this role, you’ll drive the creation of high-quality training and test datasets that inform our model’s roadmap and ensure safety in healthcare deployments.
Responsibilities
Define the strategy and architecture for model evaluation across agent behaviors.
Collaborate with data scientists, ML engineers, and clinicians to craft robust benchmarks.
Design and manage internal and external workflows for data labeling and generation.
Monitor data quality and iterate on tooling and process efficiency.
Work closely with the model training team to align data feedback loops with product performance.
Qualifications
3+ years in product management with experience in ML evaluation, labeling, or data pipelines.
Familiarity with language model datasets, especially in high-stakes or regulated settings.
Experience collaborating with labeling vendors, data QA teams, or managing Mechanical Turk-style pipelines.
Attention to detail in process design and tooling for human-in-the-loop systems.
***Be aware of recruitment scams impersonating Hippocratic AI. All recruiting communication will come from @hippocraticai.com email addresses. We will never request payment or sensitive personal information during the hiring process. If anything