Location
San Francisco United States
San Francisco United States
Salary
(Yearly)
(Yearly)
(Yearly)
(Yearly)
(Yearly)
Salary information is not provided for this position.
Undisclosed
0
USD
150000
-
190000
Category
Data Scientist
Date posted
July 30, 2025
Job type
Full-time
Experience level
Mid level

Job Description

About the role

As a Data Scientist at Greenlite, you'll assess and improve the quality of AI systems that help banks and fintechs fight financial crime at a massive scale. You'll work directly with our biggest customers—institutions serving over a billion people—establishing metrics, guidelines, and evaluation frameworks that ensure our AI agents meet the highest standards for accuracy and regulatory compliance. Real customer needs inform your data science work and directly impact our AI quality, so you need to build robust evaluation systems, work effectively with engineering and product teams, understand regulated environments, and adapt quickly.

This is a core data science role on our Engineering team. You're an exceptional data scientist who understands that financial institutions need rigorous quality assessment for AI systems operating in highly regulated environments. You'll establish evaluation metrics, design feedback loops, and create guidelines that ensure our LLM outputs meet compliance standards and customer expectations. You're not just analyzing data—you're defining quality standards based on what our most sophisticated customers need for reliable AI-powered compliance operations.

Please note: We work in person Monday through Friday in our SF office.

What you'll do

Month 1:

  • Master our AI agent platform and understand complex financial compliance quality requirements

  • Shadow experienced team members on LLM output evaluation and customer quality assessments

  • Establish your first evaluation metrics for compliance use cases like sanctions screening and AML investigations

Month 2:

  • Own end-to-end quality assessment frameworks for major bank and fintech AI deployments

  • Build comprehensive evaluation dashboards and statistical analysis of LLM performance across customer environments

  • Partner with engineering and product teams to establish quality guidelines and improvement recommendations

  • Become the go-to expert for AI quality assessment in financial compliance applications

Ongoing:

  • Design and implement rigorous evaluation frameworks for LLM outputs in highly regulated environments

  • Establish metrics and guidelines that ensure AI agent responses meet regulatory compliance and customer quality standards

  • Lead statistical analysis of AI performance trends, failure modes, and improvement opportunities across customer deployments

  • Collaborate with Engineering, Customer, and Product teams as the AI quality expert

What we're looking for

  • 3-5 years of industry experience in data science, AI evaluation, or machine learning with focus on quality assessment

  • Bachelor's or Master's degree in Computer Science, Statistics, Mathematics, Physics, or other quantitative field

  • Strong proficiency in Python and experience with statistical analysis libraries (pandas, numpy, scipy, matplotlib)

  • Experience with LLM evaluation techniques, prompt analysis, and AI system quality assessment

  • Strong statistical knowledge including experimental design, hypothesis testing, and performance measurement

  • Experience with SQL, data visualization, and building dashboards for stakeholder communication

  • Understanding of evaluation metrics, A/B testing, and statistical significance in AI system assessment

Bonus points:

  • Experience with LLM evaluation frameworks, prompt engineering assessment, or AI safety metrics

  • Background in financial services, compliance, or other highly regulated industries

  • Experience with statistical analysis of text data, content quality assessment, or human-AI interaction evaluation

  • Familiarity with regulatory requirements for AI systems in banking or financial services

  • Previous experience establishing quality standards and evaluation processes for AI/ML products

About you

You're a data scientist who thrives at the intersection of statistical rigor and AI quality assessment. You understand that practical data science in AI means establishing reliable metrics and evaluation frameworks that ensure LLM outputs meet the accuracy and compliance standards that regulated industries demand. You're comfortable balancing technical analysis with clear communication of quality insights, and you want your evaluation expertise to have a direct, measurable impact on how financial institutions trust and adopt AI to fight crime.

Compensation & Benefits

  • Comprehensive healthcare, 401k matching, commuter benefits

  • 15 days PTO + holidays, unlimited sick days

  • Flexible leave options

  • Working late? We've got you covered with DoorDash and an Uber home

Join us in building AI that protects the global financial system from financial crimes that fund terrorism, human trafficking, and other serious threats.

Companies size
11-50
employees
Founded in
2022
Headquaters
New York City, NY, United States
Country
United States
Industry
Software Development
Social media
Visit website

Similar AI jobs

Here are other jobs you might want to apply for.

CA.svg
Canada

Data Scientist

Full-time
Data Scientist
US.svg
United States

Senior Data Scientist

Full-time
Data Scientist
US.svg
United States

Data Scientist, Claude Code

Full-time
Data Scientist
GB.svg
United Kingdom

Lead Data Scientist

Full-time
Data Scientist
US.svg
United States

Data Scientist

Full-time
Data Scientist
US.svg
United States

Data Scientist

Full-time
Data Scientist