About the role
As a Data Scientist at Greenlite, you'll assess and improve the quality of AI systems that help banks and fintechs fight financial crime at a massive scale. You'll work directly with our biggest customers—institutions serving over a billion people—establishing metrics, guidelines, and evaluation frameworks that ensure our AI agents meet the highest standards for accuracy and regulatory compliance. Real customer needs inform your data science work and directly impact our AI quality, so you need to build robust evaluation systems, work effectively with engineering and product teams, understand regulated environments, and adapt quickly.
This is a core data science role on our Engineering team. You're an exceptional data scientist who understands that financial institutions need rigorous quality assessment for AI systems operating in highly regulated environments. You'll establish evaluation metrics, design feedback loops, and create guidelines that ensure our LLM outputs meet compliance standards and customer expectations. You're not just analyzing data—you're defining quality standards based on what our most sophisticated customers need for reliable AI-powered compliance operations.
Please note: We work in person Monday through Friday in our SF office.
What you'll do
Month 1:
Master our AI agent platform and understand complex financial compliance quality requirements
Shadow experienced team members on LLM output evaluation and customer quality assessments
Establish your first evaluation metrics for compliance use cases like sanctions screening and AML investigations
Month 2:
Own end-to-end quality assessment frameworks for major bank and fintech AI deployments
Build comprehensive evaluation dashboards and statistical analysis of LLM performance across customer environments
Partner with engineering and product teams to establish quality guidelines and improvement recommendations
Become the go-to expert for AI quality assessment in financial compliance applications
Ongoing:
Design and implement rigorous evaluation frameworks for LLM outputs in highly regulated environments
Establish metrics and guidelines that ensure AI agent responses meet regulatory compliance and customer quality standards
Lead statistical analysis of AI performance trends, failure modes, and improvement opportunities across customer deployments
Collaborate with Engineering, Customer, and Product teams as the AI quality expert
What we're looking for
3-5 years of industry experience in data science, AI evaluation, or machine learning with focus on quality assessment
Bachelor's or Master's degree in Computer Science, Statistics, Mathematics, Physics, or other quantitative field
Strong proficiency in Python and experience with statistical analysis libraries (pandas, numpy, scipy, matplotlib)
Experience with LLM evaluation techniques, prompt analysis, and AI system quality assessment
Strong statistical knowledge including experimental design, hypothesis testing, and performance measurement
Experience with SQL, data visualization, and building dashboards for stakeholder communication
Understanding of evaluation metrics, A/B testing, and statistical significance in AI system assessment
Bonus points:
Experience with LLM evaluation frameworks, prompt engineering assessment, or AI safety metrics
Background in financial services, compliance, or other highly regulated industries
Experience with statistical analysis of text data, content quality assessment, or human-AI interaction evaluation
Familiarity with regulatory requirements for AI systems in banking or financial services
Previous experience establishing quality standards and evaluation processes for AI/ML products
About you
You're a data scientist who thrives at the intersection of statistical rigor and AI quality assessment. You understand that practical data science in AI means establishing reliable metrics and evaluation frameworks that ensure LLM outputs meet the accuracy and compliance standards that regulated industries demand. You're comfortable balancing technical analysis with clear communication of quality insights, and you want your evaluation expertise to have a direct, measurable impact on how financial institutions trust and adopt AI to fight crime.
Compensation & Benefits
Comprehensive healthcare, 401k matching, commuter benefits
15 days PTO + holidays, unlimited sick days
Flexible leave options
Working late? We've got you covered with DoorDash and an Uber home
Join us in building AI that protects the global financial system from financial crimes that fund terrorism, human trafficking, and other serious threats.