Evaluation Scenario Writer - AI Agent Testing Specialist
Contributors create structured test cases simulating complex human workflows, define gold-standard behavior and scoring logic to evaluate agent actions, analyze agent logs, failure modes, and decision paths, work with code repositories and test frameworks to validate scenarios, iterate on prompts, instructions, and test cases to improve clarity and difficulty, and ensure scenarios are production-ready, easy to run, and reusable.
Software Engineering Manager
Oversee the design and operation of the core platform including third-party providers, storage, billing, observability, security, and API. Provide technical leadership for various product and platform features. Improve developer experience to increase the speed of the team's shipping process. Guide efforts that bridge AI research to production across all modalities such as video, audio, image, and text. Understand the capabilities and limitations of state-of-the-art AI models and determine the best ways to leverage them in products. Partner with product, design, and research teams to ensure development aligns closely with user needs and business objectives.
Senior Software Engineer, Consumer Experience
As a Senior Software Engineer, you will drive the design and development of the core experiences that help students find opportunities, spanning search, discovery, jobs, and job search. You will set technical direction, uplevel engineering quality, and play a key role in scaling the platform and AI-powered experiences. You will design and implement scalable, high-availability systems powering search and discovery experiences across the platform. Lead the development of agentic AI experiences for students, including conversational AI for resume help, job search, and interview prep. Work with OpenAI real-time APIs and agentic frameworks to deliver intelligent, conversational features, and guide best practices for their use across teams. Collaborate with cross-functional partners (Product, Design, Data, GTM) to define, build, and iterate on features that drive member value and engagement. Own projects end-to-end, from technical design and implementation to rollout and monitoring, ensuring reliability and performance at scale. Champion engineering excellence through code reviews, technical mentorship, and improving tooling, standards, and architecture.
Software Engineering Manager, AI Observability & Evals Platform (San Francisco, CA)
The Engineering Manager will lead the team building LangSmith, the observability and evaluation platform for LLM applications. Responsibilities include building, mentoring, and growing a high-performing engineering team while fostering collaboration, ownership, and accountability. The manager will strengthen LangChain's engineering culture through mentorship, high-quality code, and technical excellence, shape the long-term technical direction, and ensure scalability and reliability of the LangSmith AI Observability Platform. They will partner with product and design teams to define scope, sequence, and success criteria for key initiatives, maintain a high bar for technical excellence with urgency and focus, write clean, maintainable, and well-tested code in Go/Python and Typescript, and engage directly with customers to understand needs and translate them into actionable product improvements.
Forward-Deployed Engineer - Enterprise (Founding Team)
As a Forward-Deployed Software Engineer at Fractional AI, you will play a leading role in building and shipping AI applications for customers by understanding their needs and deploying solutions in their environments. Your responsibilities include designing and developing custom AI solutions tailored to clients' unique needs, ensuring seamless integration and scalability. You will partner with a high-caliber team throughout the full project lifecycle, including requirements gathering, prototyping, system design, coding, testing, deployment, and support. You will actively shape the engineering and broader company culture, influencing learning, hiring, and celebration processes. Your role includes owning delivery from early scoping through deployment and iteration, maintaining a balance between coding and customer-facing problem solving. You will work closely with other engineers, product managers, and leadership, and engage with customers frequently to ensure solutions meet their needs.
Software Engineering Intern
As a Software Engineering Intern at Parallel, you will work on how AI agents retrieve and understand information from the open web. You will help train and scale embedding and ranking models that sit behind the company's APIs, where model quality, latency, and freshness all matter.
Product Engineer, GTM Innovation
As a product engineer on the GTM Innovation team, you will build high-impact applications and tools that accelerate OpenAI’s go-to-market efforts. You will work across the full product lifecycle for GTM, including prototyping, iterating, shipping, and maintaining products. You will embed with Sales, Technical Success, and Revenue Operations teams to identify user needs and build solutions for them. Applying OpenAI’s models in novel ways to solve real-world customer and internal workflow problems is also part of the role. Additionally, you will translate learnings from your work into feedback for Applied and Research teams to inform product development.
Forward Deployed Software Engineer (UK Defence)
As a Forward Deployed Software Engineer at Northslope, you will be responsible for helping customers solve their most valuable problems by working closely with stakeholders to conceive, architect, build, and maintain applications that have a measurable impact on their business. You will understand customers' most critical objectives and build operationalized workflows on their enterprise data that end users actually use to make change within the business. Your role involves iterating quickly, building from scratch, and helping organizations transform their operations at an uncommon speed. You will not only ship code but also help reshape how entire organizations work by putting powerful tools in the hands of executives and real operators.
Head of Engineering
As Head of Engineering at Adthena, you will be responsible for the technical execution behind a platform that processes vast volumes of search data, runs complex distributed computations, and delivers enterprise-grade insights. You will own the engineering roadmap execution for all products and platform capabilities, join architectural discussions concerning distributed data processing, ML pipelines, and web application architecture, and ensure the systems are scalable and reliable. You will champion engineering best practices, establish predictable delivery processes across all teams, implement measurable engineering KPIs, and drive continuous improvement in release quality. Additionally, you will lead and mentor direct reports, build an engineering organization that scales, foster a collaborative environment, partner with Product, Data Science, ML, and DevOps teams, contribute to the company-wide technology strategy, identify opportunities to accelerate innovation, and manage engineering budgets, tooling spend, cloud infrastructure costs, and vendor relationships.
Senior Software Engineer, Agent Development
As a full-stack product engineer in the Agent Building team, you will build tools that empower customers to create and optimize their AI agents. This involves working closely with the Agent Software Engineering and Product Management teams to understand workflows and design scalable abstractions and interfaces. Responsibilities include eliminating manual engineering work by developing self-serve functionality for agent configuration, training, and deployment; creating AI-powered tools for non-technical teams to manage workflows without coding; designing configuration abstractions that balance power with simplicity for diverse use cases; building monitoring and analytics to provide actionable insights for agent improvement; and owning features end-to-end from technical architecture through deployment and iteration.
Access all 4,256 remote & onsite AI jobs.
Frequently Asked Questions
Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique. Duis cursus, mi quis viverra ornare, eros dolor interdum nulla, ut commodo diam libero vitae erat. Aenean faucibus nibh et justo cursus id rutrum lorem imperdiet. Nunc ut sem vitae risus tristique posuere.