Member of Technical Staff (All Levels) - Agent Data
As an Agent Data engineer at Basis, you will own projects completely from scoping to delivery and be the Responsible Party for the systems you design, deciding how to build them, measure success, and when to ship. You will manage yourself, plan your own projects, work closely with your pod, and take full responsibility for execution and quality. Your tasks include building and standardizing the data platform by designing data pipelines that ingest, validate, and transform accounting data into reliable datasets, defining schemas and data contracts, building validation, lineage tracking, and drift detection into every pipeline, and creating interfaces for data discovery, computation, and observation. You will model the domain as a system by translating accounting concepts into well-structured ontologies, creating abstractions to help AI systems reason about real-world constraints, and designing for clarity through schema, code, and documentation. Additionally, you will lead through clarity and technical excellence by owning the architectural vision for your area, running effective design reviews, mentoring engineers on system thinking including load testing, schema design, and observability patterns, and simplifying systems by removing accidental complexity and enforcing clean, stable abstractions.
Software Engineer, Distributed Data Systems
As a Data Engineer, you will architect and build the data infrastructure that powers all company operations, including crawling billions of pages, training embedding models, and serving real-time search. You will have autonomy in designing systems that scale to hundreds of petabytes. Responsibilities include designing lakehouse architectures, building and operating large-scale distributed data processing pipelines, creating streaming pipelines for real-time indexing, architecting data layers for embedding training infrastructure, and scaling deployments to handle analytical queries across petabytes of data.
Don't See Your Role? Apply Here!
The job posting does not specify explicit responsibilities for any particular role. It describes the company's mission, the importance of their work on AI model evaluations, and mentions various roles they are exploring without detailing specific responsibilities.
[UMOS ONE] Data & AI Engineering Lead
The responsibilities include developing AI models and integrating Agentic AI for routing, dispatching, and prediction, specifically using features extracted from knowledge graphs to develop AI-based optimal routing, dispatching technologies, demand prediction, ETA prediction, and improving analytic prediction models. The role also involves designing and implementing the integration architecture with Agentic AI systems. Additionally, responsibilities cover the design and development of mobility and logistics-specific ontologies, building knowledge graph-based data models, integrating and refining large heterogeneous data, and managing relationships among service entities to enhance data intelligence. Furthermore, the position requires designing, building, and operating large-scale data pipelines (ETL/ELT) for UMOS platforms, establishing and automating MLOps pipelines for stable model operation, and developing and integrating efficient API interfaces with service backend systems.
Data Engineer – Spark Specialist
Help users discover and master the Dataiku platform through user training, office hours, demos, and ongoing consultative support. Analyse and investigate various kinds of data and machine learning applications across industries and use cases. Provide strategic input to the customer and account teams that help make customers successful. Scope and co-develop production-level data science projects with customers. Mentor and help educate data scientists and other customer team members to aid in career development and growth.
Data Engineer
The Data Engineer will design, build, and maintain data pipelines, manage data ingestion, and develop reliable data models to support AI and ML workflows. The role also involves close collaboration with ML and product teams to ensure clean, structured, and high-quality data delivery for analytics and product features.
AI Pilot Vibe Coding Assistant (Freelance)
AI Pilot Vibe Coding Assistants collaborate with AI-driven systems to generate, refine, and submit accurate, well-structured outputs based on complex prompts. They handle coding, automation, data processing, troubleshooting technical issues, and improving AI output quality across diverse domains.
Data Engineer
The Data Engineer will design, build, and maintain scalable data pipelines to support analytics and data-driven decision making at Replit. They will collaborate across teams to deliver ETL/ELT workflows, ensure data quality, and build unified data models for in-depth analysis.
Member of Technical Staff, Data Engineering
As a Data Engineer specializing in pretraining data, you will be responsible for developing and maintaining data pipelines that support Cohere's advanced language models. You will manage the entire lifecycle of training data, including ingestion, cleaning, optimization, and modeling for optimal model performance, while collaborating with cross-functional teams to ensure the quality and efficiency of data curation.
Data Operations Manager
Build and scale data and financial operations to support deployment and growth of AI agents for major institutional clients. Take ownership of billing, collections, data infrastructure, dashboards, and cross-functional operations to provide actionable, real-time visibility to business leaders.
Access all 4,256 remote & onsite AI jobs.
Frequently Asked Questions
Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique. Duis cursus, mi quis viverra ornare, eros dolor interdum nulla, ut commodo diam libero vitae erat. Aenean faucibus nibh et justo cursus id rutrum lorem imperdiet. Nunc ut sem vitae risus tristique posuere.