AI Data Engineer Jobs

Discover the latest remote and onsite AI Data Engineer roles across top active AI companies. Updated hourly.

Check out 207 new AI Data Engineer opportunities posted on The Homebase

Sr. Data Engineer (Poland)

New
Top rated
Craft
Full-time
Full-time
Posted

You will build and optimize data pipelines, extract and model diverse datasets, and design maintainable software systems. The role also involves setting data strategies, incorporating best practices, and leveraging AI-powered tools to accelerate development.

Undisclosed

()

Maybe global
Remote Solely

Staff Data Engineer

New
Top rated
Figure AI
Full-time
Full-time
Posted

Design, deploy, and maintain Figure's training clusters. Architect and maintain scalable deep learning frameworks for training on massive robot datasets. Work together with AI researchers to implement training of new model architectures at a large scale. Implement distributed training and parallelization strategies to reduce model development cycles. Implement tooling for data processing, model experimentation, and continuous integration.

$150,000 – $350,000
Undisclosed
YEAR

(USD)

San Jose, United States
Maybe global
Onsite

Senior Data Engineer (Evergreen)

New
Top rated
Demandbase
Full-time
Full-time
Posted

Lead initiatives to improve entity identification datasets and design new data pipelines critical to the company's intelligence platform. Collaborate with the data science team to enable development and deployment of ML/AI models and optimize pipeline performance.

Undisclosed
YEAR

(USD)

Maybe global
Remote OK

Software Engineer, ML Data Platform

New
Top rated
Mirage
Full-time
Full-time
Posted

You will build and scale large-scale distributed data systems and feature pipelines to support machine learning products, including real-time streaming and batch processing workflows. The role also involves designing storage infrastructure, orchestrating workflows, optimizing performance and cost, and collaborating with ML teams to enable analytics and data science workflows.

Undisclosed
YEAR

(USD)

Maybe global
On-site

Analytics Engineer

New
Top rated
Anduril
Full-time
Full-time
Posted

Develop and maintain data systems architecture for data ingestion, transformation, and analytics for mission-critical air defense operations. Collaborate with stakeholders to support operational workflows, ensure data quality, perform root cause analysis, and occasionally travel to deploy solutions in real-world scenarios.

Undisclosed
YEAR

(USD)

Maybe global
On-site

Big Data Solutions Consultant, Spark Expert

New
Top rated
Databricks
Full-time
Full-time
Posted

The Big Data Solutions Consultant guides customers through the implementation of transformational big data projects, including end-to-end development and deployment of industry-leading big data and AI applications using the Databricks platform. They facilitate technical workshops, architect and deploy solutions, and ensure best practices are followed while mentoring other team members and supporting project scoping.

Undisclosed

()

Maybe global
On-site

Database Specialist - Contract

New
Top rated
PathAI
Contractor
Full-time
Posted

The Database Specialist will analyze and optimize storage strategies for machine learning experiment data and metadata, refactor ETL pipelines, and support ML R&D and analytics through improving infrastructure. Collaboration with ML, SRE, and platform teams is key, along with providing knowledge transfer for long-term maintainers.

Undisclosed

()

Maybe global
Remote Solely

Head of Data & Analytics

New
Top rated
Sanity
Full-time
Full-time
Posted

The Head of Data & Analytics will build and lead Sanity.io's data function, scaling a team of data engineers, analytics engineers, and analysts to deliver actionable insights. They will own and execute the data strategy, establish scalable processes and best practices, and partner cross-functionally to drive data-driven decisions and business outcomes.

Undisclosed
YEAR

(USD)

Maybe global
Remote OK

Data Engineer

New
Top rated
Jasper
Full-time
Full-time
Posted

The Data Engineer will design, scale, and maintain data infrastructure and data processing pipelines to power the training of state-of-the-art multimodal models. They will collaborate with research scientists and engineers to collect, clean, and process large-scale datasets and support ongoing innovation within the research team.

Undisclosed

()

Maybe global
Hybrid

Lead Data Engineer

New
Top rated
Air Ops
Full-time
Full-time
Posted

Own and scale the data platform that enables analytics on AI search visibility and content performance. Lead the design and operation of data pipelines, data modeling, and tooling while managing a high-output data and analytics engineering team.

Undisclosed

()

Maybe global
Hybrid

Want to see more AI Data Engineer jobs?

View all jobs

Access all 4,256 remote & onsite AI jobs.

Join our private AI community to unlock full job access, and connect with founders, hiring managers, and top AI professionals.
(Yes, it’s still free—your best contributions are the price of admission.)

Frequently Asked Questions

Have questions about roles, locations, or requirements for AI Data Engineer jobs?

Question text goes here

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique. Duis cursus, mi quis viverra ornare, eros dolor interdum nulla, ut commodo diam libero vitae erat. Aenean faucibus nibh et justo cursus id rutrum lorem imperdiet. Nunc ut sem vitae risus tristique posuere.

[{"question":"What does an AI Data Engineer do?","answer":"AI Data Engineers build and manage data pipelines specifically for AI and machine learning models. They design architectures that process diverse data types such as text, images, and videos for model consumption. Their daily work includes implementing data validation systems, ensuring quality, and integrating large-scale datasets from multiple sources. They create real-time data workflows, handle vector databases like FAISS or Milvus, and optimize performance of AI data infrastructure. Using tools like Python, SQL, Apache Spark and Airflow, they collaborate with data scientists and ML engineers to transform raw data into formats that support model training and deployment."},{"question":"What skills are required for AI Data Engineer jobs?","answer":"Strong programming skills in Python and SQL form the foundation for AI Data Engineer roles. Proficiency with data engineering frameworks like Apache Spark, Airflow, and Ray is essential for building robust pipelines. Experience with cloud platforms (AWS, GCP, Azure) and vector databases enables handling of AI-specific data needs. Skills in data quality assurance, monitoring, and error handling ensure reliable AI systems. Engineers should understand embedding techniques for unstructured data processing and have experience with ETL processes at scale. Soft skills like cross-functional collaboration are valuable as these roles bridge technical teams with AI scientists and business stakeholders."},{"question":"What qualifications are needed for AI Data Engineer jobs?","answer":"Most AI Data Engineer positions require a bachelor's degree in computer science, data engineering, or related technical fields, with many employers preferring master's degrees for senior roles. Hands-on experience building data pipelines for machine learning applications is crucial. Employers look for demonstrated expertise with cloud data services like Redshift, BigQuery or Snowflake, and familiarity with MLOps practices. Knowledge of data preprocessing techniques for unstructured data (text, images, videos) sets successful candidates apart. Professional certifications in cloud platforms or data technologies can strengthen qualifications, especially when combined with proven experience integrating large-scale datasets for AI workflows."},{"question":"What is the salary range for AI Data Engineer jobs?","answer":"Compensation for AI Data Engineers varies based on several key factors. Location significantly impacts pay, with tech hubs like San Francisco and New York offering higher salaries than smaller markets. Experience level creates substantial differences, with senior engineers commanding significantly more than entry-level positions. Specialized skills in emerging AI tools, vector databases, and specific cloud platforms can increase earning potential. Company size also matters—large tech companies and well-funded AI startups often pay premium rates. The specialized nature of preparing data for AI applications typically positions these roles at higher compensation levels than traditional data engineering positions with similar years of experience."},{"question":"How long does it take to get hired as an AI Data Engineer?","answer":"The hiring timeline for AI Data Engineers typically spans 4-8 weeks from application to offer. The process usually includes an initial resume screening, followed by a technical phone interview covering Python, SQL, and data pipeline concepts. Candidates then face 1-3 rounds of technical interviews focusing on data engineering problems, system design for AI workflows, and coding exercises. Some companies add take-home assignments demonstrating pipeline building for AI data. Final rounds often include discussions with potential team members and hiring managers. Specialized skills in AI data preprocessing and experience with vector databases can accelerate the process, especially for candidates with proven experience in similar roles."},{"question":"Are AI Data Engineer jobs in demand?","answer":"AI Data Engineer positions show strong demand as organizations build infrastructure for AI initiatives. This specialized role bridges traditional data engineering and AI needs, with job postings appearing at major institutions like Stanford and companies like OpenAI. The role is gaining recognition as essential for AI implementation success, particularly as companies scale their machine learning operations. Demand stems from the unique requirements of AI data pipelines, which differ significantly from traditional analytics infrastructure. Organizations need engineers who understand the specific data preprocessing needs of machine learning models and can build robust pipelines for handling diverse data types including text, images, and videos."},{"question":"What is the difference between AI Data Engineer and Data Engineer?","answer":"While both roles build data pipelines, AI Data Engineers specifically focus on preparing data for machine learning and AI systems rather than business analytics. They work extensively with unstructured data (text, images, videos), implementing specialized preprocessing techniques that traditional Data Engineers rarely handle. AI Data Engineers commonly use vector databases like FAISS and embedding libraries that aren't typical in standard data engineering. They must understand model training data requirements and build infrastructure supporting model deployment. Traditional Data Engineers concentrate on structured data flows, data warehousing, and analytics support, while AI Data Engineers create pipelines optimized for machine learning with features like data versioning, lineage tracking, and real-time AI-ready data delivery."}]