Top Machine Learning Engineer Jobs Openings in 2025
Looking for opportunities in Machine Learning Engineer? This curated list features the latest Machine Learning Engineer job openings from AI-native companies. Whether you're an experienced professional or just entering the field, find roles that match your expertise, from startups to global tech leaders. Updated everyday.
Curriculum Engineer
AIFund
51-100
-
No items found.
Contractor
Remote
true
AI is the new electricity. Millions of AI engineers are needed to transform industries with AI, particularly in the realm of GenAI, and we’re building an education platform to train them. With a mission to grow and connect the global AI community, DeepLearning.AI is an education technology company that is empowering the global workforce to build an AI-powered future through world-class education, hands-on training, and a collaborative community. We’re a small tech company with serious credentials, exciting marketing challenges, and wonderful teammates.
DeepLearning.AI is looking for a Curriculum Engineer to work alongside our growing team of developers and engineers. In this role, you will apply your programming skills to help Subject Matter Experts (SMEs) create and / or revise AI and ML coding exercises, write and maintain high-quality code, test and debug new code exercises on the Coursera platform, and build and deploy autograder software for coding exercises. Your experience as an educator, your technical skills in the AI space and your ability to communicate and work well with a team will all be critical for this role.
We are open to remote global workers as long as they are available to work within 3 time zones of California (PDT / GMT -7).
Here’s what you’ll do:Help Subject Matter Experts (SMEs) to create and / or revise coding assignments: In this role, you will be responsible for creating and / or revising coding exercises in collaboration with SMEs. Although you do not have to be an expert in the subject of every course, you have to be sufficiently technical to develop coding exercises given a set of specifications, or review / revise code that was written by others. As such, we expect you to be proficient in math (calculus, linear algebra and statistics) as well as Python programming (functions, classes, data structures, machine learning frameworks) at a level where you can rapidly learn AI topic areas that are new to you and contribute to / provide feedback on course code exercises. Write and maintain high-quality code: Your software engineering skills are more important for this role than your knowledge of AI, but at least a basic knowledge of AI / machine learning (enough to be a successful learner in the course you’re working on) is a requirement. You’ll be expected to collaborate with SMEs and others to write well documented code that is easily human readable and maintain that code using best practices in GitHub. Python is our preferred language, but a high degree of proficiency in another OOP language similar to Python could serve as a replacement for the Python requirement identified above. Test and debug new code exercises on the Coursera platform and revise / update existing coding exercises: The development of code exercises often happens offline, but you’ll also be responsible for uploading and testing exercises within the Coursera Labs environment. You will also work on maintenance and updates of existing coding exercises. This includes implementing improvements such as fixing bugs / refactoring individual code exercises, collecting and modifying datasets to ensure the code exercises have the desired pedagogical impact, as well as implementing course-wide revisions to update to the latest version of a framework or package being used in the code exercises.Build and deploy autograder software for coding exercises: All of our coding exercises are set up to be automatically graded by software. Familiarity with writing good software unit tests, working with docker containers and debugging across various platforms is critical for this role. Apart from writing robust code, you will always be working with an eye toward how to “humanize” the autograder systems we deploy, such that learners get meaningful automated feedback when their code needs to be revised.
Here are the skills you should have:Technical background in math and programming at a level sufficient to follow and successfully complete an online course in machine learning such as DLAI’s Deep Learning Specialization. We expect you to be proficient in math (calculus, linear algebra, statistics etc.) as well as Python programming (functions, classes, data structures etc.) at a level where you can rapidly learn ML / AI topic areas that are new to you. Basic knowledge of AI / machine learning is required. Graduate degree (Master’s or PhD) in fields like Computer Science, Artificial Intelligence, Data Science, or a related STEM area, or a Bachelor’s degree and equivalent industry experience with hands-on expertise in machine learning, software development, or data-driven model deployment.In-depth knowledge and experience designing and/or teaching technical courses online or offline, with experience in designing, structuring, and teaching technical courses specifically in machine learning, data science, or artificial intelligence. Demonstrated ability to create interactive, applied learning experiences. Knowledge of software testing best practices and experience with Github, Docker.Excellent communicator with an ability to author and edit high-quality written content in English.Knowledge of instructional design best practices, specifically as it relates to content creation.A team player with willingness to be flexible in timing and tasks to produce the best quality product.
Bonus if you have:Industry experience as a data scientist, machine learning engineer or similar.Previous experience developing online asynchronous curriculum in the areas of AI, machine learning, data science, robotics or similar.Familiarity with pedagogical practices such as defining learning objectives and backward design, as well as experience applying such practices in the creation of online educational content.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Data Scientist
Data Science & Analytics
Apply
August 19, 2025
Curriculum Developer
AIFund
51-100
-
No items found.
Contractor
Remote
true
AI is the new electricity. Millions of AI engineers are needed to transform industries with AI, particularly in the realm of GenAI, and we’re building an education platform to train them. With a mission to grow and connect the global AI community, DeepLearning.AI is an education technology company that is empowering the global workforce to build an AI-powered future through world-class education, hands-on training, and a collaborative community. We’re a small tech company with serious credentials, exciting marketing challenges, and wonderful teammates.
DeepLearning.AI is looking for a Curriculum Developer (CD) to work with our internal Curriculum Product Managers (CPMs) and network of world experts in the AI space to create high-quality education content. In this role, you will leverage your expertise in learning experience design to help Subject Matter Experts (SMEs) produce high-quality courses and effective learning experiences. Your experience as an educator, your technical skills in the AI space, and your ability to communicate and work well with a team will all be critical for this role.
We are open to remote global workers as long as they are available to work within 3 time zones of California (PDT / GMT -7).
Responsibilities:Convert the SME’s vision into effective learning content: Our SMEs are experts in their technical domain but are not always teaching or curriculum experts. You will take their ideas and rough materials (hand-sketched slides, bullet-pointed notes, or recorded video lectures) and convert them into learner-facing content (slides, scripts, quizzes, reading items, etc.). Although you do not have to be an expert in the subject of every course, you must be sufficiently technical to understand the material and its intent, and rapidly upskill, before or in the process of creating a new program. Review and provide feedback on technical content: You will be learner zero, and the judge and advocate of how other learners may perceive the material. This requires expertise in instructional design and an ability to empathize with learners. You will need to identify areas of improvement in material produced by SMEs or your peers, suggest solutions, and implement them. Ensure consistently high content quality and good pedagogy. Effective courses are accurate, cohesive, and engaging. You will be working with peers, with the guidance of the SME and CPM to provide the connective tissue that aligns all the various pieces of feedback and content together. In this generalist role, you could be creating assessments one day, and making video edits to align the scripts and slides the next, all with great attention to detail.
Requirements:Technical background in math and programming at a level sufficient to follow and successfully complete an online course in machine learning such as DLAI’s Deep Learning Specialization. We expect you to be proficient in math (calculus, linear algebra, statistics etc.) as well as Python programming (functions, classes, data structures etc.) at a level where you can rapidly learn ML / AI topic areas that are new to you. Basic knowledge of AI / machine learning is required. Graduate degree (Master’s or PhD) in fields like Computer Science, Artificial Intelligence, Data Science, or a related STEM area, or equivalent industry experience with hands-on expertise in machine learning, software development, or data-driven model deployment.In-depth knowledge and experience designing and/or teaching technical courses online or offline , with experience in designing, structuring, and teaching technical courses specifically in machine learning, data science, or artificial intelligence. Demonstrated ability to create interactive, applied learning experiences.Experience designing and/or teaching technical courses online or offline. Must have created slide presentations, reading or lecture material, and assessments for post-secondary or adult learners. Excellent communicator with the ability to author and edit high-quality written content in English.A team player with a willingness to be flexible in timing and tasks to produce the best quality product.
Preferred:A PhD in Math, Data Science, Computer Science, or a related field. Industry experience as a data scientist, machine learning engineer, or similar.Previous experience developing online asynchronous curricula in the areas of AI, machine learning, data science, robotics, or similar.Familiarity with pedagogical practices such as defining learning objectives and backward design, as well as experience applying such practices in the creation of online educational content.
Machine Learning Engineer
Data Science & Analytics
Apply
August 19, 2025
AI/ML Software Engineer II
Conductor
201-500
USD
0
110000
-
130000
United States
Full-time
Remote
false
The rise of generative AI is fundamentally changing how people search for information and discover brands online. Conductor is the only end-to-end, AI-first platform that enables enterprises to create high-performing, valuable content at scale and ensure their brand is found everywhere customers are looking—from Google to generative AI engines like ChatGPT. Recognized by Forrester as a Leader and #1 rated by customers on G2 and TrustRadius, we are committed to building a workplace where our people can grow and make a positive impact. Conductor is a mission-driven company with a commitment to innovation, customer success, and culture. For Conductor, success is improving the lives of everyone in our orbit—our customers, our customers' customers, our employee-owners, and our communities. About the Role We are seeking a skilled and experienced Senior Machine Learning Engineer to join our team. Reporting to the Engineering Director, the successful candidate will take on a pivotal role in designing, developing, and implementing innovative solutions to our customers' problems, while adhering to the standards and principles of our R&D department. The Senior ML Engineer will work closely with data engineers, product managers, application developers, UI/UX designers, and other stakeholders to create scalable, high-performance, reliable, and secure SaaS applications. The ideal candidate is passionate about solving problems, influencing strategic thinking, and mentoring others within the team. Reporting to the Engineering Director, this engineer’s key responsibilities are: Design and implement end-to-end AI systems and ML pipelines, creating a scalable framework that powers iterative R&D workflows. Orchestrate multi-model decision workflows and implement efficient agent handoff systems that optimize for both performance and user experience. Design human-in-the-loop interactions allowing AI agents to collaborate effectively with content marketers, incorporate feedback, and refine workflows autonomously. Optimize AI models for efficiency, interpretability, and real-world performance in enterprise environments. Define AI evaluation frameworks and metrics to ensure continuous improvement of models and systems. Collaborate with cross-functional teams to unlock insights, innovation, and intelligence in our applications. Stay current with state-of-the-art techniques in AI/ML and apply this knowledge to our architectures. Mentor other engineers and foster a culture of knowledge sharing and collaboration. Understand challenges in the SEO space and solve problems, not just deliver features. Who you are: 3+ years of experience in software development, with at least 3 years in building scalable and secure SaaS application platforms and systems for intelligent applications. Deep knowledge of modern technologies including: LLM architectures (GPT-4, Claude, Gemini) Retrieval-augmented generation (RAG) and vector database (Milvus) optimizations Multi-agent AI architectures and agentic frameworks Experience with model evaluation, including designing self-assessment prompts and implementing performance dashboards using relevant metrics (confidence and perplexity scores, win-rate comparisons between model versions, etc.). Proficiency with traditional ML technologies (PyTorch, TensorFlow, AWS SageMaker, MLflow) and strong Python coding skills. Experience with cloud technologies, particularly in data processing and distributed systems. Knowledge of AWS technologies is a plus. Excellent communication skills, with the ability to translate complex technical concepts for both technical and non-technical stakeholders. Solution-oriented approach using rapid prototyping, experimentation, and iterative development. Self-directed, proactive, and comfortable working across organizational stacks, taking ownership of end-to-end problem-solving in a fast-paced environment. ------------------------------------------------------------------------------------------------------------------------------- Compensation: Conductor maintains competitive, performance-based compensation programs. The NYC base salary range for this role is currently $110,000 - 130,000 per year. Placement within that range will be dependent on experience. Benefits: Conductor offers the following attractive benefits and perks including: 100% covered employee medical plan, a dental & vision plans, 401(k) with employer contribution, an unlimited vacation policy, 10 sick days, short-term disability, long-term disability, generous paid parental leave, employee assistance program, flexible savings accounts, paid holidays, life and accidental death insurance, and a host of perks. Conductor LLC is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. Bringing in diverse perspectives and challenging our assumptions is the clear key to growth; it drives innovation, creativity, faster problem-solving, and stronger decision making. All aspects of employment including the decision to hire, promote, train, discipline, or discharge, will be based on merit, competence, performance, and business needs. Conductor does not discriminate against any employee or applicant on the basis of race, color, ancestry, national origin, religion or religious creed, mental or physical disability, medical condition, genetic information, sex (including pregnancy, childbirth, and related medical conditions), sexual orientation, gender identity, gender expression, age, marital status, military or veteran status, or other characteristics protected by state or federal law or local ordinance. In addition, it is the policy of Conductor to provide reasonable accommodation to qualified employees who have protected disabilities to the extent required by applicable laws, regulations and ordinances where a particular employee works.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Apply
August 19, 2025
Member of Technical Staff, Integration/RL Team (Research Engineer)
Cohere
501-1000
-
France
United Kingdom
Canada
Full-time
Remote
true
Who are we?Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI.We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. We like to work hard and move fast to do what’s best for our customers.Cohere is a team of researchers, engineers, designers, and more, who are passionate about their craft. Each person is one of the best in the world at what they do. We believe that a diverse range of perspectives is a requirement for building great products.Join us on our mission and shape the future!The integration team is responsible for developing and scaling machine learning algorithms and infrastructure for LLM post-training, with a focus on large-scale, distributed RL methods. We strive for excellence in both engineering and science by meticulously designing experiments and design docs. While tasks are assigned according to everyone’s expertise, there is a global team effort to write production code and support the team research efforts, depending on individual interests and organizational needs.In particular, this role aims to enhance the global quality of the post-training codebase by implementing new tools to ease and support research, optimizing post-training algorithms, and scaling distributed RL to unprecedented levels.Please Note: We have offices in London, Paris, Toronto, San Francisco, New York but we are also remote-friendly! Applicants for this role may work anywhere between UTC−06:00 and UTC+01:00.As a Member of Technical Staff, you will:Design and write high-performing and scalable software for training models.Develop new tools to support and accelerate research and LLM training.Coordinate with other engineering teams (Infrastructure, Efficiency, Serving) and the scientific teams (Agent, Multimodal, Multilingual, etc.) to create a strong and integrated post-training ecosystem.Craft and implement techniques to improve performance and speed up our training cycles, both on SFT, offline preference, and the RL regime.Research, implement, and experiment with ideas on our cluster and data infrastructure.Collaborate, Collaborate, and Collaborate with other scientists, engineers, and teams!You are an ideal candidate if you have:Extremely strong software engineering skills.Value test-driven development methods, clean code, and strive to reduce technical debts at all levels.Proficiency in Python and related ML frameworks such as JAX, Pytorch and/or XLA/MLIR.Experience using and debugging large-scale distributed training strategies (memory/speed profiling).[Bonus] Experience with distributed training infrastructures (Kubernetes) and associated frameworks (Ray).[Bonus] Hands-on experience with the post-training phase of model training, with a strong emphasis on scalability and performance.[Bonus] Experience in ML, LLM and RL academic research.This role is perfect for you if you:Have a deep passion for quality work.Enjoy tuning and optimising large LLM models.Comfortable working with people with different levels of software engineering skills, from beginner to more advanced.Comfortable diving into complex ML codebases to identify and resolve issues, ensuring the smooth operation of our systems.Thrive in a fast-paced, technically challenging environment, where you can contribute your innovative ideas and solutions.If some of the above doesn’t line up perfectly with your experience, we still encourage you to apply! If you want to work really hard on a glorious mission with teammates that want the same thing, Cohere is the place for you.We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.Full-Time Employees at Cohere enjoy these Perks:🤝 An open and inclusive culture and work environment 🧑💻 Work closely with a team on the cutting edge of AI research 🍽 Weekly lunch stipend, in-office lunches & snacks🦷 Full health and dental benefits, including a separate budget to take care of your mental health 🐣 100% Parental Leave top-up for 6 months for employees based in Canada, the US, and the UK🎨 Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement🏙 Remote-flexible, offices in Toronto, New York, San Francisco and London and co-working stipend✈️ 6 weeks of vacationNote: This post is co-authored by both Cohere humans and Cohere technology.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Research Scientist
Product & Operations
Apply
August 19, 2025
Forward Deployed Engineer - US
Parloa
201-500
-
United States
Full-time
Remote
false
YOUR MISSION: As a Forward Deployed Engineer (FDE) at Parloa, you will be on the front lines of AI transformation at the world’s most sophisticated enterprises. You will implement, integrate, and extend Parloa's platform to meet complex, real-world customer needs - serving as the embedded technical expert during and after deployment. This is a high-autonomy, field-facing engineering role where you will work side-by-side with customer technical teams and internal stakeholders to unlock rapid adoption and value. You will own the hands-on development work required to activate customer use cases, from workflow integrations to custom extensions, in production environments. IN THIS ROLE YOU WILL: Deploy and integrate Parloa's platform into complex enterprise environments, ensuring successful implementation and real-world impact Own system configurations, API integrations, custom code extensions, and performance tuning in customer-specific setups Act as a trusted technical partner to customer engineering teams, working collaboratively through ambiguity and unique constraints Solve tough problems in real-time; debug, unblock, and optimize field implementations at speed Partner with Deployment Strategists and Product teams to ensure feedback loops from the field inform broader platform improvements Be on-site with customers during key phases of deployment; travel required up to 25% OUR TECH STACK Backend: TypeScript, Python, Node.js Infrastructure: Terraform, Azure, Kubernetes, CI/CD Pipelines Databases: MongoDB, CosmosDB, PostgreSQL Monitoring & Observability: Prometheus, Grafana, OpenTelemetry, ElasticSearch, Kibana, Datadog AI & Data: LLMs, Prompt Engineering, RAG WHAT YOU BRING TO THE TABLE: 4+ years of experience as a software, implementation, or systems integration engineer in a customer-facing capacity Hands-on experience with Python or TypeScript, Terraform, Kubernetes, and cloud platforms (preferably Azure) Strong working knowledge of APIs, MongoDB/PostgreSQL/CosmosDB, CI/CD pipelines, and observability tools like Grafana or Prometheus Comfort working independently in fast-paced, field-facing roles with high levels of ambiguity and responsibility Track record of delivering successful implementations for large enterprise clients Degree in Computer Science, Engineering, or related technical field Certifications in cloud or infrastructure technologies (AWS, Azure, GCP, Kubernetes) are a strong plus Demonstrated hands-on experience developing or deploying AI solutions using LLMs, including crafting and optimizing prompts for specific use cases, and familiarity with designing or integrating agentic architectures for autonomous task workflows WHATS IN IT FOR YOU: Join a diverse team of 40+ nationalities with flat hierarchies and a collaborative company culture, and enjoy an immersive onboarding experience in Berlin to dive into our product and culture. Opportunity to build and scale your career at the intersection of customer-facing roles and engineering in a dynamic startup on its journey to become an international leader in SaaS platforms for Conversational AI. A beautiful office with flair in the heart of NYC with all the conveniences, such as social area, snacks, and drinks. Competitive compensation and equity package. Flexible working hours, unlimited PTO, and travel opportunities. Access to a training and development budget for continuous professional growth. ClassPass membership, Nilo Health, Health insurance, weekly sponsored office lunches. Regular team events, game nights, and other social activities. Hybrid work environment - we believe in hiring the best talent, no matter where they are based. However, we love to build real connections and want to welcome everyone in the office on certain days Your recruiting process at Parloa: Recruiter video call → Meet your manager → Challenge Task → Leadership Assessment→ Bar Raiser Interview Why Parloa? Parloa is one of the fastest growing startups in the world of Generative AI and customer service. Parloa’s voice-first GenAI platform for contact centers is built on the best AI technology to automate customer service with natural-sounding conversations for outstanding experiences on all communication channels. Leveraging natural language processing (NLP) and machine learning, Parloa creates intelligent phone and chat solutions for businesses that turn contact centers into value centers by boosting customer service efficiency. The Parloa platform resolves the majority of customer queries quickly and automatically, allowing human agents to focus on complex issues and relationships. Parloa was founded in 2018 by Malte Kosub and Stefan Ostwald and today employs over 400+ people in Berlin, Munich, and New York. When you join Parloa, you become part of a dynamic and innovative team made up of over 34 nationalities that’s revolutionizing an entire industry. We’re passionate about growing together and creating opportunities for personal and professional development. With our recent $120 million Series C investment, we’re expanding globally and looking for talented individuals to join us on this exciting journey. Do you have questions about Parloa, the role, or our team before you apply? Please feel free to get in touch with our Hiring Team. Parloa is committed to upholding the highest data protection standards for our clients' and employees' data. All our employees are instrumental in ensuring the utmost care, GDPR, and ISO compliance, including ISO 27001, in handling sensitive information. * We provide equal opportunities to all qualified applicants regardless race, gender, sexual orientation, age, religion, national origin, disability status, socioeconomic background and other characteristics.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Apply
August 18, 2025
Evaluation Scenario Writer - AI Agent Testing Specialist
Mindrift
1001-5000
USD
0
0
-
55
United States
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleWe’re looking for someone who can design realistic and structured evaluation scenarios for LLM-based agents. You’ll create test cases that simulate human-performed tasks and define gold-standard behavior to compare agent actions against. You’ll work to ensure each scenario is clearly defined, well-scored, and easy to execute and reuse. You’ll need a sharp analytical mindset, attention to detail, and an interest in how AI agents make decisions.Although every project is unique, you might typically: Designing structured test scenarios based on real-world tasks Defining the golden path and acceptable agent behavior Annotating task steps, expected outputs, and edge cases Working with devs to test your scenarios and improve clarity Reviewing agent outputs and adapting tests accordingly How to get startedSimply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.Requirements You have a Bachelor's or Master’s degree in Computer Science, Software Engineering, Data Science / Data Analytics, Artificial Intelligence / Machine Learning, Computational Linguistics / Natural Language Processing (NLP), Information Systems or other related fields. You have 3+ years of experience. Your level of English is advanced (C1) or above. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines. Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge. BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $55/hour depending on your skills, experience, and project needs. Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments. Work on advanced AI projects and gain valuable experience that enhances your portfolio. Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
Data Scientist
Data Science & Analytics
NLP Engineer
Software Engineering
Apply
August 17, 2025
Evaluation Scenario Writer - AI Agent Testing Specialist
Mindrift
1001-5000
USD
0
0
-
55
United States
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleWe’re looking for someone who can design realistic and structured evaluation scenarios for LLM-based agents. You’ll create test cases that simulate human-performed tasks and define gold-standard behavior to compare agent actions against. You’ll work to ensure each scenario is clearly defined, well-scored, and easy to execute and reuse. You’ll need a sharp analytical mindset, attention to detail, and an interest in how AI agents make decisions.Although every project is unique, you might typically: Designing structured test scenarios based on real-world tasks Defining the golden path and acceptable agent behavior Annotating task steps, expected outputs, and edge cases Working with devs to test your scenarios and improve clarity Reviewing agent outputs and adapting tests accordingly How to get startedSimply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.Requirements You have a Bachelor's or Master’s degree in Computer Science, Software Engineering, Data Science / Data Analytics, Artificial Intelligence / Machine Learning, Computational Linguistics / Natural Language Processing (NLP), Information Systems or other related fields. You have 3+ years of experience. Your level of English is advanced (C1) or above. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines. Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge. BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $55/hour depending on your skills, experience, and project needs. Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments. Work on advanced AI projects and gain valuable experience that enhances your portfolio. Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
NLP Engineer
Software Engineering
Apply
August 17, 2025
Evaluation Scenario Writer - AI Agent Testing Specialist
Mindrift
1001-5000
-
India
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleWe’re looking for someone who can design realistic and structured evaluation scenarios for LLM-based agents. You’ll create test cases that simulate human-performed tasks and define gold-standard behavior to compare agent actions against. You’ll work to ensure each scenario is clearly defined, well-scored, and easy to execute and reuse. You’ll need a sharp analytical mindset, attention to detail, and an interest in how AI agents make decisions.Although every project is unique, you might typically: Designing structured test scenarios based on real-world tasks Defining the golden path and acceptable agent behavior Annotating task steps, expected outputs, and edge cases Working with devs to test your scenarios and improve clarity Reviewing agent outputs and adapting tests accordingly How to get startedSimply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.Requirements You have a Bachelor's or Master’s degree in Computer Science, Software Engineering, Data Science / Data Analytics, Artificial Intelligence / Machine Learning, Computational Linguistics / Natural Language Processing (NLP), Information Systems or other related fields. You have 3+ years of experience. Your level of English is advanced (C1) or above. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines. Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge. BenefitsWhy this freelance opportunity might be a great fit for you? Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments. Work on advanced AI projects and gain valuable experience that enhances your portfolio. Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
NLP Engineer
Software Engineering
Apply
August 17, 2025
Evaluation Scenario Writer - AI Agent Testing Specialist
Mindrift
1001-5000
USD
0
0
-
55
United States
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleWe’re looking for someone who can design realistic and structured evaluation scenarios for LLM-based agents. You’ll create test cases that simulate human-performed tasks and define gold-standard behavior to compare agent actions against. You’ll work to ensure each scenario is clearly defined, well-scored, and easy to execute and reuse. You’ll need a sharp analytical mindset, attention to detail, and an interest in how AI agents make decisions.Although every project is unique, you might typically: Designing structured test scenarios based on real-world tasks Defining the golden path and acceptable agent behavior Annotating task steps, expected outputs, and edge cases Working with devs to test your scenarios and improve clarity Reviewing agent outputs and adapting tests accordingly How to get startedSimply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.Requirements You have a Bachelor's or Master’s degree in Computer Science, Software Engineering, Data Science / Data Analytics, Artificial Intelligence / Machine Learning, Computational Linguistics / Natural Language Processing (NLP), Information Systems or other related fields. You have 3+ years of experience. Your level of English is advanced (C1) or above. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines. Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge. BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $55/hour depending on your skills, experience, and project needs. Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments. Work on advanced AI projects and gain valuable experience that enhances your portfolio. Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
NLP Engineer
Software Engineering
Apply
August 17, 2025
Evaluation Scenario Writer - AI Agent Testing Specialist
Mindrift
1001-5000
USD
0
0
-
55
United States
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleWe’re looking for someone who can design realistic and structured evaluation scenarios for LLM-based agents. You’ll create test cases that simulate human-performed tasks and define gold-standard behavior to compare agent actions against. You’ll work to ensure each scenario is clearly defined, well-scored, and easy to execute and reuse. You’ll need a sharp analytical mindset, attention to detail, and an interest in how AI agents make decisions.Although every project is unique, you might typically: Designing structured test scenarios based on real-world tasks Defining the golden path and acceptable agent behavior Annotating task steps, expected outputs, and edge cases Working with devs to test your scenarios and improve clarity Reviewing agent outputs and adapting tests accordingly How to get startedSimply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.Requirements You have a Bachelor's or Master’s degree in Computer Science, Software Engineering, Data Science / Data Analytics, Artificial Intelligence / Machine Learning, Computational Linguistics / Natural Language Processing (NLP), Information Systems or other related fields. You have 3+ years of experience. Your level of English is advanced (C1) or above. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines. Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge. BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $55/hour depending on your skills, experience, and project needs. Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments. Work on advanced AI projects and gain valuable experience that enhances your portfolio. Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
NLP Engineer
Software Engineering
Apply
August 17, 2025
Evaluation Scenario Writer - AI Agent Testing Specialist
Mindrift
1001-5000
-
Poland
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleWe’re looking for someone who can design realistic and structured evaluation scenarios for LLM-based agents. You’ll create test cases that simulate human-performed tasks and define gold-standard behavior to compare agent actions against. You’ll work to ensure each scenario is clearly defined, well-scored, and easy to execute and reuse. You’ll need a sharp analytical mindset, attention to detail, and an interest in how AI agents make decisions.Although every project is unique, you might typically: Designing structured test scenarios based on real-world tasks Defining the golden path and acceptable agent behavior Annotating task steps, expected outputs, and edge cases Working with devs to test your scenarios and improve clarity Reviewing agent outputs and adapting tests accordingly How to get startedSimply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.Requirements You have a Bachelor's or Master’s degree in Computer Science, Software Engineering, Data Science / Data Analytics, Artificial Intelligence / Machine Learning, Computational Linguistics / Natural Language Processing (NLP), Information Systems or other related fields. You have 3+ years of experience. Your level of English is advanced (C1) or above. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines. Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge. BenefitsWhy this freelance opportunity might be a great fit for you? Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments. Work on advanced AI projects and gain valuable experience that enhances your portfolio. Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
NLP Engineer
Software Engineering
Apply
August 17, 2025
Applied Research Engineer, Agents
Labelbox
201-500
USD
0
250000
-
300000
United States
Poland
Full-time
Remote
false
Shape the Future of AI At Labelbox, we're building the critical infrastructure that powers breakthrough AI models at leading research labs and enterprises. Since 2018, we've been pioneering data-centric approaches that are fundamental to AI development, and our work becomes even more essential as AI capabilities expand exponentially. About Labelbox We're the only company offering three integrated solutions for frontier AI development: Enterprise Platform & Tools: Advanced annotation tools, workflow automation, and quality control systems that enable teams to produce high-quality training data at scale Frontier Data Labeling Service: Specialized data labeling through Alignerr, leveraging subject matter experts for next-generation AI models Expert Marketplace: Connecting AI teams with highly skilled annotators and domain experts for flexible scaling Why Join Us High-Impact Environment: We operate like an early-stage startup, focusing on impact over process. You'll take on expanded responsibilities quickly, with career growth directly tied to your contributions. Technical Excellence: Work at the cutting edge of AI development, collaborating with industry leaders and shaping the future of artificial intelligence. Innovation at Speed: We celebrate those who take ownership, move fast, and deliver impact. Our environment rewards high agency and rapid execution. Continuous Growth: Every role requires continuous learning and evolution. You'll be surrounded by curious minds solving complex problems at the frontier of AI. Clear Ownership: You'll know exactly what you're responsible for and have the autonomy to execute. We empower people to drive results through clear ownership and metrics. Role Overview As an Applied Research Engineer at Labelbox, you’ll sit at the junction of advanced AI research and real product impact, with a focus on the data that makes modern agents work—browser interactions, SWE/code traces, GUI sessions, and multi-turn workflows. You’ll drive the data landscape required to advance capable, adaptable agents and help shape Labelbox’s strategy for collecting, synthesizing, and evaluating it. You will possess expertise in LLM agents and planning/execution loops, plus creativity in tackling problems across data design, interaction, and measurement. You’ll publish meaningful results, collaborate with customer researchers in frontier AI labs, and turn prototypes into reliable, scalable features. Your Impact Create frameworks and tools to construct, train, benchmark and evaluate autonomous agent capabilities. Design agent-focused data programs using supervised fine-tuning (SFT) and reinforcement learning (RL) methodologies. Develop data pipelines from diverse sources like code repositories, web browsers, and computer systems. Implement and adapt popular open-source agent libraries and benchmarks with proprietary datasets and models. Engage with research teams in frontier AI labs and the wider AI community to understand evolving agent data needs for frontier models and share best practices. Collaborate closely with frontier AI lab customers to understand requirements and guide model development. Publish research findings in academic journals, conferences, and blog posts. What You Bring Ph.D. or Master's degree in Computer Science, Machine Learning, AI, or related field. At least 3 years of experience addressing sophisticated ML problems with successful delivery to customers. Experience building and training autonomous agents—tool use, structured outputs, multi-step planning—across browsers/GUI, codebases, and databases using SFT and RL. Constructed and evaluated agentic benchmarks (e.g. SWE-bench, WebArena, τ-bench, OSWorld) and reliability/efficiency suites (e.g. WABER). Adept at interpreting research literature and quickly turning new ideas into prototypes. Deep understanding of frontier models (autoregressive, diffusion), post-training (SFT, RLVR, RLAIF, RLHF, et al.), and their human data requirements. Proficient in Python, data science libraries and deep learning frameworks (e.g., PyTorch, JAX, TensorFlow). Strong analytical and problem-solving abilities in ambiguous situations. Excellent communication skills. Track record of publications in top-tier AI/ML venues (e.g., ACL, EMNLP, NAACL, NeurIPS, ICML, ICLR, etc.). Labelbox Applied Research At Labelbox Applied Research, we're committed to pushing the boundaries of AI and data-centric machine learning, with a particular focus on advanced human-AI interaction techniques. We believe that high-quality human data and sophisticated human feedback integration methods are key to unlocking the next generation of AI capabilities. Our research team works at the intersection of machine learning, human-computer interaction, and AI ethics to develop innovative solutions that can be practically applied in real-world scenarios. We foster an environment of intellectual curiosity, collaboration, and innovation. We encourage our researchers to explore new ideas, engage in open discussions, and contribute to the wider AI community through publications and conference presentations. Our goal is to be at the forefront of human-centric AI development, setting new standards for how AI systems learn from and interact with humans.Labelbox strives to ensure pay parity across the organization and discuss compensation transparently. The expected annual base salary range for United States-based candidates is below. This range is not inclusive of any potential equity packages or additional benefits. Exact compensation varies based on a variety of factors, including skills and competencies, experience, and geographical location.Annual base salary range$250,000—$300,000 USDLife at Labelbox Location: Join our dedicated tech hubs in San Francisco or Wrocław, Poland Work Style: Hybrid model with 2 days per week in office, combining collaboration and flexibility Environment: Fast-paced and high-intensity, perfect for ambitious individuals who thrive on ownership and quick decision-making Growth: Career advancement opportunities directly tied to your impact Vision: Be part of building the foundation for humanity's most transformative technology Our Vision We believe data will remain crucial in achieving artificial general intelligence. As AI models become more sophisticated, the need for high-quality, specialized training data will only grow. Join us in developing new products and services that enable the next generation of AI breakthroughs. Labelbox is backed by leading investors including SoftBank, Andreessen Horowitz, B Capital, Gradient Ventures, Databricks Ventures, and Kleiner Perkins. Our customers include Fortune 500 enterprises and leading AI labs. Your Personal Data Privacy: Any personal information you provide Labelbox as a part of your application will be processed in accordance with Labelbox’s Job Applicant Privacy notice. Any emails from Labelbox team members will originate from a @labelbox.com email address. If you encounter anything that raises suspicions during your interactions, we encourage you to exercise caution and suspend or discontinue communications.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Research Scientist
Product & Operations
Apply
August 15, 2025
Senior AI Engineer (Europe remote - TS/Vue/NodeJS)
N8n
201-500
-
Germany
Full-time
Remote
true
n8n is a workflow automation platform that uniquely combines AI capabilities with business process automation. We give technical teams the flexibility of code with the speed of no-code, backed by a passionate community of builders. With 500+ integrations and fair-code principles, we're revolutionizing how businesses connect their systems and processes. We were founded end of 2019 and currently: 🧑🤝🧑 We’re a diverse team of + 120 talented people 🚀 Our annual recurring revenue is growing over 7x year-over-year ⭐️ With +118k GitHub stars, we are in the top 50 most popular projects of all time on Github 🍾 We were the 25th fastest growing startup last year and 4th BtoB SaaS Rising 100 this year in Europe according to Sifted 🌱 We were Sequoia's first seed investment in Germany, and most recently secured our $60M Series B (February '25, led by Highland)As a Senior AI Engineer, you'll drive intelligent features that redefine how users build automations. You'll build AI-powered capabilities, from natural language input to smart suggestions, that make creating workflows faster and more intuitive.You'll collaborate across engineering, product, and design to bring generative AI and LLM-based enhancements into the core user experience. You'll improve existing AI integrations, develop new ones, and shape how AI powers our product.You’ll work across the entire AI feature lifecycle:Architect and implement AI-powered capabilities: code generation, intelligent node creation, and workflow optimizationIntegrate LLM APIs and embedding models for text-to-workflow and natural language code suggestionsDesign and iterate on prompts to improve model output and user experienceBuild internal tooling, evaluation benchmarks, and automated testing for AI componentsCollaborate closely with other engineers to ensure AI features are reliable, performant, and scalableBalance experimentation with impact: ship quickly while focusing on user valueStay current with advances in LLMs, prompt frameworks, and developer tools - and bring those insights into our roadmapRequirements5+ years of experience building web-based products, ideally in B2B SaaS startupsStrong backend development skills with TypeScript, Node.js, and API designProven track record shipping AI-powered features in production with LLM APIs (OpenAI, Anthropic, etc.) and understanding of how to translate machine intelligence into user valueHands-on experience with prompt engineering, embedding models, and vector storesA user-focused mindset: you care about delivering features that solve real problemsA bias for shipping and learning - fast iteration is second nature to youBonus Points ForExperience fine-tuning LLMs or working with retrieval-augmented generation (RAG) systemsFrontend experience using Vue or ReactTechnical writing or documentation contributions, especially around developer tools or AIn8n is an equal opportunity employer and does not discriminate on the basis of race, religion, colour, national origin, gender, sexual orientation, gender identity, age, marital status, veteran status, or disability status.We can sponsor visas to Germany; for any other country, you need to have existing right to work.Our company language is English.You care about diversity and inclusion? We do too! Check out our Diversity, Inclusion and Belonging initiatives at n8n (https://www.notion.so/n8n/Diversity-inclusion-and-belonging-n8n-c1bec2fff536422d868b1a438d990e35).Location disclaimer: If you see multiple job postings for the same role, it is most likely because we're hiring remotely for this role and posting in different locations to make sure every potential candidate can see the role. Please apply to the location you're the most likely to work from in the future. Benefits Competitive compensation 💸 – We offer fair and attractive pay.Ownership 💪 – Our core value is to “empower others,” and we mean it—you’ll get a slice of n8n with equity.Work/life balance 🏖️ – We work hard but ensure you have time to recharge:Europe: 30 days of vacation, plus public holidays wherever you are.US: 15 vacation days, 8 sick days, plus public holidays wherever you are.Health & wellness 🩺 –Europe: We provide benefits according to local country norms.*US: Comprehensive medical (PPO 1200), dental, and vision plans.Future planning 💰 –Europe: We provide pension contributions according to local country norms.*US: 401(k) retirement plan.Financial security 🛡️ –Europe: We provide benefits according to local country norms.*US: Short-term & long-term disability insurance, life & AD&D coverage, and additional hospital coverage.Career growth 📈 – We hire rising stars who grow with us! You’ll get €1K (or equivalent) per year to spend on courses, books, events, or coaching to level up your skills.A passionate team 🤩 – We love our product, and we prove it with regular hackathons where we see who can build the coolest thing with it!Remote-first 🌏 – Our team works remotely across Europe, with regular off-sites for team bonding. Some roles, like sales in the US, are hybrid—please check the job description.Giving back 🤝 – We're big fans of open source, and you'll get $100 per month to support projects you care about.Transparency 🙏 – We all know what everyone’s working on, how the company is doing—the whole shebang.An ambitious but kind culture 😍 – People love working here—our eNPS for 2024 is 94!* Country-specific details are provided in your contract.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Apply
August 15, 2025
Senior Machine Learning Engineer
Faculty
501-1000
-
United Kingdom
Full-time
Remote
false
About Faculty
At Faculty, we transform organisational performance through safe, impactful and human-centric AI. With more than a decade of experience, we provide over 350 global customers with software, bespoke AI consultancy, and Fellows from our award winning Fellowship programme. Our expert team brings together leaders from across government, academia and global tech giants to solve the biggest challenges in applied AI. Should you join us, you’ll have the chance to work with, and learn from, some of the brilliant minds who are bringing Frontier AI to the frontlines of the world.We operate a hybrid way of working, meaning that you'll split your time across client location, Faculty's Old Street office and working from home depending on the needs of the project. For this role, you can expect to be client-side for up-to three days per week at times and working either from home or our Old street office for the rest of your time. What You'll Be DoingWorking in our Defence business unit You will design, build, and deploy production-grade software, infrastructure, and MLOps systems that leverage machine learning. The work you do will help our customers solve a broad range of high-impact problems in the defence and national security space - examples of which can be found hereYou are engineering-focused, with a keen interest and working knowledge of operationalised machine learning. You have a desire to take cutting-edge ML applications into the real world. You will develop new methodologies and champion best practices for managing AI systems deployed at scale, with regard to technical, ethical and practical requirements. You will support both technical and non-technical stakeholders to deploy ML to solve real-world problems. To enable this, we work in cross-functional teams with representation from commercial, data science, product management and design specialities to cover all aspects of AI product delivery.The Machine Learning Engineering team is responsible for the engineering aspects of our customer delivery projects. As a Machine Learning Engineer, you’ll be essential to helping us achieve that goal by:Building software and infrastructure that leverages Machine Learning;Creating reusable, scalable tools to enable better delivery of ML systemsWorking with our customers to help understand their needsWorking with data scientists and engineers to develop best practices and new technologies; andImplementing and developing Faculty’s view on what it means to operationalise ML software.We’re a rapidly growing organisation, so roles are dynamic and subject to change. Your role will evolve alongside business needs, but you can expect your key responsibilities to include:
Working in cross-functional teams of engineers, data scientists, designers and managers to deliver technically sophisticated, high-impact systems.Leading on the scope and design of projectsOffering leadership and management to more junior engineers on the team Providing technical expertise to our customersTechnical DeliveryWho We're Looking ForAt Faculty, your attitude and behaviour are just as important as your technical skill. We look for individuals who can support our values, foster our culture, and deliver for our organisation.We like people who combine expertise and ambition with optimism -- who are interested in changing the world for the better -- and have the drive and intelligence to make it happen. If you’re the right candidate for us, you probably:Think scientifically, even if you’re not a scientist - you test assumptions, seek evidence and are always looking for opportunities to improve the way we do things.Love finding new ways to solve old problems - when it comes to your work and professional development, you don’t believe in ‘good enough’. You always seek new ways to solve old challenges.Are pragmatic and outcome-focused - you know how to balance the big picture with the little details and know a great idea is useless if it can’t be executed in the real world.To succeed in this role, you’ll need the following - these are illustrative requirements and we don’t expect all applicants to have experience in everything (70% is a rough guide):Understanding of and interest in the full machine learning lifecycle, including deploying trained machine learning models developed using common frameworks such as Scikit-learn, TensorFlow, or PyTorchUnderstanding of the core concepts of probability and statistics and familiarity with common supervised and unsupervised learning techniquesExperience in Software Engineering including programming in Python.Technical experience of cloud architecture, security, deployment, and open-source tools. Hands-on experience required of at least one major cloud platformDemonstrable experience with containers and specifically Docker and KubernetesComfortable in a high-growth startup environment.Outstanding verbal and written communication.Excitement about working in a dynamic role with the autonomy and freedom you need to take ownership of problems and see them through to executionWhat we can offer you:
The Faculty team is diverse and distinctive, and we all come from different personal, professional and organisational backgrounds. We all have one thing in common: we are driven by a deep intellectual curiosity that powers us forward each day.
Faculty is the professional challenge of a lifetime. You’ll be surrounded by an impressive group of brilliant minds working to achieve our collective goals.
Our consultants, product developers, business development specialists, operations professionals and more all bring something unique to Faculty, and you’ll learn something new from everyone you meet.
Machine Learning Engineer
Data Science & Analytics
MLOps / DevOps Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Apply
August 14, 2025
Senior/Staff AI/ML Engineer
Articul8
51-100
-
United States
Full-time
Remote
false
About us:At Articul8 AI, we relentlessly pursue excellence and create exceptional AI products that exceed customer expectations. We are a team of dedicated individuals who take pride in our work and strive for greatness in every aspect of our business. We believe in using our advantages to make a positive impact on the world and inspiring others to do the same.Job Description:Articul8 AI is seeking a Senior or Staff AI/ML Engineer to design, develop, and deploy AI-driven solutions that solve real-world problems at scale. You will work on regression and classification AI/ML models, integrating them with LLM Agentic frameworks, while optimizing performance for production environments. This role requires deep expertise in model architectures, AI/ML frameworks, cloud platforms, and software engineering best practices for training and deploying at scale.Responsibilities:Design, develop, and deploy AI/ML models for production use, from traditional ML regression algorithms to transformers-based architectures.Train, fine-tune, and optimize deep learning and LLM-based solutions.Collaborate with researchers, software engineers, and product teams to integrate AI capabilities into our applications.Evaluate and implement state-of-the-art AI/ML algorithms to improve model accuracy and efficiency.Optimize models for cloud and on-prem environments, ensuring low latency and high availability.Develop APIs and microservices to serve AI models in production.Stay up to date with the latest AI trends, research, and best practices.Ensure ethical AI practices, data privacy, and security compliance.Required Qualifications:8+ years of experience in AI/ML modeling development and deployment.Master’s / PhD in Computer Science, AI, Machine Learning, or an equivalent engineering field.Strong programming skills in Python or PyTorchExperience with LLMs, NLP, computer vision, or reinforcement learning.Experience with containerization (Docker, Kubernetes)Proficiency in data preprocessing and model evaluation techniques, as well as MLops like MLflow, Kubeflow.Experience with distributed computing frameworks (Spark, Ray, Dask).Experience with AWS, GCP, or Azure for AI model deployment.Strong problem-solving and analytical skills.Excellent communication and teamwork skills.Preferred Qualifications:Experience in multi-cloud AI deployments.Prior experience as technical lead, guiding junior engineers and data scientist throughout product life-cycles, from POC to model deployment.Prior experience working on AI SaaS platforms.Professional Attributes:Problem Solving: ability to break down complex problems into manageable components, devising creative solutions, and iteratively refining ideas based on feedback and experimental evidence.Collaboration and Communication: proficiency in working cross-functionally—communicating clearly, providing constructive criticism, delegating responsibilities, and respecting diverse perspectives.Project Management and Prioritization: demonstrated aptitude in balancing multiple projects, deadlines, and allocating time efficiently between short-term objectives and long-term goals.Critical Thinking: ability to carefully evaluate assumptions, questioning established methodologies, challenging own biases, and maintaining skepticism when interpreting results.Curiosity and Continuous Learning: ability to stay curious about advances in related fields and constantly seeking opportunities to expand knowledge base.Emotional Intelligence and Intellectual Humility: capable of displaying empathy, resilience, adaptability, and self-awareness. Ability to recognize own limitations, embracing uncertainty, acknowledging mistakes, and valuing others' contributions.What We Offer:By joining our team, you become part of a community that embraces diversity, inclusiveness, and lifelong learning. We nurture curiosity and creativity, encouraging exploration beyond conventional wisdom. Through mentorship, knowledge exchange, and constructive feedback, we cultivate an environment that supports both personal and professional development.If you're ready to join a team that's changing the game, apply now to become a part of the Articul8 team. Join us on this adventure and help shape the future of Generative AI in the enterprise.
Machine Learning Engineer
Data Science & Analytics
Apply
August 14, 2025
Manager Forward Deployed Engineering
Taktile
101-200
USD
0
250000
-
315000
United States
Full-time
Remote
false
About The RoleTaktile is a high-growth, post product-market-fit start-up, on a fast trajectory to becoming market leader in the field of automated decisioning. We are looking for a Forward Deployed Engineer Manager to help us transform how our customers make critical business decisions by overseeing a team of Forward Deployed Engineers onboarding them onto Taktile and ensuring they get real value from our platform. You ensure your team acts as a trusted advisor and supports customers in reaching their goals while maximizing Taktile’s impact.If you’re passionate about tech and AI, and have extensive experience with Python, SQL, and REST APIs, you’ll thrive here.What You'll do as Forward Deployed Engineering ManagerOversee Taktile deployments in production, technical delivery across multiple projects from scoping to stable production traffic.Apply technical expertise, problem-solving skills and creativity to help organizations address real-world challenges by partnering and problem solving with your team members. Your day could include reviewing solution architectures, co-developing decision logic or AI agents, or aligning with key customer stakeholders together with your team members.Reliably review solution design and scoping proposals, sequence delivery, and proactively remove blockers. You are making thoughtful trade-offs between scope, speed, and quality to ensure successful and timely project delivery.Manage capacity of your team and partners with RevOps/Customer operations to improve or introduce scalable processes to the Forward Deployed Engineering teams at Taktile.Partner with Taktile’s product management team to turn your understanding of customer needs into actionable product insights, directly influencing the evolution of Taktile’s product roadmap.You play a key role in scaling the Forward Deployed Engineering function by creating reusable resources, best practices, and tools that share your expertise and drive organizational growthYou actively coach and mentor Forward Deployed Engineers on your team, supporting their development and success.You hire, grow and retain a team of exceptional Forward Deployed Engineers.About YouYou bring 8+ years of engineering or technical deployment experience that includes customer-facing work.You had first experience of leading a technical customer-facing team of 3x direct reports.You have strong technical background, preferred in fields such as Computer Science, Mathematics, Software Engineering, Physics, and Data Science.You can write and review production-grade code using Python and SQL. You possess a strong understanding of REST APIs.You excel at breaking down complex problems and making quick, well-informed decisions even under pressure.You build strong relationships with both technical and business stakeholders at all levels, driven by curiosity and a customer-centric mindset that helps you understand their needs and solve their challenges.You are creative and proactive, always seeking new ways to deliver value and stand out with customers.You are collaborative and work well with your peers in product teams, engineers and other GTM teams.You are humble and have a growth mindset, with a willingness to learn new skills and methodologies and bring best practices into our business.You have excellent written and spoken English.You are open to a hybrid work model and can work from our NYC office at least three days per weekIdeal Qualifications (but not required)You have 8+ years of experience as a Forward Deployed Engineer, Solution Engineer, Implementation Specialist or an equivalent position within a B2B SaaS company.You have led a large technical customer facing team of 5-10 direct reports, have experience in hiring and retaining exceptional talent.You have experience in building AI applications.You have experience in applying and optimizing statistical and machine learning models to solve business problems.You have experience with at least one of the major cloud platforms (AWS, Azure, GCP).You are fluent in Spanish and/or Portuguese.What We OfferWork with colleagues that lift you up, challenge you, celebrate you and help you grow. We come from many different backgrounds, but what we have in common is the desire to operate at the very top of our fields. If you are similarly capable, caring, and driven, you'll find yourself at home here.Make an impact and meaningfully shape an early-stage company.Experience a truly flat hierarchy and communicate directly with founding team members. Having an opinion and voicing your ideas is not only welcome but encouraged, especially when they challenge the status quo.Learn from experienced mentors and achieve tremendous personal and professional growth. Get to know and leverage our network of leading tech investors and advisors around the globe.Receive a top-of-market equity and cash compensation package.Get access to a self-development budget you can use to e.g. attend conferences, buy books or take classes.Use the equipment of your choice including meaningful home office set-up.Our StanceWe're eager to meet talented and driven candidates regardless of whether they tick all the boxes. We're looking for someone who will add to our culture, not just fit within it. We strongly encourage individuals from groups traditionally underestimated and underrepresented in tech to apply.We seek to actively recognize and combat racism, sexism, ableism and ageism. We embrace and support all gender identities and expressions, and celebrate love in its many forms. We won't inquire about how you identify or if you've experienced discrimination, but if you want to tell your story, we are all ears.About UsTaktile is building the world's leading software platform for running critical and highly-automated decisions. Our customers use our product to catch fraudsters, prevent money laundering, and expand access to credit for small businesses, among many other use cases. Taktile is already making millions of such decisions across the globe every day.Taktile is based in Berlin, London and New York City. It was founded by machine learning and data science veterans with extensive experience building and running production ML in financial services. Our team consists of engineers, entrepreneurs, and researchers with a diverse set of backgrounds. Some of us attended top universities such as Harvard, Oxford, and Stanford and some of us have no degree at all. We have accumulated extensive work experience at leading tech companies, startups, and the enterprise software sphere.Our backers include Y Combinator, Index Ventures, and stellar angels such as the founders of Looker, GitHub, Mulesoft, Datadog and UiPath.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Solutions Architect
Software Engineering
Apply
August 13, 2025
Lead Cyber Security Evaluation Expert
Scale AI
5000+
USD
0
180000
-
200000
United States
Full-time
Remote
false
Scale is at the frontier of the AI industry, improving the world’s leading Generative AI and Large Language Models through model evaluations, human-powered supervised fine-tuning (SFT) datasets, world-class Reinforcement Learning with Human Feedback (RLHF), and more. We are seeking a deeply experienced and cross-functional Lead Cybersecurity Evaluation Expert to advise and oversee the technical quality and strategic scope of cutting-edge Cyber Test & Evaluation (T&E) projects assessing Large Language Models (LLMs). This internal expert will serve as the lead advisor across multiple cyber domains, guiding dataset development efforts, validating expert contributions from subcontractors, and ensuring that benchmarks reflect real-world complexity, domain authenticity, and technical rigor. The ideal candidate will possess deep hands-on knowledge across multiple cybersecurity domains—such as network exploitation, cryptographic systems, LLM adversarial testing, APT analysis, and cyber ethics—and have prior experience in red teaming, incident response, or threat intelligence. This role is pivotal to ensuring that all T&E artifacts generated by subcontracted experts meet the highest standards of realism, fidelity, and relevance. Key Responsibilities Domain oversight: Provide strategic oversight across all cyber subdomains including but not limited to malicious network traffic, cryptographic systems, adversarial LLM prompts, threat intelligence, and cyber ethics. Scoping & strategy: Collaborate with the Program Manager (you) to define project goals, deliverable scopes, evaluation frameworks, and technical benchmarks. Expert vetting: Assess the technical credibility of cyber experts proposed by subcontractors; conduct interviews and review technical artifacts to validate expertise. Quality control: Review and validate the accuracy, depth, and applicability of all datasets and question-answer pairs produced by subcontracted experts. Standardization: Establish and enforce evaluation rubrics, scenario fidelity criteria, and documentation standards to ensure consistency across all workstreams. Cross-domain bridging: Identify cross-domain gaps, propose integrated benchmark scenarios, and ensure logical alignment between adjacent domains (e.g., how network behavior supports APT identification). Stakeholder communication: Provide subject-matter advice to internal and external stakeholders on technical feasibility, risks, and coverage completeness. Required Skills 8+ years of hands-on experience in cybersecurity, with demonstrated proficiency across multiple domains (e.g., red teaming, cryptography, network forensics, cyber threat intelligence, adversarial ML). Proven experience in one or more of the following: red-teaming LLMs, TTP identification using MITRE ATT&CK, cryptographic protocol evaluation, or creation of high-fidelity cyber scenarios. Familiarity with cybersecurity testing methodologies (e.g., penetration testing, adversarial simulation, red team exercises). Strong analytical, evaluative, and problem-solving abilities. Excellent communication skills with a strong technical writing background. Preferred Qualifications Prior experience leading or advising multi-expert technical teams across multiple cybersecurity disciplines. Understanding of LLM architectures and AI model evaluation processes. Familiarity with T&E in government or defense settings (e.g., AFWERX, MITRE, DoD AI efforts). Certifications such as CISSP, OSCP, GCIH, GCIA, GPEN, or equivalent. Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position, determined by work location and additional factors, including job-related skills, experience, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You’ll also receive benefits including, but not limited to: Comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend.The base salary range for this full-time position in the location of Washington DC is:$180,000—$200,000 USDPLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, we believe that the transition from traditional software to AI is one of the most important shifts of our time. Our mission is to make that happen faster across every industry, and our team is transforming how organizations build and deploy AI. Our products power the world's most advanced LLMs, generative models, and computer vision models. We are trusted by generative AI companies such as OpenAI, Meta, and Microsoft, government agencies like the U.S. Army and U.S. Air Force, and enterprises including GM and Accenture. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at accommodations@scale.com. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision. PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
Machine Learning Engineer
Data Science & Analytics
Apply
August 13, 2025
Member of technical staff (Data Research)
H Company
201-500
-
France
United Kingdom
Full-time
Remote
false
About H:
H exists to push the boundaries of superintelligence with agentic AI. By automating complex, multi-step tasks typically performed by humans, AI agents will help unlock full human potential.H is hiring the world’s best AI talent, seeking those who are dedicated as much to building safely and responsibly as to advancing disruptive agentic capabilities. We promote a mindset of openness, learning, and collaboration, where everyone has something to contribute.
About the Team: The AI Data team advances the performance of Large Language Models (LLMs) and Vision-Language Models (VLMs) through cutting-edge data-centric techniques. From synthetic data generation to model distillation and AI-driven preference alignment, we develop high-quality datasets that enhance model efficiency, reasoning, and adaptability. Our work directly impacts the training and fine-tuning of frontier AI systems, ensuring they learn from richer, more diverse, and better-structured data.Join us in shaping the future of AI through cutting-edge data optimization. We’re looking for driven individuals who thrive in fast-changing environments, adapt to new research paradigms, and eagerly take on challenges—whether deploying models, inspecting data, or pioneering new synthetic and reinforcement learning data methods.Key Responsibilities:Develop and implement cutting-edge data strategies to improve the performance, efficiency, and applicability of LLMs, VLMs and Action Models:Generate and augment synthetic multimodal datasets, including images, text, and action trajectories, to advance model capabilities in areas like VQA, agent behaviors, and virtual navigationApply model distillation techniques to optimize large-scale models for edge deployment, ensuring scalability without compromising performanceDesign and iterate on evaluation frameworks to target edge cases and measure model improvements across multiple domainsLead research into aligning data with human and AI preferences, implementing feedback loops to refine agent decision-making and learning behaviorsCollaborate effectively with cross-functional teams to integrate data-driven solutions into LLM, VLM and Agent systemsStay at the forefront of breakthroughs in AI data strategies, model distillation, and multimodal learning through active scientific explorationRequirements:Technical skills:Strong, polyvalent programming skills in Python covering parallel computing, system design, large-scale deployments, AWS deployments and model evaluationsExperience developing and maintaining multimodal data pipelinesExperience in training and deploying LLMs, VLMs or Pytorch modelsResearch skills:MSc or PhD in machine learning, computer vision, natural language processing, or a related fieldDeep understanding of training and evaluation paradigms for multimodal modelsSoft skills:Strong communication skills with technical and non-technical staffEffectiveness in fast-changing environmentsNice to Have:Experience with agent-specific data pipelines and improvement techniques is a plusExperience managing efficient multi-modal human annotation platforms is a plusLocation:Paris or London.This role is hybrid, and you are expected to be in the office 3 days a week on average.Please expect some travel between offices on a reasonable cadence (e.g., every 4-6 weeks).What We Offer:Join the exciting journey of shaping the future of AI, and be part of the early days of one of the hottest AI startupsCollaborate with a fun, dynamic and multicultural team, working alongside world-class AI talent in a highly collaborative environmentEnjoy a competitive salaryUnlock opportunities for professional growth, continuous learning, and career developmentIf you want to change the status quo in AI, join us.
Machine Learning Engineer
Data Science & Analytics
Data Engineer
Data Science & Analytics
Research Scientist
Product & Operations
Apply
August 13, 2025
Member of technical staff (Models)
H Company
201-500
-
France
United Kingdom
Full-time
Remote
false
About H:
H exists to push the boundaries of superintelligence with agentic AI. By automating complex, multi-step tasks typically performed by humans, AI agents will help unlock full human potential.H is hiring the world’s best AI talent, seeking those who are dedicated as much to building safely and responsibly as to advancing disruptive agentic capabilities. We promote a mindset of openness, learning, and collaboration, where everyone has something to contribute.
About the Team: The Models team builds the foundational models that power our cutting-edge agentic technology. We focus on training techniques to optimize model capabilities specifically for agent applications. This allows us to achieve the best performance at a given inference cost. Our work spans the development of Large Language Models (LLMs) and Vision-Language Models (VLMs), enabling agents to perceive, understand, and act within complex environments. We are deeply involved in enhancing these models through training methods with a focus on improved instruction following, tool use, and interaction with dynamic environments via large-scale reinforcement learning and reward modeling. We operate at the intersection of research and product, translating cutting-edge research into practical solutions that drive the next generation of AI. We are looking for bright, motivated individuals to join our ranks and shape the future of superintelligent AI.Key Responsibilities:Develop and train advanced LLMs and VLMs, including multimodal architecturesResearch and implement training methods for enhanced capabilities like instruction following and tool useDesign and optimize data pipelines and training systems for large-scale distributed trainingCollaborate with cross-functional teams to integrate models into agentic AI systemsEvaluate model performance and communicate findings to stakeholdersStay current with advancements in LLMs, VLMs, and related fieldsRequirements:Technical skills:Strong programming skills (Python, Git)Expertise in deep learning frameworks (PyTorch, JAX, TensorFlow)Experience with large-scale distributed training of LLMs and VLMsHands-on experience with LLM training, alignment, and reinforcement learningKnowledge of multimodal architectures and applicationsResearch skills:Publications in top-tier AI conferences (e.g., NeurIPS, ICML, CVPR, ACL, ICCV)Advanced degree (PhD or MSc) in a relevant field (e.g., ML, DL, NLP, CV)Soft skills:Excellent communication and presentation skillsStrong collaboration and teamwork skillsPassion for AI and problem-solvingBonuses:Industry experienceExperience in LLM training with RLExperience with data processing techniquesLocation:Paris or London.This role is hybrid, and you are expected to be in the office 3 days a week on average.Please expect some travel between offices on a reasonable cadence (e.g., every 4-6 weeks).What We Offer:Join the exciting journey of shaping the future of AI, and be part of the early days of one of the hottest AI startupsCollaborate with a fun, dynamic and multicultural team, working alongside world-class AI talent in a highly collaborative environmentEnjoy a competitive salaryUnlock opportunities for professional growth, continuous learning, and career developmentIf you want to change the status quo in AI, join us.
Machine Learning Engineer
Data Science & Analytics
Computer Vision Engineer
Software Engineering
NLP Engineer
Software Engineering
Research Scientist
Product & Operations
Apply
August 13, 2025
AI Solutions Intern
Nice
5000+
-
United States
Intern
Remote
false
At NiCE, we don’t limit our challenges. We challenge our limits. Always. We’re ambitious. We’re game changers. And we play to win. We set the highest standards and execute beyond them. And if you’re like us, we can offer you the ultimate career opportunity that will light a fire within you.About the Role Join NiCE Public Safety as an AI Solutions Intern for our Fall Internship Program where you'll work with real 911 audio, AI models, and public safety data, creating tools that power the next generation of emergency response. You won’t be on the front lines. You’ll be behind them – writing the logic, testing the models, and building the dashboards that support real heroes across the U.S. In this internship, you will gain hands-on experience with AI, cloud, and analytics, real-world skills in software used by 911 professionals nationwide, the chance to test and improve tools that support real emergency teams and side-by-side work with senior engineers. This opportunity can lead to future full employment. Working hours will be 5-20 hours per week. How you will make an impact? Training and testing AI scoring tools for emergency calls Analyzing patterns from 911 data and build tools for dispatchers Working with APIs, microservices, and real-world cloud apps Working on automation Have you got what it takes? Experience with Python, JavaScript, or another scripting language Experience with Whisper, OpenAI APIs, LangChain or anything AI/ML Experience with AWS, GCP, Azure or cloud tools Experience APIs, webhooks, or building scrappy internal tools You will have an advantage if you also have: Experience/knowledge of CCaaS, CX, and Conversational Ai solutions. Experience/knowledge in the public safety sector. What’s in it for you? Join an ever-growing, market disrupting, global company where the teams – comprised of the best of the best – work in a fast-paced, collaborative, and creative environment! As the market leader, every day at NiCE is a chance to learn and grow, and there are endless internal career opportunities across multiple roles, disciplines, domains, and locations. If you are passionate, innovative, and excited to constantly raise the bar, you may just be our next NiCEr! Requisition ID: 8271
Reporting into: Senior Product Manager
Role Type: InternAbout NiCE NICE Ltd. (NASDAQ: NICE) software products are used by 25,000+ global businesses, including 85 of the Fortune 100 corporations, to deliver extraordinary customer experiences, fight financial crime and ensure public safety. Every day, NiCE software manages more than 120 million customer interactions and monitors 3+ billion financial transactions. Known as an innovation powerhouse that excels in AI, cloud and digital, NiCE is consistently recognized as the market leader in its domains, with over 8,500 employees across 30+ countries. NiCE is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, age, sex, marital status, ancestry, neurotype, physical or mental disability, veteran status, gender identity, sexual orientation or any other category protected by law.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Apply
August 12, 2025
No job found
There is no job in this category at the moment. Please try again later