Top AI Machine Learning Engineer Jobs Openings in 2025
Looking for opportunities in AI Machine Learning Engineer? This curated list features the latest AI Machine Learning Engineer job openings from AI-native companies. Whether you're an experienced professional or just entering the field, find roles that match your expertise, from startups to global tech leaders. Updated everyday.
Edit filters
Latest AI Jobs
Showing 61 – 79 of 79 jobs
Tag
Machine Learning Engineering Manager, Enterprise
Scale AI
5000+
USD
212000
-
254400
United States
Full-time
Remote
false
AI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 8 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including generative AI, defense applications, and autonomous vehicles. With our recent Series F round, we’re accelerating the usage of frontier data and models by building complex agents for enterprises around the world through our Scale Generative AI Platform (SGP). The SGP ML team works on the front lines of this AI revolution. We interface directly with clients to build cutting edge products using the arsenal of proprietary research and resources developed at Scale. As an ML Engineering Manager, you’ll manage a team of high-calibre Applied AI Engineers + MLEs who work with clients to train ML models to satisfy their business needs. Your team’s work will range from training next-generation AI cybersecurity firewall LLMs to training foundation agentic action models making predictions about business-saving outcomes. You will guide your team towards using data-driven experiments to provide key insights around model strengths and inefficiencies in an effort to improve products. If you are excited about shaping the future of the modern AI movement, we would love to hear from you! You will: Train state of the art models, developed both internally and from the community, in production to solve problems for our enterprise customers. Manage a team of 5+ Applied AI Engineers / ML Engineers Work with product and research teams to identify opportunities for ongoing and upcoming services. Explore approaches that integrate human feedback and assisted evaluation into existing product lines. Create state of the art techniques to integrate tool-calling into production-serving LLMs. Work closely with customers - some of the most sophisticated ML organizations in the world - to quickly prototype and build new deep learning models targeted at multi-modal content understanding problems. Ideally you’d have: At least 3 years of model training, deployment and maintenance experience in a production environment At least 1-2 years of management or tech leadership experience Strong skills in NLP, LLMs and deep learning Solid background in algorithms, data structures, and object-oriented programming Experience working with a cloud technology stack (eg. AWS or GCP) and developing machine learning models in a cloud environment Experience building products with LLMs including knowing the ins and outs of evaluation, experimentation, and designing solutions to get the most of the models PhD or Masters in Computer Science or a related field Nice to haves: Experience in dealing with large scale AI problems, ideally in the generative-AI field Demonstrated expertise in large vision-language models for diverse real-world applications, e.g. classification, detection, question-answering, etc. Published research in areas of machine learning at major conferences (NeurIPS, ICML, EMNLP, CVPR, etc.) and/or journals Strong high-level programming skills (e.g., Python), frameworks and tools such as DeepSpeed, Pytorch lightning, kubeflow, TensorFlow, etc. Strong written and verbal communication skills to operate in a cross functional team environment Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position, determined by work location and additional factors, including job-related skills, experience, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You’ll also receive benefits including, but not limited to: Comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend.Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is:$212,000—$254,400 USDPLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, we believe that the transition from traditional software to AI is one of the most important shifts of our time. Our mission is to make that happen faster across every industry, and our team is transforming how organizations build and deploy AI. Our products power the world's most advanced LLMs, generative models, and computer vision models. We are trusted by generative AI companies such as OpenAI, Meta, and Microsoft, government agencies like the U.S. Army and U.S. Air Force, and enterprises including GM and Accenture. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at accommodations@scale.com. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision. PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
Machine Learning Engineer
Data Science & Analytics
Apply
July 17, 2025
Staff Software Engineer, Pilots
Haydenai
101-200
USD
0
221000
-
260000
United States
Full-time
Remote
false
About UsAt Hayden AI, we are on a mission to harness the power of computer vision to transform the way transit systems and other government agencies address real-world challenges.From bus lane and bus stop enforcement to transportation optimization technologies and beyond, our innovative mobile perception system empowers our clients to accelerate transit, enhance street safety, and drive toward a sustainable future.Job Summary:Hayden is seeking a Staff-Level Perception generalist to drive the full life cycle of our perception algorithms. In this pivotal role, you will contribute across all phases of the perception stack -- from early prototyping and validation to production deployment and real-world performance monitoring. You’ll work closely with cross-functional teams to support pilot programs critical to our market exploration, both locally and internationally. As part of a fast-paced startup entering a scale-up phase, you’ll play a foundational role in building reliable, scalable, and high-impact perception systems.Responsibilities: Software Engineering: Lead the development of robust, production-grade C++ perception modules. Deliver well-tested, high-performance code that is maintainable and scalable.
Pilot Program Support: Develop deep familiarity with all perception submodules. Coordinate work across perception teams to deliver customized solutions for pilot programs and international launches.
Cross-Functional Collaboration: Partner with front-end, back-end, embedded systems, and product teams to ensure seamless integration and monitoring of the perception stack in production environments.
Technical Leadership: Act as a hands-on tech lead, providing architectural guidance, conducting code and design reviews, and mentoring both senior and junior engineers. Foster a culture of technical excellence and collaboration within the team.Required Qualifications:Master’s degree in Robotics, Machine Learning, Computer Science, Electrical Engineering, or a related field, or equivalent practical experience.10+ years of relevant experience with a proven track record in building and deploying real-world perception systems using a combination of Computer Vision, Machine Learning, and Robotics techniques. Experience in automotive or robotics domains is preferred.Proficiency in C++ and Python, with the ability to write performant, reliable, and maintainable software in production environments.Demonstrated skill in debugging and analyzing complex perception stacks across simulation, development, and real-world deployments.Expertise in at least one of the following areas, with familiarity in others:Machine Learning & Deep LearningComputer Vision (e.g., feature detection, tracking, geometric methods)Robotics (e.g., SLAM, state estimation, Kalman Filters, Particle Filters)Excellent verbal and written communication abilities. Able to explain complex technical concepts clearly to both engineers and non-engineers.Preferred Qualifications:Past experience in providing technical direction, conducting code/design reviews, and mentoring junior and mid to senior engineers is a plus.A strong academic track record in relevant fields (e.g., robotics, computer vision, machine learning) is a plus.
Machine Learning Engineer
Data Science & Analytics
Computer Vision Engineer
Software Engineering
Robotics Engineer
Software Engineering
Software Engineer
Software Engineering
Apply
July 17, 2025
ACL 2025: Member of Technical Staff
Cohere
501-1000
0
0
-
0
Canada
United Kingdom
United States
Full-time
Remote
true
Who are we?Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI.We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. We like to work hard and move fast to do what’s best for our customers.Cohere is a team of researchers, engineers, designers, and more, who are passionate about their craft. Each person is one of the best in the world at what they do. We believe that a diverse range of perspectives is a requirement for building great products.Join us on our mission and shape the future!It's lovely to meet you in Vienna for ACL 2025. We're recruiting for a range of roles available for review here: https://jobs.ashbyhq.com/coherePlease drop your CV under this posting and our Talent Team will reach out after the conference if there is a suitable role open.Below is a job description for one of our Member of Technical Staff roles.Why this role?Design and implement novel research ideas, ship state of the art models to production, and maintain deep connections to academia. We have one of the highest ratio of compute to engineers in the world. We do not delineate strongly between engineering and research. Everyone will contribute to writing production code and conducting research depending on individual interest and organizational needs. We have all the compute, data, and talent available for you to do your best work.Please Note: We have offices in Toronto, London, San Francisco and New York but also embrace being remote-friendly! There are no restrictions on where you can be located for this role.As a Member of Technical Staff, you will:Design, build and scale AI systems for serving our users.Research, implement, and experiment with ideas on our supercompute and data infrastructure.Learn from and work with the best researchers in the field.You may be a good fit if you have:Extremely strong software engineering skills.Proficiency in Python and related ML frameworks such as Tensorflow, TF-Serving, JAX, and XLA/MLIR.Experience writing kernels for GPUs using CUDA.Experience using large-scale distributed training strategies.Familiarity with autoregressive sequence models, such as Transformers.Bonus: paper at top-tier venues (such as NeurIPS, ICML, ICLR, AIStats, MLSys, JMLR, AAAI, Nature, COLING, ACL, EMNLP).If some of the above doesn’t line up perfectly with your experience, we still encourage you to apply! If you want to work really hard on a glorious mission with teammates that want the same thing, Cohere is the place for you.We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.Full-Time Employees at Cohere enjoy these Perks:🤝 An open and inclusive culture and work environment 🧑💻 Work closely with a team on the cutting edge of AI research 🍽 Weekly lunch stipend, in-office lunches & snacks🦷 Full health and dental benefits, including a separate budget to take care of your mental health 🐣 100% Parental Leave top-up for 6 months for employees based in Canada, the US, and the UK🎨 Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement🏙 Remote-flexible, offices in Toronto, New York, San Francisco and London and co-working stipend✈️ 6 weeks of vacationNote: This post is co-authored by both Cohere humans and Cohere technology.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Apply
July 16, 2025
Technical Founder in Residence
AIFund
51-100
-
United States
Full-time
Remote
true
About the Role:We’re launching a new venture in Generative AI and are seeking a founder-in-residence to lead it from concept to company. If you're a builder with technical GenAI experience and a track record of building products from 0 to 1, this is your opportunity to shape a venture at the ground floor.
As the Founder-in-Residence, you’ll set the product vision, lead the technical execution, and serve as the face of the company. This role requires a unique mix of deep technical expertise, product intuition, business execution, storytelling, and leadership.
Who We Are:AI is the new electricity. Just as it transformed entire industries 100 years ago, AI is poised to do the same today. AI Fund is a venture studio that co-founds companies alongside entrepreneurs, and we build companies from inception to market launch. Founded in 2017 by Dr. Andrew Ng, the $370 million-dollar funded venture studio is backed by top-tier VC firms and investors, and brings to bear the AI Fund teams’ combined experiences as AI pioneers, entrepreneurs, venture capitalists, and builders.
Why Partner With AI Fund:We’ve been there. We’ve founded and scaled successful companies ourselves, and we know that creating meaningful startups is really hard. We accelerate the company building process. Coming up with great ideas, turning the idea into a tangible product, assembling great teams, and helping raise capital is what we do. We shorten a process that can take years down to months. We make sure you are building a compelling product supported by great AI technology, and are surrounded by great teammates. But we also know that the process is not about us. It’s about great Founders and empowering them to do great things.
What We are Looking for:Technical Expertise:Hands-on experience building and deploying GenAI or ML-powered applications (e.g., ones involving components such as LLMs, agentic workflows, reasoning systems, chatbots, evals, etc.) )Full-stack engineering understanding with a grasp of scalable, cloud-native application architecture.Ability to architect and ship complex systems across frontend, backend, and infrastructure.Product & Startup Experience:Proven track record of building 0→1 products.Experience navigating ambiguity, iterating quickly, and building products from scratch.Team Leadership: Experience building, managing, or scaling high-performing teams.Strong leadership presence with the ability to inspire, align, and influence cross-functional teams.Go-to-Market Skills:Strong customer empathy — actively seeks out user feedback and integrates it into product decisions.Understanding of market dynamics and competitive positioning.Comfortable collaborating with sales, marketing, and GTM teams to shape product strategy and messaging
Bonus Points For:Experience presenting to investors or participating in capital raises.Deep knowledge or strong familiarity with our target industry (e.g., renewable energy, infrastructure, compliance, etc.).Previous co-founder or founding engineer of VC-backed software company.
Characteristics We Value:Accountability: an obligation or willingness to accept responsibility or to account for one's actions while doing so with the highest regard for integrity. Leadership: able to influence others to follow you and lead the team to a brighter future. Grit: able to stick with projects and work hard through good and bad times. High pain tolerance and can perform well under stress or pressure.Scrappy: Takes initiative and proactively gets things done with low resources, but doing creative things, begging, borrowing, and whatever is needed in an ambiguous environment or situation.Ownership orientation: Demonstrated orientation of extreme ownership over all aspects of the company and extremely results-driven in nature.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Product Manager
Product & Operations
Apply
July 16, 2025
Engineering Manager, Evals (API)
OpenAI
5000+
USD
0
325000
-
405000
United States
Full-time
Remote
false
About the Team:
OpenAI's mission is to ensure that artificial general intelligence (AGI) benefits all of humanity. Through our API, we realize our mission by enabling everyone to harness the power of AGI safely, effectively, and at scale. Our API is the most widely used AI platform in the industry, empowering startups, indie developers, and Fortune 500 companies alike.About the Role:
We're seeking an Engineering Manager to lead the team responsible for our Evals product on the OpenAI API. The Evals platform enables customers to rigorously evaluate the effectiveness of OpenAI models on their own use cases. It ensures effective assessment and improvement of AI model performance and reliability.In this role, you will:Build, mentor, and grow a high-performing engineering team focused on developing, scaling, and maintaining our Evals product.Collaborate closely with product managers, researchers, and cross-functional stakeholders to define strategic vision and roadmap.Guide your team through technical and architectural decisions, emphasizing scalability, robustness, and reliability.Foster a culture of innovation, continuous learning, and accountability within the team.Manage project timelines, priorities, and resource allocation effectively to meet organizational goals.Ensure alignment with OpenAI’s broader mission and ethical guidelines for AI development and deployment.Qualifications:Proven track record managing teams that deliver high-quality products at scale.Strong technical background with an understanding of modern software engineering practices and architecture.Exceptional collaboration and communication skills, capable of aligning diverse stakeholders toward common objectives.Experience or strong interest in machine learning, AI evaluation methodologies, and performance assessment.Ability to operate effectively in a fast-paced, ambiguous startup environment.
Preferred Qualifications:Prior experience building evaluation frameworks or infrastructure in AI/ML domains, especially involving large language models.Familiarity with cloud platforms and distributed systems.About OpenAIOpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity. We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic. For additional information, please see OpenAI’s Affirmative Action and Equal Employment Opportunity Policy Statement.Qualified applicants with arrest or conviction records will be considered for employment in accordance with applicable law, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.OpenAI Global Applicant Privacy PolicyAt OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Apply
July 16, 2025
AI Deployment Engineer
Bland
51-100
USD
0
120000
-
175000
United States
Full-time
Remote
false
AI Deployment Engineer (Pre-sales to postales)About Bland AIWe’re a series B startup, and have raised $65 million from Emergence Capital, Y Combinator, and the founders of PayPal and Twilio. We have a 40 person team, and we serve customers like Better.com, by delivering the most friendly, helpful, and human-like AI phone agents in the world.Why This Role ExistsEvery customer is different – the results they want to drive, expectations of their customer base, and internal systems are totally varied. We need people who are excited to dive into that ambiguity to turn fuzzy goals into concrete plans, ship the first pathways yourself, then broaden adoption across our customer’s org. What You’ll DoPresales partner. Jump on discovery calls, white-board integrations, and explain Bland’s unique features (and ability to drive value) when speaking with product and engineering leaders. You’re working directly with an AE to sell Bland – and to scope an implementation timeline and plan.
Own implementation start-to-finish. Design, build, and iterate pathways; wire up APIs; test relentlessly until the agent has “Bland tone” and feels human.
Coordinate stakeholders. Engineers, product, operations, and anyone else who’s involved with implementation—you’ll corral them, set timelines and expectations, persistently follow up, and make sure timelines are hit and outcomes are delivered.
Move fast and iterate. You build a thorough first agent (based on the scoping you define with the customer) and then you get it into production as fast as possible. You listen back to real calls, fix edge cases, then actively share those results with the customer’s team to build excitement and demonstrate the progress you’re driving.
Expand the footprint. By embedding yourself in the customer’s organization, you understand the business priorities, know the value calls are driving, and you identify new opportunities for expansion and then are incredibly persistent to scope those expansions and work with sales to close the upsell.
Be the face of Bland. You are the customer’s champion, their best employee, and you treat them with unreasonable hospitality. You travel on-site, get to know our customers on a human level, and develop real relationships with our champions and other stakeholders, going above and beyond to host training sessions and dinners.
Must-Have Qualifications3-7 yrs in solutions engineering, product-minded software engineering, founding a startup, or any role that proves you can own a customer-facing build from zero to production.
Comfortable reading & writing REST/JSON; able to sling quick Python or JS scripts to glue systems together.
Track record of relentless ownership—moments where you ran through walls and surpassed immense challenges (can be both in personal and professional life)
Clear communicator who can translate LLM quirks - and the specifics of how Bland works – to everyone on our customer’s team (at the right level of complexity for your audience).
Ready for the intensity of a fast-growing, early-stage startup—the work is hard, the pace is high. This is not a big tech job. You’re joining Bland because you want to push yourself, you’re ambitious, and you care about doing great work.
Will spend the first month full-time in our beautiful Jackson Square office in San Francisco, then stay mostly in-person or on customer sites. As we build our AI engineering team, we want people to be in-person as much as possible and we have a strong in-office culture.
Nice-to-HavesBuilt side-projects or shipped production features with LLMs.
Prior life as a founder, solutions architect, PM, CSM, or any other role where you had high ownership over delivering outcomes and was customer facing. Prior experience owning pre-sales to post-sales to expansions is a huge plus (as long as you’re flexible and excited to learn how to work with LLMs!)
Exceptional new grads with strong ownership and ambition are welcome to apply.
You’ll Thrive Here If…You’re smart, relentless, and love working with customers
You’re organized, keep tight timelines, and deliver clear updates (exceptionally thorough)
You like hopping on planes to visit customers on site (not a requirement, but hopefully it’s something you enjoy).
You care about the craft and about our customers’ customers – all our phone agents should sound truly human and have “Bland tone” – they should not sound like a corporate robot or like a phone tree.Relentlessness is the most important qualityIf you think you’re missing relevant experience but you’re a fast learner who’s excited for a new challenge – and you have the intangibles our team is looking for – please reach out. As long as you’re resourceful and a fast learner (and you can prove it to our team) we would love to meet you.Compensation & PerksSalary: $120k – $175k base + meaningful equity + benefits.
Gorgeous office in Jackson Square, San Francisco (rooftop views & great coffee shops nearby).
Machine Learning Engineer
Data Science & Analytics
Solutions Architect
Software Engineering
Apply
July 16, 2025
Senior Team Lead AI Solutions Engineering (f/m/d)
AlephAlpha
201-500
-
Germany
Full-time
Remote
false
OVERVIEW At Aleph Alpha, we are shaping the future of AI with European values at the core. The heart of our product is developing cutting-edge generative AI solutions with a strong emphasis on sovereignty, ethical development, and societal benefit. Our generative AI offering empowers businesses, governments, and individuals to achieve their full potential. TEAM To bring our vision to life, we work closely with our partners to unlock the transformative power of generative AI. Our Customer Team empowers them by leveraging Aleph Alpha’s sovereign solution stack, ensuring they can harness AI’s full potential with confidence and security. As our Senior Team Lead, AI Solutions Engineering (f/m/d), you will drive the implementation of scalable solution concepts to support business growth in alignment with company objectives.You will work closely with Engineering and Product to help them shape a stable production environment. YOUR RESPONSIBILITIES Lead, inspire, and manage a diverse team of engineers, fostering a culture of accountability, inclusion, collaboration, and continuous learning.Oversee end-to-end project delivery, ensuring timelines, quality standards, and a wide range of stakeholder expectations are effectively met.Mentor and support the development of all team members through regular, constructive feedback, goal setting, and inclusive career development planning.Supervise project delivery teams, matching the right skills and strengths to each project, and empowering them to deliver high-quality outcomes on time.Act as a trusted escalation point and oversight lead for both internal and external projects, ensuring alignment, responsiveness, and timely resolution.Stay up to date with the latest developments in AI and related technologies to help shape an innovation strategy that reflects market needs and diverse perspectives.Analyze complex technical and organizational challenges to identify root causes and co-create effective, timely, and sustainable solutions.Collaborate closely with cross-functional teams including sales, engineering, support, operations, IT, and business stakeholders to ensure project success and shared ownership of outcomes.YOUR PROFILE You have a proven track record of leading complex projects from planning to execution in dynamic, fast-paced environments.You lead, coach, and develop high-performing technical teams, helping individuals grow while aligning with business goals.You bring expert-level knowledge of Machine Learning, Artificial Intelligence, and Large Language Models, enabling you to guide technical discussions, make informed decisions, and align team efforts with strategic goals.You have extensive experience delivering software and solution implementation projects in complex customer and partner environments, across both pre- and post-sales phases.You are solution-oriented, with a strong record of turning around challenging projects and solving organizational issues with speed and creativity.You are fluent in English and German, enabling effective communication with diverse, international stakeholders and teams.WHAT YOU CAN EXPECT FROM USBecome part of an AI revolution! 30 days of paid vacation Access to a variety of fitness & wellness offerings via Wellhub Mental health support through nilo.health JobRad® Bike Lease Substantially subsidized company pension plan for your future security Subsidized Germany-wide transportation ticket Budget for additional technical equipment Flexible working hours for better work-life balance and hybrid working model Virtual Stock Option Plan
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Apply
July 15, 2025
Senior Machine Learning Engineer
Faculty
501-1000
0
0
-
0
United Kingdom
Full-time
Remote
false
About Faculty
At Faculty, we transform organisational performance through safe, impactful and human-centric AI. With more than a decade of experience, we provide over 350 global customers with software, bespoke AI consultancy, and Fellows from our award winning Fellowship programme. Our expert team brings together leaders from across government, academia and global tech giants to solve the biggest challenges in applied AI. Should you join us, you’ll have the chance to work with, and learn from, some of the brilliant minds who are bringing Frontier AI to the frontlines of the world.We're always on the lookout for talented individuals whose principles and interests align with our own. While we don't have a vacancy open in a specific team at the moment, we are starting to plan for a period of growth in Machine Learning.
By registering your application for this position you'll be considered for Senior Machine Learning roles in our Applied AI Consultancy more generally and we'll reach out as these open up.
What You'll Be DoingAs a Senior Machine Learning Engineer at Faculty, you'll design, build, and deploy production-grade software, infrastructure, and MLOps systems that leverage machine learning.You'll be engineering-focused, with a keen interest and working knowledge of operationalised machine learning. You have a desire to take cutting-edge ML applications into the real world. You will develop new methodologies and champion best practices for managing AI systems deployed at scale, with regard to technical, ethical and practical requirements. You will support both technical and non-technical stakeholders to deploy ML to solve real-world problems. To enable this, we work in cross-functional teams with representation from commercial, data science, product management and design specialities to cover all aspects of AI product delivery.The Machine Learning Engineering team is responsible for the engineering aspects of our customer delivery projects. As a Machine Learning Engineer, you’ll be essential to helping us achieve that goal by:Building software and infrastructure that leverages Machine Learning;Creating reusable, scalable tools to enable better delivery of ML systemsWorking with our customers to help understand their needsWorking with data scientists and engineers to develop best practices and new technologies; andImplementing and developing Faculty’s view on what it means to operationalise ML software.We’re a rapidly growing organisation, so roles are dynamic and subject to change. Your role will evolve alongside business needs, but you can expect your key responsibilities to include:
Working in cross-functional teams of engineers, data scientists, designers and managers to deliver technically sophisticated, high-impact systems.Leading on the scope and design of projectsOffering leadership and management to more junior engineers on the team Providing technical expertise to our customersTechnical DeliveryWho We're Looking ForTo succeed in this role, you’ll need the following - these are illustrative requirements and we don’t expect all applicants to have experience in everything (70% is a rough guide):Understanding of and interest in the full machine learning lifecycle, including deploying trained machine learning models developed using common frameworks such as Scikit-learn, TensorFlow, or PyTorchUnderstanding of the core concepts of probability and statistics and familiarity with common supervised and unsupervised learning techniquesExperience in Software Engineering including programming in Python.Technical experience of cloud architecture, security, deployment, and open-source tools. Hands-on experience required of at least one major cloud platformDemonstrable experience with containers and specifically Docker and KubernetesComfortable in a high-growth startup environment.Outstanding verbal and written communication.What we can offer you:
The Faculty team is diverse and distinctive, and we all come from different personal, professional and organisational backgrounds. We all have one thing in common: we are driven by a deep intellectual curiosity that powers us forward each day.
Faculty is the professional challenge of a lifetime. You’ll be surrounded by an impressive group of brilliant minds working to achieve our collective goals.
Our consultants, product developers, business development specialists, operations professionals and more all bring something unique to Faculty, and you’ll learn something new from everyone you meet.
Machine Learning Engineer
Data Science & Analytics
Apply
July 15, 2025
Senior Backend AI Engineer
Mintlify
11-50
USD
0
180000
-
250000
United States
Full-time
Remote
false
Why Mintlify?We're on a mission to empower builders. Massive reach: Our docs platform serves 100 million+ developers every year and powers documentation for 10,000+ companies, including Anthropic, Cursor, Windsurf, Scale AI, X, and over 20% of the last YC batch.Small team, huge impact: We’re only 25 people today, backed by $22 million in funding, each new hire shapes the company’s trajectory.Culture of slope over y-intercept: We value learning velocity, grit, and unapologetically unique personalities.We grew in value faster than headcount and we’re looking to align the two quickly.What you'll work on hereBackend software engineering & infrastructure engineering to support the AI and agentic flowsRAG/data ingestion pipelinesPrompt engineeringModel performance and evalsWhat you bring to the table4+ years of software development experienceDeep customer empathy, including the desire to speak with customers and make product decisionsStrong ability to learn new technologies and be productive in unfamiliar domainsPassion for tasteful user experienceStrong ownership mentalityDeep experience in LLM fine-tuning, RAG systems, and AI automationsBonus points: Previously founded a startup. Extra bonus for a dev-tools startupWhy you should join our engineering teamYou're all about finding the intersection between what excites you and business priorities, and you're excited for your role to evolve accordinglyYou crave a mix of collaborative and heads-down builder time, and are excited to contribute to a small-but-mighty teamYou're looking for an environment where the best ideas win and acknowledge when you're wrongCompany Benefits:Competitive compensation and equity | Free Waymos20 days paid time off every year | Health, dental, vision401k or RRSP | Free lunch and dinners$420/mo. wellness stipend | Annual team offsite
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Apply
July 15, 2025
Backend AI Engineer
Mintlify
11-50
USD
0
140000
-
200000
United States
Full-time
Remote
false
Why Mintlify?We're on a mission to empower builders. Massive reach: Our docs platform serves 100 million+ developers every year and powers documentation for 10,000+ companies, including Anthropic, Cursor, Windsurf, Scale AI, X, and over 20% of the last YC batch.Small team, huge impact: We’re only 25 people today, backed by $22 million in funding, each new hire shapes the company’s trajectory.Culture of slope over y-intercept: We value learning velocity, grit, and unapologetically unique personalities.We grew in value faster than headcount and we’re looking to align the two quickly.What you'll work on hereBackend software engineering & infrastructure engineering to support the AI and agentic flowsRAG/data ingestion pipelinesPrompt engineeringModel performance and evalsWhat you bring to the table1+ year of software development experienceDeep customer empathy, including the desire to speak with customers and make product decisionsStrong ability to learn new technologies and be productive in unfamiliar domainsPassion for tasteful user experienceDeep experience in LLM fine-tuning, RAG systems, and AI automationsBonus points: Previously founded a startup. Extra bonus for a dev-tools startupWhy you should join our engineering teamYou're all about finding the intersection between what excites you and business priorities, and you're excited for your role to evolve accordinglyYou crave a mix of collaborative and heads-down builder time, and are excited to contribute to a small-but-mighty teamYou're looking for an environment where the best ideas win and acknowledge when you're wrongCompany Benefits:Competitive compensation and equity | Free Waymos20 days paid time off every year | Health, dental, vision401k or RRSP | Free lunch and dinners$420/mo. wellness stipend | Annual team offsite
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Apply
July 14, 2025
Senior ML Engineer
Lovable
201-500
-
Sweden
Full-time
Remote
false
TL;DR - We’re looking for Founding ML Engineers who will shape how we fine-tune, serve, and evaluate LLMs and frontier models in production - and help define what it means to build a truly lovable AI product.Why Lovable?Lovable lets anyone and everyone build software with plain English. From solopreneurs to Fortune 100 teams, millions of people use Lovable to transform raw ideas into real products - fast. We are at the forefront of a foundational shift in software creation, which means you have an unprecedented opportunity to change the way the digital world works. Over 2 million people in 200+ countries already use Lovable to launch businesses, automate work, and bring their ideas to life. And we’re just getting started.We’re a small, talent-dense team building a generation-defining company from Stockholm. We value extreme ownership, high velocity and low-ego collaboration. We seek out people who care deeply, ship fast, and are eager to make a dent in the world.What we’re looking forLed or contributed to cutting-edge LLM research at top AI labs / globally leading tech startupsTrained and fine-tuned LLMs on large-scale code, language, or multimodal datasetsDeep understanding of transformer architectures, attention mechanisms, and model optimizationShipped ML systems in production, with real users and real uptimeBuilt fast, production-level systems while maintaining strong practices around reproducibility, monitoring, and model performanceYou hold somewhat strong opinions about model safety, latency and helpfulness, but aren’t afraid to experimentWhat you’ll doIn one sentence: Train, tune, and scale frontier LLMs that power lovable products.Own training pipelines for LLMs, from data curation to evaluation and deploymentFine-tune models on high-quality, domain-specific data (code, natural language, product usage signals)Work closely with product engineers to integrate models into real user-facing featuresBuild retrieval pipelines, evaluation frameworks, and experimentation toolsPush the limits of what’s possible with current/upcoming open models, and help define what we should train nextOur tech stackWe're building with tools that both humans and AI love:Frontend: React for lightning-fast interfacesBackend: Golang and Rust for serious performanceCloud: Cloudflare, Fly.io, Google Cloud Run, AWS, TerraformDevOps & Tooling: CI/CD pipelines, observability, infra-as-codeAnd always on the lookout for what's next!How we hireFill in a short form then jump on an intro call with the team.Complete the take-home exerciseShow us how you approach problems during two technical interviewsJoin us for trial work lasting 2 days preferably on-site. We'll see how you tick and you get to meet the team and explore whether joining Lovable feels right for you.About your applicationPlease submit your application in English - it’s our company language so you’ll be speaking lots of it if you joinWe treat all candidates equally - if you’re interested please apply through our careers portal
Machine Learning Engineer
Data Science & Analytics
Apply
July 13, 2025
AI Researcher & Engineer - Multimodal (Real-time Video)
X AI
5000+
USD
180000
-
440000
United States
Full-time
Remote
false
About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All engineers and researchers are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.Our team is pushing the frontier of multimodal intelligence through Grok Voice, the world’s smartest AI assistant that is able to listen, see, and respond to you in real time. We actively research reinforcement learning to develop novel video understanding capabilities that solve user problems in both the physical and digital worlds. We own the full-stack of post-training: from data curation to model training, deployment, and iterating end-to-end on the user experience. Ideal candidates thrive well at the intersection of research and engineering. What you’ll do Research, design, and implement methods to enhance video understanding, whether through developing new models, systems, or tools. Improve data quality by curating robust datasets, building scalable data pipelines, and analyzing user interactions with models. Develop and apply evaluation metrics to measure model performance and systematically identify and address failure modes. Manage the complete experimental lifecycle: from designing experiments and training models to deployment and iterative refinement based on feedback and data. Ideal Experience You’d be an exceptional candidate if you possess some (or all) of the following: Experience in LLM reinforcement learning, tool use, and agentic approaches. Experience in real-world computer vision. For example, experience in visual/multimodal search and dealing with noisy visual data. Strong engineering background with experience working with large-scale, modern backend services. An attitude to just execute and solve problems. You’re willing to dive into new codebases you’ve not seen before if it means you can get stuff done faster. Tech Stack Python JAX / PyTorch Rust Interview Process After submitting your application, the team reviews your CV and statement of exceptional work. If your application passes this stage, you will be invited to a 15 minute interview (“phone interview”) during which a member of our team will ask some basic questions. If you clear the initial phone interview, you will enter the main process, which consists of four technical interviews: One-on-one research discussion & coding interviews (three meetings total) Project deep-dive: Present your past exceptional work and your vision with xAI to a small audience. Every application is reviewed by a member of our technical team. All interviews will be conducted via Google Meet. We do not condone usage of AI in interviews and have tools to detect AI usage. Benefits Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks. Annual Salary Range $180,000 - $440,000 USDxAI is an equal opportunity employer. California Consumer Privacy Act (CCPA) Notice
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Computer Vision Engineer
Software Engineering
Research Scientist
Product & Operations
Apply
July 12, 2025
Machine Learning Scientist, NLP (All Levels)
Abridge
201-500
USD
0
200000
-
300000
United States
Full-time
Remote
false
About AbridgeAbridge was founded in 2018 with the mission of powering deeper understanding in healthcare. Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation efficiencies while enabling clinicians to focus on what matters most—their patients.Our enterprise-grade technology transforms patient-clinician conversations into structured clinical notes in real-time, with deep EMR integrations. Powered by Linked Evidence and our purpose-built, auditable AI, we are the only company that maps AI-generated summaries to ground truth, helping providers quickly trust and verify the output. As pioneers in generative AI for healthcare, we are setting the industry standards for the responsible deployment of AI across health systems.We are a growing team of practicing MDs, AI scientists, PhDs, creatives, technologists, and engineers working together to empower people and make care make more sense. We have offices located in the SoHo neighborhood of New York, the Mission District in San Francisco, and East Liberty in Pittsburgh.The RoleFrom transcribing medical conversations to delivering key takeaways, our trailblazing work in machine learning research makes the Abridge experience possible. We're currently looking to hire research scientists with experience in machine learning and natural language processing. The ideal candidate will bring technical mastery, fluency with foundation models, genuine interest in the medical domain, and strong critical thinking skills to the role. At Abridge, all of our ML work has a strong research component, and all of our research scientists contribute directly to real products that impact the lives of doctors. What You'll DoAdvance the state of the art in medical NLP, in areas including conversation summarization, evidence extraction, outcome prediction, evaluation techniques, and experimentation.Actively contribute to the wider research community by sharing and publishing original researchHelp to define important problems, identify appropriate baselines, develop state-of-the-art methods, and ship them into production.Dial deeply into real-time feedback from clinicians to guide further refinements and innovationsBe results-oriented in the face of ambiguous problems and uncertain outcomesWhat You'll BringStrong research background, as demonstrated through papers and a graduate degree (MS or PhD) in Electrical Engineering, Computer Sciences, Mathematics, or equivalent experience.High-impact publications at peer-reviewed AI conferences (e.g. *CL, NeurIPS, ICML, ICLR).Significant real-world impact, as demonstrated through open source contributions and deployed technology.Strong programming skills with proven experience crafting, prototyping, and delivering machine learning solutions into production.Experience with deep learning libraries (e.g. PyTorch, Jax, Tensorflow) and platforms, multi-GPU training, and statistical analyses of observational and experimental data.Must be willing to work from our SF office at least 3x per weekThis position requires a commitment to a hybrid work model, with the expectation of coming into the office a minimum of (3) three times per week. Relocation assistance is available for candidates willing to move to San Francisco within 6 months of accepting an offer.We value people who want to learn new things, and we know that great team members might not perfectly match a job description. If you’re interested in the role but aren’t sure whether or not you’re a good fit, we’d still like to hear from you.Why Work at Abridge?At Abridge, we’re transforming healthcare delivery experiences with generative AI, enabling clinicians and patients to connect in deeper, more meaningful ways. Our mission is clear: to power deeper understanding in healthcare. We’re driving real, lasting change, with millions of medical conversations processed each month.Joining Abridge means stepping into a fast-paced, high-growth startup where your contributions truly make a difference. Our culture requires extreme ownership—every employee has the ability to (and is expected to) make an impact on our customers and our business.Beyond individual impact, you will have the opportunity to work alongside a team of curious, high-achieving people in a supportive environment where success is shared, growth is constant, and feedback fuels progress. At Abridge, it’s not just what we do—it’s how we do it. Every decision is rooted in empathy, always prioritizing the needs of clinicians and patients.We’re committed to supporting your growth, both professionally and personally. Whether it's flexible work hours, an inclusive culture, or ongoing learning opportunities, we are here to help you thrive and do the best work of your life.If you are ready to make a meaningful impact alongside passionate people who care deeply about what they do, Abridge is the place for you.How we take care of Abridgers:Generous Time Off: 13 paid holidays, flexible PTO for salaried employees, and accrued time off for hourly employees.Comprehensive Health Plans: Medical, Dental, and Vision plans for all full-time employees. Abridge covers 100% of the premium for you and 75% for dependents. If you choose a HSA-eligible plan, Abridge also makes monthly contributions to your HSA. Paid Parental Leave: 16 weeks paid parental leave for all full-time employees.401k and Matching: Contribution matching to help invest in your future.Pre-tax Benefits: Access to Flexible Spending Accounts (FSA) and Commuter Benefits.Learning and Development Budget: Yearly contributions for coaching, courses, workshops, conferences, and more.Sabbatical Leave: 30 days of paid Sabbatical Leave after 5 years of employment.Compensation and Equity: Competitive compensation and equity grants for full time employees.... and much more!Diversity & InclusionAbridge is an equal opportunity employer. Diversity and inclusion is at the core of what we do. We actively welcome applicants from all backgrounds (including but not limited to race, gender, educational background, and sexual orientation).Staying safe - Protect yourself from recruitment fraudWe are aware of individuals and entities fraudulently representing themselves as Abridge recruiters and/or hiring managers. Abridge will never ask for financial information or payment, or for personal information such as bank account number or social security number during the job application or interview process. Any emails from the Abridge recruiting team will come from an @abridge.com email address. You can learn more about how to protect yourself from these types of fraud by referring to this article. Please exercise caution and cease communications if something feels suspicious about your interactions.
Machine Learning Engineer
Data Science & Analytics
NLP Engineer
Software Engineering
Research Scientist
Product & Operations
Apply
July 11, 2025
ML Research Engineer
Oumi
11-50
USD
0
140000
-
220000
United States
Full-time
Remote
true
About OumiWhy we exist: Oumi is on a mission to make frontier AI truly open for all. We are founded on the belief that AI will have a transformative impact on humanity, and that developing it collectively, in the open, is the best path forward to ensure that it is done efficiently and safely.What we do: Oumi provides an all-in-one platform to build state-of-the-art AI models, end to end, from data preparation to production deployment, empowering innovators to build cutting-edge models at any scale. Oumi also develops open foundation models in collaboration with academic collaborators and the open community.Our Approach: Oumi is fundamentally an open-source first company, with open-collaboration across the community as a core principle. Our work is:Open Source First: All our platform and core technology is open sourceResearch-driven: We conduct and publish original research in AI, collaborating with our community of academic research labs and collaboratorsCommunity-powered: We believe in the power of open-collaboration and welcome contributions from researchers and developers worldwideRole OverviewWe’re looking for a Research Engineer to join our team working on generative AI and LLMs. In this role, you'll bridge research and engineering—designing scalable infrastructure, enabling cutting-edge experiments, and helping open-source the next generation of LLMs. You will collbaborate closely with our research team and the open-source community to build tools, run evaluations, and contribute to models that are safe, performant, and accessible.What you'll do:Design and build systems to support training, fine-tuning, and evaluating large language models.Partner with researchers to define experiments, write reusable code, run benchmarks, and interpret results.Work on LLM alignment and tuning using techniques like reinforcement learning (RLHF), supervised fine-tuning, and prompt optimization.Develop scalable ML pipelines for distributed training (e.g., across multi-GPU and multi-node environments).Contribute to open-source tooling and models to support transparency and community collaboration.Optimize performance across the ML stack—from data loading to deployment.What you’ll bring:Strong experience in machine learning, deep learning, or NLP—especially in generative AI or LLMs.Solid programming skills in Python, and experience with ML frameworks like PyTorch.Experience designing or maintaining ML infrastructure at scale (e.g., cloud-based training, distributed systems).Comfort working in highly collaborative environments with research and engineering teams.Bonus: experience with academic publications, open-source contributions, or LLM alignment work.Share Oumi's values: Beneficial for all, Customer-obsessed, Radical Ownership, Exceptional Teammates, Science-grounded.BenefitsCompetitive salary: $140,000 - $220,000Equity in a high-growth startupComprehensive health, dental and vision insurance21 days PTORegular team offsites and events
Machine Learning Engineer
Data Science & Analytics
Research Scientist
Product & Operations
Software Engineer
Software Engineering
Apply
July 11, 2025
Member of Technical Staff, Large Generative Models
Captions
101-200
USD
0
175000
-
275000
United States
Full-time
Remote
false
Captions is the leading AI video company—our mission is to empower anyone, anywhere to tell their stories through video. Over 10 million creators and businesses have used Captions to simplify video creation with truly novel and groundbreaking AI capabilities.We are a rapidly growing team of ambitious, experienced, and devoted engineers, researchers, designers, marketers, and operators based in NYC. As an early member of our team, you’ll have an opportunity to have an outsized impact on our products and our company's culture.Our TechnologyMirage Announcement our proprietary omni-modal foundation modelSeeing Voices (technical paper) generating A-roll video from audio with MirageMirage Studio for generating expressive videos at scale"Captions: For Talking Videos” available in the iOS app storePress CoverageLenny’s Podcast: Interview with Gaurav Misra (CEO)Latest Fundraise: Series C AnnouncementThe Information: 50 Most Promising StartupsFast Company: Next Big Things in TechBusiness Insider: 34 most promising AI startupsTIME: The Best Inventions of 2024Our InvestorsWe’re very fortunate to have some the best investors and entrepreneurs backing us, including Index Ventures, Kleiner Perkins, Sequoia Capital, Andreessen Horowitz, Uncommon Projects, Kevin Systrom, Mike Krieger, Lenny Rachitsky, Antoine Martin, Julie Zhuo, Ben Rubin, Jaren Glover, SVAngel, 20VC, Ludlow Ventures, Chapter One, and more.** Please note that all of our roles will require you to be in-person at our NYC HQ (located in Union Square) We do not work with third-party recruiting agencies, please do not contact us** About the role:Captions is seeking an exceptional Research Engineer (MOTS) to advance the state-of-the-art in large-scale multimodal video diffusion models. You'll conduct novel research on generative modeling architectures, develop new training techniques, and scale models to billions of parameters. As a key member of our ML Research team, you'll work at the cutting edge of multimodal generation while building systems that enable natural, controllable video creation. We're already training large-scale models with demonstrated product impact, and we're excited to continue expanding the scope and capabilities of our research.We're especially excited about pushing the boundaries of audio-video generation, with a focus on realistic and charismatic human behavior that enables natural storytelling and creative iteration. Our models power creative tools used by millions of creators, and we're tackling fundamental challenges in how to generate compelling human motion, expression, and speech. Key Responsibilities:Research & Architecture Development:Design and implement novel architectures for large-scale video and multimodal diffusion modelsDevelop new approaches to multimodal fusion, temporal modeling, and video controlResearch temporal video editing techniques and controllable generationResearch and validate scaling laws for video generation modelsCreate new loss functions and training objectives for improved generation qualityDrive rapid experimentation with model architectures and training strategiesValidate research directly through product deployment and user feedbackModel Training & Optimization:Train and optimize models at massive scale (10s-100s of billions of parameters)Develop sophisticated distributed training approaches using FSDP, DeepSpeed, Megatron-LMDesign and implement model surgery techniques (pruning, distillation, quantization)Create new approaches to memory optimization and training efficiencyResearch techniques for improving training stability at scaleConduct systematic empirical studies of architecture and optimization choicesTechnical Innovation:Advance state-of-the-art in video model architecture design and optimization Develop new approaches to temporal modeling for video generationCreate novel solutions for multimodal learning and cross-modal alignmentResearch and implement new optimization techniques for generative modeling and samplingDesign and validate new evaluation metrics for generation qualitySystematically analyze and improve model behavior across different regimesRequirements:Research Experience:Master's or PhD in Computer Science, Machine Learning, or related fieldTrack record of research contributions at top ML conferences (NeurIPS, ICML, ICLR)Demonstrated experience implementing and improving upon state-of-the-art architecturesDeep expertise in generative modeling approaches (diffusion, autoregressive, VAEs, etc.)Strong background in optimization techniques and loss function designExperience with empirical scaling studies and systematic architecture researchTechnical Expertise:Strong proficiency in modern deep learning tooling (PyTorch, CUDA, Triton, FSDP, etc.)Experience training diffusion models with 10B+ parametersExperience with very large language models (200B+ parameters) is a plusDeep understanding of attention, transformers, and modern multimodal architecturesExpertise in distributed training systems and model parallelismProven ability to implement and improve complex model architecturesTrack record of systematic empirical research and rigorous evaluationEngineering Capabilities:Ability to write clean, modular research code that scalesStrong software engineering practices including testing and code reviewExperience with rapid prototyping and experimental designStrong analytical skills for debugging model behavior and training dynamicsFacility with profiling and optimization toolsTrack record of bringing research ideas to productionExperience maintaining high code quality in a research environmentAbout the Team:You'll work directly alongside our research and engineering teams in our NYC office. We've intentionally built a culture where technical innovation and research excellence are highly valued - your success will be measured by your contributions to improving our models and advancing the field, not by your ability to navigate politics. We're a team that loves diving deep into complex technical problems and emerging with practical breakthroughs.Our team values:Open technical discussions and collaborationRapid iteration and practical solutionsDeep technical expertise and continuous learningDirect impact on research and product outcomesWhat sets us apart:Opportunity to advance the state-of-the-art in video generationDirect impact on products used by millions of creatorsAccess to massive compute resources and diverse, large-scale datasetsEnvironment that values both research excellence and practical impactAbility to validate research through direct product feedbackBenefits:Comprehensive medical, dental, and vision plans401K with employer matchCommuter BenefitsCatered lunch multiple days per weekDinner stipend every night if you're working late and want a bite! Doordash DashPass subscriptionHealth & Wellness Perks (Talkspace, Kindbody, One Medical subscription, HealthAdvocate, Teladoc)Multiple team offsites per year with team events every monthGenerous PTO policyCaptions provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.Please note benefits apply to full time employees only.
Machine Learning Engineer
Data Science & Analytics
Research Scientist
Product & Operations
Apply
July 11, 2025
Member of Technical Staff, GPU Optimization
Captions
101-200
USD
0
175000
-
275000
United States
Full-time
Remote
false
Captions is the leading AI video company—our mission is to empower anyone, anywhere to tell their stories through video. Over 10 million creators and businesses have used Captions to simplify video creation with truly novel and groundbreaking AI capabilities.We are a rapidly growing team of ambitious, experienced, and devoted engineers, researchers, designers, marketers, and operators based in NYC. As an early member of our team, you’ll have an opportunity to have an outsized impact on our products and our company's culture.Our TechnologyMirage Announcement our proprietary omni-modal foundation modelSeeing Voices (technical paper) generating A-roll video from audio with MirageMirage Studio for generating expressive videos at scale"Captions: For Talking Videos” available in the iOS app storePress CoverageLenny’s Podcast: Interview with Gaurav Misra (CEO)Latest Fundraise: Series C AnnouncementThe Information: 50 Most Promising StartupsFast Company: Next Big Things in TechBusiness Insider: 34 most promising AI startupsTIME: The Best Inventions of 2024Our InvestorsWe’re very fortunate to have some the best investors and entrepreneurs backing us, including Index Ventures, Kleiner Perkins, Sequoia Capital, Andreessen Horowitz, Uncommon Projects, Kevin Systrom, Mike Krieger, Lenny Rachitsky, Antoine Martin, Julie Zhuo, Ben Rubin, Jaren Glover, SVAngel, 20VC, Ludlow Ventures, Chapter One, and more.** Please note that all of our roles will require you to be in-person at our NYC HQ (located in Union Square) We do not work with third-party recruiting agencies, please do not contact us** About the RoleAs an expert in making AI models run fast—really fast—you live at the intersection of CUDA, PyTorch, and generative models, and get excited by the idea of squeezing every last bit of performance out of modern GPUs. You will have the opportunity to turn our cutting-edge video generation research into scalable, production-grade systems. From designing custom CUDA or Triton kernels to profiling distributed inference pipelines, you'll work across the full stack to make sure our models train and serve at peak performance.Key ResponsibilitiesOptimize model training and inference pipelines, including data loading, preprocessing, checkpointing, and deployment, for throughput, latency, and memory efficiency on NVIDIA GPUsDesign, implement, and benchmark custom CUDA and Triton kernels for performance-critical operationsIntegrate low-level optimizations into PyTorch-based codebases, including custom ops, low-precision formats, and TorchInductor passesProfile and debug the entire stack—from kernel launches to multi-GPU I/O paths—using Nsight, nvprof, PyTorch Profiler, and custom toolsWork closely with colleagues to co-design model architectures and data pipelines that are hardware-friendly and maintain state-of-the-art qualityStay on the cutting edge of GPU and compiler tech (e.g., Hopper features, CUDA Graphs, Triton, FlashAttention, and more) and evaluate their impactCollaborate with infrastructure and backend experts to improve cluster orchestration, scaling strategies, and observability for large experimentsProvide clear, data-driven insights and trade-offs between performance, quality, and costContribute to a culture of fast iteration, thoughtful profiling, and performance-centric designRequired QualificationsBachelor's degree in Computer Science, Electrical/Computer Engineering, or equivalent practical experience3+ years of hands-on experience writing and optimizing CUDA kernels for production ML workloadsDeep understanding of GPU architecture: memory hierarchies, warp scheduling, tensor cores, register pressure, and occupancy tuningStrong Python skills and familiarity with PyTorch internals, TorchScript, and distributed data-parallel trainingProven track record profiling and accelerating large-scale training and inference jobs (e.g., mixed precision, kernel fusion, custom collectives)Comfort working in Linux environments with modern CI/CD, containerization, and cluster managers such as KubernetesPreferred QualificationsAdvanced degree (MS/PhD) in Computer Science, Electrical/Computer Engineering, or related fieldExperience with multi-modal AI systems, particularly video generation or computer vision modelsFamiliarity with distributed training frameworks (DeepSpeed, FairScale, Megatron) and model parallelism techniquesKnowledge of compiler optimization techniques and experience with MLIR, XLA, or similar frameworksExperience with cloud infrastructure (AWS, GCP, Azure) and GPU cluster managementAbility to translate research goals into performant code, balancing numerical fidelity with hardware constraintsStrong communication skills and experience mentoring junior engineersBenefits:Comprehensive medical, dental, and vision plans401K with employer matchCommuter BenefitsCatered lunch multiple days per weekDinner stipend every night if you're working late and want a bite! Doordash DashPass subscriptionHealth & Wellness Perks (Talkspace, Kindbody, One Medical subscription, HealthAdvocate, Teladoc)Multiple team offsites per year with team events every monthGenerous PTO policyCaptions provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.Please note benefits apply to full time employees only.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
MLOps / DevOps Engineer
Data Science & Analytics
Apply
July 11, 2025
AI Engineer
E2B
11-50
-
Czech Republic
Full-time
Remote
false
About the roleYour job will be to inspire developers what they can build E2B. Part of that job is creating examples based on what we often see that our users are doing and another part of that is leading by example by building experimental projects using E2B.You’ll be building both smaller examples that you can find in our Cookbook but also bigger projects like Fragments or AI Analyst.This role requires a high amount of creativity and ability to finish the projects by taking them from 0 to 1.What we’re looking for3+ years of experience being a software engineerBeing comfortable with fast-pace field and environmentBeing interested in the latest news in the AI fieldExcited to work in person from Prague on a devtool productDetail oriented with a great tasteExcited to work closely with our usersNot being afraid to take ownership of the part of our productIf you join E2B, you’ll get a lot of freedom. We expect you to be proactive and take ownership. You’ll be taking projects from 0 to 1 with the support of the rest of the team.What it’s like to work at E2BWork at a fast growing startup at an early team (we grow 20%-100% MoM)We ship fast but don’t release junkWe like hard work and problems. Challenges mean potential value.We have a long runway and can offer a competitive salary for the startup at our stageWork closely with other AI companies on the edge of what’s possible todayDogfooding our own product on projects like FragmentsNo meetings, highly writing and transparent cultureYou’re the decision maker in day-to-day, important product and roadmap decisions are on Vasek (CEO) and Tomas (CTO)Spend 10-20% of the roadmap on highly experimental projectsHiring processWe aim to have the whole process done in 7-10 days. We understand that it’s important to move fast and try to follow up in 24 hours after each stage.30-minute call with Vasek (CEO). We’ll go over your past work experience and what you’re looking for to make sure this would be a good fit for both of us.First technical interview with Tomas (CTO). About 1 hour long call. You’ll get asked thorough technical questions. Often these are questions about problems that we ourselves experienced while building E2B.Second technical interview. Another 1-2 hours long call. Expect live coding on this call. We’ll ask you to solve specific problems (don’t worry, it’s not a leet code) that are related to your role.One day of in-person hacking at our office (paid). We invite you to our office to work on the product with us. This is a great opportunity for all of us to try how it’s working together and for you to meet the team.Last call with Vasek. Last 30-minute call with the CEO to talk more about the role and answer any of your questions.Decision and potential offer.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Apply
July 10, 2025
AI Engineer
E2B
11-50
-
United States
Full-time
Remote
false
About the roleYou’ll be working on tooling on top of our sandboxWhat we’re looking for3+ years of experience being a software engineerBeing comfortable with fast-pace field and environmentBeing interested in the latest news in the AI fieldExcited to work in person from San Francisco on a devtool productDetail oriented with a great tasteExcited to work closely with our usersNot being afraid to take ownership of the part of our productIf you join E2B, you’ll get a lot of freedom. We expect you to be proactive and take ownership. You’ll be taking projects from 0 to 1 with the support of the rest of the team.What it’s like to work at E2BWork at a fast growing startup at an early team (we grow 20%-100% MoM)We ship fast but don’t release junkWe like hard work and problems. Challenges mean potential value.We have a long runway and can offer a competitive salary for the startup at our stageWork closely with other AI companies on the edge of what’s possible todayDogfooding our own product on projects like FragmentsNo meetings, highly writing and transparent cultureYou’re the decision maker in day-to-day, important product and roadmap decisions are on Vasek (CEO) and Tomas (CTO)Spend 10-20% of the roadmap on highly experimental projectsHiring processWe aim to have the whole process done in 7-10 days. We understand that it’s important to move fast and try to follow up in 24 hours after each stage.30-minute call with Vasek (CEO). We’ll go over your past work experience and what you’re looking for to make sure this would be a good fit for both of us.First technical interview with Tomas (CTO). About 1 hour long call. You’ll get asked thorough technical questions. Often these are questions about problems that we ourselves experienced while building E2B.Second technical interview. Another 1-2 hours long call. Expect live coding on this call. We’ll ask you to solve specific problems (don’t worry, it’s not a leet code) that are related to your role.One day of in-person hacking at our office (paid). We invite you to our office to work on the product with us. This is a great opportunity for all of us to try how it’s working together and for you to meet the team.Last call with Vasek. Last 30-minute call with the CEO to talk more about the role and answer any of your questions.Decision and potential offer.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Apply
July 10, 2025
ICML 2025 - Job Application
Scale AI
5000+
USD
176000
-
325000
United States
Full-time
Remote
false
This posting is for candidates who attended ICLML '25 and met with a member of our team. It was great meeting you at ICML 2025! Whether we chatted at our booth, during a poster session, or one of the workshops – we’re thrilled to connect with people who are passionate about pushing the boundaries of machine learning and AI. At Scale AI, our mission is to accelerate the development of AI applications. For 8 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including: generative AI, defense applications, and autonomous vehicles. We’re currently growing our team across multiple roles. If you're excited about the challenges we’re working on, drop your info here, and we’ll make sure someone from our team reaches out if there's a good fit with one of our open roles. Even if the timing isn’t quite right, we’d love to stay in touch. We look forward to continuing the conversation! In the meantime, you can read more about our research at scale.com/research.Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position, determined by work location and additional factors, including job-related skills, experience, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You’ll also receive benefits including, but not limited to: Comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend.The base salary range for this full-time position in the location of San Francisco is:$176,000—$325,000 USDPLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, we believe that the transition from traditional software to AI is one of the most important shifts of our time. Our mission is to make that happen faster across every industry, and our team is transforming how organizations build and deploy AI. Our products power the world's most advanced LLMs, generative models, and computer vision models. We are trusted by generative AI companies such as OpenAI, Meta, and Microsoft, government agencies like the U.S. Army and U.S. Air Force, and enterprises including GM and Accenture. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at accommodations@scale.com. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision. PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Data Engineer
Data Science & Analytics
Computer Vision Engineer
Software Engineering
Research Scientist
Product & Operations
Apply
July 10, 2025
Platform Architect
webAI
101-200
-
United States
Full-time
Remote
false
About Us:webAI is pioneering the future of artificial intelligence by establishing the first distributed AI infrastructure dedicated to personalized AI. We recognize the evolving demands of a data-driven society for scalability and flexibility, and we firmly believe that the future of AI lies in distributed processing at the edge, bringing computation closer to the source of data generation. Our mission is to build a future where a company's valuable data and intellectual property remain entirely private, enabling the deployment of large-scale AI models directly on standard consumer hardware without compromising the information embedded within those models. We are developing an end-to-end platform that is secure, scalable, and fully under the control of our users, empowering enterprises with AI that understands their unique business. We are a team driven by truth, ownership, tenacity, and humility, and we seek individuals who resonate with these core values and are passionate about shaping the next generation of AI.About the Role:We are seeking a visionary Platform Architect to lead core architectural decisions across our distributed AI stack — from runtime orchestration, through application interfaces, to mesh-aware networking. This is role lives at the intersection of systems architecture, AI infrastructure, and distributed computing. You will define, validate, and evolve the technical blueprint that powers webAI’s intelligent mesh and ensures it scales with enterprise-grade resilience, security, and performance.Key Responsibilities:End-to-End Architecture:
Own the design and evolution of the platform architecture across the runtime, application, and mesh-network layers. Prioritize modularity, composability, and real-world deployability.AI Runtime Design:
Define and build high-performance runtimes, including task scheduling, memory management, and hardware abstraction for heterogeneous environments (CPU, GPU, NPU). Contribute to or integrate frameworks like ONNX, TensorRT, or custom inference stacks.Networking Layer:
Architect resilient peer-to-peer communication and mesh networking systems across diverse radios (Wi-Fi, Bluetooth, WebRTC, etc.), including device discovery, data synchronization, and fault tolerance.Security & Observability:
Embed zero-trust principles, encryption at rest and in transit, signed inference, and platform-level logging, monitoring, and diagnostics by design.Technical Leadership:
Serve as a technical mentor for engineering teams, validate architectural choices, conduct design reviews, and collaborate with stakeholders to ensure alignment with product vision and performance requirements.Required Skills & Qualifications:Experience:10+ years of relevant experience in systems architecture, distributed computing, or AI infrastructure, with a proven track record designing and scaling production-grade distributed platforms. We require depth of experience and demonstrated impact over tenure.AI Runtime DesignExperience building or contributing to high-performance runtimes (ONNX, TensorRT, custom runtimes).Task scheduling, memory management, hardware abstraction (CPU, GPU, NPU).Tooling & Stack PreferencesSystems programming in Rust, Go, or C++ (must-have: not just Python or web dev).Bonus: experience with federated learning, agent communication, MPC, or privacy-preserving computation.Understanding of real-time, low-latency, and resource-constrained environments.Networking LayerCross-radio P2P communication (Wi-Fi, Bluetooth, WebRTC, etc).Device discovery, resilient mesh networking, and data synchronization.Security & ObservabilityFamiliarity with zero-trust models, signed inference, encryption at rest & in transit.Build-in logging, monitoring, and platform-level diagnostics.We at webAI are committed to living out the core values we have put in place as the foundation on which we operate as a team. We seek individuals who exemplify the following:Truth - Emphasizing transparency and honesty in every interaction and decision.Ownership - Taking full responsibility for one’s actions and decisions, demonstrating commitment to the success of our clients. Tenacity - Persisting in the face of challenges and setbacks, continually striving for excellence and improvement.Humility - Maintaining a respectful and learning-oriented mindset, acknowledging the strengths and contributions of others.Benefits:Competitive salary and performance-based incentives.Comprehensive health, dental, and vision benefits package.401k Match$200/mos Health and Wellness Stipend$400/year Continuing Education Credit$500/year Function Health subscription (US-based only)Free parking, for in-office employeesUnlimited Approved PTOParental Leave for Eligible EmployeesSupplemental Life Insurance
webAI is an Equal Opportunity Employer and does not discriminate against any employee or applicant on the basis of age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We adhere to these principles in all aspects of employment, including recruitment, hiring, training, compensation, promotion, benefits, social and recreational programs, and discipline. In addition, it is the policy of webAI to provide reasonable accommodation to qualified employees who have protected disabilities to the extent required by applicable laws, regulations and ordinances where a particular employee works.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Apply
July 9, 2025
No job found
Your search did not match any job. Please try again
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.