Top Machine Learning Engineer Jobs Openings in 2025
Looking for opportunities in Machine Learning Engineer? This curated list features the latest Machine Learning Engineer job openings from AI-native companies. Whether you're an experienced professional or just entering the field, find roles that match your expertise, from startups to global tech leaders. Updated everyday.
Senior Machine Learning Engineer
Faculty
501-1000
0
0
-
0
United Kingdom
Full-time
Remote
false
About Faculty
At Faculty, we transform organisational performance through safe, impactful and human-centric AI. With more than a decade of experience, we provide over 350 global customers with software, bespoke AI consultancy, and Fellows from our award winning Fellowship programme. Our expert team brings together leaders from across government, academia and global tech giants to solve the biggest challenges in applied AI. Should you join us, you’ll have the chance to work with, and learn from, some of the brilliant minds who are bringing Frontier AI to the frontlines of the world.We're always on the lookout for talented individuals whose principles and interests align with our own. While we don't have a vacancy open in a specific team at the moment, we are starting to plan for a period of growth in Machine Learning.
By registering your application for this position you'll be considered for Senior Machine Learning roles in our Applied AI Consultancy more generally and we'll reach out as these open up.
What You'll Be DoingAs a Senior Machine Learning Engineer at Faculty, you'll design, build, and deploy production-grade software, infrastructure, and MLOps systems that leverage machine learning.You'll be engineering-focused, with a keen interest and working knowledge of operationalised machine learning. You have a desire to take cutting-edge ML applications into the real world. You will develop new methodologies and champion best practices for managing AI systems deployed at scale, with regard to technical, ethical and practical requirements. You will support both technical and non-technical stakeholders to deploy ML to solve real-world problems. To enable this, we work in cross-functional teams with representation from commercial, data science, product management and design specialities to cover all aspects of AI product delivery.The Machine Learning Engineering team is responsible for the engineering aspects of our customer delivery projects. As a Machine Learning Engineer, you’ll be essential to helping us achieve that goal by:Building software and infrastructure that leverages Machine Learning;Creating reusable, scalable tools to enable better delivery of ML systemsWorking with our customers to help understand their needsWorking with data scientists and engineers to develop best practices and new technologies; andImplementing and developing Faculty’s view on what it means to operationalise ML software.We’re a rapidly growing organisation, so roles are dynamic and subject to change. Your role will evolve alongside business needs, but you can expect your key responsibilities to include:
Working in cross-functional teams of engineers, data scientists, designers and managers to deliver technically sophisticated, high-impact systems.Leading on the scope and design of projectsOffering leadership and management to more junior engineers on the team Providing technical expertise to our customersTechnical DeliveryWho We're Looking ForTo succeed in this role, you’ll need the following - these are illustrative requirements and we don’t expect all applicants to have experience in everything (70% is a rough guide):Understanding of and interest in the full machine learning lifecycle, including deploying trained machine learning models developed using common frameworks such as Scikit-learn, TensorFlow, or PyTorchUnderstanding of the core concepts of probability and statistics and familiarity with common supervised and unsupervised learning techniquesExperience in Software Engineering including programming in Python.Technical experience of cloud architecture, security, deployment, and open-source tools. Hands-on experience required of at least one major cloud platformDemonstrable experience with containers and specifically Docker and KubernetesComfortable in a high-growth startup environment.Outstanding verbal and written communication.What we can offer you:
The Faculty team is diverse and distinctive, and we all come from different personal, professional and organisational backgrounds. We all have one thing in common: we are driven by a deep intellectual curiosity that powers us forward each day.
Faculty is the professional challenge of a lifetime. You’ll be surrounded by an impressive group of brilliant minds working to achieve our collective goals.
Our consultants, product developers, business development specialists, operations professionals and more all bring something unique to Faculty, and you’ll learn something new from everyone you meet.
Machine Learning Engineer
Data Science & Analytics
Apply
July 15, 2025
Senior Backend AI Engineer
Mintlify
11-50
USD
0
180000
-
250000
United States
Full-time
Remote
false
Why Mintlify?We're on a mission to empower builders. Massive reach: Our docs platform serves 100 million+ developers every year and powers documentation for 10,000+ companies, including Anthropic, Cursor, Windsurf, Scale AI, X, and over 20% of the last YC batch.Small team, huge impact: We’re only 25 people today, backed by $22 million in funding, each new hire shapes the company’s trajectory.Culture of slope over y-intercept: We value learning velocity, grit, and unapologetically unique personalities.We grew in value faster than headcount and we’re looking to align the two quickly.What you'll work on hereBackend software engineering & infrastructure engineering to support the AI and agentic flowsRAG/data ingestion pipelinesPrompt engineeringModel performance and evalsWhat you bring to the table4+ years of software development experienceDeep customer empathy, including the desire to speak with customers and make product decisionsStrong ability to learn new technologies and be productive in unfamiliar domainsPassion for tasteful user experienceStrong ownership mentalityDeep experience in LLM fine-tuning, RAG systems, and AI automationsBonus points: Previously founded a startup. Extra bonus for a dev-tools startupWhy you should join our engineering teamYou're all about finding the intersection between what excites you and business priorities, and you're excited for your role to evolve accordinglyYou crave a mix of collaborative and heads-down builder time, and are excited to contribute to a small-but-mighty teamYou're looking for an environment where the best ideas win and acknowledge when you're wrongCompany Benefits:Competitive compensation and equity | Free Waymos20 days paid time off every year | Health, dental, vision401k or RRSP | Free lunch and dinners$420/mo. wellness stipend | Annual team offsite
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Apply
July 15, 2025
Backend AI Engineer
Mintlify
11-50
USD
0
140000
-
200000
United States
Full-time
Remote
false
Why Mintlify?We're on a mission to empower builders. Massive reach: Our docs platform serves 100 million+ developers every year and powers documentation for 10,000+ companies, including Anthropic, Cursor, Windsurf, Scale AI, X, and over 20% of the last YC batch.Small team, huge impact: We’re only 25 people today, backed by $22 million in funding, each new hire shapes the company’s trajectory.Culture of slope over y-intercept: We value learning velocity, grit, and unapologetically unique personalities.We grew in value faster than headcount and we’re looking to align the two quickly.What you'll work on hereBackend software engineering & infrastructure engineering to support the AI and agentic flowsRAG/data ingestion pipelinesPrompt engineeringModel performance and evalsWhat you bring to the table1+ year of software development experienceDeep customer empathy, including the desire to speak with customers and make product decisionsStrong ability to learn new technologies and be productive in unfamiliar domainsPassion for tasteful user experienceDeep experience in LLM fine-tuning, RAG systems, and AI automationsBonus points: Previously founded a startup. Extra bonus for a dev-tools startupWhy you should join our engineering teamYou're all about finding the intersection between what excites you and business priorities, and you're excited for your role to evolve accordinglyYou crave a mix of collaborative and heads-down builder time, and are excited to contribute to a small-but-mighty teamYou're looking for an environment where the best ideas win and acknowledge when you're wrongCompany Benefits:Competitive compensation and equity | Free Waymos20 days paid time off every year | Health, dental, vision401k or RRSP | Free lunch and dinners$420/mo. wellness stipend | Annual team offsite
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Apply
July 14, 2025
Senior ML Engineer
Lovable
201-500
-
Sweden
Full-time
Remote
false
TL;DR - We’re looking for Founding ML Engineers who will shape how we fine-tune, serve, and evaluate LLMs and frontier models in production - and help define what it means to build a truly lovable AI product.Why Lovable?Lovable lets anyone and everyone build software with plain English. From solopreneurs to Fortune 100 teams, millions of people use Lovable to transform raw ideas into real products - fast. We are at the forefront of a foundational shift in software creation, which means you have an unprecedented opportunity to change the way the digital world works. Over 2 million people in 200+ countries already use Lovable to launch businesses, automate work, and bring their ideas to life. And we’re just getting started.We’re a small, talent-dense team building a generation-defining company from Stockholm. We value extreme ownership, high velocity and low-ego collaboration. We seek out people who care deeply, ship fast, and are eager to make a dent in the world.What we’re looking forLed or contributed to cutting-edge LLM research at top AI labs / globally leading tech startupsTrained and fine-tuned LLMs on large-scale code, language, or multimodal datasetsDeep understanding of transformer architectures, attention mechanisms, and model optimizationShipped ML systems in production, with real users and real uptimeBuilt fast, production-level systems while maintaining strong practices around reproducibility, monitoring, and model performanceYou hold somewhat strong opinions about model safety, latency and helpfulness, but aren’t afraid to experimentWhat you’ll doIn one sentence: Train, tune, and scale frontier LLMs that power lovable products.Own training pipelines for LLMs, from data curation to evaluation and deploymentFine-tune models on high-quality, domain-specific data (code, natural language, product usage signals)Work closely with product engineers to integrate models into real user-facing featuresBuild retrieval pipelines, evaluation frameworks, and experimentation toolsPush the limits of what’s possible with current/upcoming open models, and help define what we should train nextOur tech stackWe're building with tools that both humans and AI love:Frontend: React for lightning-fast interfacesBackend: Golang and Rust for serious performanceCloud: Cloudflare, Fly.io, Google Cloud Run, AWS, TerraformDevOps & Tooling: CI/CD pipelines, observability, infra-as-codeAnd always on the lookout for what's next!How we hireFill in a short form then jump on an intro call with the team.Complete the take-home exerciseShow us how you approach problems during two technical interviewsJoin us for trial work lasting 2 days preferably on-site. We'll see how you tick and you get to meet the team and explore whether joining Lovable feels right for you.About your applicationPlease submit your application in English - it’s our company language so you’ll be speaking lots of it if you joinWe treat all candidates equally - if you’re interested please apply through our careers portal
Machine Learning Engineer
Data Science & Analytics
Apply
July 13, 2025
AI Researcher & Engineer - Multimodal (Real-time Video)
X AI
5000+
USD
180000
-
440000
United States
Full-time
Remote
false
About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All engineers and researchers are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.Our team is pushing the frontier of multimodal intelligence through Grok Voice, the world’s smartest AI assistant that is able to listen, see, and respond to you in real time. We actively research reinforcement learning to develop novel video understanding capabilities that solve user problems in both the physical and digital worlds. We own the full-stack of post-training: from data curation to model training, deployment, and iterating end-to-end on the user experience. Ideal candidates thrive well at the intersection of research and engineering. What you’ll do Research, design, and implement methods to enhance video understanding, whether through developing new models, systems, or tools. Improve data quality by curating robust datasets, building scalable data pipelines, and analyzing user interactions with models. Develop and apply evaluation metrics to measure model performance and systematically identify and address failure modes. Manage the complete experimental lifecycle: from designing experiments and training models to deployment and iterative refinement based on feedback and data. Ideal Experience You’d be an exceptional candidate if you possess some (or all) of the following: Experience in LLM reinforcement learning, tool use, and agentic approaches. Experience in real-world computer vision. For example, experience in visual/multimodal search and dealing with noisy visual data. Strong engineering background with experience working with large-scale, modern backend services. An attitude to just execute and solve problems. You’re willing to dive into new codebases you’ve not seen before if it means you can get stuff done faster. Tech Stack Python JAX / PyTorch Rust Interview Process After submitting your application, the team reviews your CV and statement of exceptional work. If your application passes this stage, you will be invited to a 15 minute interview (“phone interview”) during which a member of our team will ask some basic questions. If you clear the initial phone interview, you will enter the main process, which consists of four technical interviews: One-on-one research discussion & coding interviews (three meetings total) Project deep-dive: Present your past exceptional work and your vision with xAI to a small audience. Every application is reviewed by a member of our technical team. All interviews will be conducted via Google Meet. We do not condone usage of AI in interviews and have tools to detect AI usage. Benefits Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks. Annual Salary Range $180,000 - $440,000 USDxAI is an equal opportunity employer. California Consumer Privacy Act (CCPA) Notice
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Computer Vision Engineer
Software Engineering
Research Scientist
Product & Operations
Apply
July 12, 2025
Machine Learning Scientist, NLP (All Levels)
Abridge
201-500
USD
0
200000
-
300000
United States
Full-time
Remote
false
About AbridgeAbridge was founded in 2018 with the mission of powering deeper understanding in healthcare. Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation efficiencies while enabling clinicians to focus on what matters most—their patients.Our enterprise-grade technology transforms patient-clinician conversations into structured clinical notes in real-time, with deep EMR integrations. Powered by Linked Evidence and our purpose-built, auditable AI, we are the only company that maps AI-generated summaries to ground truth, helping providers quickly trust and verify the output. As pioneers in generative AI for healthcare, we are setting the industry standards for the responsible deployment of AI across health systems.We are a growing team of practicing MDs, AI scientists, PhDs, creatives, technologists, and engineers working together to empower people and make care make more sense. We have offices located in the SoHo neighborhood of New York, the Mission District in San Francisco, and East Liberty in Pittsburgh.The RoleFrom transcribing medical conversations to delivering key takeaways, our trailblazing work in machine learning research makes the Abridge experience possible. We're currently looking to hire research scientists with experience in machine learning and natural language processing. The ideal candidate will bring technical mastery, fluency with foundation models, genuine interest in the medical domain, and strong critical thinking skills to the role. At Abridge, all of our ML work has a strong research component, and all of our research scientists contribute directly to real products that impact the lives of doctors. What You'll DoAdvance the state of the art in medical NLP, in areas including conversation summarization, evidence extraction, outcome prediction, evaluation techniques, and experimentation.Actively contribute to the wider research community by sharing and publishing original researchHelp to define important problems, identify appropriate baselines, develop state-of-the-art methods, and ship them into production.Dial deeply into real-time feedback from clinicians to guide further refinements and innovationsBe results-oriented in the face of ambiguous problems and uncertain outcomesWhat You'll BringStrong research background, as demonstrated through papers and a graduate degree (MS or PhD) in Electrical Engineering, Computer Sciences, Mathematics, or equivalent experience.High-impact publications at peer-reviewed AI conferences (e.g. *CL, NeurIPS, ICML, ICLR).Significant real-world impact, as demonstrated through open source contributions and deployed technology.Strong programming skills with proven experience crafting, prototyping, and delivering machine learning solutions into production.Experience with deep learning libraries (e.g. PyTorch, Jax, Tensorflow) and platforms, multi-GPU training, and statistical analyses of observational and experimental data.Must be willing to work from our SF office at least 3x per weekThis position requires a commitment to a hybrid work model, with the expectation of coming into the office a minimum of (3) three times per week. Relocation assistance is available for candidates willing to move to San Francisco within 6 months of accepting an offer.We value people who want to learn new things, and we know that great team members might not perfectly match a job description. If you’re interested in the role but aren’t sure whether or not you’re a good fit, we’d still like to hear from you.Why Work at Abridge?At Abridge, we’re transforming healthcare delivery experiences with generative AI, enabling clinicians and patients to connect in deeper, more meaningful ways. Our mission is clear: to power deeper understanding in healthcare. We’re driving real, lasting change, with millions of medical conversations processed each month.Joining Abridge means stepping into a fast-paced, high-growth startup where your contributions truly make a difference. Our culture requires extreme ownership—every employee has the ability to (and is expected to) make an impact on our customers and our business.Beyond individual impact, you will have the opportunity to work alongside a team of curious, high-achieving people in a supportive environment where success is shared, growth is constant, and feedback fuels progress. At Abridge, it’s not just what we do—it’s how we do it. Every decision is rooted in empathy, always prioritizing the needs of clinicians and patients.We’re committed to supporting your growth, both professionally and personally. Whether it's flexible work hours, an inclusive culture, or ongoing learning opportunities, we are here to help you thrive and do the best work of your life.If you are ready to make a meaningful impact alongside passionate people who care deeply about what they do, Abridge is the place for you.How we take care of Abridgers:Generous Time Off: 13 paid holidays, flexible PTO for salaried employees, and accrued time off for hourly employees.Comprehensive Health Plans: Medical, Dental, and Vision plans for all full-time employees. Abridge covers 100% of the premium for you and 75% for dependents. If you choose a HSA-eligible plan, Abridge also makes monthly contributions to your HSA. Paid Parental Leave: 16 weeks paid parental leave for all full-time employees.401k and Matching: Contribution matching to help invest in your future.Pre-tax Benefits: Access to Flexible Spending Accounts (FSA) and Commuter Benefits.Learning and Development Budget: Yearly contributions for coaching, courses, workshops, conferences, and more.Sabbatical Leave: 30 days of paid Sabbatical Leave after 5 years of employment.Compensation and Equity: Competitive compensation and equity grants for full time employees.... and much more!Diversity & InclusionAbridge is an equal opportunity employer. Diversity and inclusion is at the core of what we do. We actively welcome applicants from all backgrounds (including but not limited to race, gender, educational background, and sexual orientation).Staying safe - Protect yourself from recruitment fraudWe are aware of individuals and entities fraudulently representing themselves as Abridge recruiters and/or hiring managers. Abridge will never ask for financial information or payment, or for personal information such as bank account number or social security number during the job application or interview process. Any emails from the Abridge recruiting team will come from an @abridge.com email address. You can learn more about how to protect yourself from these types of fraud by referring to this article. Please exercise caution and cease communications if something feels suspicious about your interactions.
Machine Learning Engineer
Data Science & Analytics
NLP Engineer
Software Engineering
Research Scientist
Product & Operations
Apply
July 11, 2025
ML Research Engineer
Oumi
11-50
USD
0
140000
-
220000
United States
Full-time
Remote
true
About OumiWhy we exist: Oumi is on a mission to make frontier AI truly open for all. We are founded on the belief that AI will have a transformative impact on humanity, and that developing it collectively, in the open, is the best path forward to ensure that it is done efficiently and safely.What we do: Oumi provides an all-in-one platform to build state-of-the-art AI models, end to end, from data preparation to production deployment, empowering innovators to build cutting-edge models at any scale. Oumi also develops open foundation models in collaboration with academic collaborators and the open community.Our Approach: Oumi is fundamentally an open-source first company, with open-collaboration across the community as a core principle. Our work is:Open Source First: All our platform and core technology is open sourceResearch-driven: We conduct and publish original research in AI, collaborating with our community of academic research labs and collaboratorsCommunity-powered: We believe in the power of open-collaboration and welcome contributions from researchers and developers worldwideRole OverviewWe’re looking for a Research Engineer to join our team working on generative AI and LLMs. In this role, you'll bridge research and engineering—designing scalable infrastructure, enabling cutting-edge experiments, and helping open-source the next generation of LLMs. You will collbaborate closely with our research team and the open-source community to build tools, run evaluations, and contribute to models that are safe, performant, and accessible.What you'll do:Design and build systems to support training, fine-tuning, and evaluating large language models.Partner with researchers to define experiments, write reusable code, run benchmarks, and interpret results.Work on LLM alignment and tuning using techniques like reinforcement learning (RLHF), supervised fine-tuning, and prompt optimization.Develop scalable ML pipelines for distributed training (e.g., across multi-GPU and multi-node environments).Contribute to open-source tooling and models to support transparency and community collaboration.Optimize performance across the ML stack—from data loading to deployment.What you’ll bring:Strong experience in machine learning, deep learning, or NLP—especially in generative AI or LLMs.Solid programming skills in Python, and experience with ML frameworks like PyTorch.Experience designing or maintaining ML infrastructure at scale (e.g., cloud-based training, distributed systems).Comfort working in highly collaborative environments with research and engineering teams.Bonus: experience with academic publications, open-source contributions, or LLM alignment work.Share Oumi's values: Beneficial for all, Customer-obsessed, Radical Ownership, Exceptional Teammates, Science-grounded.BenefitsCompetitive salary: $140,000 - $220,000Equity in a high-growth startupComprehensive health, dental and vision insurance21 days PTORegular team offsites and events
Machine Learning Engineer
Data Science & Analytics
Research Scientist
Product & Operations
Software Engineer
Software Engineering
Apply
July 11, 2025
Member of Technical Staff, Large Generative Models
Captions
101-200
USD
0
175000
-
275000
United States
Full-time
Remote
false
Captions is the leading AI video company—our mission is to empower anyone, anywhere to tell their stories through video. Over 10 million creators and businesses have used Captions to simplify video creation with truly novel and groundbreaking AI capabilities.We are a rapidly growing team of ambitious, experienced, and devoted engineers, researchers, designers, marketers, and operators based in NYC. As an early member of our team, you’ll have an opportunity to have an outsized impact on our products and our company's culture.Our TechnologyMirage Announcement our proprietary omni-modal foundation modelSeeing Voices (technical paper) generating A-roll video from audio with MirageMirage Studio for generating expressive videos at scale"Captions: For Talking Videos” available in the iOS app storePress CoverageLenny’s Podcast: Interview with Gaurav Misra (CEO)Latest Fundraise: Series C AnnouncementThe Information: 50 Most Promising StartupsFast Company: Next Big Things in TechBusiness Insider: 34 most promising AI startupsTIME: The Best Inventions of 2024Our InvestorsWe’re very fortunate to have some the best investors and entrepreneurs backing us, including Index Ventures, Kleiner Perkins, Sequoia Capital, Andreessen Horowitz, Uncommon Projects, Kevin Systrom, Mike Krieger, Lenny Rachitsky, Antoine Martin, Julie Zhuo, Ben Rubin, Jaren Glover, SVAngel, 20VC, Ludlow Ventures, Chapter One, and more.** Please note that all of our roles will require you to be in-person at our NYC HQ (located in Union Square) We do not work with third-party recruiting agencies, please do not contact us** About the role:Captions is seeking an exceptional Research Engineer (MOTS) to advance the state-of-the-art in large-scale multimodal video diffusion models. You'll conduct novel research on generative modeling architectures, develop new training techniques, and scale models to billions of parameters. As a key member of our ML Research team, you'll work at the cutting edge of multimodal generation while building systems that enable natural, controllable video creation. We're already training large-scale models with demonstrated product impact, and we're excited to continue expanding the scope and capabilities of our research.We're especially excited about pushing the boundaries of audio-video generation, with a focus on realistic and charismatic human behavior that enables natural storytelling and creative iteration. Our models power creative tools used by millions of creators, and we're tackling fundamental challenges in how to generate compelling human motion, expression, and speech. Key Responsibilities:Research & Architecture Development:Design and implement novel architectures for large-scale video and multimodal diffusion modelsDevelop new approaches to multimodal fusion, temporal modeling, and video controlResearch temporal video editing techniques and controllable generationResearch and validate scaling laws for video generation modelsCreate new loss functions and training objectives for improved generation qualityDrive rapid experimentation with model architectures and training strategiesValidate research directly through product deployment and user feedbackModel Training & Optimization:Train and optimize models at massive scale (10s-100s of billions of parameters)Develop sophisticated distributed training approaches using FSDP, DeepSpeed, Megatron-LMDesign and implement model surgery techniques (pruning, distillation, quantization)Create new approaches to memory optimization and training efficiencyResearch techniques for improving training stability at scaleConduct systematic empirical studies of architecture and optimization choicesTechnical Innovation:Advance state-of-the-art in video model architecture design and optimization Develop new approaches to temporal modeling for video generationCreate novel solutions for multimodal learning and cross-modal alignmentResearch and implement new optimization techniques for generative modeling and samplingDesign and validate new evaluation metrics for generation qualitySystematically analyze and improve model behavior across different regimesRequirements:Research Experience:Master's or PhD in Computer Science, Machine Learning, or related fieldTrack record of research contributions at top ML conferences (NeurIPS, ICML, ICLR)Demonstrated experience implementing and improving upon state-of-the-art architecturesDeep expertise in generative modeling approaches (diffusion, autoregressive, VAEs, etc.)Strong background in optimization techniques and loss function designExperience with empirical scaling studies and systematic architecture researchTechnical Expertise:Strong proficiency in modern deep learning tooling (PyTorch, CUDA, Triton, FSDP, etc.)Experience training diffusion models with 10B+ parametersExperience with very large language models (200B+ parameters) is a plusDeep understanding of attention, transformers, and modern multimodal architecturesExpertise in distributed training systems and model parallelismProven ability to implement and improve complex model architecturesTrack record of systematic empirical research and rigorous evaluationEngineering Capabilities:Ability to write clean, modular research code that scalesStrong software engineering practices including testing and code reviewExperience with rapid prototyping and experimental designStrong analytical skills for debugging model behavior and training dynamicsFacility with profiling and optimization toolsTrack record of bringing research ideas to productionExperience maintaining high code quality in a research environmentAbout the Team:You'll work directly alongside our research and engineering teams in our NYC office. We've intentionally built a culture where technical innovation and research excellence are highly valued - your success will be measured by your contributions to improving our models and advancing the field, not by your ability to navigate politics. We're a team that loves diving deep into complex technical problems and emerging with practical breakthroughs.Our team values:Open technical discussions and collaborationRapid iteration and practical solutionsDeep technical expertise and continuous learningDirect impact on research and product outcomesWhat sets us apart:Opportunity to advance the state-of-the-art in video generationDirect impact on products used by millions of creatorsAccess to massive compute resources and diverse, large-scale datasetsEnvironment that values both research excellence and practical impactAbility to validate research through direct product feedbackBenefits:Comprehensive medical, dental, and vision plans401K with employer matchCommuter BenefitsCatered lunch multiple days per weekDinner stipend every night if you're working late and want a bite! Doordash DashPass subscriptionHealth & Wellness Perks (Talkspace, Kindbody, One Medical subscription, HealthAdvocate, Teladoc)Multiple team offsites per year with team events every monthGenerous PTO policyCaptions provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.Please note benefits apply to full time employees only.
Machine Learning Engineer
Data Science & Analytics
Research Scientist
Product & Operations
Apply
July 11, 2025
Member of Technical Staff, GPU Optimization
Captions
101-200
USD
0
175000
-
275000
United States
Full-time
Remote
false
Captions is the leading AI video company—our mission is to empower anyone, anywhere to tell their stories through video. Over 10 million creators and businesses have used Captions to simplify video creation with truly novel and groundbreaking AI capabilities.We are a rapidly growing team of ambitious, experienced, and devoted engineers, researchers, designers, marketers, and operators based in NYC. As an early member of our team, you’ll have an opportunity to have an outsized impact on our products and our company's culture.Our TechnologyMirage Announcement our proprietary omni-modal foundation modelSeeing Voices (technical paper) generating A-roll video from audio with MirageMirage Studio for generating expressive videos at scale"Captions: For Talking Videos” available in the iOS app storePress CoverageLenny’s Podcast: Interview with Gaurav Misra (CEO)Latest Fundraise: Series C AnnouncementThe Information: 50 Most Promising StartupsFast Company: Next Big Things in TechBusiness Insider: 34 most promising AI startupsTIME: The Best Inventions of 2024Our InvestorsWe’re very fortunate to have some the best investors and entrepreneurs backing us, including Index Ventures, Kleiner Perkins, Sequoia Capital, Andreessen Horowitz, Uncommon Projects, Kevin Systrom, Mike Krieger, Lenny Rachitsky, Antoine Martin, Julie Zhuo, Ben Rubin, Jaren Glover, SVAngel, 20VC, Ludlow Ventures, Chapter One, and more.** Please note that all of our roles will require you to be in-person at our NYC HQ (located in Union Square) We do not work with third-party recruiting agencies, please do not contact us** About the RoleAs an expert in making AI models run fast—really fast—you live at the intersection of CUDA, PyTorch, and generative models, and get excited by the idea of squeezing every last bit of performance out of modern GPUs. You will have the opportunity to turn our cutting-edge video generation research into scalable, production-grade systems. From designing custom CUDA or Triton kernels to profiling distributed inference pipelines, you'll work across the full stack to make sure our models train and serve at peak performance.Key ResponsibilitiesOptimize model training and inference pipelines, including data loading, preprocessing, checkpointing, and deployment, for throughput, latency, and memory efficiency on NVIDIA GPUsDesign, implement, and benchmark custom CUDA and Triton kernels for performance-critical operationsIntegrate low-level optimizations into PyTorch-based codebases, including custom ops, low-precision formats, and TorchInductor passesProfile and debug the entire stack—from kernel launches to multi-GPU I/O paths—using Nsight, nvprof, PyTorch Profiler, and custom toolsWork closely with colleagues to co-design model architectures and data pipelines that are hardware-friendly and maintain state-of-the-art qualityStay on the cutting edge of GPU and compiler tech (e.g., Hopper features, CUDA Graphs, Triton, FlashAttention, and more) and evaluate their impactCollaborate with infrastructure and backend experts to improve cluster orchestration, scaling strategies, and observability for large experimentsProvide clear, data-driven insights and trade-offs between performance, quality, and costContribute to a culture of fast iteration, thoughtful profiling, and performance-centric designRequired QualificationsBachelor's degree in Computer Science, Electrical/Computer Engineering, or equivalent practical experience3+ years of hands-on experience writing and optimizing CUDA kernels for production ML workloadsDeep understanding of GPU architecture: memory hierarchies, warp scheduling, tensor cores, register pressure, and occupancy tuningStrong Python skills and familiarity with PyTorch internals, TorchScript, and distributed data-parallel trainingProven track record profiling and accelerating large-scale training and inference jobs (e.g., mixed precision, kernel fusion, custom collectives)Comfort working in Linux environments with modern CI/CD, containerization, and cluster managers such as KubernetesPreferred QualificationsAdvanced degree (MS/PhD) in Computer Science, Electrical/Computer Engineering, or related fieldExperience with multi-modal AI systems, particularly video generation or computer vision modelsFamiliarity with distributed training frameworks (DeepSpeed, FairScale, Megatron) and model parallelism techniquesKnowledge of compiler optimization techniques and experience with MLIR, XLA, or similar frameworksExperience with cloud infrastructure (AWS, GCP, Azure) and GPU cluster managementAbility to translate research goals into performant code, balancing numerical fidelity with hardware constraintsStrong communication skills and experience mentoring junior engineersBenefits:Comprehensive medical, dental, and vision plans401K with employer matchCommuter BenefitsCatered lunch multiple days per weekDinner stipend every night if you're working late and want a bite! Doordash DashPass subscriptionHealth & Wellness Perks (Talkspace, Kindbody, One Medical subscription, HealthAdvocate, Teladoc)Multiple team offsites per year with team events every monthGenerous PTO policyCaptions provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.Please note benefits apply to full time employees only.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
MLOps / DevOps Engineer
Data Science & Analytics
Apply
July 11, 2025
AI Engineer
E2B
11-50
-
Czech Republic
Full-time
Remote
false
About the roleYour job will be to inspire developers what they can build E2B. Part of that job is creating examples based on what we often see that our users are doing and another part of that is leading by example by building experimental projects using E2B.You’ll be building both smaller examples that you can find in our Cookbook but also bigger projects like Fragments or AI Analyst.This role requires a high amount of creativity and ability to finish the projects by taking them from 0 to 1.What we’re looking for3+ years of experience being a software engineerBeing comfortable with fast-pace field and environmentBeing interested in the latest news in the AI fieldExcited to work in person from Prague on a devtool productDetail oriented with a great tasteExcited to work closely with our usersNot being afraid to take ownership of the part of our productIf you join E2B, you’ll get a lot of freedom. We expect you to be proactive and take ownership. You’ll be taking projects from 0 to 1 with the support of the rest of the team.What it’s like to work at E2BWork at a fast growing startup at an early team (we grow 20%-100% MoM)We ship fast but don’t release junkWe like hard work and problems. Challenges mean potential value.We have a long runway and can offer a competitive salary for the startup at our stageWork closely with other AI companies on the edge of what’s possible todayDogfooding our own product on projects like FragmentsNo meetings, highly writing and transparent cultureYou’re the decision maker in day-to-day, important product and roadmap decisions are on Vasek (CEO) and Tomas (CTO)Spend 10-20% of the roadmap on highly experimental projectsHiring processWe aim to have the whole process done in 7-10 days. We understand that it’s important to move fast and try to follow up in 24 hours after each stage.30-minute call with Vasek (CEO). We’ll go over your past work experience and what you’re looking for to make sure this would be a good fit for both of us.First technical interview with Tomas (CTO). About 1 hour long call. You’ll get asked thorough technical questions. Often these are questions about problems that we ourselves experienced while building E2B.Second technical interview. Another 1-2 hours long call. Expect live coding on this call. We’ll ask you to solve specific problems (don’t worry, it’s not a leet code) that are related to your role.One day of in-person hacking at our office (paid). We invite you to our office to work on the product with us. This is a great opportunity for all of us to try how it’s working together and for you to meet the team.Last call with Vasek. Last 30-minute call with the CEO to talk more about the role and answer any of your questions.Decision and potential offer.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Apply
July 10, 2025
AI Engineer
E2B
11-50
-
United States
Full-time
Remote
false
About the roleYou’ll be working on tooling on top of our sandboxWhat we’re looking for3+ years of experience being a software engineerBeing comfortable with fast-pace field and environmentBeing interested in the latest news in the AI fieldExcited to work in person from San Francisco on a devtool productDetail oriented with a great tasteExcited to work closely with our usersNot being afraid to take ownership of the part of our productIf you join E2B, you’ll get a lot of freedom. We expect you to be proactive and take ownership. You’ll be taking projects from 0 to 1 with the support of the rest of the team.What it’s like to work at E2BWork at a fast growing startup at an early team (we grow 20%-100% MoM)We ship fast but don’t release junkWe like hard work and problems. Challenges mean potential value.We have a long runway and can offer a competitive salary for the startup at our stageWork closely with other AI companies on the edge of what’s possible todayDogfooding our own product on projects like FragmentsNo meetings, highly writing and transparent cultureYou’re the decision maker in day-to-day, important product and roadmap decisions are on Vasek (CEO) and Tomas (CTO)Spend 10-20% of the roadmap on highly experimental projectsHiring processWe aim to have the whole process done in 7-10 days. We understand that it’s important to move fast and try to follow up in 24 hours after each stage.30-minute call with Vasek (CEO). We’ll go over your past work experience and what you’re looking for to make sure this would be a good fit for both of us.First technical interview with Tomas (CTO). About 1 hour long call. You’ll get asked thorough technical questions. Often these are questions about problems that we ourselves experienced while building E2B.Second technical interview. Another 1-2 hours long call. Expect live coding on this call. We’ll ask you to solve specific problems (don’t worry, it’s not a leet code) that are related to your role.One day of in-person hacking at our office (paid). We invite you to our office to work on the product with us. This is a great opportunity for all of us to try how it’s working together and for you to meet the team.Last call with Vasek. Last 30-minute call with the CEO to talk more about the role and answer any of your questions.Decision and potential offer.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Apply
July 10, 2025
Platform Architect
webAI
101-200
-
United States
Full-time
Remote
false
About Us:webAI is pioneering the future of artificial intelligence by establishing the first distributed AI infrastructure dedicated to personalized AI. We recognize the evolving demands of a data-driven society for scalability and flexibility, and we firmly believe that the future of AI lies in distributed processing at the edge, bringing computation closer to the source of data generation. Our mission is to build a future where a company's valuable data and intellectual property remain entirely private, enabling the deployment of large-scale AI models directly on standard consumer hardware without compromising the information embedded within those models. We are developing an end-to-end platform that is secure, scalable, and fully under the control of our users, empowering enterprises with AI that understands their unique business. We are a team driven by truth, ownership, tenacity, and humility, and we seek individuals who resonate with these core values and are passionate about shaping the next generation of AI.About the Role:We are seeking a visionary Platform Architect to lead core architectural decisions across our distributed AI stack — from runtime orchestration, through application interfaces, to mesh-aware networking. This is role lives at the intersection of systems architecture, AI infrastructure, and distributed computing. You will define, validate, and evolve the technical blueprint that powers webAI’s intelligent mesh and ensures it scales with enterprise-grade resilience, security, and performance.Key Responsibilities:End-to-End Architecture:
Own the design and evolution of the platform architecture across the runtime, application, and mesh-network layers. Prioritize modularity, composability, and real-world deployability.AI Runtime Design:
Define and build high-performance runtimes, including task scheduling, memory management, and hardware abstraction for heterogeneous environments (CPU, GPU, NPU). Contribute to or integrate frameworks like ONNX, TensorRT, or custom inference stacks.Networking Layer:
Architect resilient peer-to-peer communication and mesh networking systems across diverse radios (Wi-Fi, Bluetooth, WebRTC, etc.), including device discovery, data synchronization, and fault tolerance.Security & Observability:
Embed zero-trust principles, encryption at rest and in transit, signed inference, and platform-level logging, monitoring, and diagnostics by design.Technical Leadership:
Serve as a technical mentor for engineering teams, validate architectural choices, conduct design reviews, and collaborate with stakeholders to ensure alignment with product vision and performance requirements.Required Skills & Qualifications:Experience:10+ years of relevant experience in systems architecture, distributed computing, or AI infrastructure, with a proven track record designing and scaling production-grade distributed platforms. We require depth of experience and demonstrated impact over tenure.AI Runtime DesignExperience building or contributing to high-performance runtimes (ONNX, TensorRT, custom runtimes).Task scheduling, memory management, hardware abstraction (CPU, GPU, NPU).Tooling & Stack PreferencesSystems programming in Rust, Go, or C++ (must-have: not just Python or web dev).Bonus: experience with federated learning, agent communication, MPC, or privacy-preserving computation.Understanding of real-time, low-latency, and resource-constrained environments.Networking LayerCross-radio P2P communication (Wi-Fi, Bluetooth, WebRTC, etc).Device discovery, resilient mesh networking, and data synchronization.Security & ObservabilityFamiliarity with zero-trust models, signed inference, encryption at rest & in transit.Build-in logging, monitoring, and platform-level diagnostics.We at webAI are committed to living out the core values we have put in place as the foundation on which we operate as a team. We seek individuals who exemplify the following:Truth - Emphasizing transparency and honesty in every interaction and decision.Ownership - Taking full responsibility for one’s actions and decisions, demonstrating commitment to the success of our clients. Tenacity - Persisting in the face of challenges and setbacks, continually striving for excellence and improvement.Humility - Maintaining a respectful and learning-oriented mindset, acknowledging the strengths and contributions of others.Benefits:Competitive salary and performance-based incentives.Comprehensive health, dental, and vision benefits package.401k Match$200/mos Health and Wellness Stipend$400/year Continuing Education Credit$500/year Function Health subscription (US-based only)Free parking, for in-office employeesUnlimited Approved PTOParental Leave for Eligible EmployeesSupplemental Life Insurance
webAI is an Equal Opportunity Employer and does not discriminate against any employee or applicant on the basis of age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We adhere to these principles in all aspects of employment, including recruitment, hiring, training, compensation, promotion, benefits, social and recreational programs, and discipline. In addition, it is the policy of webAI to provide reasonable accommodation to qualified employees who have protected disabilities to the extent required by applicable laws, regulations and ordinances where a particular employee works.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Apply
July 9, 2025
Generative AI Engineer
Dataiku
1001-5000
USD
0
162000
-
185000
United States
Full-time
Remote
false
Dataiku is The Universal AI Platform™, giving organizations control over their AI talent, processes, and technologies to unleash the creation of analytics, models, and agents. Providing no-, low-, and full-code capabilities, Dataiku meets teams where they are today, allowing them to begin building with AI using their existing skills and knowledge.As a Generative AI Engineer on the ED&A team, you will design, develop, and deploy enterprise AI applications for internal stakeholders — primarily using Dataiku’s platform and associated tooling. You’ll work with large language models (LLMs) from third-party providers (OpenAI, Anthropic, AWS Bedrock, Azure, etc.), build full-stack web applications, and create specialized “Agents” (and corresponding “Agent Tools”) capable of executing complex tasks. Success in this role requires strong programming abilities (Python, SQL, HTML/CSS/JS), excellent communication skills, and a deep interest in operationalizing generative AI across the company. How you'll make an impact Solution Development & Integration Design end-to-end Generative AI solutions on Dataiku’s platform (or Python-based frameworks where needed). Integrate and optimize third-party LLM APIs (OpenAI, Anthropic, AWS Bedrock, Azure etc), focusing on prompt engineering and orchestration (e.g., LangChain). Build and maintain robust data pipelines, ensuring AI applications are performant and secure. Agent & Agent Tool Creation Develop and implement autonomous or semi-autonomous AI “Agents” that interact with multiple data sources and services. Design Agent Tools (e.g., data retrieval modules, decisioning logic, automated workflows) that Agents can leverage to fulfill complex tasks. Monitor Agent performance and iterate based on usage data, business feedback, and changing requirements. Web Application Development Create front-end user interfaces (UI) for AI applications using HTML, CSS, and JavaScript—often within Dataiku’s webapps framework. Collaborate on UX design, ensuring internal stakeholders find solutions intuitive and responsive. Collaboration & Governance Work closely with analytics and data engineering teams to maintain metadata on reference datasets for LLM consumption. Adhere to data governance, security, and regulatory compliance requirements for all GenAI solutions. Partner with stakeholders to identify opportunities for AI-driven process optimization and automation. Continuous Learning & Thought Leadership Provide product feedback to the development team to improve the product. Stay current with industry trends, particularly around AI agent frameworks and LLM best practices (LangChain, prompt engineering, vector databases). What you'll need to be successful Must have strong Python skills (including familiarity with typical data science libraries). Experience developing AI, ML, or analytics solutions in a production setting; familiarity with agentic workflows (task decomposition, tool chaining, autonomous decision-making) and hands-on experience building AI agents is a plus. Web development fundamentals (HTML, CSS, JavaScript); experience with Dataiku webapps or similar frameworks a bonus. Familiarity with LangChain or similar LLM orchestration libraries is strongly preferred. Exposure to third-party LLMs (OpenAI, Anthropic, AWS Bedrock, Azure) or other NLP/AI APIs. Bachelor’s or Master’s in Computer Science, Data Science, Engineering, or a related field; equivalent experience also considered. Demonstrated ability to integrate multiple technologies, optimize workflows, and deliver user-friendly AI solutions. Strong communication and presentation skills, capable of collaborating effectively with both technical and non-technical stakeholders. Problem-solving mindset with a passion for innovation and delivering measurable business value. Openness to learning new tools (e.g., Dataiku) and adapting to an evolving tech landscape. Must be currently based within the EST timezone, but ideally located in New York City. Why join our team? Work with Cutting-Edge Tech: You’ll gain hands-on experience with Dataiku’s platform, LangChain, and leading LLM providers. Impact & Visibility: Our GenAI solutions address critical business needs, offering high visibility and meaningful impact. Collaborative Environment: We prioritize teamwork, mentorship, and continuous learning—perfect for both experienced engineers and those just starting out. Career Growth: From contributing to technical roadmaps to presenting results to stakeholders, you’ll have countless opportunities to develop your skills and influence the company’s AI strategy. Compensation and Benefits
The final compensation package for this role will be determined during the interview process and is based on a variety of factors, including, but not limited to, geographic location, internal equity, education, skill set, experience and training. Eligible roles may also be entitled to receive commission or other variable compensation through Dataiku's incentive compensation program. Dataiku also offers comprehensive benefits, including stock options, medical, dental, and vision plans, flexible spending accounts, pre-tax commuter benefits, a 401k company match, paid vacations and sick leave, paid parental leave, employer paid disability coverage, and additional health and wellbeing perks and benefits. Dataiku reserves the right to amend or modify employee perks and benefits at any time.US only national base pay ranges$162,000—$185,000 USD What are you waiting for! At Dataiku, you'll be part of a journey to shape the ever-evolving world of AI. We're not just building a product; we're crafting the future of AI. If you're ready to make a significant impact in a company that values innovation, collaboration, and your personal growth, we can't wait to welcome you to Dataiku! And if you’d like to learn even more about working here, you can visit our Dataiku LinkedIn page. Our practices are rooted in the idea that everyone should be treated with dignity, decency and fairness. Dataiku also believes that a diverse identity is a source of strength and allows us to optimize across the many dimensions that are needed for our success. Therefore, we are proud to be an equal opportunity employer. All employment practices are based on business needs, without regard to race, ethnicity, gender identity or expression, sexual orientation, religion, age, neurodiversity, disability status, citizenship, veteran status or any other aspect which makes an individual unique or protected by laws and regulations in the locations where we operate. This applies to all policies and procedures related to recruitment and hiring, compensation, benefits, performance, promotion and termination and all other conditions and terms of employment. If you need assistance or an accommodation, please contact us at: reasonable-accommodations@dataiku.com Protect yourself from fraudulent recruitment activity Dataiku will never ask you for payment of any type during the interview or hiring process. Other than our video-conference application, Zoom, we will also never ask you to make purchases or download third-party applications during the process. If you experience something out of the ordinary or suspect fraudulent activity, please review our page on identifying and reporting fraudulent activity here.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Apply
July 9, 2025
Associate Professional Services Engineer
Nice
5000+
-
Philippines
Full-time
Remote
false
At NiCE, we don’t limit our challenges. We challenge our limits. Always. We’re ambitious. We’re game changers. And we play to win. We set the highest standards and execute beyond them. And if you’re like us, we can offer you the ultimate career opportunity that will light a fire within you.So, what’s the role all about?
The Associate Professional Services Engineer is a customer-facing, billable role responsible for delivering specialized expertise and solutions for NICE products and services. This includes implementing, configuring, and optimizing AI-driven applications, such as chatbots, knowledge assistants, and other digital solutions, for enterprise contact centers. The position combines technical and business responsibilities, requiring engagement with clients to understand their needs, provide consultative support, and deliver cutting-edge solutions. How will you make an impact?
Technical Implementation: Configure, program, and develop NICE solutions, including AI, NLU, ACD, IVR, ASR, and CRM integrations. Lead the design and development of multiple AI and bot applications, ensuring alignment with customer requirements and industry best practices. Optimize and maintain multiple AI bots, including both generative and legacy models. Implement and enhance AI-driven services such as knowledge assistant engines and conversational intelligence. Project Management: Collaborate with project managers to design and oversee end-to-end project rollouts. Manage system lifecycle development, change control processes, and risk analysis for enterprise solutions. Ensure seamless project execution through inter-departmental coordination and clear communication. Business Consulting: Provide subject matter expertise on NICE digital products and AI solutions during client consultations. Conduct business analysis to assess user needs, design tailored solutions and provide industry guidance. Promote the use of AI tools to enhance decision-making and operational efficiency across business units. Leadership and Mentorship: Mentor and lead project teams, sharing expertise and fostering a collaborative environment. Develop documentation and processes for emerging digital products from R&D teams. Have you got what it takes?
Bachelor’s degree in technical (e.g., Computer Science, Information Systems, Electrical Engineering) or business field (e.g., Marketing, MIS) or equivalent work experience. 3+ years of professional experience, with 1+ years in digital channels or AI/Bot software applications preferred. Proven expertise in: Technical configuration and programming of AI and contact center technologies. CRM integrations, APIs, and other ecosystem technologies. Best practices for contact center operations and KPIs. Familiarity with: Generative AI models, NLU techniques, and automation principles. Industry trends and emerging technologies in AI and digital customer engagement. Desired personal traits: Analytical and inquisitive mindset. Team-oriented with strong interpersonal skills. Early adopter of innovative technologies. Strong sense of accountability and ownership. Professional communication, behavior and demeanor. What’s in It for You?
Join an ever-growing, market-disrupting global company where teams of top talent work in a fast-paced, collaborative, and creative environment. NICE offers endless internal career opportunities across multiple roles, disciplines, domains, and locations. If you are passionate about innovation and thrive in dynamic settings, this is your chance to make an impact.
Flexible Work Model:
Experience the NICE-FLEX hybrid model with two days in the office and three days of remote work per week, fostering collaboration and innovation while maintaining work-life balance.
Requisition ID: 7923
Reporting To: Tech Manager
Role Type: Individual ContributorAbout NiCE NICE Ltd. (NASDAQ: NICE) software products are used by 25,000+ global businesses, including 85 of the Fortune 100 corporations, to deliver extraordinary customer experiences, fight financial crime and ensure public safety. Every day, NiCE software manages more than 120 million customer interactions and monitors 3+ billion financial transactions. Known as an innovation powerhouse that excels in AI, cloud and digital, NiCE is consistently recognized as the market leader in its domains, with over 8,500 employees across 30+ countries. NiCE is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, age, sex, marital status, ancestry, neurotype, physical or mental disability, veteran status, gender identity, sexual orientation or any other category protected by law.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
MLOps / DevOps Engineer
Data Science & Analytics
Apply
July 9, 2025
Research Engineer / Research Scientist, Pre-training
Anthropic
1001-5000
0
0
-
0
Switzerland
Full-time
Remote
false
About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.About the team We are seeking passionate Research Scientists and Engineers to join our growing Pre-training team in Zurich. We are involved in developing the next generation of large language models. The team primarily focuses on multimodal capabilities: giving LLMs the ability to understand and interact with modalities other than text. In this role, you will work at the intersection of cutting-edge research and practical engineering, contributing to the development of safe, steerable, and trustworthy AI systems. Responsibilities In this role you will interact with many parts of the engineering and research stacks. Conduct research and implement solutions in areas such as model architecture, algorithms, data processing, and optimizer development Independently lead small research projects while collaborating with team members on larger initiatives Design, run, and analyze scientific experiments to advance our understanding of large language models Optimize and scale our training infrastructure to improve efficiency and reliability Develop and improve dev tooling to enhance team productivity Contribute to the entire stack, from low-level optimizations to high-level model design Qualifications & Experience We encourage you to apply even if you do not believe you meet every single criterion. Because we focus on so many areas, the team is looking for both experienced engineers and strong researchers, and encourage anyone along the researcher/engineer spectrum to apply. Degree (BA required, MS or PhD preferred) in Computer Science, Machine Learning, or a related field Strong software engineering skills with a proven track record of building complex systems Expertise in Python and deep learning frameworks Have worked on high-performance, large-scale ML systems, particularly in the context of language modeling Familiarity with ML Accelerators, Kubernetes, and large-scale data processing Strong problem-solving skills and a results-oriented mindset Excellent communication skills and ability to work in a collaborative environment You'll thrive in this role if you Have significant software engineering experience Are able to balance research goals with practical engineering constraints Are happy to take on tasks outside your job description to support the team Enjoy pair programming and collaborative work Are eager to learn more about machine learning research Are enthusiastic to work at an organization that functions as a single, cohesive team pursuing large-scale AI research projects Have ambitious goals for AI safety and general progress in the next few years, and you’re excited to create the best outcomes over the long-term Sample Projects Optimizing the throughput of novel attention mechanisms Proposing Transformer variants, and experimentally comparing their performance Preparing large-scale datasets for model consumption Scaling distributed training jobs to thousands of accelerators Designing fault tolerance strategies for training infrastructure Creating interactive visualizations of model internals, such as attention patterns If you're excited about pushing the boundaries of AI while prioritizing safety and ethics, we want to hear from you!Logistics Education requirements: We require at least a Bachelor's degree in a related field or equivalent experience.
Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process
Machine Learning Engineer
Data Science & Analytics
Research Scientist
Product & Operations
Software Engineer
Software Engineering
Apply
July 8, 2025
Research Engineer, Focused Bets
OpenAI
5000+
USD
295000
-
440000
United States
Full-time
Remote
false
About the TeamThe Strategic Deployment team makes frontier models more capable, reliable, and aligned to transform high-impact domains. On one hand, this involves deploying models in real-world, high-stakes settings to drive AI-driven transformation and elicit insights—training data, evaluation methods, and techniques—to shape our frontier model development. On the other hand, we leverage these learning to build the science and engineering of impactful frontier model deployment.Put differently, we want to understand: if AGI is viewed as AI being able to majorly transform our economy, how close are we to AGI? What’s still missing? How do we bridge these gaps?
About the RoleAs a Research Engineer on the Focused Bets effort in the Strategic Deployment team, you will help OpenAI identify real-world domains that are ripe for transformation through frontier AI capabilities. You’ll act as a technical lead and hands-on builder, partnering with subject matter experts to understand the key aspects of a given domain, pinpointing the most critical tasks in that domain, and developing technical proofs-of-concept that build conviction driving OpenAI’s strategic engagements.We’re looking for people who combine a strong machine learning background with an entrepreneurial mindset—engineers who can thrive in ambiguity, quickly iterate toward the signal, and adapt their approach as new insights emerge. You should be deeply curious about where AI can (and can’t yet) have impact, and excited to push the boundaries of what’s possible.This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.In this role, you will:Drive research and prototyping sprints into specific domains in collaboration with domain experts.Translate real-world tasks into tractable ML/engineering problems and develop proof-of-concept solutions using OpenAI models and tools.Deliver working demos, run in-the-wild evaluations, and distill insights into actionable guidance for OpenAI’s research and deployment teams.
You might thrive in this role if you:Care about real-world impact of AI.Are excited to drive how frontier models are developed and deployed.Have hands-on experience in AI research, systems, or applied science.Are excited by startup-style ambiguity and enjoy working across disciplines to drive a project from zero to one.Enjoy working in open-ended problem spaces and high-feedback environments.Are a curious, adaptable generalist who enjoys learning fast, asking deep questions, and solving open-ended problems.About OpenAIOpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity. We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic. For additional information, please see OpenAI’s Affirmative Action and Equal Employment Opportunity Policy Statement.Qualified applicants with arrest or conviction records will be considered for employment in accordance with applicable law, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.OpenAI Global Applicant Privacy PolicyAt OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.
Machine Learning Engineer
Data Science & Analytics
Apply
July 8, 2025
Member of Technical Staff - Reasoning Workflows
Talent Labs
11-50
-
United Kingdom
Full-time
Remote
false
We are looking for a highly skilled Member of Technical Staff to lead the development of cutting-edge workflows. You will build autonomous systems that can navigate complex scientific tasks entirely through natural language conversation. In your role, you will architect and deploy reasoning systems that democratise access to breakthrough synthetic biology tools, enabling researchers worldwide to leverage our frontier models through intuitive chat interfaces.Who we are:At Latent Labs, we are building frontier models that learn the fundamentals of biology. We pursue ambitious goals with curiosity and are committed to scientific excellence. Before building Latent Labs, our team co-developed DeepMind's Nobel-prize winning AlphaFold, invented latent diffusion, and built pioneering lab data management systems as well as high throughput protein screening platforms. At Latent Labs you will be working with some of the brightest minds in generative AI and biology.Our team is committed to interdisciplinary exchange, continuous learning and collaboration. Team offsites help us foster a culture of trust across our London and San Francisco sites.We're looking for innovators passionate about tackling complex challenges and maximising positive global impact. Join us on our moonshot mission.Who You Are:You are a strong software engineer with deep experience in Python, API design, and distributed systems architecture.You are an expert in LLM orchestration. You have hands-on experience with LLM APIs (OpenAI, Anthropic, etc.) and orchestration frameworks like LangChain, LlamaIndex, or have built custom agent frameworks from scratch.You understand intelligent information retrieval. You have experience with RAG (Retrieval-Augmented Generation) systems, vector databases, and embedding models for knowledge extraction.You can architect complex workflows. You have experience with workflow orchestration tools (Airflow, Prefect, Temporal) or have built custom pipeline systems for multi-step autonomous processes.You bridge science and engineering. You are comfortable with scientific computing libraries (NumPy, SciPy, pandas) and understand scientific literature formats, databases (PubMed, arXiv), and academic data processing.What Sets You Apart:You have a research background. You are a former academic researcher who transitioned to industry ML/AI roles, or a research software engineer with deep ML/AI experience.You're passionate about scientific automation. You have experience with document processing, OCR, text extraction from academic papers, and scientific data formats.You understand the research ecosystem. You have worked in academic or pharmaceutical research environments and understand research workflows and publishing processes.You're a multimodal specialist. You have a background in natural language processing, particularly for scientific text processing and citation networks.Your Responsibilities:Build autonomous scientific agents that can execute complex research workflows through natural language interaction—from protein structure analysis to experimental design.Architect end-to-end reasoning systems that integrate our platform capabilities with intelligent decision-making, enabling users to accomplish sophisticated tasks through simple chat interfaces.Develop knowledge discovery pipelines that can autonomously mine scientific literature, identify undrugged disease pathways, and propose novel therapeutic targets.Create scientific content at scale by building agents that can design experiments, generate hypotheses, and produce research-grade articles and blog posts.Pioneer autonomous lab workflows by developing agents that can design complex biological systems (like protein-based logic gates) and orchestrate their validation.Collaborate with scientists to understand research pain points and translate them into intelligent automation solutions.Publish and evangelise breakthrough applications of agentic workflows in synthetic biology through articles, blog posts, and scientific demonstrations.Apply:We offer strongly competitive compensation and benefits packages, including:Private health insurancePension/401(K) contributionsGenerous leave policies (including gender neutral parental leave)Hybrid workingTravel opportunities and moreWe also offer a stimulating work environment, and the opportunity to shape the future of synthetic biology through the application of breakthrough generative models.We welcome applicants from all backgrounds and we are committed to building a team that represents a variety of backgrounds, perspectives, and skills.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Data Engineer
Data Science & Analytics
Apply
July 8, 2025
Blender - AI Trainer
Ryz Labs
51-100
-
United States
Argentina
Contractor
Remote
true
Ryz Labs is seeking Blender - AI Trainer who are eager to collaborate with leading AI labs to push the boundaries of frontier AI models. You will work to train and evaluate these models to understand, replicate, and enhance the workflows developers use on these platforms.
Qualifications: - Have 2+ years of experience in working with Blender.- Possess deep expertise using and understanding Blender as a tool.- Demonstrate strong attention to detail.- Have excellent verbal and written communication skills.
About RYZ Labs:RYZ Labs is a startup studio built in 2021 by two lifelong entrepreneurs. The founders of RYZ have worked at some of the world's largest tech companies and some of the most iconic consumer brands. They have lived and worked in Argentina for many years and have decades of experience in Latam. What brought them together is the passion for the early phases of company creation and the idea of attracting the brightest talents in order to build industry-defining companies in a post-pandemic world.
Our teams are remote and distributed throughout the US and Latam. They use the latest cutting-edge technologies in cloud computing to create applications that are scalable and resilient. We aim to provide diverse product solutions for different industries, planning to build a large number of startups in the upcoming years.
At RYZ, you will find yourself working with autonomy and efficiency, owning every step of your development. We provide an environment of opportunities, learning, growth, expansion, and challenging projects. You will deepen your experience while sharing and learning from a team of great professionals and specialists.
Our values and what to expect:- Customer First Mentality - every decision we make should be made through the lens of the customer.- Bias for Action - urgency is critical, expect that the timeline to get something done is accelerated.- Ownership - step up if you see an opportunity to help, even if not your core responsibility. - Humility and Respect - be willing to learn, be vulnerable, and treat everyone who interacts with RYZ with respect.- Frugality - being frugal and cost-conscious helps us do more with less- Deliver Impact - get things done in the most efficient way. - Raise our Standards - always be looking to improve our processes, our team, and our expectations. The status quo is not good enough and never should be.
Machine Learning Engineer
Data Science & Analytics
Apply
July 8, 2025
Machine Learning Engineer
Notable
201-500
USD
0
175000
-
200000
United States
Full-time
Remote
true
Notable is the leading intelligent automation company for healthcare. Customers use Notable to drive patient acquisition, retention, and reimbursement, scaling growth without hiring more staff. We don’t just make software.We are on a mission to fix the broken U.S. healthcare system by helping to eliminate the massive administrative burden that is placed on our nation’s healthcare staff. We hire people from diverse backgrounds and are always looking for employees who bring fresh ideas to our space. Passion is paramount, and at Notable, you will get to work with other talented people who aim to set a new standard for innovation in healthcare.Role Summary:As an ML/AI engineer at Notable, you'll work on developing and deploying conversational AI models and agents across text, image and voice modalities, helping users automate tedious but valuable healthcare administrative tasks.What You’ll Do:Work with the product development team and product managers to define scope of work, timeline and product specificationsWork with backend engineers to deploy, maintain and scale AI modelsDefine interfaces between the microservices that runs and delivers AI modelsDiscover, collect, clean and transfer data to train AI modelsExperimentation of different AI models, methodologies, frameworks and communicate critical evaluation metrics to product teamsExplore, refine, improve best practices within the ML teamPush the boundaries of ML and AI and innovate on how to best leverage existing technologies to solve new problemsWhat We’re Looking For:7+ years of experience working in a relevant roleDemonstrated ability to translate business requirements and metrics into machine learning model specificationsQuickly prototype new models from open-sourced code and demonstrate resultsAbility to design and train new model architectures for complex dataExperience working with real-world data: large, messy, incomplete, irregular, etc.Experience working with a mix of structured and unstructured dataProficiency with Python and the standard ML stack (numpy, pandas, scikit-learn)Experience with a deep learning package, e.g. Tensorflow, PyTorchExperience deploying ML models in productionBeware of job scam fraudsters! Our recruiters use @notablehealth.com email addresses exclusively. We do not conduct interviews via text or instant message and we do not ask candidates to download software other than Zoom, to purchase equipment through us, or to provide sensitive personally identifiable information such as bank account or social security numbers. If you have been contacted by someone claiming to be me from a different domain about a job offer, please report it as potential job fraud to law enforcement and contact us here.
Machine Learning Engineer
Data Science & Analytics
Apply
July 7, 2025
Forward Deployed AI Engineer
webAI
101-200
-
United States
Full-time
Remote
false
About Us:webAI is pioneering the future of artificial intelligence by establishing the first distributed AI infrastructure dedicated to personalized AI. We recognize the evolving demands of a data-driven society for scalability and flexibility, and we firmly believe that the future of AI lies in distributed processing at the edge, bringing computation closer to the source of data generation. Our mission is to build a future where a company's valuable data and intellectual property remain entirely private, enabling the deployment of large-scale AI models directly on standard consumer hardware without compromising the information embedded within those models. We are developing an end-to-end platform that is secure, scalable, and fully under the control of our users, empowering enterprises with AI that understands their unique business. We are a team driven by truth, ownership, tenacity, and humility, and we seek individuals who resonate with these core values and are passionate about shaping the next generation of AI.About the Role: As a Forward Deployed AI Engineer at webAI, you will be on the front lines of integrating cutting-edge AI solutions into real-world enterprise environments. You will work directly with customers to understand their needs, deploy AI systems into their infrastructure, and ensure high performance and reliability at scale. You’ll help bridge the gap between research, engineering, and operations—playing a pivotal role in building the future of edge AI.This role will require a minimum of 25% travel, including occasional on-site work at customer locations to support deployment, troubleshooting, and integration of our platform within enterprise environments.Key Responsibilities:Collaborate closely with customers to scope, deploy, and maintain AI solutions in production environments.Debug and optimize data pipelines and AI systems running on customer networks.Translate complex, often ambiguous customer requirements into well-scoped technical solutions.Work across the stack—from model inference on consumer hardware to infrastructure automation.Read hardware schematics/logs to identify performance bottlenecks and suggest improvements.Serve as a trusted technical advisor to enterprise clients, representing the engineering team externally.Contribute feedback and insight to internal teams to continuously improve product robustness and usability.Required Skills & Qualifications:5+ years of combined experience in software engineering and machine learning.Proven track record of deploying and maintaining machine learning/AI systems in production.Strong expertise in MLX and/or PyTorch.Experience debugging complex systems involving data pipelines, model inference, and hardware interaction.Exceptional communication skills; able to challenge vague requirements and turn them into actionable plans.Comfortable working in dynamic environments with high customer exposure.Preferred Qualifications:Experience deploying models on edge devices or consumer hardware.Familiarity with distributed systems, DevOps tooling, and performance tuning.Prior experience in a customer-facing engineering role.Knowledge of privacy-preserving AI and secure compute environments.Master’s degree in a relevant technical discipline.
We at webAI are committed to living out the core values we have put in place as the foundation on which we operate as a team. We seek individuals who exemplify the following:Truth - Emphasizing transparency and honesty in every interaction and decision.Ownership - Taking full responsibility for one’s actions and decisions, demonstrating commitment to the success of our clients. Tenacity - Persisting in the face of challenges and setbacks, continually striving for excellence and improvement.Humility - Maintaining a respectful and learning-oriented mindset, acknowledging the strengths and contributions of others.Benefits:Competitive salary and performance-based incentives.Comprehensive health, dental, and vision benefits package.401k Match$200/mos Health and Wellness Stipend$400/year Continuing Education CreditFree parking, for in-office employeesUnlimited Approved PTOParental, Bereavement LeaveSupplemental Life Insurance
webAI is an Equal Opportunity Employer and does not discriminate against any employee or applicant on the basis of age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We adhere to these principles in all aspects of employment, including recruitment, hiring, training, compensation, promotion, benefits, social and recreational programs, and discipline. In addition, it is the policy of webAI to provide reasonable accommodation to qualified employees who have protected disabilities to the extent required by applicable laws, regulations and ordinances where a particular employee works.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Apply
July 7, 2025
No job found
There is no job in this category at the moment. Please try again later