⚠️ Sorry, this job is no longer available.

Find AI Work That Works for You

Latest roles in AI and machine learning, reviewed by real humans for quality and clarity.

Edit filters

New AI Opportunities

Showing 6179  of 79 jobs
Tag
BJAK.jpg

Lead AI Engineer

Bjak
-
SG.svg
Singapore
Full-time
Remote
true
Shape AI That Powers the Future of Financial AccessAt BJAK, we use AI to solve meaningful, high-impact problems — from fraud detection and risk modeling to hyper-personalized financial experiences that make insurance and financial services more accessible. We’re looking for a Lead AI Engineer to play a pivotal role in building, scaling, and leading the next generation of intelligent systems.This role blends deep technical expertise with hands-on leadership. You’ll write production code, shape strategy, mentor engineers, and help build a lean, world-class AI team.The position supports fully work model.Why This Role MattersYou’ll design and deploy core AI systems that support millions of usersYou’ll partner closely with the Head of AI to drive technical execution and engineering excellenceYou’ll mentor junior engineers and elevate team performanceYou’ll play a central role in technical hiring and scaling the AI organisationWhat You’ll DoDesign, build, and ship production-grade ML models for high-impact use casesLead model architecture decisions, data pipeline design, and deployment strategiesCollaborate across product, engineering, and data teams to drive AI initiatives end-to-endOwn the full ML lifecycle: data preparation, training, validation, deployment, monitoringReview code, guide technical projects, and mentor engineers through challengesPromote experimentation, continuous learning, and AI best practicesInterview and evaluate candidates as part of the technical hiring processStay ahead of emerging ML techniques and bring cutting-edge approaches into productionYou’ll Thrive Here If You…Lead from the front - hands-on with code while lifting the team around youThink like an owner - solving problems proactively, not waiting for instructionsOperate with speed and depth - making fast, sound decisions with clear trade-offsBring clarity - adding structure and direction even in fast-moving environmentsCare about impact - focusing on solutions that matter in the real worldEnjoy mentoring - helping others grow is part of your leadership DNAAdapt quickly - ambiguity and changing priorities don’t slow you downHave a builder’s mindset - prototype fast, iterate fast, improve fastCommit to continuous learning - across technical, product, and leadership skillsetsRequirementsBachelor’s or Master’s degree in Computer Science, Engineering, or a related fieldStrong proficiency in PythonOver 4 years of AI/ML engineering experience with real-world deploymentDeep expertise with ML frameworks (TensorFlow, PyTorch, Scikit-learn)Solid understanding of end-to-end ML workflows: data pipelines, training, validation, monitoringExperience working on applied AI problems (e.g., recommendation, fraud, risk, NLP, etc.)Proven track record of technical leadership and mentoringExcellent communication, collaboration, and problem-solving skillsComfortable working remotely with distributed teams across time zonesNice to HaveExperience with MLOps tools (MLflow, Airflow, Docker, GCP/AWS)Familiarity with responsible AI principles (fairness, interpretability, model governance)Startup or scale-up environment experienceExperience developing internal tools, reusable ML components, or AI platformsWhat You’ll GetCompetitive salary and performance bonusesRemote-first work culture, with a hybrid work option for Malaysia-based team membersHigh ownership and visibility - your work directly impacts regional-scale systemsDirect reporting line to the Head of AI with leadership exposureOpportunity to build and grow a high-performance AI engineering teamFast career growth and learning in a high-velocity environmentFlat, collaborative culture where ideas move fast and great work gets noticedAbout BJAKBJAK is a leading digital insurance platform serving millions of users with transparent and affordable financial protection. We simplify financial services using AI, automation, and smart APIs, and we’re building next-generation intelligent systems to make finance accessible, fast, and fair for everyone.From personalized pricing engines to smart automation and fraud detection, we focus on technology that solves real problems at scale.If you’re ready to build, lead, and scale AI systems that matter - in a company that moves fast and thinks big - we’d love to hear from you.
Machine Learning Engineer
Data Science & Analytics
Hidden link
BJAK.jpg

Senior AI/ML Software Engineer

Bjak
0
0
-
0
HK.svg
Hong Kong
Full-time
Remote
true
Build AI Systems That Make Finance Simpler, Smarter, and More InclusiveAt BJAK, we use AI to make insurance and financial services easier to access, understand, and afford for millions of users. As a Senior AI/ML Software Engineer, you’ll help build the intelligent systems that power this mission - from personalized recommendations and fraud detection to automation and search.This is a fully remote role, working closely with cross-functional teams across regions. You’ll join a fast-paced, flat engineering environment where your execution and ideas shape real-world outcomes every day.Why This Role MattersYour work will directly improve user experience, efficiency, and platform intelligenceYou’ll contribute to production-grade AI systems that support millions of usersYou’ll collaborate across product, data, and engineering to build scalable ML tools end-to-endYou’ll grow quickly in a lean, high-impact environmentWhat You’ll DoWork with product, data, and engineering teams to define ML goals and technical strategiesDesign, build, and deploy machine learning models for personalization, automation, and insightsManage the full ML lifecycle: data preprocessing, feature engineering, training, tuning, evaluation, deploymentBuild scalable ML infrastructure and deployment pipelinesIntegrate ML outputs into user-facing products and backend systemsStay up-to-date with AI/ML research trends and apply relevant innovationsContribute to debugging, testing, and optimization of production ML systemsYou’ll Thrive Here If You...Take full ownership and ensure the models you build drive real-world resultsAre a self-starter who can figure things out even with ambiguityMove with urgency — ship fast, iterate fasterOwn problems end-to-end, from messy data to deploymentBring a humble, collaborative, team-first attitudeStay calm in fast-changing, high-growth environmentsLearn obsessively and share openlyRequirementsBachelor’s degree in Computer Science, Data Science, Engineering, or any related technical fieldStrong proficiency in Python2–4 years of experience in machine learning or backend software developmentHands-on experience with ML frameworks (TensorFlow, PyTorch, Scikit-learn)Solid understanding of ML workflows: data cleaning, model development, tuning, evaluationFamiliarity with model deployment, API development, or real-world ML product integrationExperience with Jupyter, Colab, or cloud-based ML platformsStrong analytical, problem-solving, and communication skillsComfortable working remotely and collaborating across time zonesNice to HaveExperience with NLP, computer vision, or recommendation systemsFamiliarity with AWS, GCP, or AzureExposure to Docker, Git, or CI/CD pipelinesBackground in high-growth startups or agile product teamsWhat You’ll GetCompetitive salary and performance-based bonusesFlexible, fully remote work arrangementDirect impact - your work reaches millions of usersFlat, transparent team environmentOpportunities for rapid personal and technical growthCross-regional collaboration and exposureAbout BJAKBJAK is a leading digital insurance platform, helping millions of users access affordable and transparent financial protection. We simplify financial services through AI, automation, and smart APIs, and we're building the next generation of intelligent systems to make finance better, faster, and fairer for everyone.If you’re excited about building ML systems that solve real-world problems at scale — and thrive in a high-impact, mission-driven team - we’d love to hear from you.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Hidden link
BJAK.jpg

Senior AI/ML Software Engineer

Bjak
0
0
-
0
SG.svg
Singapore
Full-time
Remote
true
Build AI Systems That Make Finance Simpler, Smarter, and More InclusiveAt BJAK, we use AI to make insurance and financial services easier to access, understand, and afford for millions of users. As a Senior AI/ML Software Engineer, you’ll help build the intelligent systems that power this mission - from personalized recommendations and fraud detection to automation and search.This is a fully remote role, working closely with cross-functional teams across regions. You’ll join a fast-paced, flat engineering environment where your execution and ideas shape real-world outcomes every day.Why This Role MattersYour work will directly improve user experience, efficiency, and platform intelligenceYou’ll contribute to production-grade AI systems that support millions of usersYou’ll collaborate across product, data, and engineering to build scalable ML tools end-to-endYou’ll grow quickly in a lean, high-impact environmentWhat You’ll DoWork with product, data, and engineering teams to define ML goals and technical strategiesDesign, build, and deploy machine learning models for personalization, automation, and insightsManage the full ML lifecycle: data preprocessing, feature engineering, training, tuning, evaluation, deploymentBuild scalable ML infrastructure and deployment pipelinesIntegrate ML outputs into user-facing products and backend systemsStay up-to-date with AI/ML research trends and apply relevant innovationsContribute to debugging, testing, and optimization of production ML systemsYou’ll Thrive Here If You...Take full ownership and ensure the models you build drive real-world resultsAre a self-starter who can figure things out even with ambiguityMove with urgency — ship fast, iterate fasterOwn problems end-to-end, from messy data to deploymentBring a humble, collaborative, team-first attitudeStay calm in fast-changing, high-growth environmentsLearn obsessively and share openlyRequirementsBachelor’s degree in Computer Science, Data Science, Engineering, or any related technical fieldStrong proficiency in Python2–4 years of experience in machine learning or backend software developmentHands-on experience with ML frameworks (TensorFlow, PyTorch, Scikit-learn)Solid understanding of ML workflows: data cleaning, model development, tuning, evaluationFamiliarity with model deployment, API development, or real-world ML product integrationExperience with Jupyter, Colab, or cloud-based ML platformsStrong analytical, problem-solving, and communication skillsComfortable working remotely and collaborating across time zonesNice to HaveExperience with NLP, computer vision, or recommendation systemsFamiliarity with AWS, GCP, or AzureExposure to Docker, Git, or CI/CD pipelinesBackground in high-growth startups or agile product teamsWhat You’ll GetCompetitive salary and performance-based bonusesFlexible, fully remote work arrangementDirect impact - your work reaches millions of usersFlat, transparent team environmentOpportunities for rapid personal and technical growthCross-regional collaboration and exposureAbout BJAKBJAK is a leading digital insurance platform, helping millions of users access affordable and transparent financial protection. We simplify financial services through AI, automation, and smart APIs, and we're building the next generation of intelligent systems to make finance better, faster, and fairer for everyone.If you’re excited about building ML systems that solve real-world problems at scale — and thrive in a high-impact, mission-driven team - we’d love to hear from you.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Hidden link
BJAK.jpg

Founding Machine Learning Engineer

Bjak
-
CN.svg
China
Full-time
Remote
true
Transform language models into real-world, high-impact product experiences.A1 is a self-funded AI division backed by BJAK, operating in full stealth. We’re building a new global consumer AI application focused on an important but underexplored use case — something practical, meaningful, and far beyond the typical chatbot or productivity agent.As a Machine Learning Engineer, you will work at the core of our product: fine-tuning frontier-level models, shaping data pipelines, evaluating behavior, and bringing intelligent features into real production environments. You’ll operate with high autonomy within a small, deeply technical team.Why This Role MattersYou will turn state-of-the-art models into real product capabilities used by a global audience.You’ll define how our models behave — accuracy, safety, tone, and user experience.You’ll help build a new consumer AI product from zero, directly influencing its technical and product direction.Your work will ensure our models are not only powerful, but safe, reliable, and impactful at scale.What You’ll DoFine-tune and adapt models as needed while also designing new model architectures and training pipelines from scratchDesign and curate datasets, craft prompts, and experiment with alignment and safety strategies for high-quality outputs.Build evaluation pipelines for model performance, safety, and alignment, and innovate on new metrics for real-world impact.Ship models to production, optimise for latency and scale, and monitor drift or degradationCollaborate with product, engineering, and design to launch polished, user-facing AI featuresExplore new modeling approaches, optimisation methods, and product use casesWhat It’s Like to Work HereYou take ownership - you solve problems end-to-end rather than wait for perfect instructionsYou learn through action - prototype → test → iterate → shipYou’re calm in ambiguity - zero-to-one building energises youYou bias toward speed with discipline - V1 now > perfect laterYou see failures and feedback as essential to growthYou work with humility, curiosity, and a founder’s mindsetYou lift the bar for yourself and your teammates every dayRequirementsStrong experience with transformers, deep learning, and fine-tuning methods (LoRA/QLoRA, SFT, distillation)Proficiency in PyTorch (preferred) or TensorFlowExperience in prompt engineering and dataset creation for alignment, tone, trust, and safetyFamiliarity with evaluation methods: perplexity, toxicity, relevance, robustnessSolid software engineering fundamentals — algorithms, data structures, clean codeAbility to operate in a fast, lean, and high-ownership environmentNice to HaveExperience with text generation, moderation, ranking, or personalisationExposure to RLHF or reinforcement learning for LLMsContributions to open-source ML projectsBackground in consumer-facing AI products or high-growth startupsWhat You’ll GetHigh ownership and autonomy from day oneDirect involvement in technical direction and product decisionsRemote-first flexibilityA high-impact role with visibility across ML, engineering, and productCompetitive compensation and performance-based bonusesBacking of a profitable US$2B group, with the speed of a startupInsurance coverage, flexible time off, and global travel insuranceOpportunity to shape a new global AI product from zeroOur Team & CultureWe operate as a dense, senior, high-performance team. We value clarity, speed, craftsmanship, and relentless ownership. We behave like founders — we build, ship, iterate, and hold ourselves to a high technical bar.If you value excellence, enjoy building real systems, and want to be part of a small team creating something globally impactful, you’ll thrive here.About A1A1 is a self-funded, independent AI division under BJAK, focused on building a new consumer AI product with global relevance. We’re in full stealth and assembling a small, elite team of ML and engineering builders who want to work on meaningful, high-impact problems.
Machine Learning Engineer
Data Science & Analytics
Hidden link
FERMÀT.jpg

Software Engineer, New Grad

Fermat
-
US.svg
United States
Full-time
Remote
false
FERMÀT is the AI native commerce platform that optimizes shopping experiences, leading to best-in-class shopper engagement and conversion. We help brands transform clicks into conversions with dynamic, personalized shopping experiences—built and optimized in minutes.Backed by VMG, Bain Capital Ventures, Greylock, QED, and named The Information’s #1 commerce startup, we’re a 70+ person team based in SF, Austin, NYC, and Bangalore. As a fast-growing Series B company, we’re building the infrastructure for the future of online retail—and we’re just getting started.About the Role:We're looking for New Grad Software Engineers to join FERMÀT and help build the future of AI-native commerce. You will work on challenging problems across our platform—from building scalable systems that handle millions of shoppers to creating AI-powered features that personalize shopping experiences.This is a generalist role where you'll rotate across different parts of our stack and product areas. You'll work alongside experienced engineers who are invested in helping you grow from a new grad into a strong engineer, while taking on real ownership of features that directly impact our customers. What You'll Work On:You'll tackle a diverse range of problems across our platform:Full-Stack DevelopmentBuild features that span from frontend interfaces to backend servicesCreate intuitive user experiences for complex commerce workflowsDesign and implement APIs that power our platformWork with databases to model and query commerce data at scaleAI & PersonalizationBuild features that leverage LLMs to generate and optimize shopping experiencesImplement personalization systems that adapt to shopper behaviorWork on experimentation infrastructure to test and measure AI-driven optimizationsHelp train and fine-tune models for commerce-specific use casesInfrastructure & ScaleBuild systems that handle millions of requests reliablyOptimize performance for high-traffic commerce scenariosDesign data pipelines that process and analyze shopper behaviorWork on deployment and monitoring systemsYou'll spend your first few months ramping up on our stack and shipping smaller features, with increasing ownership and complexity as you grow. We believe in giving engineers real problems to solve from day one. What We're Looking For:Required:Bachelor's or Master's degree in Computer Science, Software Engineering, or related fieldStrong foundation in data structures, algorithms, and software designExperience building projects with at least one programming language (Go, Python, JavaScript, Java, C++, or similar)Familiarity with web development (frontend or backend)Ability to write clean, well-tested codeStrong problem-solving skills and curiosity about how things workExcellent communication and collaboration skills—you can explain technical concepts clearlyExcitement about learning quickly and working across the stackNice to Have:Internship experience at a tech company or startupProjects involving AI/ML, LLMs, or data systemsExperience with TypeScript, React, Go, or PythonContributions to open source or personal projects you're proud ofCoursework or projects in distributed systems, databases, or machine learningOur Tech StackGolang • Python • TypeScript • React • Next.js • PostgreSQL • Google Cloud • Various LLM APIsDon't worry if you haven't used all of these. We're looking for engineers who can learn quickly and adapt to new technologies. What a Typical Week Looks Like:You'll spend most of your time coding—building features, fixing bugs, and learning our systems. You'll participate in daily standups, collaborate with engineers across the team, and pair program with senior engineers who can help you grow. You'll ship code regularly and see your work go live to real customers quickly.Early on, you'll work on well-scoped features with guidance. As you ramp up, you'll take on more complex problems and start making architectural decisions. How You'll Grow:Technical SkillsDevelop full-stack engineering skills across frontend, backend, and infrastructureGain hands-on experience with AI systems in productionLearn to build and scale systems that serve millions of usersMaster software engineering best practices: testing, code review, system designMentorship & SupportDedicated onboarding plan to set you up for successRegular 1:1s with your manager and senior engineersCode & tech spec reviews focused on learning and growthQuarterly goal-setting to help you progress in your careerCareer DevelopmentExposure to different parts of the stack and productOpportunities to specialize in areas that interest you (AI, infrastructure, product engineering)Clear path from new grad to mid-level and senior engineerCulture that values learning and experimentationWhy Join FERMÀT as a New GradReal impact from day one: Your code will be used by major brands and millions of shoppersLearn from the best: Work with experienced engineers from companies like Stripe, Google, and top startupsCutting-edge technology: Work on AI-native products at the intersection of commerce and machine learningFast-paced growth: Series B startup where you'll wear different hats and learn rapidlyStrong engineering culture: We value code quality, thoughtful design, and continuous learningBenefitsCompetitive salary + equity packageComprehensive health, dental, and vision insurance for you and all your dependents.Retirement benefits:US: 401(k) plan with 4% matchingIndia: Provident Fund with 12% matching4 months of paid parental leaveUnlimited PTO policy (with minimum 5 days PTO / quarter!)
Software Engineer
Software Engineering
Hidden link
FERMÀT.jpg

Senior Product Designer

Fermat
0
0
-
0
US.svg
United States
Full-time
Remote
false
FERMÀT is the AI native commerce platform that optimizes shopping experiences, leading to best-in-class shopper engagement and conversion. We help brands transform clicks into conversions with dynamic, personalized shopping experiences—built and optimized in minutes.Backed by VMG, Bain Capital Ventures, Greylock, QED, and named The Information’s #1 commerce startup, we’re a 70+ person team based in SF, Austin, NYC, and Bangalore. As a fast-growing Series B company, we’re building the infrastructure for the future of online retail—and we’re just getting started.About the RoleWe’re seeking a Senior Product Designer to own major product areas end-to-end across FERMÀT’s growing suite of offerings. With only two designers covering four products, you’ll step directly into high ownership, shaping some of the most critical surfaces in the company—from our core funnel builder to emerging AI-powered workflows used by premier global brands.This is a uniquely impactful opportunity to influence a new category: agentic commerce systems. You’ll operate with significant autonomy, partner with leadership, and help define a design culture built on craft, velocity, and AI-native workflows.What You’ll DoLead design for 1–2 major product areas (e.g., funnel builder, AI search, DPP), owning experiences from concept to production.Design high-quality, intuitive UI and interaction patterns in Figma for brand-facing tools and AI-powered workflowsConduct user research, facilitate feedback loops with brands, and transform insights into production-ready designs.Collaborate with PMs (primarily Rhea, also Shreyas) and engineering to drive product strategy and execution.Contribute to the design direction for both brand-facing workflows and large-scale consumer surfaces.Champion AI design tooling and elevate design workflows across the team.Help shape our design system and contribute to the foundations of a fast-growing product suite.Who You Are4-7 years of experience in a product design, ecommerce experience preffered.Exceptional visual/UI design craft, with a strong sense of aesthetics, layout, and interaction.A strong public portfolio demonstrating end-to-end product thinking and complex problem solving, not just screensExpert-level proficiency in Figma (components, prototyping, design systems).Active user of AI tools, with curiosity about how AI can reshape design workflowss.Strong communicator with a track record of partnering closely with PMs and engineering.Experience owning product areas independently in prior roles.Startup/SaaS experience with comfort in ambiguity preferred.BenefitsCompetitive salary + equity packageComprehensive health, dental, and vision insurance for you and all your dependents.Retirement benefits:US: 401(k) plan with 4% matchingIndia: Provident Fund with 12% matching4 months of paid parental leaveUnlimited PTO policy (with minimum 5 days PTO / quarter!)
Product Designer
Creative & Design
Hidden link
Mindrift.jpg

AI Agent Evaluation Analyst (Freelance)

Mindrift
USD
0
0
-
48
CA.svg
Canada
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we doThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who we're looking for:We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?This is a flexible, project-based opportunity well-suited for:Analysts, researchers, or consultants with strong critical thinking skills.Students (senior undergrads / grad students) looking for an intellectually interesting gig.People open to a part-time and non-permanent opportunity. About the project:We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.What you’ll be doing:Reviewing evaluation tasks and scenarios for logic, completeness, and realism.Identifying inconsistencies, missing assumptions, or unclear decision points.Helping define clear expected behaviors (gold standards) for AI agents.Annotating cause-effect relationships, reasoning paths, and plausible alternatives.Thinking through complex systems and policies as a human would to ensure agents are tested properly.Working closely with QA, writers, or developers to suggest refinements or edge case coverage.How to get started:Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.RequirementsExcellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.Ability to assess scenarios holistically: What's missing, what’s unrealistic, what might break?Good communication and clear writing (in English) to document your findings. We also value applicants who have:Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.Exposure to LLMs, prompt engineering, or AI-generated content.Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).BenefitsGet paid for your expertise, with rates that can go up to $48/hour depending on your skills, experience, and project needs.Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.Participate in an advanced AI project and gain valuable experience to enhance your portfolio.Influence how future AI models understand and communicate in your field of expertise.
No items found.
Hidden link
Mindrift.jpg

Freelance AI Red Team Engineer

Mindrift
USD
0
0
-
44
SG.svg
Singapore
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects.  Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $44/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
MLOps / DevOps Engineer
Data Science & Analytics
Machine Learning Engineer
Data Science & Analytics
Hidden link
Mindrift.jpg

AI Agent Evaluation Analyst (Freelance)

Mindrift
USD
0
0
-
48
AU.svg
Australia
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we doThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who we're looking for:We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?This is a flexible, project-based opportunity well-suited for:Analysts, researchers, or consultants with strong critical thinking skills.Students (senior undergrads / grad students) looking for an intellectually interesting gig.People open to a part-time and non-permanent opportunity. About the project:We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.What you’ll be doing:Reviewing evaluation tasks and scenarios for logic, completeness, and realism.Identifying inconsistencies, missing assumptions, or unclear decision points.Helping define clear expected behaviors (gold standards) for AI agents.Annotating cause-effect relationships, reasoning paths, and plausible alternatives.Thinking through complex systems and policies as a human would to ensure agents are tested properly.Working closely with QA, writers, or developers to suggest refinements or edge case coverage.How to get started:Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.RequirementsExcellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.Ability to assess scenarios holistically: What's missing, what’s unrealistic, what might break?Good communication and clear writing (in English) to document your findings. We also value applicants who have:Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.Exposure to LLMs, prompt engineering, or AI-generated content.Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).BenefitsGet paid for your expertise, with rates that can go up to $48/hour depending on your skills, experience, and project needs.Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.Participate in an advanced AI project and gain valuable experience to enhance your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Data Analyst
Data Science & Analytics
Hidden link
Mindrift.jpg

Freelance AI Red Team Engineer

Mindrift
USD
0
0
-
35
IT.svg
Italy
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects.  Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $35/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
MLOps / DevOps Engineer
Data Science & Analytics
Hidden link
Mindrift.jpg

Freelance AI/ML Penetration Tester

Mindrift
USD
0
0
-
52
CA.svg
Canada
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects.  Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $52/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
MLOps / DevOps Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Hidden link
Mindrift.jpg

Freelance AI/ML Penetration Tester

Mindrift
USD
0
0
-
56
ES.svg
Spain
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects.  Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $56/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
MLOps / DevOps Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Hidden link
Mindrift.jpg

Freelance AI Red Team Engineer

Mindrift
USD
-
65
US.svg
United States
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects.  Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $65/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
MLOps / DevOps Engineer
Data Science & Analytics
Hidden link
Mindrift.jpg

Freelance AI Red Team Engineer

Mindrift
USD
0
0
-
52
AU.svg
Australia
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects.  Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $52/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
MLOps / DevOps Engineer
Data Science & Analytics
Hidden link
Mindrift.jpg

AI Agent Evaluation Analyst (Freelance)

Mindrift
USD
0
0
-
17
BR.svg
Brazil
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we doThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who we're looking for:We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?This is a flexible, project-based opportunity well-suited for:Analysts, researchers, or consultants with strong critical thinking skills.Students (senior undergrads / grad students) looking for an intellectually interesting gig.People open to a part-time and non-permanent opportunity. About the project:We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.What you’ll be doing:Reviewing evaluation tasks and scenarios for logic, completeness, and realism.Identifying inconsistencies, missing assumptions, or unclear decision points.Helping define clear expected behaviors (gold standards) for AI agents.Annotating cause-effect relationships, reasoning paths, and plausible alternatives.Thinking through complex systems and policies as a human would to ensure agents are tested properly.Working closely with QA, writers, or developers to suggest refinements or edge case coverage.How to get started:Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.RequirementsExcellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.Ability to assess scenarios holistically: What's missing, what’s unrealistic, what might break?Good communication and clear writing (in English) to document your findings. We also value applicants who have:Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.Exposure to LLMs, prompt engineering, or AI-generated content.Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).BenefitsGet paid for your expertise, with rates that can go up to $17/hour depending on your skills, experience, and project needs.Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.Participate in an advanced AI project and gain valuable experience to enhance your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Data Analyst
Data Science & Analytics
Hidden link
Mindrift.jpg

AI Agent Evaluation Analyst (Freelance)

Mindrift
USD
0
0
-
60
US.svg
United States
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we doThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who we're looking for:We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?This is a flexible, project-based opportunity well-suited for:Analysts, researchers, or consultants with strong critical thinking skills.Students (senior undergrads / grad students) looking for an intellectually interesting gig.People open to a part-time and non-permanent opportunity. About the project:We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.What you’ll be doing:Reviewing evaluation tasks and scenarios for logic, completeness, and realism.Identifying inconsistencies, missing assumptions, or unclear decision points.Helping define clear expected behaviors (gold standards) for AI agents.Annotating cause-effect relationships, reasoning paths, and plausible alternatives.Thinking through complex systems and policies as a human would to ensure agents are tested properly.Working closely with QA, writers, or developers to suggest refinements or edge case coverage.How to get started:Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.RequirementsExcellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.Ability to assess scenarios holistically: What's missing, what’s unrealistic, what might break?Good communication and clear writing (in English) to document your findings. We also value applicants who have:Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.Exposure to LLMs, prompt engineering, or AI-generated content.Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).BenefitsGet paid for your expertise, with rates that can go up to $60/hour depending on your skills, experience, and project needs.Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.Participate in an advanced AI project and gain valuable experience to enhance your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
Hidden link
Mindrift.jpg

Freelance AI Red Team Engineer

Mindrift
USD
0
0
-
56
FR.svg
France
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects.  Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $56/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
MLOps / DevOps Engineer
Data Science & Analytics
Hidden link
Mindrift.jpg

Freelance AI Red Team Engineer

Mindrift
USD
0
0
-
44
SA.svg
Saudi Arabia
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects.  Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $44/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
MLOps / DevOps Engineer
Data Science & Analytics
Hidden link
Mindrift.jpg

Freelance AI Red Team Engineer

Mindrift
USD
0
0
-
35
No items found.
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects.  Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $35/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
MLOps / DevOps Engineer
Data Science & Analytics
Hidden link
Mindrift.jpg

Freelance AI Red Team Engineer

Mindrift
USD
0
0
-
24
ZA.svg
South Africa
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects.  Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $24/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
MLOps / DevOps Engineer
Data Science & Analytics
Hidden link
No job found
Your search did not match any job. Please try again
Department
Clear
Category
Clear
Country
Clear
Job type
Clear
Remote
Clear
Only remote job
Company size
Clear
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.