AI Marketing & Sales Jobs
Latest roles in AI Marketing & Sales, reviewed by real humans for quality and clarity.
People also search for:
All Jobs
Showing 61 – 79 of 79 jobs
Freelance AI Red Team Engineer
Mindrift
1001-5000
USD
0
0
-
44
Saudi Arabia
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects. Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started
Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $44/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
DevOps Engineer
Data Science & Analytics
Apply
December 3, 2025
AI Agent Evaluation Analyst (Freelance)
Mindrift
1001-5000
USD
0
0
-
60
United States
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we doThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who we're looking for:We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?This is a flexible, project-based opportunity well-suited for:Analysts, researchers, or consultants with strong critical thinking skills.Students (senior undergrads / grad students) looking for an intellectually interesting gig.People open to a part-time and non-permanent opportunity.
About the project:We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.What you’ll be doing:Reviewing evaluation tasks and scenarios for logic, completeness, and realism.Identifying inconsistencies, missing assumptions, or unclear decision points.Helping define clear expected behaviors (gold standards) for AI agents.Annotating cause-effect relationships, reasoning paths, and plausible alternatives.Thinking through complex systems and policies as a human would to ensure agents are tested properly.Working closely with QA, writers, or developers to suggest refinements or edge case coverage.How to get started:Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.RequirementsExcellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.Ability to assess scenarios holistically: What's missing, what’s unrealistic, what might break?Good communication and clear writing (in English) to document your findings.
We also value applicants who have:Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.Exposure to LLMs, prompt engineering, or AI-generated content.Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).BenefitsGet paid for your expertise, with rates that can go up to $60/hour depending on your skills, experience, and project needs.Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.Participate in an advanced AI project and gain valuable experience to enhance your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
Apply
December 3, 2025
AI Agent Evaluation Analyst (Freelance)
Mindrift
1001-5000
USD
0
0
-
22
South Africa
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we doThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who we're looking for:We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?This is a flexible, project-based opportunity well-suited for:Analysts, researchers, or consultants with strong critical thinking skills.Students (senior undergrads / grad students) looking for an intellectually interesting gig.People open to a part-time and non-permanent opportunity.
About the project:We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.What you’ll be doing:Reviewing evaluation tasks and scenarios for logic, completeness, and realism.Identifying inconsistencies, missing assumptions, or unclear decision points.Helping define clear expected behaviors (gold standards) for AI agents.Annotating cause-effect relationships, reasoning paths, and plausible alternatives.Thinking through complex systems and policies as a human would to ensure agents are tested properly.Working closely with QA, writers, or developers to suggest refinements or edge case coverage.How to get started:Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.RequirementsExcellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.Ability to assess scenarios holistically: What's missing, what’s unrealistic, what might break?Good communication and clear writing (in English) to document your findings.
We also value applicants who have:Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.Exposure to LLMs, prompt engineering, or AI-generated content.Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).BenefitsGet paid for your expertise, with rates that can go up to $22/hour depending on your skills, experience, and project needs.Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.Participate in an advanced AI project and gain valuable experience to enhance your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Data Analyst
Data Science & Analytics
Apply
December 3, 2025
Freelance AI Red Team Engineer
Mindrift
1001-5000
USD
0
0
-
53
United Kingdom
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects. Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started
Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $53/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
DevOps Engineer
Data Science & Analytics
Apply
December 3, 2025
Freelance AI Red Team Engineer
Mindrift
1001-5000
USD
0
0
-
19
Brazil
Contractor
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects. Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started
Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $19/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
DevOps Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Apply
December 3, 2025
AI Agent Evaluation Analyst (Freelance)
Mindrift
1001-5000
USD
0
0
-
32
Poland
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we doThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who we're looking for:We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?This is a flexible, project-based opportunity well-suited for:Analysts, researchers, or consultants with strong critical thinking skills.Students (senior undergrads / grad students) looking for an intellectually interesting gig.People open to a part-time and non-permanent opportunity.
About the project:We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.What you’ll be doing:Reviewing evaluation tasks and scenarios for logic, completeness, and realism.Identifying inconsistencies, missing assumptions, or unclear decision points.Helping define clear expected behaviors (gold standards) for AI agents.Annotating cause-effect relationships, reasoning paths, and plausible alternatives.Thinking through complex systems and policies as a human would to ensure agents are tested properly.Working closely with QA, writers, or developers to suggest refinements or edge case coverage.How to get started:Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.RequirementsExcellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.Ability to assess scenarios holistically: What's missing, what’s unrealistic, what might break?Good communication and clear writing (in English) to document your findings.
We also value applicants who have:Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.Exposure to LLMs, prompt engineering, or AI-generated content.Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).BenefitsGet paid for your expertise, with rates that can go up to $32/hour depending on your skills, experience, and project needs.Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.Participate in an advanced AI project and gain valuable experience to enhance your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
Apply
December 3, 2025
Freelance AI Red Team Engineer
Mindrift
1001-5000
USD
0
0
-
56
No items found.
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects. Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started
Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $56/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
DevOps Engineer
Data Science & Analytics
Apply
December 3, 2025
Freelance AI Red Team Engineer
Mindrift
1001-5000
USD
0
0
-
65
United States
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects. Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started
Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $65/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
DevOps Engineer
Data Science & Analytics
Machine Learning Engineer
Data Science & Analytics
Apply
December 3, 2025
Freelance AI Red Team Engineer
Mindrift
1001-5000
USD
0
0
-
35
No items found.
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects. Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started
Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $35/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
DevOps Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Apply
December 3, 2025
AI Agent Evaluation Analyst (Freelance)
Mindrift
1001-5000
USD
0
0
-
52
Denmark
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we doThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who we're looking for:We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?This is a flexible, project-based opportunity well-suited for:Analysts, researchers, or consultants with strong critical thinking skills.Students (senior undergrads / grad students) looking for an intellectually interesting gig.People open to a part-time and non-permanent opportunity.
About the project:We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.What you’ll be doing:Reviewing evaluation tasks and scenarios for logic, completeness, and realism.Identifying inconsistencies, missing assumptions, or unclear decision points.Helping define clear expected behaviors (gold standards) for AI agents.Annotating cause-effect relationships, reasoning paths, and plausible alternatives.Thinking through complex systems and policies as a human would to ensure agents are tested properly.Working closely with QA, writers, or developers to suggest refinements or edge case coverage.How to get started:Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.RequirementsExcellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.Ability to assess scenarios holistically: What's missing, what’s unrealistic, what might break?Good communication and clear writing (in English) to document your findings.
We also value applicants who have:Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.Exposure to LLMs, prompt engineering, or AI-generated content.Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).BenefitsGet paid for your expertise, with rates that can go up to $52/hour depending on your skills, experience, and project needs.Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.Participate in an advanced AI project and gain valuable experience to enhance your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Data Analyst
Data Science & Analytics
Apply
December 3, 2025
AI Agent Evaluation Analyst (Freelance)
Mindrift
1001-5000
USD
0
0
-
17
Philippines
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we doThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who we're looking for:We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?This is a flexible, project-based opportunity well-suited for:Analysts, researchers, or consultants with strong critical thinking skills.Students (senior undergrads / grad students) looking for an intellectually interesting gig.People open to a part-time and non-permanent opportunity.
About the project:We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.What you’ll be doing:Reviewing evaluation tasks and scenarios for logic, completeness, and realism.Identifying inconsistencies, missing assumptions, or unclear decision points.Helping define clear expected behaviors (gold standards) for AI agents.Annotating cause-effect relationships, reasoning paths, and plausible alternatives.Thinking through complex systems and policies as a human would to ensure agents are tested properly.Working closely with QA, writers, or developers to suggest refinements or edge case coverage.How to get started:Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.RequirementsExcellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.Ability to assess scenarios holistically: What's missing, what’s unrealistic, what might break?Good communication and clear writing (in English) to document your findings.
We also value applicants who have:Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.Exposure to LLMs, prompt engineering, or AI-generated content.Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).BenefitsGet paid for your expertise, with rates that can go up to $17/hour depending on your skills, experience, and project needs.Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.Participate in an advanced AI project and gain valuable experience to enhance your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Data Analyst
Data Science & Analytics
Apply
December 3, 2025
AI Agent Evaluation Analyst (Freelance)
Mindrift
1001-5000
USD
0
0
-
60
United States
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we doThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who we're looking for:We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?This is a flexible, project-based opportunity well-suited for:Analysts, researchers, or consultants with strong critical thinking skills.Students (senior undergrads / grad students) looking for an intellectually interesting gig.People open to a part-time and non-permanent opportunity.
About the project:We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.What you’ll be doing:Reviewing evaluation tasks and scenarios for logic, completeness, and realism.Identifying inconsistencies, missing assumptions, or unclear decision points.Helping define clear expected behaviors (gold standards) for AI agents.Annotating cause-effect relationships, reasoning paths, and plausible alternatives.Thinking through complex systems and policies as a human would to ensure agents are tested properly.Working closely with QA, writers, or developers to suggest refinements or edge case coverage.How to get started:Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.RequirementsExcellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.Ability to assess scenarios holistically: What's missing, what’s unrealistic, what might break?Good communication and clear writing (in English) to document your findings.
We also value applicants who have:Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.Exposure to LLMs, prompt engineering, or AI-generated content.Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).BenefitsGet paid for your expertise, with rates that can go up to $60/hour depending on your skills, experience, and project needs.Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.Participate in an advanced AI project and gain valuable experience to enhance your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Data Analyst
Data Science & Analytics
Apply
December 3, 2025
AI Agent Evaluation Analyst (Freelance)
Mindrift
1001-5000
USD
0
0
-
32
No items found.
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we doThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who we're looking for:We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?This is a flexible, project-based opportunity well-suited for:Analysts, researchers, or consultants with strong critical thinking skills.Students (senior undergrads / grad students) looking for an intellectually interesting gig.People open to a part-time and non-permanent opportunity.
About the project:We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.What you’ll be doing:Reviewing evaluation tasks and scenarios for logic, completeness, and realism.Identifying inconsistencies, missing assumptions, or unclear decision points.Helping define clear expected behaviors (gold standards) for AI agents.Annotating cause-effect relationships, reasoning paths, and plausible alternatives.Thinking through complex systems and policies as a human would to ensure agents are tested properly.Working closely with QA, writers, or developers to suggest refinements or edge case coverage.How to get started:Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.RequirementsExcellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.Ability to assess scenarios holistically: What's missing, what’s unrealistic, what might break?Good communication and clear writing (in English) to document your findings.
We also value applicants who have:Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.Exposure to LLMs, prompt engineering, or AI-generated content.Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).BenefitsGet paid for your expertise, with rates that can go up to $32/hour depending on your skills, experience, and project needs.Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.Participate in an advanced AI project and gain valuable experience to enhance your portfolio.Influence how future AI models understand and communicate in your field of expertise.
No items found.
Apply
December 3, 2025
Freelance AI Red Team Engineer
Mindrift
1001-5000
USD
0
0
-
19
India
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects. Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started
Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $19/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
DevOps Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Apply
December 3, 2025
Freelance AI Red Team Engineer
Mindrift
1001-5000
USD
0
0
-
65
United States
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects. Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started
Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $65/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
DevOps Engineer
Data Science & Analytics
Machine Learning Engineer
Data Science & Analytics
Apply
December 3, 2025
AI Agent Evaluation Analyst (Freelance)
Mindrift
1001-5000
USD
0
0
-
52
France
Contractor
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we doThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who we're looking for:We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?This is a flexible, project-based opportunity well-suited for:Analysts, researchers, or consultants with strong critical thinking skills.Students (senior undergrads / grad students) looking for an intellectually interesting gig.People open to a part-time and non-permanent opportunity.
About the project:We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.What you’ll be doing:Reviewing evaluation tasks and scenarios for logic, completeness, and realism.Identifying inconsistencies, missing assumptions, or unclear decision points.Helping define clear expected behaviors (gold standards) for AI agents.Annotating cause-effect relationships, reasoning paths, and plausible alternatives.Thinking through complex systems and policies as a human would to ensure agents are tested properly.Working closely with QA, writers, or developers to suggest refinements or edge case coverage.How to get started:Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.RequirementsExcellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.Ability to assess scenarios holistically: What's missing, what’s unrealistic, what might break?Good communication and clear writing (in English) to document your findings.
We also value applicants who have:Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.Exposure to LLMs, prompt engineering, or AI-generated content.Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).BenefitsGet paid for your expertise, with rates that can go up to $52/hour depending on your skills, experience, and project needs.Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.Participate in an advanced AI project and gain valuable experience to enhance your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Data Analyst
Data Science & Analytics
Apply
December 3, 2025
Freelance AI/ML Penetration Tester
Mindrift
1001-5000
USD
0
0
-
53
United Kingdom
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects. Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started
Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $53/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
Software Engineer
Software Engineering
DevOps Engineer
Data Science & Analytics
Apply
December 3, 2025
AI Agent Evaluation Analyst (Freelance)
Mindrift
1001-5000
USD
0
0
-
17
India
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we doThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who we're looking for:We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?This is a flexible, project-based opportunity well-suited for:Analysts, researchers, or consultants with strong critical thinking skills.Students (senior undergrads / grad students) looking for an intellectually interesting gig.People open to a part-time and non-permanent opportunity.
About the project:We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.What you’ll be doing:Reviewing evaluation tasks and scenarios for logic, completeness, and realism.Identifying inconsistencies, missing assumptions, or unclear decision points.Helping define clear expected behaviors (gold standards) for AI agents.Annotating cause-effect relationships, reasoning paths, and plausible alternatives.Thinking through complex systems and policies as a human would to ensure agents are tested properly.Working closely with QA, writers, or developers to suggest refinements or edge case coverage.How to get started:Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.RequirementsExcellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.Ability to assess scenarios holistically: What's missing, what’s unrealistic, what might break?Good communication and clear writing (in English) to document your findings.
We also value applicants who have:Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.Exposure to LLMs, prompt engineering, or AI-generated content.Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).BenefitsGet paid for your expertise, with rates that can go up to $17/hour depending on your skills, experience, and project needs.Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.Participate in an advanced AI project and gain valuable experience to enhance your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Data Analyst
Data Science & Analytics
Apply
December 3, 2025
Freelance AI Red Team Engineer
Mindrift
1001-5000
USD
0
0
-
23
Mexico
Contractor
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects. Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started
Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $23/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
DevOps Engineer
Data Science & Analytics
Apply
December 3, 2025
AI Agent Deployment Engineering Manager
Magical
51-100
0
0
-
0
Canada
Full-time
Remote
false
AI Agent Deployment Engineering ManagerReports to: Head of Deployment / ProductMagical’s agentic AI platform brings AI Agents into the workplace to take over repetitive, soul-crushing workflows that slow teams down. Our customers, especially in healthcare, use Magical to automate high-stakes, high-complexity processes.The shift to agentic work is accelerating, and we’re leading it. Our AI Deployment Engineering team is the engine behind that shift. As we scale to a team of 30 people, we’re hiring a hands-on, detail-obsessed leader who can build both the people and the systems required to deliver world-class deployments in high-stakes customer environments.We’re backed by the investors behind OpenAI, Anthropic, Hugging Face, and Notion, including Greylock, Bain Capital, Coatue, Altman Capital, and Lightspeed.
About the RoleYou’ll lead and scale Magical’s AI Deployment Engineering team — the technical, customer-facing group responsible for implementing our AI Agents into customer environments.You'll manage and dive into the details every day:Reviewing workflow logic and agentic designsTroubleshooting issues directlyProject-managing customer launchesRecruiting, onboarding, and developing new AI Agent EngineersCreating systems, templates, and processes that make the team faster and more predictableYou will own both delivery and quality, ensuring customers go live quickly, reliably, and with an experience that feels unmistakably Magical.
What You’ll DoScale & Lead the TeamGrow the AI Agent Engineering team from 8 → ~25 by EOY 2026.Recruit, hire, and onboard top-tier talent with engineering, QA, or test backgrounds.Build a culture of ownership, precision, and customer obsession.Establish performance management, leveling, and clear expectations.Drive World-Class DeliveryOversee the design, build, testing, and launch of agentic workflows across healthcare and payer/provider customers.Stay hands-on with builds early on to set the quality bar and expectations.Ensure deployments meet reliability, latency, and quality standards.Operational ExcellenceBuild rigorous systems for project management, QA, review cycles, and documentation.Create reusable components, templates, and tooling that dramatically speed up future builds.Reduce deployment timelines from weeks to days through process improvement and standardization.Customer & Stakeholder ManagementAct as the senior escalation point for complex customer issues.Interface directly with customer engineering teams, PMs, and operational leaders.Set expectations clearly, communicate proactively, and manage timelines confidently.Feed real-world deployment learnings into Product and Engineering to improve the platform.Raise the Quality BarImplement detailed workflow and logic reviews with consistency.Drive best practices in agentic design, testing, and performance tuning.Ensure every deployment is robust, maintainable, and production-grade.
About YouMust-Have Leadership TraitsYou’ve managed 20+ technical ICs before (pods, squads, or distributed groups).Obsessively detail-oriented — nothing escapes your field of view.Hands-on operator who works in the weeds, not above them.Obsessive commitment to quality in logic, workflows, and customer experience.Customer-obsessed, especially in high-pressure or healthcare environments.Exceptional project manager who can juggle many active deployments simultaneously.Clear, direct communicator with customers and internal teams.Able to recruit, hire, and build a high-performing AI Agent Engineering team.Experience delivering technical solutions to customers, ideally in environments requiring precision and reliability.Background in engineering, QA, or testing — strong technical underpinning is mandatory.Bonus ExperienceHealthcare deployments (EHRs, payer systems, portals)Multi-agent orchestration or LLM/automation workQA automation or test engineering foundationsStartup or 0→1 team-building experienceConsulting or forward-deployed engineering roles
What We OfferThe opportunity to build and scale one of the world’s first Forward Deployed AI Engineering teams focused on HealthcareA chance to shape how AI Agents are deployed into mission-critical, real-world systemsCompetitive salary + meaningful equityUnlimited PTOEpic offsites (Iceland, Lisbon, Cancun, Costa Rica)Cutting-edge AI tools and infrastructure
No items found.
Apply
December 3, 2025
No job found
Your search did not match any job. Please try again
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.