⚠️ Sorry, this job is no longer available.

Find AI Work That Works for You

Latest roles in AI and machine learning, reviewed by real humans for quality and clarity.

Edit filters

New AI Opportunities

Showing 6179  of 79 jobs
Tag
Mindrift.jpg

AI Agent Evaluation Analyst (Freelance)

Mindrift
-
52
GE.svg
Germany
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we doThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who we're looking for:We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?This is a flexible, project-based opportunity well-suited for:Analysts, researchers, or consultants with strong critical thinking skills.Students (senior undergrads / grad students) looking for an intellectually interesting gig.People open to a part-time and non-permanent opportunity. About the project:We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.What you’ll be doing:Reviewing evaluation tasks and scenarios for logic, completeness, and realism.Identifying inconsistencies, missing assumptions, or unclear decision points.Helping define clear expected behaviors (gold standards) for AI agents.Annotating cause-effect relationships, reasoning paths, and plausible alternatives.Thinking through complex systems and policies as a human would to ensure agents are tested properly.Working closely with QA, writers, or developers to suggest refinements or edge case coverage.How to get started:Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.RequirementsExcellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.Ability to assess scenarios holistically: What's missing, what’s unrealistic, what might break?Good communication and clear writing (in English) to document your findings. We also value applicants who have:Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.Exposure to LLMs, prompt engineering, or AI-generated content.Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).BenefitsGet paid for your expertise, with rates that can go up to $52/hour depending on your skills, experience, and project needs.Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.Participate in an advanced AI project and gain valuable experience to enhance your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
Hidden link
Mindrift.jpg

AI Agent Evaluation Analyst (Freelance)

Mindrift
USD
0
0
-
22
ZA.svg
South Africa
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we doThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who we're looking for:We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?This is a flexible, project-based opportunity well-suited for:Analysts, researchers, or consultants with strong critical thinking skills.Students (senior undergrads / grad students) looking for an intellectually interesting gig.People open to a part-time and non-permanent opportunity. About the project:We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.What you’ll be doing:Reviewing evaluation tasks and scenarios for logic, completeness, and realism.Identifying inconsistencies, missing assumptions, or unclear decision points.Helping define clear expected behaviors (gold standards) for AI agents.Annotating cause-effect relationships, reasoning paths, and plausible alternatives.Thinking through complex systems and policies as a human would to ensure agents are tested properly.Working closely with QA, writers, or developers to suggest refinements or edge case coverage.How to get started:Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.RequirementsExcellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.Ability to assess scenarios holistically: What's missing, what’s unrealistic, what might break?Good communication and clear writing (in English) to document your findings. We also value applicants who have:Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.Exposure to LLMs, prompt engineering, or AI-generated content.Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).BenefitsGet paid for your expertise, with rates that can go up to $22/hour depending on your skills, experience, and project needs.Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.Participate in an advanced AI project and gain valuable experience to enhance your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Data Analyst
Data Science & Analytics
Hidden link
Mindrift.jpg

Freelance AI Red Team Engineer

Mindrift
USD
0
0
-
53
GB.svg
United Kingdom
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects.  Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $53/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
MLOps / DevOps Engineer
Data Science & Analytics
Hidden link
Mindrift.jpg

Freelance AI Red Team Engineer

Mindrift
USD
0
0
-
65
US.svg
United States
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects.  Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $65/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
MLOps / DevOps Engineer
Data Science & Analytics
Hidden link
Mindrift.jpg

Freelance AI Red Team Engineer

Mindrift
USD
0
0
-
19
BR.svg
Brazil
Contractor
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects.  Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $19/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
MLOps / DevOps Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Hidden link
Mindrift.jpg

AI Agent Evaluation Analyst (Freelance)

Mindrift
USD
0
0
-
32
PL.svg
Poland
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we doThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who we're looking for:We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?This is a flexible, project-based opportunity well-suited for:Analysts, researchers, or consultants with strong critical thinking skills.Students (senior undergrads / grad students) looking for an intellectually interesting gig.People open to a part-time and non-permanent opportunity. About the project:We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.What you’ll be doing:Reviewing evaluation tasks and scenarios for logic, completeness, and realism.Identifying inconsistencies, missing assumptions, or unclear decision points.Helping define clear expected behaviors (gold standards) for AI agents.Annotating cause-effect relationships, reasoning paths, and plausible alternatives.Thinking through complex systems and policies as a human would to ensure agents are tested properly.Working closely with QA, writers, or developers to suggest refinements or edge case coverage.How to get started:Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.RequirementsExcellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.Ability to assess scenarios holistically: What's missing, what’s unrealistic, what might break?Good communication and clear writing (in English) to document your findings. We also value applicants who have:Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.Exposure to LLMs, prompt engineering, or AI-generated content.Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).BenefitsGet paid for your expertise, with rates that can go up to $32/hour depending on your skills, experience, and project needs.Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.Participate in an advanced AI project and gain valuable experience to enhance your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
Hidden link
Mindrift.jpg

AI Agent Evaluation Analyst (Freelance)

Mindrift
USD
0
0
-
52
No items found.
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we doThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who we're looking for:We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?This is a flexible, project-based opportunity well-suited for:Analysts, researchers, or consultants with strong critical thinking skills.Students (senior undergrads / grad students) looking for an intellectually interesting gig.People open to a part-time and non-permanent opportunity. About the project:We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.What you’ll be doing:Reviewing evaluation tasks and scenarios for logic, completeness, and realism.Identifying inconsistencies, missing assumptions, or unclear decision points.Helping define clear expected behaviors (gold standards) for AI agents.Annotating cause-effect relationships, reasoning paths, and plausible alternatives.Thinking through complex systems and policies as a human would to ensure agents are tested properly.Working closely with QA, writers, or developers to suggest refinements or edge case coverage.How to get started:Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.RequirementsExcellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.Ability to assess scenarios holistically: What's missing, what’s unrealistic, what might break?Good communication and clear writing (in English) to document your findings. We also value applicants who have:Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.Exposure to LLMs, prompt engineering, or AI-generated content.Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).BenefitsGet paid for your expertise, with rates that can go up to $52/hour depending on your skills, experience, and project needs.Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.Participate in an advanced AI project and gain valuable experience to enhance your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Data Analyst
Data Science & Analytics
Hidden link
Mindrift.jpg

Freelance AI Red Team Engineer

Mindrift
USD
0
0
-
35
PL.svg
Poland
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects.  Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $35/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
MLOps / DevOps Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Hidden link
Mindrift.jpg

Freelance AI Red Team Engineer

Mindrift
USD
0
0
-
56
No items found.
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects.  Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $56/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
MLOps / DevOps Engineer
Data Science & Analytics
Hidden link
Mindrift.jpg

AI Agent Evaluation Analyst (Freelance)

Mindrift
USD
0
0
-
41
SG.svg
Singapore
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we doThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who we're looking for:We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?This is a flexible, project-based opportunity well-suited for:Analysts, researchers, or consultants with strong critical thinking skills.Students (senior undergrads / grad students) looking for an intellectually interesting gig.People open to a part-time and non-permanent opportunity. About the project:We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.What you’ll be doing:Reviewing evaluation tasks and scenarios for logic, completeness, and realism.Identifying inconsistencies, missing assumptions, or unclear decision points.Helping define clear expected behaviors (gold standards) for AI agents.Annotating cause-effect relationships, reasoning paths, and plausible alternatives.Thinking through complex systems and policies as a human would to ensure agents are tested properly.Working closely with QA, writers, or developers to suggest refinements or edge case coverage.How to get started:Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.RequirementsExcellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.Ability to assess scenarios holistically: What's missing, what’s unrealistic, what might break?Good communication and clear writing (in English) to document your findings. We also value applicants who have:Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.Exposure to LLMs, prompt engineering, or AI-generated content.Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).BenefitsGet paid for your expertise, with rates that can go up to $41/hour depending on your skills, experience, and project needs.Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.Participate in an advanced AI project and gain valuable experience to enhance your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
Hidden link
Mindrift.jpg

Freelance AI Red Team Engineer

Mindrift
USD
0
0
-
56
DK.svg
Denmark
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects.  Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $56/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
MLOps / DevOps Engineer
Data Science & Analytics
Hidden link
Mindrift.jpg

Freelance AI Red Team Engineer

Mindrift
USD
0
0
-
56
GE.svg
Germany
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects.  Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $56/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
MLOps / DevOps Engineer
Data Science & Analytics
Hidden link
Mindrift.jpg

Freelance AI Red Team Engineer

Mindrift
USD
0
0
-
35
No items found.
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects.  Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $35/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Machine Learning Engineer
Data Science & Analytics
MLOps / DevOps Engineer
Data Science & Analytics
Software Engineer
Software Engineering
Hidden link
Mindrift.jpg

Freelance AI Red Team Engineer

Mindrift
USD
0
0
-
65
US.svg
United States
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects.  Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $65/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
MLOps / DevOps Engineer
Data Science & Analytics
Machine Learning Engineer
Data Science & Analytics
Hidden link
Mindrift.jpg

AI Agent Evaluation Analyst (Freelance)

Mindrift
USD
0
0
-
52
DK.svg
Denmark
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we doThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who we're looking for:We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?This is a flexible, project-based opportunity well-suited for:Analysts, researchers, or consultants with strong critical thinking skills.Students (senior undergrads / grad students) looking for an intellectually interesting gig.People open to a part-time and non-permanent opportunity. About the project:We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.What you’ll be doing:Reviewing evaluation tasks and scenarios for logic, completeness, and realism.Identifying inconsistencies, missing assumptions, or unclear decision points.Helping define clear expected behaviors (gold standards) for AI agents.Annotating cause-effect relationships, reasoning paths, and plausible alternatives.Thinking through complex systems and policies as a human would to ensure agents are tested properly.Working closely with QA, writers, or developers to suggest refinements or edge case coverage.How to get started:Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.RequirementsExcellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.Ability to assess scenarios holistically: What's missing, what’s unrealistic, what might break?Good communication and clear writing (in English) to document your findings. We also value applicants who have:Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.Exposure to LLMs, prompt engineering, or AI-generated content.Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).BenefitsGet paid for your expertise, with rates that can go up to $52/hour depending on your skills, experience, and project needs.Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.Participate in an advanced AI project and gain valuable experience to enhance your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Data Analyst
Data Science & Analytics
Hidden link
Mindrift.jpg

AI Agent Evaluation Analyst (Freelance)

Mindrift
USD
0
0
-
17
PH.svg
Philippines
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we doThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who we're looking for:We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?This is a flexible, project-based opportunity well-suited for:Analysts, researchers, or consultants with strong critical thinking skills.Students (senior undergrads / grad students) looking for an intellectually interesting gig.People open to a part-time and non-permanent opportunity. About the project:We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.What you’ll be doing:Reviewing evaluation tasks and scenarios for logic, completeness, and realism.Identifying inconsistencies, missing assumptions, or unclear decision points.Helping define clear expected behaviors (gold standards) for AI agents.Annotating cause-effect relationships, reasoning paths, and plausible alternatives.Thinking through complex systems and policies as a human would to ensure agents are tested properly.Working closely with QA, writers, or developers to suggest refinements or edge case coverage.How to get started:Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.RequirementsExcellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.Ability to assess scenarios holistically: What's missing, what’s unrealistic, what might break?Good communication and clear writing (in English) to document your findings. We also value applicants who have:Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.Exposure to LLMs, prompt engineering, or AI-generated content.Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).BenefitsGet paid for your expertise, with rates that can go up to $17/hour depending on your skills, experience, and project needs.Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.Participate in an advanced AI project and gain valuable experience to enhance your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Data Analyst
Data Science & Analytics
Hidden link
Mindrift.jpg

Freelance AI Red Team Engineer

Mindrift
USD
0
0
-
35
ES.svg
Spain
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleGenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects.  Although every project is unique, you might typically: Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. Develop and implement automation scripts, custom tools, environments and test harnesses. Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. Advise on cybersecurity best practices and policy implications.How to get started Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields. Your level of English is advanced (C1) or above.Proficient in scripting and automation using Python, Bash, or PowerShell.Experienced with containerization and CI/CD security tools, especially Docker. Hands-on experience with penetration testing across web, API, network, and infrastructure environments. Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).Familiar with AI red-teaming frameworks such as garak or PyRIT. Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines. Proficient in offensive exploitation and exploit development.Skilled in reverse engineering using tools like Ghidra or equivalents. Expertise in network and application security, including web application security. Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals. Familiar with secure coding practices for full-stack development. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $35/hour depending on your skills, experience, and project needs.Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.Work on advanced AI projects and gain valuable experience that enhances your portfolio.Influence how future AI models understand and communicate in your field of expertise.
MLOps / DevOps Engineer
Data Science & Analytics
Machine Learning Engineer
Data Science & Analytics
Hidden link
Mindrift.jpg

AI Agent Evaluation Analyst (Freelance)

Mindrift
USD
0
0
-
60
US.svg
United States
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we doThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who we're looking for:We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?This is a flexible, project-based opportunity well-suited for:Analysts, researchers, or consultants with strong critical thinking skills.Students (senior undergrads / grad students) looking for an intellectually interesting gig.People open to a part-time and non-permanent opportunity. About the project:We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.What you’ll be doing:Reviewing evaluation tasks and scenarios for logic, completeness, and realism.Identifying inconsistencies, missing assumptions, or unclear decision points.Helping define clear expected behaviors (gold standards) for AI agents.Annotating cause-effect relationships, reasoning paths, and plausible alternatives.Thinking through complex systems and policies as a human would to ensure agents are tested properly.Working closely with QA, writers, or developers to suggest refinements or edge case coverage.How to get started:Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.RequirementsExcellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.Ability to assess scenarios holistically: What's missing, what’s unrealistic, what might break?Good communication and clear writing (in English) to document your findings. We also value applicants who have:Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.Exposure to LLMs, prompt engineering, or AI-generated content.Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).BenefitsGet paid for your expertise, with rates that can go up to $60/hour depending on your skills, experience, and project needs.Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.Participate in an advanced AI project and gain valuable experience to enhance your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Data Analyst
Data Science & Analytics
Hidden link
Mindrift.jpg

AI Agent Evaluation Analyst (Freelance)

Mindrift
USD
0
0
-
32
No items found.
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we doThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who we're looking for:We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?This is a flexible, project-based opportunity well-suited for:Analysts, researchers, or consultants with strong critical thinking skills.Students (senior undergrads / grad students) looking for an intellectually interesting gig.People open to a part-time and non-permanent opportunity. About the project:We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.What you’ll be doing:Reviewing evaluation tasks and scenarios for logic, completeness, and realism.Identifying inconsistencies, missing assumptions, or unclear decision points.Helping define clear expected behaviors (gold standards) for AI agents.Annotating cause-effect relationships, reasoning paths, and plausible alternatives.Thinking through complex systems and policies as a human would to ensure agents are tested properly.Working closely with QA, writers, or developers to suggest refinements or edge case coverage.How to get started:Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.RequirementsExcellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.Ability to assess scenarios holistically: What's missing, what’s unrealistic, what might break?Good communication and clear writing (in English) to document your findings. We also value applicants who have:Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.Exposure to LLMs, prompt engineering, or AI-generated content.Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).BenefitsGet paid for your expertise, with rates that can go up to $32/hour depending on your skills, experience, and project needs.Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.Participate in an advanced AI project and gain valuable experience to enhance your portfolio.Influence how future AI models understand and communicate in your field of expertise.
No items found.
Hidden link
Mindrift.jpg

AI Agent Evaluation Analyst (Freelance)

Mindrift
USD
0
0
-
49
GB.svg
United Kingdom
Part-time
Remote
true
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we doThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe. Who we're looking for:We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?This is a flexible, project-based opportunity well-suited for:Analysts, researchers, or consultants with strong critical thinking skills.Students (senior undergrads / grad students) looking for an intellectually interesting gig.People open to a part-time and non-permanent opportunity. About the project:We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.What you’ll be doing:Reviewing evaluation tasks and scenarios for logic, completeness, and realism.Identifying inconsistencies, missing assumptions, or unclear decision points.Helping define clear expected behaviors (gold standards) for AI agents.Annotating cause-effect relationships, reasoning paths, and plausible alternatives.Thinking through complex systems and policies as a human would to ensure agents are tested properly.Working closely with QA, writers, or developers to suggest refinements or edge case coverage.How to get started:Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.RequirementsExcellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.Ability to assess scenarios holistically: What's missing, what’s unrealistic, what might break?Good communication and clear writing (in English) to document your findings. We also value applicants who have:Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.Exposure to LLMs, prompt engineering, or AI-generated content.Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).BenefitsGet paid for your expertise, with rates that can go up to $49/hour depending on your skills, experience, and project needs.Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.Participate in an advanced AI project and gain valuable experience to enhance your portfolio.Influence how future AI models understand and communicate in your field of expertise.
Data Analyst
Data Science & Analytics
Hidden link
No job found
Your search did not match any job. Please try again
Department
Clear
Category
Clear
Country
Clear
Job type
Clear
Remote
Clear
Only remote job
Company size
Clear
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.