AI Red Team Engineer — LLM Security & Pentesting (C1 English)
OpenTrain AI · Remote · Worldwide · Posted Jun 9, 2026
About OpenTrain
OpenTrain is a central job board for data-labeling and AI-training work. We aggregate roles from many AI companies and labeling platforms so people can discover remote, flexible projects in one place.
Creating an OpenTrain account is free, and applying takes only a few minutes. We connect qualified contributors with meaningful work that helps shape how AI systems behave.
About AI Training & Red Teaming
AI training—also called data labeling, annotation, or human feedback work—covers tasks where people prepare, test, and review examples that modern models learn from. Red teaming and adversarial evaluation focus on finding and mitigating model weaknesses so systems are safer and more reliable.
This role sits at the intersection of cybersecurity and model evaluation: you will probe LLM attack surfaces, craft adversarial prompts, and help build repeatable test suites that measure real-world risks.
The Role
We are hiring multiple AI Red Team Engineers to design and execute adversarial evaluations of LLMs, agents, and RAG pipelines in a remote, part-time contract capacity. Work is less than 20 hours per week and paid at $40 USD per hour.
You will follow detailed guidelines and strict ethical and safety standards while collaborating with other engineers and evaluators to produce reproducible findings and actionable mitigations.
- Commitment: Less than 20 hours/week
- Pay: $40 USD per hour (PAY_PER_HOUR)
- Employment type: Contractor, Part-time
- Labeling type: RED_TEAMING (TEXT); labeling software: OTHER
- Experience level: Intermediate
What You’ll Do
Design, run, and automate adversarial attacks against LLMs, agents, and retrieval-augmented generation (RAG) pipelines. Create test suites that stress prompt injection, jailbreaks, data exfiltration, function-calling and tool use.
Document findings with reproducible steps, risk ratings, concise reports, and suggested mitigations. Build small scripts and utilities to scale testing and integrate with CI/CD where appropriate.
- Craft and automate attack prompts and payloads for LLMs and agents
- Build test suites and scoring rubrics; grade model behaviors consistently
- Probe function-calling/tool use and RAG retrieval paths for leakage
- Produce reproducible documentation with risk ratings and mitigation suggestions
- Contribute scripts/utilities to scale and automate red-team workflows
Requirements & Qualifications
You must have a Bachelor’s or Master’s in Computer Science, Software Engineering, Cybersecurity, Digital Forensics, or a related field, and hands-on penetration testing experience across web, API, network, and infrastructure.
This project requires strong scripting and automation skills and domain knowledge specific to LLM security and offensive techniques.
- Degree: Bachelor’s or Master’s in a relevant technical field
- Pentesting: Practical experience with web, API, network, and infrastructure testing
- Scripting: Proficiency in Python plus Bash or PowerShell for automation
- Containers/CI: Experience with containerization and CI/CD security (e.g., Docker) and secure SDLC practices
- LLM security: Familiarity with prompt injection, jailbreaks, data leakage, and the OWASP Top 10 for LLMs
- Frameworks: Experience with red-teaming/eval frameworks such as garak or PyRIT
- Reverse engineering & OS security: Offensive exploitation and reverse engineering (e.g., Ghidra) and OS-level experience (Linux privilege escalation, Windows internals)
- Communication: Ability to write clear rubrics, adversarial prompts, and concise reports in advanced (C1) English
- Assessment: Availability to complete a HackerRank + platform assessment immediately after screening
Who Should Apply / Nice-To-Haves
This role is aimed at security professionals with hands-on pentesting and automation skills who want to focus on LLM and agent security. If you enjoy building reproducible tests and writing clear, concise findings, you’ll fit well here.
Preferred but not strictly required: prior experience at leading AI or security organizations or shipping LLM security work; these are listed as pluses.
- Ideal background: cybersecurity engineers, red teamers, security researchers with scripting skills
- Plus: prior LLM security or AI-security product experience at notable organizations
How It Works & Eligibility
Screening includes a HackerRank test plus a platform assessment; you must be available to complete these immediately after initial screening. Work is remote and contract-based with flexible hours under 20 hours/week.
Candidates must follow ethical testing rules and project-specific guidelines; you will not perform unauthorized testing or violate safety protocols.
- Start process: screening → HackerRank + platform assessment → contract onboarding
- Work model: remote, part-time contractor; follow provided testing guidelines and safety rules
Restricted Locations
The following locations are ineligible for this role: Iran, Cuba, North Korea, Syria, Sudan, Venezuela, Myanmar; Switzerland; China, Taiwan; Kenya; Armenia, Israel, Kazakhstan, UAE, Netherlands, Serbia, Kyrgyzstan, Turkey, Uzbekistan, Belarus, Russia, Ukraine, Abkhazia, South Ossetia; United States (restricted states): Alaska, Arkansas, California, Connecticut, Delaware, Georgia, Hawaii, Illinois, Indiana, Kansas, Louisiana, Maine, Maryland, Massachusetts, Nebraska, Nevada, New Hampshire, New Jersey, New Mexico, Ohio, Oregon, Tennessee, Utah, Vermont, Washington, West Virginia; and the following territories/regions: Antarctica, Aruba, Åland Islands, Saint Barthélemy, Bonaire, Sint Eustatius and Saba, Bouvet Island, Cocos (Keeling) Islands, Democratic Republic of the Congo, Cook Islands, Christmas Island, Western Sahara, Falkland Islands (Malvinas), French Guiana, Guadeloupe, South Georgia and the South Sandwich Islands, Heard Island and McDonald Islands, British Indian Ocean Territory, Northern Mariana Islands, Martinique, New Caledonia, Norfolk Island, Niue, French Polynesia, Saint Pierre and Miquelon, Pitcairn, Réunion, Saint Helena, Ascension and Tristan da Cunha, Svalbard and Jan Mayen, Sint Maarten (Dutch part), French Southern Territories, Tokelau, United States Minor Outlying Islands, Holy See, Virgin Islands (British), Wallis and Futuna, Mayotte.