AI Infrastructure Automation Engineer
OpenTrain AI · Remote · Worldwide · Posted Mar 29, 2026
About OpenTrain
OpenTrain collects and lists data-labeling and AI-training jobs from many companies and platforms so candidates can find this work in one place. Creating an OpenTrain account is free and applying takes only a few minutes.
We connect contractors with projects that need human expertise to train and evaluate AI — from transcription and translation to model feedback and evaluation like the role below.
About AI training work
AI models learn from human-provided examples and judgments. This field—often called data labeling, annotation, or human feedback—involves crafting prompts, rating model outputs, and defining evaluation rubrics so systems behave reliably and safely.
This role focuses on the intersection of model evaluation and infrastructure: your judgments and prompts will shape how AI handles deployment, scaling, monitoring, and failure modes in real-world systems.
The role
We are seeking an experienced engineer to help train and evaluate AI systems that generate infrastructure design and automation guidance. You will create prompts, define evaluation criteria, and review or improve AI-generated runbooks and troubleshooting steps.
Work is contractor, part-time, remote, 20+ hours per week. Employment types: CONTRACTOR and PART_TIME. Data type: TEXT. Labeling task type: EVALUATION_RATING. Labeling software: OTHER.
- Level: Intermediate.
- Pay: PAY_PER_HOUR, USD $15–$45/hr (typical up to $45/hr).
- Time requirement: 20+ hours/week; schedule flexible within contractor arrangement.
What you'll do
Day-to-day work centers on evaluating and improving AI outputs for infrastructure automation, plus building evaluation frameworks and prompts that produce reliable, actionable guidance.
- Generate and refine prompts targeting deployment, scaling, monitoring, and fault-tolerance strategies.
- Define and apply evaluation criteria and rubrics for uptime, monitoring quality, and fault-tolerance behaviors.
- Review, score, and edit AI-generated DevOps runbooks, automation scripts, and troubleshooting steps.
- Assess self-hosted automation platforms for reliability and performance under load; document findings in text-based evaluations.
- Contribute content and evaluation examples to AI tutoring systems that teach complex infrastructure tasks.
Requirements
You must meet all substantive requirements listed below. Provide a CV in English that states your English proficiency level and includes your email address and phone number.
- 2+ years of DevOps, infrastructure, or backend systems experience.
- Proven skill deploying and operating self-hosted environments.
- Hands-on experience with monitoring, uptime, and fault-tolerance practices.
- Hands-on text annotation, rubric-based evaluation, or QA experience (text-focused).
- Experience evaluating LLM outputs specifically for legal reasoning quality.
- Demonstrated English proficiency at B2 level or higher.
- CV must be in English and include your English level, an email address, and a phone number.
Who should apply
Apply if you are an infrastructure-minded engineer who enjoys translating operational knowledge into clear, testable evaluation criteria and written guidance for models.
This role suits people who can judge technical correctness under real-world constraints and who have prior experience annotating or rating text-based outputs.
- Ideal candidates: DevOps engineers, SREs, backend engineers, or infrastructure architects with annotation/QC experience.
- You should be comfortable writing and reviewing technical prose (runbooks, postmortems, monitoring checks).
Location, restrictions, and eligibility
This is a remote, worldwide role except for the restricted locations listed below. Applicants must confirm they are not located in excluded countries or territories.
The client cannot acquire services from certain countries, specified U.S. states, and listed territories; check the full list before applying.
- Restricted countries: Iran, Cuba, North Korea, Syria, Sudan, Venezuela, Myanmar, Russia, Belarus, Palestine.
- Restricted countries/regions also include: China, Taiwan, Switzerland, Kenya.
- Restricted U.S. states: Alaska, Arkansas, California, Connecticut, Delaware, Georgia, Hawaii, Illinois, Indiana, Kansas, Louisiana, Maine, Maryland, Massachusetts, Nebraska, Nevada, New Hampshire, New Jersey, New Mexico, Ohio, Oregon, Tennessee, Utah, Vermont, Washington, West Virginia.
- Additional excluded territories include Antarctica, Aruba, Åland Islands, Saint Barthélemy, Bonaire/Sint Eustatius and Saba, Bouvet Island, Cocos (Keeling) Islands, Democratic Republic of the Congo, Cook Islands, Christmas Island, Western Sahara, Falkland Islands (Malvinas), French Guiana, Guadeloup
How to apply
Create or sign in to your free OpenTrain account and submit your CV in English. Your CV must state your English proficiency (B2 or higher), and include an email address and phone number.
Applications are reviewed for required experience and annotation background; shortlisted candidates may be asked to complete evaluation or sample tasks.
- Include: CV in English, English level, email address, and phone number.
- Type of assignment: contractor, part-time; pay-per-hour (USD $15–$45/hr).
- Work will be text-based evaluation/rating of AI outputs (EVALUATION_RATING) using other labeling tools.