API Integration Developer for AI Evaluation

Join OpenTrain to design prompts and evaluate LLM-generated API integration plans, payloads, and workflows for SaaS systems. Contractor, remote, 20+ hrs/week; pay up to $45/hr — apply with an English CV including your proficiency level, email, and phone.

Generative Ai Rlhf

100% Remote Hourly · $15–$45/hr

$15–$45/hr

Compensation

Worldwide

Eligibility

Intermediate

Experience

Mar 29, 2026

Posted

Open worldwide

About OpenTrain

OpenTrain is the #1 platform for finding and building careers in AI training and data labeling. The platform helps people start and grow careers teaching AI by discovering projects, building a profile, and applying quickly — creating an OpenTrain account is free.

We connect experienced contributors with projects that shape how AI systems behave. This role is posted by OpenTrain AI and focuses on API integrations, interoperability scenarios, and LLM evaluation.

About AI training and why this work matters

AI training (also called data labeling, annotation, or human feedback work) is the human side of building modern models: people create examples, evaluate outputs, and shape model behavior. Contributors work on tasks like evaluating model responses, writing prompts, and reviewing technical outputs.

This type of work is often 100% remote and flexible, making it a strong fit for people who want part-time technical work that directly affects cutting-edge AI systems.

The role

We are seeking an Integration Developer (API Specialist) to generate prompts that challenge models to design integrations between business systems, and to evaluate AI-generated integration plans, payloads, and workflows. Typical domains include integrations between CRMs, ad platforms, email tools, and databases such as Airtable, Google Sheets, and Notion using REST APIs and webhooks.

This is a contractor, part-time role requiring 20+ hours per week. Work is remote and focused on text-based evaluation and annotation tasks (data type: TEXT) across evaluation, fine-tuning, RLHF, code/calling tasks, and programming-related review.

Employment: Contractor, Part-time
Time: 20+ hours/week
Pay: up to $45/hour (hourly range listed $15–$45) in USD
Label types: EVALUATION_RATING, FINE_TUNING, RLHF, COMPUTER_PROGRAMMING_CODING, FUNCTION_CALLING
Labeling software: OTHER

What you'll do

You will design prompts and evaluation tasks that ask models to produce integration architectures, API payloads, webhook flows, error-handling strategies, and data mappings. You will review AI outputs against rubrics, score solutions, and provide corrective annotations.

Work examples include reviewing model-generated request/response examples, validating schema mappings, checking webhook retry and error handling, comparing batch vs. real-time approaches, and suggesting improvements or failure mitigations.

Create and refine prompts that test API integration and interoperability scenarios
Evaluate LLM-generated integration plans, sample payloads, data mappings, and workflow diagrams
Annotate and rate outputs using rubrics and provide corrective feedback for fine-tuning
Identify and document API failure modes and propose troubleshooting steps
Work with examples involving REST APIs, webhooks, SaaS tools (Airtable, Google Sheets, Notion), CRMs, ad platforms, and email systems

Requirements

You must be comfortable reading technical API documentation and have hands-on experience building integrations with REST APIs and webhooks. This role expects an intermediate level of experience connecting SaaS tools and business systems, plus practical skills in data mapping, schema validation, and designing data pipelines.

Candidates should be able to work independently, communicate clearly in English at B2 level or higher, and submit a CV in English that shows your English proficiency and includes email and phone contact details.

Proven experience with REST APIs and webhooks integration
Ability to interpret technical API documentation and translate specs into integration steps
Hands-on experience with data mapping, schema validation, and pipeline design
Knowledge of real-time vs. batch processing tradeoffs
Skilled at troubleshooting API failures and data issues
Experience evaluating LLM outputs for legal reasoning quality
Hands-on text annotation, rubric-based QA, or evaluation experience
English proficiency: B2 or higher; CV must be in English and include email and phone

Who should apply

Apply if you have practical integration experience (connecting CRMs, databases, ad/email platforms) and enjoy evaluating technical content and LLM outputs. This role suits people with a mix of software-integration experience and annotation or QA experience who want remote, flexible, part-time work.

Candidates with demonstrated examples of complex system integrations and experience interpreting API docs will be prioritized.

Restricted locations for acquisition

Due to acquisition restrictions, applicants located in the places listed below cannot be accepted for this role. Please do not apply from these locations.

Countries: Iran, Cuba, North Korea, Syria, Sudan, Venezuela, Myanmar, Russia, Belarus, Palestine
Also excluded: Switzerland; China, Taiwan; Kenya
U.S. states excluded: Alaska, Arkansas, California, Connecticut, Delaware, Georgia, Hawaii, Illinois, Indiana, Kansas, Louisiana, Maine, Maryland, Massachusetts, Nebraska, Nevada, New Hampshire, New Jersey, New Mexico, Ohio, Oregon, Tennessee, Utah, Vermont, Washington, West Virginia
Territories and other places: Antarctica, Aruba, Åland Islands, Saint Barthélemy, Bonaire, Sint Eustatius and Saba, Bouvet Island, Cocos (Keeling) Islands, Democratic Republic of the Congo, Cook Islands, Christmas Island, Western Sahara, Falkland Islands (Malvinas), French Guiana, Guadeloupe, South

How to apply

To apply, submit your CV in English and include your English proficiency level, an email address, and a phone number. Provide concrete examples of relevant API integrations or links to documentation, projects, or descriptions of systems you’ve connected.

Applications are reviewed for technical fit and annotation/QC experience. OpenTrain connects contributors to projects that match their skills — creating an account is free and speeds future applications.

Submit CV in English and state your English proficiency level
Include email and phone number on your CV
List examples of integrations, APIs worked with, and any annotation or rubric-based evaluation experience