Skip to content
OpenTrain AI

API Integration Developer for AI Evaluation

Join OpenTrain to design prompts and evaluate LLM-generated API integration plans, payloads, and workflows for SaaS systems. Contractor, remote, 20+ hrs/week; pay up to $45/hr — apply with an English CV including your proficiency level, email, and phone.

OpenTrain AI

Generative Ai Rlhf

100% Remote Hourly · $15–$45/hr

$15–$45/hr

Compensation

Worldwide

Eligibility

Intermediate

Experience

Mar 29, 2026

Posted

Open worldwide

About OpenTrain

OpenTrain is the #1 platform for finding and building careers in AI training and data labeling. The platform helps people start and grow careers teaching AI by discovering projects, building a profile, and applying quickly — creating an OpenTrain account is free.

We connect experienced contributors with projects that shape how AI systems behave. This role is posted by OpenTrain AI and focuses on API integrations, interoperability scenarios, and LLM evaluation.

About AI training and why this work matters

AI training (also called data labeling, annotation, or human feedback work) is the human side of building modern models: people create examples, evaluate outputs, and shape model behavior. Contributors work on tasks like evaluating model responses, writing prompts, and reviewing technical outputs.

This type of work is often 100% remote and flexible, making it a strong fit for people who want part-time technical work that directly affects cutting-edge AI systems.

The role

We are seeking an Integration Developer (API Specialist) to generate prompts that challenge models to design integrations between business systems, and to evaluate AI-generated integration plans, payloads, and workflows. Typical domains include integrations between CRMs, ad platforms, email tools, and databases such as Airtable, Google Sheets, and Notion using REST APIs and webhooks.

This is a contractor, part-time role requiring 20+ hours per week. Work is remote and focused on text-based evaluation and annotation tasks (data type: TEXT) across evaluation, fine-tuning, RLHF, code/calling tasks, and programming-related review.

  • Employment: Contractor, Part-time
  • Time: 20+ hours/week
  • Pay: up to $45/hour (hourly range listed $15–$45) in USD
  • Label types: EVALUATION_RATING, FINE_TUNING, RLHF, COMPUTER_PROGRAMMING_CODING, FUNCTION_CALLING
  • Labeling software: OTHER

What you'll do

You will design prompts and evaluation tasks that ask models to produce integration architectures, API payloads, webhook flows, error-handling strategies, and data mappings. You will review AI outputs against rubrics, score solutions, and provide corrective annotations.

Work examples include reviewing model-generated request/response examples, validating schema mappings, checking webhook retry and error handling, comparing batch vs. real-time approaches, and suggesting improvements or failure mitigations.

  • Create and refine prompts that test API integration and interoperability scenarios
  • Evaluate LLM-generated integration plans, sample payloads, data mappings, and workflow diagrams
  • Annotate and rate outputs using rubrics and provide corrective feedback for fine-tuning
  • Identify and document API failure modes and propose troubleshooting steps
  • Work with examples involving REST APIs, webhooks, SaaS tools (Airtable, Google Sheets, Notion), CRMs, ad platforms, and email systems

Requirements

You must be comfortable reading technical API documentation and have hands-on experience building integrations with REST APIs and webhooks. This role expects an intermediate level of experience connecting SaaS tools and business systems, plus practical skills in data mapping, schema validation, and designing data pipelines.

Candidates should be able to work independently, communicate clearly in English at B2 level or higher, and submit a CV in English that shows your English proficiency and includes email and phone contact details.

  • Proven experience with REST APIs and webhooks integration
  • Ability to interpret technical API documentation and translate specs into integration steps
  • Hands-on experience with data mapping, schema validation, and pipeline design
  • Knowledge of real-time vs. batch processing tradeoffs
  • Skilled at troubleshooting API failures and data issues
  • Experience evaluating LLM outputs for legal reasoning quality
  • Hands-on text annotation, rubric-based QA, or evaluation experience
  • English proficiency: B2 or higher; CV must be in English and include email and phone

Who should apply

Apply if you have practical integration experience (connecting CRMs, databases, ad/email platforms) and enjoy evaluating technical content and LLM outputs. This role suits people with a mix of software-integration experience and annotation or QA experience who want remote, flexible, part-time work.

Candidates with demonstrated examples of complex system integrations and experience interpreting API docs will be prioritized.

Restricted locations for acquisition

Due to acquisition restrictions, applicants located in the places listed below cannot be accepted for this role. Please do not apply from these locations.

  • Countries: Iran, Cuba, North Korea, Syria, Sudan, Venezuela, Myanmar, Russia, Belarus, Palestine
  • Also excluded: Switzerland; China, Taiwan; Kenya
  • U.S. states excluded: Alaska, Arkansas, California, Connecticut, Delaware, Georgia, Hawaii, Illinois, Indiana, Kansas, Louisiana, Maine, Maryland, Massachusetts, Nebraska, Nevada, New Hampshire, New Jersey, New Mexico, Ohio, Oregon, Tennessee, Utah, Vermont, Washington, West Virginia
  • Territories and other places: Antarctica, Aruba, Åland Islands, Saint Barthélemy, Bonaire, Sint Eustatius and Saba, Bouvet Island, Cocos (Keeling) Islands, Democratic Republic of the Congo, Cook Islands, Christmas Island, Western Sahara, Falkland Islands (Malvinas), French Guiana, Guadeloupe, South

How to apply

To apply, submit your CV in English and include your English proficiency level, an email address, and a phone number. Provide concrete examples of relevant API integrations or links to documentation, projects, or descriptions of systems you’ve connected.

Applications are reviewed for technical fit and annotation/QC experience. OpenTrain connects contributors to projects that match their skills — creating an account is free and speeds future applications.

  • Submit CV in English and state your English proficiency level
  • Include email and phone number on your CV
  • List examples of integrations, APIs worked with, and any annotation or rubric-based evaluation experience