Skip to content
OpenTrain AI

Data Annotators for RL Environments

Join an entry-level, remote contractor role labeling text for RL environments: perform in-app actions and search for specific information. $11/hr, 20+ hours/week, worldwide — use internal tooling to produce question-answering labels.

OpenTrain AI

Generative Ai Rlhf

100% Remote Hourly · $11/hr

$11/hr

Compensation

Worldwide

Eligibility

Entry

Experience

Jul 22, 2025

Posted

Open worldwide

About OpenTrain

OpenTrain is the #1 platform for finding and building careers in AI training and data labeling. Creating an OpenTrain account is free, and the platform connects contributors with real projects where they can grow skills and income in a fast-growing industry.

About AI training work

AI training (also called data labeling or human feedback work) is the human side of building modern AI systems. Contributors create and review examples — here that means text-based labels — that help models learn how to act and respond.

This kind of work is often remote and flexible, accessible to entry-level contributors, and gives you a direct role in shaping state-of-the-art systems.

  • 100% remote — work from anywhere with an internet connection.
  • Flexible, part-time opportunities that can fit around other commitments.
  • Accessible to entry-level contributors; domain expertise can increase pay on specialized projects.

The role

We are hiring entry-level data annotators for RL environments to produce text labels used in reinforcement learning and model evaluation. This is a contractor, part-time role that requires 20+ hours per week.

Compensation is USD $11 per hour (paid per hour). Work is worldwide and done in our internal proprietary tooling. The primary label type for this project is question-answering applied to text tasks.

  • Employment type: Contractor, Part-time.
  • Time commitment: 20+ hours per week.
  • Pay: $11 USD per hour (pay-per-hour).
  • Tooling: Internal proprietary annotation software.
  • Data type: Text; Label type: Question answering.

What you'll do

You will follow task instructions to perform simple in-app actions or locate specific information inside an application, then record the results as structured question-answering labels.

Tasks are concrete and example-driven; they focus on correctness and completeness rather than creative writing or domain expertise.

  • Perform actions in the app (for example: create, update, or delete an invoice).
  • Find information in the app (for example: locate the invoice number for customer X).
  • Record answers and follow question-answering prompts in the annotation tool.
  • Maintain high accuracy and attention to detail for each labeled item.

Requirements

This is an entry-level position but requires strong attention to detail and the ability to follow precise instructions. You must be available for at least 20 hours per week and able to work remotely from anywhere in the world.

All work is completed in internal proprietary tooling, so you should be comfortable learning and using web-based annotation interfaces.

  • Experience level: Entry level.
  • Availability: 20+ hours per week.
  • Must demonstrate high attention to detail (explicit project requirement).
  • Work location: Worldwide / remote.
  • Comfortable working with text-based tasks and question-answering labels.

How it works and how to apply

Create a free OpenTrain account, build your profile, and apply to this project. OpenTrain connects you to the project and the project team will contact selected contributors with onboarding steps and access to the internal tooling.

As a contractor you will be assigned tasks, complete them in the annotation interface, and be paid per hour at the stated rate.

  • Step 1: Sign up for a free OpenTrain account and complete your profile.
  • Step 2: Apply to this project and indicate your weekly availability (20+ hrs).
  • Step 3: If selected, follow the onboarding instructions and begin labeling in the internal tool.