Freelance Software Developer – AI Trainer / QA (Python, JavaScript, TypeScript, Rust, Ruby) [AS-L]

OpenTrain AI · Remote · Worldwide · Posted Jun 9, 2026

About OpenTrain

OpenTrain aggregates data-labeling and AI-training opportunities from across the industry so contributors can find work in one place. Creating an OpenTrain account is free and applying takes only a few minutes.

This listing is for a freelance, contractor position in OpenTrain’s network focused on code-quality evaluation and AI-training tasks for code generation models.

About AI Training Work

AI training (a.k.a. data labeling or human feedback) is the human side of building machine learning systems: people prepare, review, and score examples that teach models to produce accurate, safe, and useful outputs.

In this project you’ll apply your software development and QA experience to review model-generated code, craft evaluation prompts, and provide structured feedback that helps improve large language models used for code generation and QA automation.

The Role

You will be paired with one primary language—Python, JavaScript/TypeScript, Rust, or Ruby—and focus on reviewing model-generated code, annotating results, and writing evaluation prompts.

This is a remote contractor, part-time role requiring 20+ hours per week. Pay is $15 USD per hour. The work is worldwide-remote and performed via the project’s labeling/review tools.

Position type: Contractor / Part-time
Time: 20+ hours/week
Pay: $15 USD per hour
Location: Remote (worldwide)

What You’ll Do

Work on short, focused tasks that require reading, executing mentally or locally, and evaluating code samples produced by models. Provide clear, structured feedback that helps model trainers understand correctness, bugs, and style issues.

Review model-generated code for correctness, bugs, logic errors, and style violations.
Annotate code with structured labels and explanations following detailed guidelines.
Create and refine evaluation prompts and test cases to probe model behavior.
Report reproducible issues and suggest corrections or improvements.
Switch between tasks quickly while following complex project instructions.

Requirements

All candidates must meet the core criteria below. You will be asked to state which primary language you are applying for—Python, JavaScript/TypeScript, Rust, or Ruby—and your responses and examples should align to that language.

Bachelor’s + in Computer Science or Software Engineering (or equivalent).
≥ 3 years professional development experience in the stated language.
Demonstrated code-review experience and familiarity with CI/CD workflows.
Advanced written English for clear, structured feedback.
Comfortable with rapid task-switching and following detailed guidelines.

Language-Specific Must-Haves

You must apply for one language and meet that language’s specific technical expectations.

Python: Fluency with pytest or unittest and automation in CI (GitHub Actions, Jenkins). Experience critiquing AI-generated Python or integrating LLM tools.
JavaScript / TypeScript: Hands-on with Jest, Mocha, Cypress, or Playwright; browser automation and bug-tracking proficiency.
Rust: Strong grasp of cargo test, quickcheck (property-based), Clippy, rustfmt; module/crate management and Rust tooling expertise.
Ruby: Experience with RSpec, Minitest, Capybara and automation pipelines; prior AI/ML or LLM-assisted QA in Ruby projects is desirable.

Bonus Signals

These are not required but will strengthen your candidacy.

Hackathon or competitive-coding participation.
Previous work on AI training, LLM fine-tuning, or code-evaluation projects.

How It Works / Interviewer Checklist

Follow this checklist when screening applicants and during your own application submission so reviewers can match you correctly.

Begin by asking: “Which primary language are you applying for—Python, JavaScript/TypeScript, Rust, or Ruby?”
Confirm language focus and request concrete examples of 3+ years’ work in that language.
Verify Bachelor’s + in CS/SE (or equivalent) and ask for code-review experience and CI/CD familiarity.
Assess advanced written English by asking for a short written critique of a code snippet or a past review.
Check for language-specific tooling experience (see Language-Specific Must-Haves).

Who Should Apply

Experienced developers who enjoy code review, QA, and improving model behavior should apply. Ideal applicants have solid testing and CI experience in one of the four languages and can produce clear, actionable written feedback.

If you meet the core criteria and can commit 20+ hours/week, specify your primary language in your application and include examples that demonstrate your experience.