Agent Evals Specialist (Knowledge Graph Review)

Remote · USA Full-time New today

Big part of Prox is AI agents that process complex technical documents into structured knowledge. The agents are right most of the time. When they're wrong, we need you to catch it. You'll work inside a review platform we built. Each task shows you the source material, what the agent produced, and the steps it took to get there. You compare them and grade the agent's work. The position is full-time (40+ hrs a week) compensation is $5-10/hr What you do per task 1. Read the source and the agent's output side by side. Verify the content was captured accurately. 2. Review what the agent did. What it created, changed, or left out. 3. Score a short rubric covering accuracy, coverage, organization, and rule adherence. Full rubric provided at onboarding. 4. Write detailed feedback about the mistake. This is the most important thing you produce since we use it to improve the agent. 5. Submit. Move to the next task. Conditions

Subject matter shifts over time. You don't need prior knowledge of the subjects. You need to be able to compare two documents carefully and spot where they disagree.
Rate is fixed for the engagement. If it changes, it goes up, and we tell you before your next task.
Work product owned by Prox (work-for-hire).
Standard NDA at offer stage.

Requirements

Strong written English
Can read dense technical content for hours without losing focus
Consistent — your scoring on Monday matches your scoring on Friday
Clear, specific feedback:"section 4 dropped the key requirement from page 17", not"this is confusing"
Reliable on committed hours

Preferred

Prior work as an AI trainer, tutor, or evaluator (Outlier, DataAnnotation, xAI, Surge, Mercor, Invisible, Toloka, etc.)
Technical writing, editing, QA, translation, paralegal, or research-assistant background
Markdown familiarity

The challenge below is the interview. We don't do resume screens or vibe calls. Everyone who applies takes the same ~30 min prescreen challenge. You will receive your prescreen challenge link after submitting this job application. You will review a real agent output, score it and communicate the feedback. We read every submission. If your submission is sharp, you start on paid tasks the same week after a short interview. Good luck! Apply To This Job

Apply

Agent Evals Specialist (Knowledge Graph Review)

Requirements

Related roles

Project Manager - Signage

Sign Systems Designer

Platform Support Engineer (Remote - Australian Capital Cities)

Business Development Representative

Go-to-Market Engineer

VP, Engineering

Strategic Services Lead

Senior Metering Project Engineer- E&C

Onboarding & Training Coordinator

Technical Customer Support - Product Expert

Experienced Front End Customer Support Specialist – Delivering Exceptional Member Experiences in a Dynamic Remote Environment

Vice President, Regulatory Partnerships

Pure Mathematics Specialist – Freelance AI Trainer Project

Online Animation Assistant - Work From Home, Training Provided

End-Point Protection Engineer - 4

Payroll Assistant â€“ Remote Contract

Experienced Customer Service Representative – Work From Home in Nebraska

Experienced Customer Success Representative I – Life Science and Healthcare Support

Experienced Customer Support Representative – Delivering Exceptional Air Travel Experiences from the Comfort of Your Own Home

Part-Time Remote Customer Service Representative – Online Support & Client Success Specialist (Flexible Schedule Available)