Manufacturing Expert - Quality Evaluator

Remote · USA Full-time New today

• *About The Job

*Mercor

connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include

*Benchmark**

,

*General Catalyst**

,

*Peter Thiel**

,

*Adam D'Angelo**

,

*Larry Summers**

, and

*Jack Dorsey**

.

*Position:**

AI Model Evaluation Specialist

*Type:
*Contract
Compensation:
$25–$35/hour
*Commitment:
*20 hours/week
*Role Responsibilities
Write realistic prompts reflecting professional and consumer domain-specific guidance.
Evaluate AI-generated responses for factual accuracy and practical usefulness.
Identify fabricated claims and misleading reasoning in model outputs.
Score and rank model responses using structured rubrics.
Provide written justifications with specific evidence for evaluations.
*Qualifications
*Must-Have
Professional experience applying domain expertise in a practitioner or advisory capacity.
Familiarity with industry-specific standards, regulations, or clinical guidelines.
Strong written communication and critical reasoning skills.
*Application Process (Takes 20–30 mins to complete)
Submit your resume to begin.
Complete the Model Response Evaluation assessment.
*Resources & Support**

• For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome

For any help or support, reach out to: [email protected]
PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.*

, Apply tot his job Apply To this Job

Related roles

Senior Product Owner, IaaS (Remote)

Remote · USA Full-time

Staff Product Owner (Oracle Retail)

Remote · USA Full-time

Educational Technology AI Rater & Evaluator

Remote · USA Full-time

Vocational Evaluator

Remote · USA Full-time

AI Decision & Response Analyst

Remote · USA Full-time

NURSE EVALUATOR III, HEALTH SERVICES

Remote · USA Full-time

Finance Model Prompt Evaluator

Remote · USA Full-time

AI Quality Evaluator (Polish)

Remote · USA Full-time

Healthcare Research Evaluator (STEM) | $30/hr Remote

Remote · USA Full-time

Generative AI Evaluator (Russian) | $15/hr Remote

Remote · USA Full-time

Experienced Online Chat Representative – Delivering Exceptional Customer Service in a Dynamic Remote Environment

Remote · USA Full-time

Experienced Customer Service Representative - Multilingual Support Specialist (Email, Phone & Chat Support)

Remote · USA Full-time

Experienced Part-Time Pharmacy Technician & Customer Service Representative – Remote Opportunity at arenaflex

Remote · USA Full-time

Experienced Customer Service Representative – Delivering Exceptional Experiences at arenaflex

Remote · USA Full-time

Enterprise Account Executive, Agentforce & Data 360

Remote · USA Full-time

Vaccine Customer Representative - Oklahoma City SW, OK

Remote · USA Full-time

Experienced Full Stack Data Entry Clerk – Remote Work Opportunity with arenaflex

Remote · USA Full-time

Experienced Customer Support Representative – Remote, Part-Time Opportunity with arenaflex

Remote · USA Full-time

Business Analyst Payments - H/F

Remote · USA Full-time

Experienced Full Stack Customer Support Agent – Live Chat & Remote Work Opportunity at arenaflex

Remote · USA Full-time