Information Systems Expert - AI Evaluator

Remote · USA Full-time New today

• *About The Job

*Mercor

connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include

*Benchmark**

,

*General Catalyst**

,

*Peter Thiel**

,

*Adam D'Angelo**

,

*Larry Summers**

, and

*Jack Dorsey**

.

*Position:**

AI Model Evaluation Specialist

*Type:
*Contract
Compensation:
$40–$60/hour
*Commitment:
*20 hours/week
*Role Responsibilities
Write realistic prompts reflecting how professionals and consumers seek domain-specific guidance.
Evaluate AI-generated responses for factual accuracy, regulatory or clinical correctness, and practical usefulness.
Identify fabricated claims, incorrect references, or misleading reasoning across model outputs.
Score and rank multiple model responses using structured rubrics across dimensions.
Provide written justifications with specific evidence for each evaluation.
*Qualifications
*Must-Have
Master’s degree or higher in Computer Science, Information Systems, or a relevant professional field.
Professional experience applying domain expertise in a practitioner or advisory capacity.
Familiarity with industry-specific standards, regulations, or clinical guidelines.
Strong written communication and critical reasoning skills.
*Application Process (Takes 20–30 mins to complete)
Submit your resume to begin.
Complete the Model Response Evaluation assessment.
*Resources & Support**

• For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome

For any help or support, reach out to: [email protected]
PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.*

, Apply tot his job Apply To this Job

Related roles

BDI Evaluator

Remote · USA Full-time

AI Writing Evaluators (Domain Experts) - English Expertise

Remote · USA Full-time

Part-time AI Writing Evaluator (Tier 1)

Remote · USA Full-time

Business Research Evaluator | $30/hr Remote

Remote · USA Full-time

Social Media Evaluator (Ukrainian-United States)

Remote · USA Full-time

Qualified Medical Evaluator (QME) - Pain Medicine Physician - Part Time

Remote · USA Full-time

Regional Vocational Evaluation Specialist

Remote · USA Full-time

Lead Program Evaluator – Title III / Federal Education Grants

Remote · USA Full-time

Spanish Speaking CFTSS OLP Supervisor/Evaluator (Remote)

Remote · USA Full-time

Manufacturing Expert - Quality Evaluator

Remote · USA Full-time

Caption Translator - English to German

Remote · USA Full-time

Employee Benefits Claims Team Leader (HYBRID OR REMOTE)

Remote · USA Full-time

Sr IT Project Manager

Remote · USA Full-time

DMV Operations Specialist (Titles) Remote

Remote · USA Full-time

Experienced Market Research/Customer Insights Intern (Marketing) – Fall Recruitment – Remote Opportunity at arenaflex

Remote · USA Full-time

Experienced Customer Service Representative - Medical Billing Call Center Customer Service Representative - Iowa - arenaflex (Remote)

Remote · USA Full-time

Senior Fullstack Engineer

Remote · USA Full-time

Experienced Remote Data Entry Clerk – Flexible Part-Time and Full-Time Opportunities with arenaflex

Remote · USA Full-time

Senior Business Intelligence Manager

Remote · USA Full-time

Software Engineer – C++ Linux

Remote · USA Full-time