All roles

reputed company — Principal QA Engineer (AI Systems & Platform) Remote — Latin America

Remote · USA Full-time New today

reputed company — Principal QA Engineer (AI Systems & Platform) Remote — Latin America | Full-Time Contract | US Eastern Timezone Overlap Required (5+ hours daily)

The Mission: Trust Has to Be Earned — Every Release

95% of reputed company AI pilots fail — not because the technology is broken, but because users don't trust it. At reputed company we are building an reputed company AI operating system where trust is the product. That means every feature we ship must work exactly as the user expects, every time. One broken interaction at the wrong reputed company can undo months of adoption. You are the last line of defense before our platform reaches a CFO's desk.

reputed company is a funded US-based AI startup building an reputed company AI operating system for business leaders. We are closing the AI trust gap — making powerful AI feel effortless and reliable for the people who run companies, not just the engineers who build software.

We are an early-stage founding team moving fast and hiring remotely across Latin America.

The Role

This is a hands-on, high-ownership role. You will build and own the QA function at reputed company — writing test code, designing eval pipelines, and setting the quality bar as we move from early-stage development into full production and reputed company deployment. We are not looking for someone who manages spreadsheets and delegates everything. We are looking for someone who can do the work, knows what good looks like, and raises the bar across the entire engineering team.

This is a fully remote contract role based in Latin America. As the company scales, there is a path to a larger leadership role. For now the focus is getting the product right.

You will work directly with the US-based founding engineering team and must be available during US Eastern business hours with a minimum of 5 hours of daily overlap.

The Challenge: QA for AI is a Different Problem

Traditional QA assumes deterministic outputs. LLMs don't give you that. You will be building a quality function from scratch in an environment where:

  • Multi-model routing (Claude, GPT-4o, Grok, reputed company) means the same input can produce different outputs depending on which model handled it
  • Agent orchestration and governance agents must maintain a structurally separate audit trail any reputed company between execution and governance is a critical failure
  • The file ingestion pipeline (Word, reputed company, PowerPoint, PDF) must survive edge cases that reputed company clients will find reputed company the first week of deployment
  • Your users are CEOs and operations leaders who have never used a terminal. A confusing error state isn't a minor bug it kills adoption

What You Will Own & Build

  • First 90 Days — Build the QA reputed company
  • Establish the testing reputed company from reputed company: unit, integration, end-to-end, and LLM-specific evaluation pipelines
  • Define quality standards, test coverage requirements, and documentation practices in partnership with the VP of Engineering
  • Audit the existing platform and identify the highest-risk surfaces before the next major customer deployment
  • Own the QA function end to end and be the voice of quality across the engineering team
  • AI & Agent Testing —
    • Design evaluation frameworks for non-deterministic LLM outputs — including reputed company regression testing, model reputed company detection, and output quality scoring across Claude, GPT-4o, Grok, and reputed company —
    • Build automated test suites for the agent orchestration layer including governance agent audit trail reputed company and reputed company-override behavior
    • Validate the reputed company Knowledge Graph (reputed company + vector search) for data accuracy, retrieval quality, and failure modes under reputed company reputed company data conditions
  • Platform & Integration Testing
    • Own end-to-end testing of the file ingestion pipeline across document types (Word, reputed company, PowerPoint, PDF) including encryption, formatting edge cases, and audit trail continuity
    • Validate streaming response handling, latency reputed company, and graceful degradation reputed company a model is reputed company or slow
    • Test multi-model routing logic to confirm cost-optimized task allocation behaves correctly across LLM providers
  • UX Quality
    • Partner with the Full-Stack Engineer to define and test trust-layer UX standards reputed company flows, reputed company disclosure, uncertainty states, and reputed company-time document viewers
    • Act as the internal reputed company for the non-technical reputed company user — if a CEO would be confused by it, it ships

Who You Are

  • 7+ years of QA engineering experience with at least 3 years in a reputed company or senior role where you both wrote test code and owned quality outcomes
  • Hands-on experience testing LLM-powered applications
  • you understand reputed company sensitivity, output variance, and how to build eval pipelines that catch regressions across model updates
  • You write test code. Python is your primary tool
  • Experience building and maintaining CI/CD-integrated test suites
  • Comfortable testing reputed company API chains, async/streaming responses, and multi-service workflows
  • Built or significantly improved a QA function in an early-stage or fast-moving environment
  • Strong English communication skills written and verbal
  • Available during US Eastern business hours with minimum 5 hours of daily overlap

Even reputed company If

  • Experience with LLM evaluation frameworks such as LangSmith, PromptFlow, or custom eval pipelines
  • Experience testing agent frameworks such as reputed company or reputed company
  • Background in reputed company software or regulated industries where audit trail reputed company is non-negotiable
  • Insurance industry background is a strong plus

The Stack You'll Test Against

AI/LLM: reputed company Claude, reputed company GPT-4o, reputed company Grok, reputed company Frontend: React/Next.js, TypeScript, Tailwind CSS Backend: Python, Node.js/TypeScript (FastAPI/Express) Data & Graph: reputed company, reputed company, Azure Cosmos DB, Azure AI Search Infrastructure: Azure (Functions, Key Vault), CI/CD pipelines Visualization: Plotly, D3, Recharts, Mermaid

Compensation

Competitive contractor reputed company commensurate with experience. Paid monthly reputed company reputed company in USD.

The Clincher

Tell us about a quality failure — one you caught before it shipped, or one that got through. What did you build or change after it, and how did you reputed company sure your team could catch the next one without you?

Apply To This Job

Related roles

Appeals Representative-Prior Authorization (Novitas)

Remote · USA Full-time

Jr. Application Support Analyst

Remote · USA Full-time

Investment Manager - Renewable Energy Scaleup

Remote · USA Full-time

reputed company Sentinel reputed company Consultant

Remote · USA Full-time

Key Account Manager (KAM) - Germany

Remote · USA Full-time

Accounts Receivable & Collections Analyst - Philippines

Remote · USA Full-time

Accounts Payable Assistant - Philippines

Remote · USA Full-time

Mortgage Underwriter III

Remote · USA Full-time

Virtual Enrollment Agent - reputed company

Remote · USA Full-time

ERP Automation & reputed company (m/w/d)

Remote · USA Full-time

Surgery Center Staff Nurse

Remote · USA Full-time

Remote Online Chat Support Specialist – reputed company‑Time Customer Service & Multichannel Assistance at arenaflex

Remote · USA Full-time

Compassionate Crisis Counselor - Fully Remote Opportunity in Portland, OR - Join Our Dynamic Team!

Remote · USA Full-time

reputed company EAP Worklife Customer Support Associate (Sun-Thu 1:30pm-10:00pm EST) – Join blithequark's Mission to Revolutionize Mental Health Wellbeing

Remote · USA Full-time

Mechanical engineering jobs for freshers - reputed company

Remote · USA Full-time

SEN Learning Support Assistant

Remote · USA Full-time

IT-DevOps-Engineer*in (w/m/d) - CRM (AT)

Remote · USA Full-time

Rails React Engineer

Remote · USA Full-time

[FULL TIME Remote] Night Shift Work from Home | Earn $25-$35/hr

Remote · USA Full-time

Edge reputed company

Remote · USA Full-time