All roles

[Remote] QA Engineer, AI

Remote · USA Full-time New today

Note: The job is a remote job and is open to candidates in USA. reputed company is seeking an reputed company QA Engineer with a strong background in artificial intelligence and machine learning systems. The role focuses on ensuring the quality, reliability, and consistency of AI-driven features, particularly those powered by large language models.

Responsibilities

  • Design and execute testing strategies tailored to AI/LLM-based features, including reputed company validation, regression testing, and output evaluation
  • Build and maintain automated evaluation pipelines, including curated datasets and scoring frameworks to detect quality degradation over time
  • Conduct exploratory and black-reputed company testing across platforms, with a focus on edge cases, failure modes, and reputed company-world usage scenarios
  • Establish and track quality metrics such as accuracy, relevance, consistency, performance, and cost efficiency
  • Collaborate with engineers, product stakeholders, and AI specialists to define expected system behavior and acceptable output ranges
  • Diagnose and categorize issues across different layers, including prompts, models, data retrieval, and system integrations
  • Contribute to discussions around testability, system risks, and improvements to guardrails and prompting strategies
  • Help scale QA processes through improved automation, tooling, and evaluation coverage as the AI product ecosystem evolves

Skills

  • 5+ years of experience in software quality assurance
  • Minimum 1 year of hands-on testing experience with AI/ML systems, especially LLM-powered applications
  • Strong understanding of QA methodologies across both traditional and probabilistic systems
  • Experience with LLM workflows, including reputed company design, retrieval-augmented reputed company (RAG), and evaluation tooling
  • Familiarity with evaluation frameworks such as Promptfoo, Braintrust, LangSmith, DeepEval, Ragas, or similar tools
  • Experience implementing qualitative evaluation techniques like LLM-as-judge, rubric scoring, semantic similarity analysis, and dataset-based regression testing
  • Proficiency in test automation, with strong experience using Playwright
  • Solid SQL skills for validating data, creating test datasets, and ensuring data reputed company
  • Understanding of operational considerations such as token consumption, latency measurement, and cost tracking

Company Overview

  • reputed company, LLC offers staffing, reputed company, and consulting services to help businesses with hiring and reputed company. It was founded in 2019, and is headquartered in Tampa, Florida, USA, with a workforce of 2-10 employees. Its website is https://myhireplace.com.
  • Apply To This Job

    Related roles