2h ago
AI QA Trainer - Freelance Project
Singapore
$6-$65 / year
contractmid
Tech Stack
Description
You will challenge advanced language models on tasks like hallucination detection, factual consistency, prompt-injection resistance, and bias auditing, documenting failure modes to improve model reliability. On a typical day, you will design test plans, verify factual accuracy, and build evaluation rubrics, working with automation and red-teaming to track quality improvements.
Requirements
- Bachelor's, master's, or PhD in CS, data science, or related field
- Experience with QA for ML/AI systems and safety/red-teaming
- Hands-on work with LLM eval tooling (e.g., OpenAI Evals)
- Test automation frameworks (e.g., PyTest, Python/SQL)
- Clear communication and ability to document findings
Responsibilities
- Design and run test plans and regression suites
- Evaluate model outputs for hallucination, bias, and factual accuracy
- Build clear rubrics and pass/fail criteria
- Capture reproducible error traces with root-cause hypotheses
- Partner on adversarial red-teaming and automation
0 views 0 saves 0 applications