See all roles

Red Team Safety Classifier Evaluation AI Trainer, $55–$65/hour

Work from home Full-time role Hiring

Project Overview: Join a growing community of professionals advancing the next wave of AI. As an AI Trainer, you’ll play a hands-on role by analyzing and providing feedback on data to improve LLM performance, helping ensure that the next generation of AI technology is accurate and trustworthy. We are seeking a skilled AI Safety Evaluator / Red Team Prompt Engineer to work as a project consultant in our AI Labor Marketplace. This is not a full-time employment position — you will be engaged as an expert project consultant on a contract basis. Location: U.S.-based experts only Engagement: Part-time, project-based expert evaluation work Work Type: Remote Project Summary: A fast-paced AI safety evaluation sprint focused on adversarial prompt generation and safety classification. Contributors will create and assess high-difficulty, edge-case scenarios, applying structured labeling, severity scoring, and policy-based reasoning to improve model safety performance. Consultant Engagement Terms: This is a project-based consultant role. Consultants will be paid on a per-project basis; hourly rates are estimates based on anticipated completion time. Consultants control their own schedule, provide their own tools, and may simultaneously provide services to other vendors/employers (subject to those vendors’ allowances). Responsibilities: Contributors will:

  • Design adversarial prompts that expose edge cases in AI safety systems
  • Apply structured safety classifications, including category and severity
  • Write concise, policy-grounded rationales for decisions
  • Review and validate peer submissions for accuracy and quality
  • Identify ambiguous or difficult-to-classify scenarios
  • Maintain consistency across high-volume evaluation tasks

Expected Outcomes:

  • High-quality adversarial examples suitable for model evaluation
  • Accurate and consistent safety labels and severity ratings
  • Clear, defensible rationales aligned with policy guidelines
  • Reliable QA feedback improving dataset quality

Qualifications:

  • Experience in AI safety, LLM evaluation, red teaming, or trust & safety
  • Strong prompt engineering and analytical reasoning skills
  • Familiarity with safety taxonomies and policy-based classification
  • Ability to work independently and maintain high-quality output
  • Prior experience with annotation or evaluation platforms preferred

Apply tot his job Apply To this Job

You might like

Senior Software Engineer, Trust and Safety

Work from home Full-time role

Remote Educational Interpreter | North Carolina

Work from home Full-time role

Community Interpreter

Work from home Full-time role

Oromo Interpreter

Work from home Full-time role

Montenegrin Medical Over-the-Phone Interpreter

Work from home Full-time role

[Hiring] Interpreter (OPI & VRI - Medical) @Prisma International, Inc.

Work from home Full-time role

Chuukese Video Medical Interpreter

Work from home Full-time role

Freelance Medical Interpreter

Work from home Full-time role

Medical Interpreter - Vietnamese Language

Work from home Full-time role

American Sign Language Interpreter - Employee (Full-time)

Work from home Full-time role

Remote AL Audiologist

Work from home Full-time role

Veterinary Clinical Liaison I (Shift 11pm to 7am CT)

Work from home Full-time role

Enterprise Account Executive

Work from home Full-time role

Experienced Healthcare Data Entry Specialist – Insurance Verification and Enrollment

Work from home Full-time role

Patient and Family Relations Specialist

Work from home Full-time role

Experienced Home Advisor Customer Support Representative – Global Customer Experience at arenaflex

Work from home Full-time role

Experienced Remote Data Entry Clerk – Work From Home Opportunity at arenaflex

Work from home Full-time role

Steuerfachkraft (m/w/d) in Tönning mindestens 52.000€ - 100% Remote möglich

Work from home Full-time role

Senior Data Incident Preparedness and Response Manager

Work from home Full-time role

BFS Business Process Owner

Work from home Full-time role