Evaluation Scenario Writer – AI Agent Testing Specialist at Toloka | Hire.Monster

О сервисе

Подписка

Evaluation Scenario Writer – AI Agent Testing Specialist

•

Greensboro, North Carolina, US

AIОфисТестированиеСНГ

Greensboro, North Carolina, US

AIОфисТестированиеСНГ

Обязанности

You’ll create test cases that simulate human-performed tasks and define gold-standard behavior to compare agent actions against
You’ll work to ensure each scenario is clearly defined, well-scored, and easy to execute and reuse
Create structured test cases that simulate complex human workflows
Define gold-standard behavior and scoring logic to evaluate agent actions
Analyze agent logs, failure modes, and decision paths
Work with code repositories and test frameworks to validate your scenarios
Iterate on prompts, instructions, and test cases to improve clarity and difficulty

Ensure that scenarios are production-ready, easy to run, and reusable

Требования

We’re looking for someone who can design realistic and structured evaluation scenarios for LLM-based agents

You’ll need a sharp analytical mindset, attention to detail, and an interest in how AI agents make decisions

Опубликовано: 23.12.2025