Loading
Hire.Monster

Technical Program Manager, AI Evaluation Specialist

US
УдалённаяМенеджментСША

Обязанности

  • As part of Speech Analytics, you will own the human-in-the-loop review processes that measure model accuracy, reliability, and alignment with Chime’s standards for quality and member trust
  • Your work provides the trust layer that ensures models behave as expected — identifying gaps, failure modes, and opportunities for improvement
  • You’ll partner closely with Speech Analytics, Data teams, Enablement, and Model Owners to ensure AI systems operate safely and consistently in production
  • Own the Human-in-the-Loop evaluation process for all AI models supporting Operations
  • Run recurring sampling and reviews to assess accuracy, consistency, and failure modes
  • Score, tag, and document cases where AI systems misclassify, hallucinate, skip steps, or generate incomplete outputs
  • Maintain structured rubrics and guidelines to ensure reviewer alignment and scoring consistency
  • Conduct deeper investigations into error patterns and root causes
  • Translate insights into recommendations for model owners and partner teams
  • Track and report key evaluation metrics such as accuracy, recall, coverage, and error types
  • Maintain thorough documentation for evaluation procedures, sampling logic, and scoring definitions
  • Collaborate with cross-functional teams to integrate evaluation findings into dashboards and tuning workflows

Support scaling governance processes and strengthening model-health standards across Operations

Требования

  • Experience reviewing unstructured text and applying rubrics or scorecards
  • Understanding of how AI supports operations (classification, summarization, categorization, automation)
  • Ability to identify patterns, edge cases, and failure modes from qualitative and quantitative data
  • Familiarity with QA frameworks or content-review workflows
  • Experience with SQL, Looker, Snowflake (nice to have)
  • Strong attention to detail and high consistency standards
  • Clear communication and documentation skills

A passion for improving member experience by ensuring AI is safe, fair, and reliable

Навыки

3–5+ years in QA, evaluation, operational analytics, HITL programs, or model monitoring

Условия

  • The base salary offered for this role and level of experience will begin at $105,000 and up to $145,000
  • Full-time employees are also eligible for a bonus, competitive equity package, and benefits
  • The actual base salary offered may be higher, depending on your location, skills, qualifications, and experience

What we offer for our full-time, regular employees 🏢 Our in-office work policy is designed to keep you connected - with four days a week in the office and Fridays from home for those near one of our offices, plus team and company-wide events depending on location

Whether you’re coming in regularly or are part of our fully remote program, you’ll stay engaged with your work and teammates. 💻 In-office perks including backup child, elder, and/or pet care, plus a subsidized commuter benefit to support your regular commute 💰 Competitive salary based on experience ✨ 401k match plus great medical, dental, vision, life, and disability benefits 🏝 Generous vacation policy and company-wide Chime Days, bonus company-wide paid days off 🫂 1% of your time off to support local community organizations of your choice 👟 Annual wellness stipend to use towards eligible wellness related expenses 👶 Up to 24 weeks of paid parental leave for birthing parents and 12 weeks of paid parental leave for non-birthing parents 👪 Access to Maven, a family planning tool, with $15k lifetime reimbursement for egg freezing, fertility treatments, adoption, and more

🎉 In-person and virtual events to connect with your fellow Chimers—think cooking classes, guided meditations, music festivals, mixology classes, paint nights, etc., and delicious snack boxes, too! *Perks also available to Chime Interns

Опубликовано: 12.01.2026