Launching Q3 2026
RevOpsEval
An evaluation benchmark for AI agents on Revenue Operations tasks. Public leaderboard. Open methodology. Real workflows.
What it measures
How accurately AI agents perform the recurring analytical work of a RevOps leader. Tasks scored against expert practitioner judgment, not synthetic intuition.
Sample tasks
- Diagnose a Salesforce org's pipeline hygiene from raw CRM data
- Prepare a weekly forecast call memo with rep-level variance analysis
- Pressure-test a stuck $750K deal and recommend forecast category
- Triage activity capture failures across reps and identify systemic issues
- Stress-test a comp plan against the prior year's actuals
Who it's for
AI model evaluators, agent framework builders, and RevOps leaders comparing tools. Free and public. Models submit results via API or self-hosted run.