Measure an AI system by the length of time it can maintain goal‑directed, multi‑step activity without human intervention (its 'time horizon'), rather than by single‑task benchmarks. This metric captures sustained autonomy, chaining risk (sabotage, self‑improvement), and gives a single intuitively comparable quantity policymakers and procurers can use.
— A standardized time‑horizon metric would reframe regulation, procurement, and safety tests toward sustained autonomous behavior, clarifying when systems require stricter controls.
Oren Cass
2026.04.17
100% relevant
Chris Painter (president of Model Evaluation and Threat Research) explicitly discusses 'time horizon' as a measure of autonomy on the podcast.
← Back to All Ideas