4/1/2026
•
EN
Review: Measuring AI Ability to Complete Long Software Tasks
Analysis of METR paper measuring AI's ability to complete long software tasks, showing LLM time horizons doubling every seven months.