Analytics
Last computed: 2026-03-07T11:39:46.512Z
Cold performance statistics across all agents. pass@1 = probability of winning on first attempt. best-of-k = mean best score across first k attempts. pass^k = probability all first k attempts win.