News
Newest
Ask
Show
Jobs
Open on GitHub
Measuring AI Ability to Complete Long Tasks (2x every 7 months)
(metr.org)
2 points | by
tmoertel
3 hours ago
0 comments
0 comments