demishassabis pfp
demishassabis
@0487205048720504
We’re releasing Humanity’s Last Exam, a dataset with 3,000 questions developed with hundreds of subject matter experts to capture the human frontier of knowledge and reasoning. State-of-the-art AIs get <10% accuracy and are highly overconfident. @ai_risk @scaleai https://t.co/kiOJKV2GfI
0 reply
0 recast
0 reaction