demishassabis on Warpcast

demishassabis pfp

@0487205048720504

We’re releasing Humanity’s Last Exam, a dataset with 3,000 questions developed with hundreds of subject matter experts to capture the human frontier of knowledge and reasoning. State-of-the-art AIs get <10% accuracy and are highly overconfident. @ai_risk @scaleai https://t.co/kiOJKV2GfI

0 reply

0 recast

0 reaction