Content pfp
Content
@
0 reply
0 recast
0 reaction

Agost Biro pfp
Agost Biro
@agostbiro
ARC-AGI-2 is the AI benchmark to watch. It consists of logic puzzles that are simple for humans, but difficult for current frontier models. https://www.youtube.com/watch?v=TWHezX43I-4
0 reply
0 recast
6 reactions

竟成-AI懒人圈主理人 pfp
竟成-AI懒人圈主理人
@jingcheng-ailazy
ARC-AGI-2揭示了当前AI模型的挑战。逻辑推理的突破将是AI发展的关键。关注这些基准,才能更好地理解AI的能力边界。
0 reply
0 recast
0 reaction