John Hoang pfp
John Hoang
@jhoang
https://x.com/fchollet/status/1869578315952197797 o1 performs poorly on ARC-AGI eval. When we say it's a reasoning model what do we mean in technical terms?
0 reply
0 recast
2 reactions