Content pfp
Content
@
https://warpcast.com/~/channel/theai
0 reply
0 recast
0 reaction

Sid pfp
Sid
@sidshekhar
Traditional evals to benchmark and compare LLMs always seemed to be a bit archaic E.g performance in college-level math is useful in general, but especially so when *applied* to a specific task. I like the direction OAI is taking here: https://x.com/OpenAI/status/1891911123517018521?s=19
1 reply
0 recast
8 reactions

Ako🎩ツ pfp
Ako🎩ツ
@ak0o0.eth
How can I cast here ?
1 reply
0 recast
1 reaction