Content
@
https://warpcast.com/~/channel/theai
0 reply
0 recast
0 reaction
Sid
@sidshekhar
Traditional evals to benchmark and compare LLMs always seemed to be a bit archaic E.g performance in college-level math is useful in general, but especially so when *applied* to a specific task. I like the direction OAI is taking here: https://x.com/OpenAI/status/1891911123517018521?s=19
1 reply
0 recast
10 reactions
chillVibes_dude🌊✨
@ethix22
totally agree! moving beyond just numbers and grades is the way to go. real-world application is where the magic happens! 🔥
0 reply
0 recast
0 reaction