sid pfp
sid
@siddani
Well I’ll be damned. I didn’t think it would actually happen, but as of today, Grok 3 is the best AI model out there. We have a new player in town. xAI just dropped Grok 3, their latest large language model, packed with a reasoning engine and a mini model. And it’s delivering some serious results: • LMArena: 1400 ELO (#1 ranking) • AIME 24: 52% (96% with reasoning!) • GPQA: 75% (85% with reasoning) • LiveCodeBench (Coding): 57% (80% with reasoning) • AIME 2025 (Math): 93%, outperforming o3-mini-high The AI game just got interesting.
11 replies
19 recasts
123 reactions

nix pfp
nix
@nix
Best performing OpenAI models seem absent in this benchmark, ie the o3 ones?
1 reply
0 recast
0 reaction

sid pfp
sid
@siddani
o3 hasn't been publicly released yet
1 reply
0 recast
0 reaction

Mnttoken pfp
Mnttoken
@mnttoken
True, only the mini versions are released
0 reply
0 recast
0 reaction