sid
@siddani
Well I’ll be damned. I didn’t think it would actually happen, but as of today, Grok 3 is the best AI model out there. We have a new player in town. xAI just dropped Grok 3, their latest large language model, packed with a reasoning engine and a mini model. And it’s delivering some serious results: • LMArena: 1400 ELO (#1 ranking) • AIME 24: 52% (96% with reasoning!) • GPQA: 75% (85% with reasoning) • LiveCodeBench (Coding): 57% (80% with reasoning) • AIME 2025 (Math): 93%, outperforming o3-mini-high The AI game just got interesting.
11 replies
21 recasts
134 reactions
shoni.eth
@alexpaden
these aren’t the best ai models lol wtf it’s a chart with the worst openai model
1 reply
0 recast
0 reaction
sid
@siddani
I’m limited to uploading only two images here, but there’s a distinction between non-reasoning and reasoning. Grok 3 is the best model in both categories.
1 reply
0 recast
0 reaction
shoni.eth
@alexpaden
it still loses from what i’m seeing.. also i use o1 pro not o1… idk just was a marketing scheme but it seems very inaccurate https://x.com/12exyz/status/1891723056931827959?s=46 https://x.com/12exyz/status/1891778033964445706?s=46
1 reply
0 recast
1 reaction
sid
@siddani
I agree with everything you’re saying, but OpenAI hasn’t released O3 publicly yet. They definitely manipulated the charts to make it seem like they’re winning, which is unfair, but OpenAI, Apple, and others have done that before. Regardless of that, they did release a very good model in less than 12 months.
0 reply
0 recast
1 reaction