sid on Warpcast

sid pfp

Well I’ll be damned. I didn’t think it would actually happen, but as of today, Grok 3 is the best AI model out there. We have a new player in town. xAI just dropped Grok 3, their latest large language model, packed with a reasoning engine and a mini model. And it’s delivering some serious results: • LMArena: 1400 ELO (#1 ranking) • AIME 24: 52% (96% with reasoning!) • GPQA: 75% (85% with reasoning) • LiveCodeBench (Coding): 57% (80% with reasoning) • AIME 2025 (Math): 93%, outperforming o3-mini-high The AI game just got interesting.

11 replies

21 recasts

134 reactions

shoni.eth pfp

these aren’t the best ai models lol wtf it’s a chart with the worst openai model

1 reply

0 recast

0 reaction

sid pfp

I’m limited to uploading only two images here, but there’s a distinction between non-reasoning and reasoning. Grok 3 is the best model in both categories.

1 reply

0 recast

0 reaction

shoni.eth pfp

it still loses from what i’m seeing.. also i use o1 pro not o1… idk just was a marketing scheme but it seems very inaccurate https://x.com/12exyz/status/1891723056931827959?s=46 https://x.com/12exyz/status/1891778033964445706?s=46

1 reply

0 recast

1 reaction

sid pfp

I agree with everything you’re saying, but OpenAI hasn’t released O3 publicly yet. They definitely manipulated the charts to make it seem like they’re winning, which is unfair, but OpenAI, Apple, and others have done that before. Regardless of that, they did release a very good model in less than 12 months.

0 reply

0 recast

1 reaction