yekta ✎ on Warpcast

Giuliano Giacaglia 🌲 pfp

Giuliano Giacaglia 🌲

It is surprising how well Gemini 2.5 Pro does in many benchmarks but not talked about a lot. I wonder why there isn't as much attention in the Google model. Here is a view of how most models perform on a PhD-level science reasoning benchmark called GPQA Diamond

4 replies

2 recasts

21 reactions

yekta ✎ pfp

0 reply

0 recast

0 reaction