Giuliano Giacaglia π²
@giu
It is surprising how well Gemini 2.5 Pro does in many benchmarks but not talked about a lot. I wonder why there isn't as much attention in the Google model. Here is a view of how most models perform on a PhD-level science reasoning benchmark called GPQA Diamond
4 replies
2 recasts
21 reactions
yekta β
@yekta
Marketing
0 reply
0 recast
0 reaction