hellno the optimist pfp
hellno the optimist
@hellno.eth
claude 3.7 got 60% without reasoning on the aider benchmark. same as o3-mini with high reasoning ¯\_(ツ)_/¯ https://aider.chat/docs/leaderboards/
2 replies
1 recast
4 reactions

Dan Romero pfp
Dan Romero
@dwr.eth
so base model is better and improves with reasoning?
1 reply
0 recast
0 reaction

sticky 🐉 pfp
sticky 🐉
@godsticky.eth
lmao gpt is so trash it's insane
0 reply
0 recast
1 reaction