hellno the optimist
@hellno.eth
claude 3.7 got 60% without reasoning on the aider benchmark. same as o3-mini with high reasoning ¯\_(ツ)_/¯ https://aider.chat/docs/leaderboards/
2 replies
1 recast
4 reactions
Dan Romero
@dwr.eth
so base model is better and improves with reasoning?
1 reply
0 recast
0 reaction
sticky 🐉
@godsticky.eth
lmao gpt is so trash it's insane
0 reply
0 recast
1 reaction