Claude 3.7 is here 
https://www.anthropic.com/news/claude-3-7-sonnet

claude 3.7 got 60% without reasoning on the aider benchmark. same as o3-mini with high reasoning 

¯\_(ツ)_/¯

claude 3.7 got 60% without reasoning on the aider benchmark. same as o3-mini with high reasoning 

¯\_(ツ)_/¯ 

https://aider.chat/docs/leaderboards/

dev + founder | @vibesengineering.eth prev: @onsenbot @herocast

so base model is better and improves with reasoning?

Working on Farcaster and Warpcast. Longer thoughts at https://dwr.email

yes. it jumped to the level of reasoning models (R1, o1) without reasoning activated. 

claude 3.5 52% 
→ claude 3.7 no thinking 60% 
→ claude 3.7 with thinking ??? 
might be first model to get 70%

should be live in aider tomorrow