Content pfp
Content
@
https://warpcast.com/~/channel/aichannel
0 reply
0 recast
0 reaction

Dan Romero pfp
Dan Romero
@dwr.eth
Question for devs: is Claude still the best model for actual coding? If so, by how much?
22 replies
19 recasts
117 reactions

Ben Adamsky 💭 pfp
Ben Adamsky 💭
@ba
I've tried r1, o1, and o3 and the answer is still clearly yes Claude is no longer my go to for big architectural discussions, but it's the best at understanding a codebase and delivering a clear solve with minimal refactoring. Oh, and it's still blazing fast in comparison. So no for high level, yes for code
2 replies
1 recast
25 reactions

shoni.eth pfp
shoni.eth
@alexpaden
o3minihigh will be a great replacement very soon
1 reply
0 recast
3 reactions

Ben Adamsky 💭 pfp
Ben Adamsky 💭
@ba
We shall see, been surprisingly hard to beat sonnet even with the latest models though Expecting a big breakthrough in the next couple of months, but so far nothing has been worth making the switch even though sonnet is a mid level programmer at best
1 reply
0 recast
0 reaction

shoni.eth pfp
shoni.eth
@alexpaden
ah i don’t agree at all, imo sonnet is so bad now i sometimes just use 4o instead. total lack of instruction following on numerous occasions in a big codebase (elizaos, 800 line files) it literally didn’t do any code and just made a comment note changing a word or something ridiculous. are you using cursor agent?
1 reply
0 recast
1 reaction

Ben Adamsky 💭 pfp
Ben Adamsky 💭
@ba
If you want really good boilerplate based on a lengthy spec than there are better models than sonnet For me, I'm typically looking to prompt for specific high level output and claude has been better at that than any other models I've tried Tried cursor agent but have a better workflow w/ chat tbh
1 reply
0 recast
1 reaction

shoni.eth pfp
shoni.eth
@alexpaden
workflow of not switching to o1 or wdym? agent really just automates cli stuff i have to copy and paste otherwise re boilerplate: i am making core edits to major files and it just is not capable at this level. it was fine on smaller projects— idk i just have no interest in it anymore it takes more time to use than is worthwhile
1 reply
0 recast
1 reaction