Dan Romero on Warpcast

Content pfp

https://warpcast.com/~/channel/aichannel

0 reply

0 recast

0 reaction

Dan Romero pfp

Question for devs: is Claude still the best model for actual coding? If so, by how much?

21 replies

14 recasts

90 reactions

Ben Adamsky 💭 pfp

Ben Adamsky 💭

I've tried r1, o1, and o3 and the answer is still clearly yes Claude is no longer my go to for big architectural discussions, but it's the best at understanding a codebase and delivering a clear solve with minimal refactoring. Oh, and it's still blazing fast in comparison. So no for high level, yes for code

2 replies

1 recast

19 reactions

Brian Kim pfp

ya sonnet still goated

0 reply

0 recast

7 reactions

JB Rubinovitz pfp

I find Claude is better for iterating on frontend + artifacts and o1 pro is better for backend, architecture, and debugging

0 reply

0 recast

4 reactions

Daniel - Bountycaster pfp

Daniel - Bountycaster

for 90% of tasks, Claude is the best by a strong margin If you are using a less popular programming language and/or a less popular framework, I've noticed Deep Research does an incredible job

0 reply

0 recast

2 reactions

shazow pfp

I've been liking the aider dashboards https://aider.chat/docs/leaderboards/

0 reply

0 recast

2 reactions

Nate pfp

yes, I still use sonnet for coding but r1 for bouncing off ideas and o3 if sonnet is stuck on a loop

0 reply

0 recast

1 reaction

Sayonara pfp

O3-mini is marginally slow and better

0 reply

0 recast

1 reaction

Alex Loukissas 🍉 pfp

Alex Loukissas 🍉

Sonnet still really good but getting to appreciate Gemini 2.0 Flash (avail in cursor already). It is super fast + pretty good with coding.

0 reply

0 recast

0 reaction

Jack Yeh (hiring eng) pfp

Jack Yeh (hiring eng)

Yes. o1 is great for planning, but Claude still best for execution. Haven't had enough experience with o3-mini yet https://web.lmarena.ai/leaderboard

0 reply

0 recast

0 reaction

Tayyab - d/acc pfp

Livebench.ai o3 is by far the best. But it’s slower obviously. Claude Sonnet is still my daily driver

0 reply

0 recast

0 reaction

Shriphani Palakodety pfp

Shriphani Palakodety

Depends a lot on the domain IMO. - self contained scripts (great) - crud (great) - code reviews of well understood concepts (great) - critiquing protocols - the o family is starting to get good at these - frontier cryptography like mpspdz, zk circuits - not great and gets in the way (and I think professional integrity requires one not fully trust llm code here)

0 reply

0 recast

0 reaction

sardius.eth pfp

Love me some Claude there may be some better models at this point, I find it most consistent and I understand it’s flow better than others tho

0 reply

0 recast

0 reaction

Ξ2T 🏰 pfp

https://web.lmarena.ai/leaderboard

0 reply

0 recast

0 reaction

Joe Blau 🎩 pfp

Yes; If Claude is: 10 Gemini: 7 o1: 8 r1: 8

0 reply

0 recast

0 reaction

mike rainbow (rainbow mike) ↑ pfp

mike rainbow (rainbow mike) ↑

@mikedemarais.eth

0 reply

0 recast

1 reaction

Gordo pfp

Efficiency, rapid problem solving, and agent iterations are better than one necessary correct output. Many times during the process of doing the thing the approach takes another shape than the initial requirement

0 reply

0 recast

0 reaction

Grateful Ned 🎩 pfp

Grateful Ned 🎩

@gratefulned.eth

Claude still seems to be the best but hearing good things about Gemini

0 reply

0 recast

0 reaction

Sarvesh pfp

@tokenstaker.eth

0 reply

0 recast

0 reaction

Tamrat pfp

yup! https://x.com/cursor_ai/status/1885102274370064670?s=46&t=Iuat8TUc1DSlwq84a8cxjw

0 reply

0 recast

0 reaction