Katsuya on Warpcast

Content pfp

0 reply

0 recast

0 reaction

Greg pfp

I haven't been following the AI space very closely, especially outside of OpenAI stuff. How far behind are open source alternatives? I've heard about stuff like llama2 supposedly being decent for chat but I'm assuming the agents/function calling and whatnot is a different story?

9 replies

0 recast

11 reactions

Katsuya pfp

It depends on how you measure it/use cases, e.g. some OS models claim they are better than GPT-4 in some benchmark. But in general, my intuition is that OpenAI is ~1 year ahead than OS models. My building approach is: Just go with OpenAI if possible then optimize with other options if absolutely need to.

1 reply

0 recast

1 reaction

Kyle Mathews pfp

I switched from gpt3.5 to openhermes-2-mistral-7b for a side project as the gpt3.5 API was taking 35-85 seconds to finish where the oss LLM on https://www.perplexity.ai took 5-8 seconds with comparable quality. Gpt4 is mildly better quality wise but the huge difference in speed (and price) meant a lot more

3 replies

0 recast

3 reactions

Katsuya pfp

gpt3.5 taking 10x longer is surprising to me. Is it consistently that way? Going with smaller models for speed/price makes sense if that works for your use cases. In general though I think devs tend to spend too much time optimizing for these choices prematurely.

2 replies

0 recast

1 reaction

Katsuya pfp

(obviously, makes sense to go deep in these details if for side projects/fun :)

0 reply

0 recast

0 reaction

Kyle Mathews pfp

Hard to say. I saw it for several days and people on forums said it'd been happening for weeks at that point. So perhaps something internally has gotten overloaded 🤷‍♂️ I'm sure it's fixable. But switching took all of 10 mins and was immediately way better so stayed

0 reply

0 recast

1 reaction