Content
@
0 reply
0 recast
0 reaction
phil
@phil
What are the best tools for running a scaleable cloud instance of Llama3 for text generation? It doesn't need to go into production, but I want to test something on a machine larger than my MBP. I've tried Replit (not supported) and AWS (way too complicated for such a simple task). Any obvious tools?
4 replies
2 recasts
11 reactions
chompk ↑
@chompk
My company use vLLM with a rented GPU VM I also have one of my friend hosting LLM as a service as well https://float16.cloud
0 reply
0 recast
0 reaction