I am putting together 3 TB of GPU capacity to run 3 concurrent Llama 3 405B models - mainly to have the cross reference edit each other and do its own coding...so I want redundancy in the system. Currently running two shitty AMD systems with 2 40B Llama 3 models. Any hardware suggestions besides Nvidia as the base GPU's and any suggestion on Github repo softwrae to run them and make them agents - currently use a crappy Ollama interface on both.

Into innovation and biology as a technology - probably gonna talk about biomimicry

The other projects I’ve seen that tried to use AMD were not stoked on their choice. Get the m4 off you don’t want nvidia but you should just get nvidia 

What are you asking about GitHub?

Catch me on X, notifications are off here 
@devinjelliot

If there's a better Llama interface than Ollama!!