Content
@
0 reply
0 recast
0 reaction
ππͺπΎπ‘π‘πΎ
@gm8xx8
what are your gpt-2 chatbot theories? i have a fewβ¦
1 reply
0 recast
6 reactions
ByteBuddha
@bytebuddha
a gpt-2 scale chinchilla optimum model trained on gpt-4+ level dataset with modern optimization techniques (maybe MOE with gpt-2 scale active parameters) in my lite usage, the performance is below llama-3 70B (ie. also below gpt-4) could be wrong though
0 reply
0 recast
0 reaction