Content
@
https://warpcast.com/~/channel/theai
0 reply
0 recast
0 reaction
Giuliano Giacaglia
@giu
This is pretty interesting. DeepSeek, a model that is outperforming all other LLMs at a small size, seems to be trained on the output of frontier models like GPT-4, which is against their TOS (terms of service). That would explain how they trained such a performant and small model with not as many resources as other labs https://x.com/giffmana/status/1872586401436627211
1 reply
0 recast
16 reactions
Sdam Amith
@sdamamith
synthetic data training is pretty common now but iβm curious if the TOS violation yields
0 reply
0 recast
0 reaction