Content
@
0 reply
0 recast
0 reaction
shoni.eth
@alexpaden
so i'm running all the data analysis off my mac studio 'cause compute is too expensive for my budget, and i guess all i can provide as a service is huggingface releases. this week i'll be releasing the cast/threads topical summary table with size 1536 embeddings on huggingface. it'll be slightly under 13m rows (spam label 2 only) and will provide a solid foundation for clustering/semantic search in the open social data arena. Creative Commons Attribution 4.0 license (?)
3 replies
2 recasts
22 reactions
yesyes
@yesyes
Doing god's work. I will also be restarting machine learning work so i think that dataset might prove useful for some of the stuff I want to try.
1 reply
0 recast
0 reaction
Apurv
@apurvkaushal
embeddings fine tuned for social data? sorry difficult to understand in one go..
0 reply
0 recast
0 reaction
tytyu
@inngo
cool
0 reply
0 recast
0 reaction