Varun Srinivasan pfp
Varun Srinivasan
@v
A public good that would be really useful is a public dataset of embeddings for casts. Assuming it's legal to do, would drastically reduce the cost of building embeddings based models on Farcaster
11 replies
64 recasts
119 reactions

borodutch pfp
borodutch
@warpcastadmin.eth
like get a cast, it's embeds, and return head og tags for each embed (url)?
1 reply
0 recast
0 reaction

Varun Srinivasan pfp
Varun Srinivasan
@v
no, i mean like vector embeddings for the text
2 replies
0 recast
1 reaction

Harry pfp
Harry
@harrytrananh
words to vectors you mean?
0 reply
0 recast
0 reaction

not parzival pfp
not parzival
@shoni.eth
What are you using the embeddings for in training- it's typically search and classification llm fine tuning and new training uses raw text i can provide snapshots if someone would like but custom transformations then embedded are where the value would be.. embedding every raw cast is a lot less useful imo
0 reply
0 recast
0 reaction