Content
@
0 reply
0 recast
0 reaction
Varun Srinivasan
@v
We've been working on improving our spam detection. A big source of alpha has been taking algos used to rank content on the web and modifying them to work in Farcaster-space. @akshaan and @notawizard collaborated to add: - PageRank - Hyperlink Induced Topic Search - Louvain Clustering
5 replies
27 recasts
81 reactions
Varun Srinivasan
@v
A quick primer on spam handling in Warpcast: 1. Accounts are categorized roughly as "definitely not spammy", "probably not spammy", "unknown", "maybe a little spammy" and "definitely spammy". 2. Roughly 5% of the network is manually labelled by the team, and this seed data is used to train an ML model. 3. The model looks at a lot of signals and gives the user a score. For example, if you like things 24 hours a day, you're likely not a human. Multiple "bad" signals like this move accounts closer to the "definitely spammy" label. 4. The model has gotten quite good and rarely misses. In the cases where it does, we manually override it and retrain it on misses periodically so it gets better. 5. The model also tries to re-evaluate users periodically, so as users get more active and there is more data it can update its opinion.
3 replies
1 recast
30 reactions
Thumbs Up
@thumbsup.eth
Is data about how the team has labeled users publicly visible somewhere? If not, it ought to be
1 reply
0 recast
1 reaction
Varun Srinivasan
@v
Yeah we’re going to do this soon. Want to have a common standard for how to share account labels across various providers like Neynar , Warpcast and others.
2 replies
1 recast
3 reactions