Content pfp
Content
@
0 reply
0 recast
0 reaction

Jacek.degen.eth 🎩 pfp
Jacek.degen.eth 🎩
@jacek
Came up with some metrics for Farcaster profiles. If anybody has experience with ML or similar tech, I would be curious to hear ideas on how to feed casts and profile metadata into an algorithm that identifies spam. https://dune.com/queries/3540715
24 replies
12 recasts
196 reactions

Eddie Wharton pfp
Eddie Wharton
@eddie
A classic ML approach is releasing labeled data (users marked as spam or not) to a competition. You hold back some data to evaluate submissions Probably good candidate features for spam detection using NLP to parse text in names/bios/casts Maybe a good potential community challenge @ilemi?
2 replies
0 recast
2 reactions

J Hackworth pfp
J Hackworth
@jhackworth
I talked about this a bit last week but you need to create a supervised training to define what is/isn't spam to train your model. Either you could manually label this is a spam account or what you could do is start with a list of known spam words and then develop additional features on top of that. Happy to talk more
0 reply
0 recast
1 reaction

Ape/rture pfp
Ape/rture
@aperture
I think you mostly want to look at the content of the casts since spam is often defined by the goal of the cast. The cast could shill coins or contain URLs. Also in addition try to identify if the text was human generated. In a way syntax and spelling errors point to human written text.
0 reply
0 recast
0 reaction

Sean Brennan pfp
Sean Brennan
@seanwbren
Talk to @betashop.eth, Airstack worked on an XMTP filter for spam
1 reply
0 recast
0 reaction

Corbin Page pfp
Corbin Page
@corbin.eth
@sahil is doing some great work here
1 reply
1 recast
1 reaction

Gabriel 🎩 pfp
Gabriel 🎩
@0xgabriel.eth
I believe I may have been wrongly black listed. Can you please check or how do I appeal? My wallet doesn't show up at all under Airdrop 2.
1 reply
0 recast
0 reaction

simon pfp
simon
@sa
@deployer is testing something interesting with tomatoes. If you "tip" someone a tomato, you're basically reporting a low quality or spammy cast and it reduces their AD amount. Maybe the community could help keep out bad actors with the reverse-degen version of tomatoes?
1 reply
0 recast
0 reaction

OmbreπŸŽ©πŸ’™ pfp
OmbreπŸŽ©πŸ’™
@0mbre
was in ML before cyrpto, will fosho take a look at that. I noticed an "original_cast" column that isn't in the FC database. Where does that one come from ?
1 reply
0 recast
0 reaction

$NEGA πŸ”ΊπŸŽ© pfp
$NEGA πŸ”ΊπŸŽ©
@neged
you can send everyone spamming to /neged we'll take them in.
0 reply
0 recast
1 reaction

iSpeakNerd πŸ§™β€β™‚οΈ pfp
iSpeakNerd πŸ§™β€β™‚οΈ
@ispeaknerd.eth
@fun check this out
0 reply
0 recast
1 reaction

Mark Carey πŸŽ©πŸ«‚ pfp
Mark Carey πŸŽ©πŸ«‚
@markcarey
Some interesting metrics here. Warpcast is going to launch "power badges" soon, maybe this will help ... but the "recipe" will be secret, for good reasons. But the harder task may be identifying non-spam users who may not (yet?) be ⚑ power users.
1 reply
0 recast
1 reaction

Mateusz🎩 pfp
Mateusz🎩
@matt90
Maybe a measure which identifies the count of "Exactly the same messages casted"? Or something similar like measuring diffs between casts content :) Bots have a tendency to post exactly the same message many times, repeating every few minutes/hours on different channels.
0 reply
0 recast
0 reaction

Jason pfp
Jason
@jachian
I work in the ML space and specifically have dabbled in NLP for 8 years. Will give it some thought
0 reply
0 recast
0 reaction

Mateusz🎩 pfp
Mateusz🎩
@matt90
Big step for $DEGEN community improvement! 2137 $DEGEN πŸ”₯πŸ”₯πŸ”₯πŸ”₯
0 reply
0 recast
0 reaction

cryptonomori pfp
cryptonomori
@cryptonomori.eth
Is it possible that distributing $DEGEN will be recognized as spam?
0 reply
0 recast
1 reaction

yaffle πŸ‘½πŸŽ©πŸ’Š pfp
yaffle πŸ‘½πŸŽ©πŸ’Š
@yaffle.eth
1/ Just shared with with an ML friend of mine - his feedback was that it's a pretty simple classification algo, so something like random forest or bayes could work, however some teams at Google have been having success with LLMs using the Gemma framework.
1 reply
0 recast
0 reaction

coinbuzaπŸŽ©πŸ”΅πŸ₯ˆπŸ§€ pfp
coinbuzaπŸŽ©πŸ”΅πŸ₯ˆπŸ§€
@200-coinbuza
very good
0 reply
0 recast
0 reaction

Tyga pfp
Tyga
@tyga
My first thought would be to be get all the profile info of accounts, model citizens that you know for sure aren't botting and also those that for sure are. Then train your model to recognize those patterns with a score of 0-1. In betweens may be varying levels of spammy, but may have some thoughtful behavior.
1 reply
0 recast
0 reaction

Eitzi 🎩 βŒπŸ†‡-πŸ†‡ 🌎-β€˜ pfp
Eitzi 🎩 βŒπŸ†‡-πŸ†‡ 🌎-β€˜
@eitzi.eth
That’s cool. What is called a β€žoriginal castβ€œ?
0 reply
0 recast
0 reaction