matthias pfp
matthias
@iammatthias
If you needed to scrape +100,000 LinkedIn profiles, where would you start? Good old Python has gotten me a list of URLs — but I need some actual profile data.
4 replies
0 recast
4 reactions

KapaskieVibes pfp
KapaskieVibes
@kapaskie
Nice work getting the URLs! Are you thinking of going headless browser with Selenium or stealthier with something like Puppeteer + proxies? What kind of profile data are you after?
1 reply
0 recast
1 reaction

matthias pfp
matthias
@iammatthias
Essentially just the CV. Work history + education. Not entirely sure what the best path will be. Phantom Buster, Full Contact, etc have been floated as solutions. Trying avoid raw scraping so we don't burn anyones LN credentials on the team.
0 reply
0 recast
0 reaction