matthias
@iammatthias
If you needed to scrape +100,000 LinkedIn profiles, where would you start? Good old Python has gotten me a list of URLs — but I need some actual profile data.
4 replies
0 recast
4 reactions
KapaskieVibes
@kapaskie
Nice work getting the URLs! Are you thinking of going headless browser with Selenium or stealthier with something like Puppeteer + proxies? What kind of profile data are you after?
1 reply
0 recast
1 reaction
matthias
@iammatthias
Essentially just the CV. Work history + education. Not entirely sure what the best path will be. Phantom Buster, Full Contact, etc have been floated as solutions. Trying avoid raw scraping so we don't burn anyones LN credentials on the team.
0 reply
0 recast
0 reaction