Scrapping Connections' info on Linkedin

Last update: Feb 11, 2022

Overview

Scrap It!

! Disclaimer:

THIS CODE HAS BEEN IMPLEMENTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE INTERVIEW PROCESS OF MCI.IR AND INTERVIEWEES WERE SUPPOSED TO PUSH THE CODE ON THEIR GITHUB. CONTACT ME TO REMOVE THIS REPOSITORY, IN CASE IT IS AGAINST YOUR TOS.
IF ANY CONNECTION IS NOT OK TO THEIR CONTACT INFO BE HERE, CONTACT ME TO REMOVE THEM ASAP.

Functionalities:

This script automatically:

opens your Linkedin profile
accesses your connections page
crawls the page for grabbing their profile links
scraps each person's information and dumps it to Sqlite db
and simultaneously logs all necessary level of info into Linkedin.log

DataFlowDiagram

Enlisted desing patterns are (but not limited to):

Creator
Low Coupling
High Cohesion
Indirection
Modularization
Information Expert

Log/DB files:

Further develepments notes:

Check out other DBs that supports multithreading which anable us dumpping all information rows at once
change IP per request (You can find its code on my "Social Media Computing course" repository)
Sometimes you need to scroll down manually when "connection" page is being loaded. You can add one line code to scroll down for you.

References:

https://www.linkedin.com/pulse/how-easy-scraping-data-from-linkedin-profiles-david-craven

https://www.geeksforgeeks.org/scrape-linkedin-using-selenium-and-beautiful-soup-in-python/

https://stackoverflow.com/questions/28883769/remove-odd-indexed-elements-from-list-in-python#:~:text=Fun%20fact%3A%20to%20remove%20all,remove(x)%20.

https://stackoverflow.com/questions/34759787/fetch-all-href-link-using-selenium-in-python

https://www.tutorialspoint.com/fetch-all-href-link-using-selenium-in-python

https://stackoverflow.com/questions/64717302/deprecationwarning-executable-path-has-been-deprecated-selenium-python

https://chromedriver.chromium.org/home

https://www.youtube.com/watch?v=-ARI4Cz-awo

Scrapping Connections' info on Linkedin

Related tags

Overview

Scrap It!

Functionalities:

DataFlowDiagram

Enlisted desing patterns are (but not limited to):

Log/DB files:

Further develepments notes:

References:

Owner

MohammadReza Ardestani

script to scrape direct download links (ddls) from google drive index.

This is python to scrape overview and reviews of companies from Glassdoor.

Works very well and you can ask for the type of image you want the scrapper to collect.

tweet random sand cat pictures

Lovely Scrapper

Library to scrape and clean web pages to create massive datasets.

for those who dont want to pay $10/month for high school game footage with ads

Ebay Webscraper for Getting Average Product Price

Simple proxy scraper made by using ProxyScrape's api.

A Python package that scrapes Google News article data while remaining undetected by Google.

A web Scraper for CSrankings.com that scrapes University and Faculty list for a particular country

PS5 bot to find a console in france for chrismas 🎄🎅🏻 NOT FOR SCALPERS

Google Developer Profile Badge Scraper

Grab the changelog from releases on Github

Script used to download data for stocks.

This repo has the source code for the crawler and data crawled from auto-data.net

✂️🕷️ Spider-Cut is a Network Mapper Framework (NMAP Framework)

A simple Discord scraper for discord bots

mlscraper: Scrape data from HTML pages automatically with Machine Learning

A web scraper that exports your entire WhatsApp chat history.