This is a simple website crawler which asks for a website link from the user to crawl and find specific data from the given website address.

Last update: Jan 10, 2022

Related tags

Web Crawling Website-Crawler-Python-

Overview

Website-Crawler-Python

This is a simple website crawler which asks for a website link from the user to crawl and find specific data from the given website address. After getting the website address, it asks for how much crawling depth the user wants in between the number of links has been found after providing the website address.

Website Crawler takes 3 inputs:

A website address
Integer value for the crawling depth
A user specified regular expression to find user specific data

General tasks:

Find all the Nowgegian mobile numbers and saves into a text file.
Find all the sub-links inside the given website and saves into a text file.
Saves the website's raw HTML code into a text file.
Find all email addresses and save into a text file.
Find all the comments used in the website and saves it into a text file.
Find five most used words and print it into the terminal.

This is a Python based project and used some dependent libraries to execute the functionalities.

RegEx
Urllib3
BeautifulSoup 4
Counter in Collections

This is a simple website crawler which asks for a website link from the user to crawl and find specific data from the given website address.

Related tags

Overview

Website-Crawler-Python

Owner

Faisal Ahmed

An helper library to scrape data from Instagram effortlessly, using the Influencer Hunters APIs.

Web-Scraping using Selenium Master

茅台抢购最新优化版本，茅台秒杀，优化了抢购协程队列

Divar.ir Ads scrapper

This program will help you to properly scrape all data from a specific website

Dailyiptvlist.com Scraper With Python

A high-level distributed crawling framework.

Web Scraping Practica With Python

EBay-email-tracker - Scapes an entire search page of a particular item on eBay and sends regular updates to an email address

Python based Web Scraper which can discover javascript files and parse them for juicy information (API keys, IP's, Hidden Paths etc)

Scrapes proxies and saves them to a text file

An helper library to scrape data from TikTok in one line, using the Influencer Hunters APIs.

A dead simple crawler to get books information from Douban.

中国大学生在线四史自动答题刷分(现仅支持英雄篇)

一款利用Python来自动获取QQ音乐上某个歌手所有歌曲歌词的爬虫软件

Shopee Scraper - A web scraper in python that extract sales, price, avaliable stock, location and more of a given seller in Brazil

for those who dont want to pay $10/month for high school game footage with ads

Scrapy-soccer-games - Scraping information about soccer games from a few websites

A python script to extract answers to any question on Quora (Quora+ included)

robobrowser - A simple, Pythonic library for browsing the web without a standalone web browser.

This is a simple website crawler which asks for a website link from the user to crawl and find specific data from the given website address.

Related tags

Overview

Website-Crawler-Python

Owner

Faisal Ahmed

An helper library to scrape data from Instagram effortlessly, using the Influencer Hunters APIs.

Web-Scraping using Selenium Master

茅台抢购最新优化版本，茅台秒杀，优化了抢购协程队列

Divar.ir Ads scrapper

This program will help you to properly scrape all data from a specific website

Dailyiptvlist.com Scraper With Python

A high-level distributed crawling framework.

Web Scraping Practica With Python

EBay-email-tracker - Scapes an entire search page of a particular item on eBay and sends regular updates to an email address

Python based Web Scraper which can discover javascript files and parse them for juicy information (API keys, IP's, Hidden Paths etc)

Scrapes proxies and saves them to a text file

An helper library to scrape data from TikTok in one line, using the Influencer Hunters APIs.

A dead simple crawler to get books information from Douban.

中国大学生在线 四史自动答题刷分(现仅支持英雄篇)

一款利用Python来自动获取QQ音乐上某个歌手所有歌曲歌词的爬虫软件

Shopee Scraper - A web scraper in python that extract sales, price, avaliable stock, location and more of a given seller in Brazil

for those who dont want to pay $10/month for high school game footage with ads

Scrapy-soccer-games - Scraping information about soccer games from a few websites

A python script to extract answers to any question on Quora (Quora+ included)

robobrowser - A simple, Pythonic library for browsing the web without a standalone web browser.

中国大学生在线四史自动答题刷分(现仅支持英雄篇)