Scrapes Every Email Address of Every Society in Every University

Last update: Dec 14, 2022

Overview

society-email-scrape

Site Live at https://kcsoc.github.io/society-email-scrape/

How to automatically generate new data

Go to unis.yml
Add your uni
Create a Pull Request
GitHub Actions bot will automatically update as per your PR
When I approve the PR your Uni will automatically be loaded into the website
https://kcsoc.github.io/society-email-scrape/

How to run yourself

Add any URLs to unis.yml

PS: Don't forget to leave a trailing newline at the end of the file

git clone https://github.com/kcsoc/society-email-scrape.git
cd society-email-scrape
./main.sh

How to test for a single university

Choose a university (on your own or from unis.yml)
Run python main.py UNIVERSITY_URL

To run with debug mode

The program features a debug mode. To enable it, simply preface the python command with DEBUG_MODE=true

For example:

DEBUG_MODE=true python3 main.py https://www.imperialcollegeunion.org/activities/a-to-z

Or set the DEBUG_MODE variable to true globally in your shell with, export DEBUG_MODE=true

Note: DEBUG_MODE is enabled when the variable is set, to unset, use the unset keyword in bash

Scrapes Every Email Address of Every Society in Every University

Related tags

Overview

society-email-scrape

Site Live at https://kcsoc.github.io/society-email-scrape/

How to automatically generate new data

How to run yourself

How to test for a single university

To run with debug mode

Owner

Krishna Consciousness Society

A modern CSS selector implementation for BeautifulSoup

A Happy and lightweight Python Package that searches Google News RSS Feed and returns a usable JSON response and scrap complete article - No need to write scrappers for articles fetching anymore

Libextract: extract data from websites

河南工业大学完美校园自动校外打卡

Web scraping library and command-line tool for text discovery and extraction (main content, metadata, comments)

feapder 是一款简单、快速、轻量级的爬虫框架。以开发快速、抓取快速、使用简单、功能强大为宗旨。支持分布式爬虫、批次爬虫、多模板爬虫，以及完善的爬虫报警机制。

This is a web scraper, using Python framework Scrapy, built to extract data from the Deals of the Day section on Mercado Livre website.

Scrapping Connections' info on Linkedin

A simple Discord scraper for discord bots

Telegram Group Scrapper

UdemyBot - A Simple Udemy Free Courses Scrapper

Comment Webpage Screenshot is a GitHub Action that captures screenshots of web pages and HTML files located in the repository

Auto Join: A GitHub action script to automatically invite everyone to the organization who star your repository.

Scrapy uses Request and Response objects for crawling web sites.

This program scrapes information and images for movies and TV shows.

Pro Football Reference Game Data Webscraper

Scrape data on SpaceX: Capsules, Rockets, Cores, Roadsters, SpaceX Info

Web Crawlers for Data Labelling of Malicious Domain Detection & IP Reputation Evaluation

Scraping followers of an instagram account

An arxiv spider

Scrapes Every Email Address of Every Society in Every University

Related tags

Overview

society-email-scrape

Site Live at https://kcsoc.github.io/society-email-scrape/

How to automatically generate new data

How to run yourself

How to test for a single university

To run with debug mode

Owner

Krishna Consciousness Society

A modern CSS selector implementation for BeautifulSoup

A Happy and lightweight Python Package that searches Google News RSS Feed and returns a usable JSON response and scrap complete article - No need to write scrappers for articles fetching anymore

Libextract: extract data from websites

河南工业大学 完美校园 自动校外打卡

Web scraping library and command-line tool for text discovery and extraction (main content, metadata, comments)

feapder 是一款简单、快速、轻量级的爬虫框架。以开发快速、抓取快速、使用简单、功能强大为宗旨。支持分布式爬虫、批次爬虫、多模板爬虫，以及完善的爬虫报警机制。

This is a web scraper, using Python framework Scrapy, built to extract data from the Deals of the Day section on Mercado Livre website.

Scrapping Connections' info on Linkedin

A simple Discord scraper for discord bots

Telegram Group Scrapper

UdemyBot - A Simple Udemy Free Courses Scrapper

Comment Webpage Screenshot is a GitHub Action that captures screenshots of web pages and HTML files located in the repository

Auto Join: A GitHub action script to automatically invite everyone to the organization who star your repository.

Scrapy uses Request and Response objects for crawling web sites.

This program scrapes information and images for movies and TV shows.

Pro Football Reference Game Data Webscraper

Scrape data on SpaceX: Capsules, Rockets, Cores, Roadsters, SpaceX Info

Web Crawlers for Data Labelling of Malicious Domain Detection & IP Reputation Evaluation

Scraping followers of an instagram account

An arxiv spider

河南工业大学完美校园自动校外打卡