A tool for scraping and organizing data from NewsBank API searches

Last update: Jun 17, 2021

Overview

nbscraper

Overview

This simple tool automates the process of copying, pasting, and organizing data from NewsBank API searches. Curerntly, nbscrape only searches print sources in the USA.

Requirements

Access to NewsBank (e.g. via your institution's library)
Python 3

Basic Usage

Call nbscrape function
- Arguments include "search", "date_from", and "date_to"
Output is a pandas dataframe, with all available metadata for each source

Disclaimer

This tool is to be used in compliance with terms of service outlined by your institution and NewsBank. As such, it is suggested that you use this tool for research purposes only, once you have settled on your final search terms. This is not an exploratory tool. The purpose of nbscraper is to alleviate the tedium of having to click through 50 pages one by one and to manually save sources' metadata.

Owner

GitHub Repository

A tool for scraping and organizing data from NewsBank API searches

Related tags

Overview

nbscraper

Overview

Requirements

Basic Usage

Disclaimer

Owner

河南工业大学完美校园自动校外打卡

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

Scrapping the data from each page of biocides listed on the BAUA website into a csv file

A Telegram crawler to search groups and channels automatically and collect any type of data from them.

Scrape data on SpaceX: Capsules, Rockets, Cores, Roadsters, SpaceX Info

Scraping and visualising India's real-time COVID-19 data from the MOHFW dataset.

Scrapy-soccer-games - Scraping information about soccer games from a few websites

优化版本的京东茅台抢购神器

Automatically download and crop key information from the arxiv daily paper.

TikTok Username Swapper/Claimer/etc

Unja is a fast & light tool for fetching known URLs from Wayback Machine

Python script who crawl first shodan page and check DBLTEK vulnerability

A spider for Universal Online Judge(UOJ) system, converting problem pages to PDFs.

Python scrapper scrapping torrent website and download new movies Automatically.

爱奇艺会员,腾讯视频,哔哩哔哩,百度,各类签到

Html Content / Article Extractor, web scrapping lib in Python

Twitter Claimer / Swapper / Turbo - Proxyless - Multithreading

A web scraper for nomadlist.com, made to avoid website restrictions.

feapder 是一款简单、快速、轻量级的爬虫框架。以开发快速、抓取快速、使用简单、功能强大为宗旨。支持分布式爬虫、批次爬虫、多模板爬虫，以及完善的爬虫报警机制。

A simple code to fetch comments below an Instagram post and save them to a csv file

A tool for scraping and organizing data from NewsBank API searches

Related tags

Overview

nbscraper

Overview

Requirements

Basic Usage

Disclaimer

Owner

河南工业大学 完美校园 自动校外打卡

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

Scrapping the data from each page of biocides listed on the BAUA website into a csv file

A Telegram crawler to search groups and channels automatically and collect any type of data from them.

Scrape data on SpaceX: Capsules, Rockets, Cores, Roadsters, SpaceX Info

Scraping and visualising India's real-time COVID-19 data from the MOHFW dataset.

Scrapy-soccer-games - Scraping information about soccer games from a few websites

优化版本的京东茅台抢购神器

Automatically download and crop key information from the arxiv daily paper.

TikTok Username Swapper/Claimer/etc

Unja is a fast & light tool for fetching known URLs from Wayback Machine

Python script who crawl first shodan page and check DBLTEK vulnerability

A spider for Universal Online Judge(UOJ) system, converting problem pages to PDFs.

Python scrapper scrapping torrent website and download new movies Automatically.

爱奇艺会员,腾讯视频,哔哩哔哩,百度,各类签到

Html Content / Article Extractor, web scrapping lib in Python

Twitter Claimer / Swapper / Turbo - Proxyless - Multithreading

A web scraper for nomadlist.com, made to avoid website restrictions.

feapder 是一款简单、快速、轻量级的爬虫框架。以开发快速、抓取快速、使用简单、功能强大为宗旨。支持分布式爬虫、批次爬虫、多模板爬虫，以及完善的爬虫报警机制。

A simple code to fetch comments below an Instagram post and save them to a csv file

河南工业大学完美校园自动校外打卡