A dead simple crawler to get books information from Douban.

Last update: Jan 10, 2022

Related tags

Web Crawling douban-books-crawler

Overview

Introduction

A dead simple crawler to get books information from Douban.

Pre-requesites

Python 3
Install dependencies from requirements.txt
(Optional) Install Anaconda to handle environment

Usage

Run get_tags to fetch all the trending tags.

# This will generate a file tags.csv
python app.py get_tags

Run crawl_books to start crawling the books by the tags from the previous step.

python app.py crawl_books -i tags.csv

Certainly, you can create the tags.csv without using the get_tags script. You might want to make sure the tags you specified can lead to any actual result of books.

License

MIT © mogita

Owner

Yun Wang

GitHub Repository

WebScraping - Scrapes Job website for python developer jobs and exports the data to a csv file

WebScraping Web scraping Pyton program that scrapes Job website for python devel

2 Jul 22, 2022

This is a python api to scrape search results from a url.

googlescrape Installation Installation is simple! # Stable version pip install googlescrape Examples from googlescrape import client scrapeClient=cli

1 Dec 15, 2022

Video Games Web Scraper is a project that crawls websites and APIs and extracts video game related data from their pages.

Video Games Web Scraper Video Games Web Scraper is a project that crawls websites and APIs and extracts video game related data from their pages. This

1 Jan 12, 2022

jd_maotai rpa 基于selenium驱动的jd抢购rpa机器人

jd_maotai rpa 基于selenium驱动的jd抢购rpa机器人, 照顾我们这样的马大哈, 不会忘记抢购了, 祝大家过年都能喝上茅台. 特别声明: 本仓库发布的jd_maotai_rpa项目定义为自动化rpa项目, 是用于防止忘记参与jd茅台的活动(由于本人时常忘记), 而不是为了秒杀和抢

35 Nov 18, 2022

A simplistic scraper made to download tons of random screenshots made by people.

printStealer 1.1 What is this tool? This tool is developed to show the insecurity of the screenshot utility called prnt sc. It is a site that stores s

4 Jul 26, 2022

河南工业大学完美校园自动校外打卡

HAUT-checkin 河南工业大学自动校外打卡由于github actions存在明显延迟，建议直接使用腾讯云函数特点多人打卡使用简单，仅需账号密码以及用于微信推送的uid 自动获取上一次打卡信息用于打卡向所有成员微信单独推送打卡状态完美校园服务器繁忙时造成打卡失败会自动重新打卡

36 Oct 27, 2022

Extract gene TSS site form gencode/ensembl/gencode database GTF file and export bed format file.

GetTss python Package extract gene TSS site form gencode/ensembl/gencode database GTF file and export bed format file. Install $ pip install GetTss Us

6 Nov 21, 2022

A tool for scraping and organizing data from NewsBank API searches

nbscraper Overview This simple tool automates the process of copying, pasting, and organizing data from NewsBank API searches. Curerntly, nbscrape onl

0 Jun 17, 2021

Quick Project made to help scrape Lexile and Atos(AR) levels from ISBN

Lexile-Atos-Scraper Quick Project made to help scrape Lexile and Atos(AR) levels from ISBN You will need to install the chrome webdriver if you have n

1 Feb 11, 2022

A universal package of scraper scripts for humans

Scrapera is a completely Chromedriver free package that provides access to a variety of scraper scripts for most commonly used machine learning and data science domains.

299 Dec 15, 2022

中国大学生在线四史自动答题刷分(现仅支持英雄篇)

中国大学生在线 “四史”学习教育竞答自动答题刷分 (现仅支持英雄篇，已更新可用) 若对您有所帮助，记得点个Star 🌟 ！！！中国大学生在线 “四史”学习教育竞答自动答题刷分 (现仅支持英雄篇，已更新可用) 🥰 🥰 🥰 依赖本项目依赖的第三方库: requests 在终端执行以下

229 Dec 12, 2022

Python based Web Scraper which can discover javascript files and parse them for juicy information (API keys, IP's, Hidden Paths etc)

Python based Web Scraper which can discover javascript files and parse them for juicy information (API keys, IP's, Hidden Paths etc).

6 Aug 26, 2022

A dead simple crawler to get books information from Douban.

Related tags

Overview

Introduction

Pre-requesites

Usage

License

Owner

Yun Wang

WebScraping - Scrapes Job website for python developer jobs and exports the data to a csv file

This is a python api to scrape search results from a url.

Video Games Web Scraper is a project that crawls websites and APIs and extracts video game related data from their pages.

jd_maotai rpa 基于selenium驱动的jd抢购rpa机器人

A simplistic scraper made to download tons of random screenshots made by people.

河南工业大学完美校园自动校外打卡

Extract gene TSS site form gencode/ensembl/gencode database GTF file and export bed format file.

A tool for scraping and organizing data from NewsBank API searches

Quick Project made to help scrape Lexile and Atos(AR) levels from ISBN

A universal package of scraper scripts for humans

中国大学生在线四史自动答题刷分(现仅支持英雄篇)

Python based Web Scraper which can discover javascript files and parse them for juicy information (API keys, IP's, Hidden Paths etc)

Python framework to scrape Pastebin pastes and analyze them

优化版本的京东茅台抢购神器

用python爬取江苏几大高校的就业网站，并提供3种方式通知给用户，分别是通过微信发送、命令行直接输出、windows气泡通知。

Webservice wrapper for hhursev/recipe-scrapers (python library to scrape recipes from websites)

学习强国自动化百分百正确、瞬间答题，分值45分

Pro Football Reference Game Data Webscraper

Create crawler get some new products with maximum discount in banimode website

This is a webscraper for a specific website

A dead simple crawler to get books information from Douban.

Related tags

Overview

Introduction

Pre-requesites

Usage

License

Owner

Yun Wang

WebScraping - Scrapes Job website for python developer jobs and exports the data to a csv file

This is a python api to scrape search results from a url.

Video Games Web Scraper is a project that crawls websites and APIs and extracts video game related data from their pages.

jd_maotai rpa 基于selenium驱动的jd抢购rpa机器人

A simplistic scraper made to download tons of random screenshots made by people.

河南工业大学 完美校园 自动校外打卡

Extract gene TSS site form gencode/ensembl/gencode database GTF file and export bed format file.

A tool for scraping and organizing data from NewsBank API searches

Quick Project made to help scrape Lexile and Atos(AR) levels from ISBN

A universal package of scraper scripts for humans

中国大学生在线 四史自动答题刷分(现仅支持英雄篇)

Python based Web Scraper which can discover javascript files and parse them for juicy information (API keys, IP's, Hidden Paths etc)

Python framework to scrape Pastebin pastes and analyze them

优化版本的京东茅台抢购神器

用python爬取江苏几大高校的就业网站，并提供3种方式通知给用户，分别是通过微信发送、命令行直接输出、windows气泡通知。

Webservice wrapper for hhursev/recipe-scrapers (python library to scrape recipes from websites)

学习强国 自动化 百分百正确、瞬间答题，分值45分

Pro Football Reference Game Data Webscraper

Create crawler get some new products with maximum discount in banimode website

This is a webscraper for a specific website

河南工业大学完美校园自动校外打卡

中国大学生在线四史自动答题刷分(现仅支持英雄篇)

学习强国自动化百分百正确、瞬间答题，分值45分