leetcode_scraper_and_pdf
A leetcode scraper to compile all questions in leetcode free tier to text file. pdf also available.
if new questions get added
Open google chrome, navigate to leetcode problems page click on inspect, navigate to dev console and paste the commands below. Copy question links and paste in text file named question_links.txt go to next page and repeat till you have all the questions links. Put the python script and question links file in same folder. run and wait to compile to text file, the convert to pdf online.
var x = document.querySelectorAll("a");
var myarray = []
for (var i=0; i
-1 && x[i].getAttribute('href').indexOf('solution') == -1 )
{
var cleanlink = x[i].href;
myarray.push([cleanlink]);
}
};
function make_table() {
var table = '
| Links |
|---|
'; for (var i=0; i
" myarray[i][0] ; }; var w="window.open("");" w.document.write(table); } make_table() < code>
Links
Lovely Scrapper
Lovely Scrapper
2 Jan 01, 2022
EBay-email-tracker - Scapes an entire search page of a particular item on eBay and sends regular updates to an email address
Introduction This is a project I built with the sole intent to learn more about
1 Jan 14, 2022
A Python library for automating interaction with websites.
Home page https://mechanicalsoup.readthedocs.io/ Overview A Python library for automating interaction with websites. MechanicalSoup automatically stor
4.3k Jan 07, 2023
Rottentomatoes, Goodreads and IMDB sites crawler. Semantic Web final project.
Crawler Rottentomatoes, Goodreads and IMDB sites crawler. Crawler written by beautifulsoup, selenium and lxml to gather books and films information an
1 Dec 30, 2021
Newsscraper - A simple Python 3 module to get crypto or news articles and their content from various RSS feeds.
NewsScraper A simple Python 3 module to get crypto or news articles and their content from various RSS feeds. 🔧 Installation Clone the repo locally.
3 Jan 02, 2022
The open-source web scrapers that feed the Los Angeles Times California coronavirus tracker.
The open-source web scrapers that feed the Los Angeles Times' California coronavirus tracker. Processed data ready for analysis is available at datade
51 Dec 14, 2022
一些爬虫相关的签名、验证码破解
cracking4crawling 一些爬虫相关的签名、验证码破解,目前已有脚本: 小红书App接口签名(shield)(2020.12.02) 小红书滑块(数美)验证破解(2020.12.02) 海南航空App接口签名(hnairSign)(2020.12.05) 说明: 脚本按目标网站、App命
90 Feb 09, 2021
Web Scraping Instagram photos with Selenium by only using a hashtag.
Web-Scraping-Instagram This project is used to automatically obtain images by web scraping Instagram with Selenium in Python. The required input will
3 Nov 24, 2022
A modern CSS selector implementation for BeautifulSoup
Soup Sieve Overview Soup Sieve is a CSS selector library designed to be used with Beautiful Soup 4. It aims to provide selecting, matching, and filter
151 Dec 23, 2022
A Python Covid-19 cases tracker that scrapes data off the web and presents the number of Cases, Recovered Cases, and Deaths that occurred because of the pandemic.
A Python Covid-19 cases tracker that scrapes data off the web and presents the number of Cases, Recovered Cases, and Deaths that occurred because of the pandemic.
1 Nov 13, 2021
a small library for extracting rich content from urls
A small library for extracting rich content from urls. what does it do? micawber supplies a few methods for retrieving rich metadata about a variety o
588 Dec 27, 2022
Python based Web Scraper which can discover javascript files and parse them for juicy information (API keys, IP's, Hidden Paths etc)
Python based Web Scraper which can discover javascript files and parse them for juicy information (API keys, IP's, Hidden Paths etc).
6 Aug 26, 2022
Grab the changelog from releases on Github
release-notes-scraper This simple script can be used to grab the release notes for projects from github that do not keep a CHANGELOG, but publish thei
4 Apr 01, 2022
Extract gene TSS site form gencode/ensembl/gencode database GTF file and export bed format file.
GetTss python Package extract gene TSS site form gencode/ensembl/gencode database GTF file and export bed format file. Install $ pip install GetTss Us
6 Nov 21, 2022
LSpider 一个为被动扫描器定制的前端爬虫
LSpider LSpider - 一个为被动扫描器定制的前端爬虫 什么是LSpider? 一款为被动扫描器而生的前端爬虫~ 由Chrome Headless、LSpider主控、Mysql数据库、RabbitMQ、被动扫描器5部分组合而成。
321 Dec 12, 2022
Web scrapping
Project Setup Table of Contents Project Setup Table of Contents Run project locally Install Requirements Run script Run project locally Install Requir
3 Feb 04, 2022
News, full-text, and article metadata extraction in Python 3. Advanced docs:
Newspaper3k: Article scraping & curation Inspired by requests for its simplicity and powered by lxml for its speed: "Newspaper is an amazing python li
12.3k Jan 07, 2023
Crawl BookCorpus
These are scripts to reproduce BookCorpus by yourself.
590 Jan 03, 2023
基于Github Action的定时HITsz疫情上报脚本,开箱即用
HITsz Daily Report 基于 GitHub Actions 的「HITsz 疫情系统」访问入口 定时自动上报脚本,开箱即用。 感谢 @JellyBeanXiewh 提供原始脚本和 idea。 感谢 @bugstop 对脚本进行重构并新增 Easy Connect 校内代理访问。
56 Nov 27, 2022
A web scraper that exports your entire WhatsApp chat history.
WhatSoup 🍲 A web scraper that exports your entire WhatsApp chat history. Table of Contents Overview Demo Prerequisites Instructions Frequen
87 Jan 06, 2023