Parse feeds in Python

Last update: Dec 30, 2022

Related tags

Web Crawling feedparser

Overview

feedparser - Parse Atom and RSS feeds in Python.

feedparser is open source. See the LICENSE file for more information.

Installation

feedparser can be installed by running pip:

$ pip install feedparser

Documentation

The feedparser documentation is available on the web at:

https://feedparser.readthedocs.io/en/latest/

It is also included in its source format, ReST, in the docs/ directory. To build the documentation you'll need the Sphinx package, which is available at:

https://www.sphinx-doc.org/

You can then build HTML pages using a command similar to:

$ sphinx-build -b html docs/ fpdocs

This will produce HTML documentation in the fpdocs/ directory.

Testing

Feedparser has an extensive test suite, powered by tox. To run it, type this:

$ python -m venv venv
$ source venv/bin/activate  # or "venv\bin\activate.ps1" on Windows
(venv) $ python -m pip install --upgrade pip
(venv) $ python -m pip install poetry
(venv) $ poetry update
(venv) $ tox

This will spawn an HTTP server that will listen on port 8097. The tests will fail if that port is in use.

Parse feeds in Python

Related tags

Overview

Installation

Documentation

Testing

Owner

Kurt McKee

crypto currency scraping

Simple library for exploring/scraping the web or testing a website you’re developing

👨🏼‍⚖️ reddit bot that turns comment chains into ace attorney scenes

Scrapes all articles and their headlines from theonion.com

The first public repository that provides free BUBT website scraping API script on Github.

Scrape all the media from an OnlyFans account - Updated regularly

OSTA web scraper, for checking the status of school buses in Ottawa

用python爬取江苏几大高校的就业网站，并提供3种方式通知给用户，分别是通过微信发送、命令行直接输出、windows气泡通知。

京东茅台抢购最新优化版本，京东茅台秒杀，优化了茅台抢购进程队列

HappyScrapper - Google news web scrapper with python

Simple proxy scraper made by using ProxyScrape's api.

Visual scraping for Scrapy

Transistor, a Python web scraping framework for intelligent use cases.

Web crawling framework based on asyncio.

script to scrape direct download links (ddls) from google drive index.

A command-line program to download media, like and unlike posts, and more from creators on OnlyFans.

Screen scraping and web crawling framework

A simple reddit scraper to get memes (only images) from r/ProgrammerHumor.

This tool crawls a list of websites and download all PDF and office documents

An IpVanish Proxies Scraper