Newsscraper - A simple Python 3 module to get crypto or news articles and their content from various RSS feeds.

Last update: Jan 02, 2022

Overview

NewsScraper

A simple Python 3 module to get crypto or news articles and their content from various RSS feeds.

🔧 Installation

Clone the repo locally.
Use the package manager pip to install the requirements.

pip install -r requirements.txt

✨ Basic Usage

import NewsScraper

all_data = NewsScraper.fetch_all()
news_data = NewsScraper.fetch_news_data()
crypto_data = NewsScraper.fetch_crypto_data()

fetch_all()

Returns a set of NewsScraper.Result containing fetched results from all available RSS feeds

Can include categories: GLOBAL, US, EU, CRYPTO, BLOCKCHAIN, BTC, ETH, LTC.

fetch_news_data()

Returns a set of NewsScraper.Result containing fetched results from CNN, ABC News, Yahoo News, Fox News RSS feeds

Can include categories: GLOBAL, US, EU.

fetch_crypto_data()

Returns a set of NewsScraper.Result containing fetched results from CoinJournal, Crypto Currency News RSS feeds.

Can include categories: CRYPTO, BLOCKCHAIN, BTC, ETH, LTC.

🔨 Advanced Usage

NewsScraper.Result class

A class used to represent a returned article.

Attributes

context : str

A string describing the category of the article.

ex. "GLOBAL", "US", "BLOCKCHAIN", "BTC".
title : str

A string containing the name of the article.
summary : str

A string containing the summary of the article.

NOTE: sometimes it can have the value of "", because the RSS feed didn't provide a summary.
content : str

A string containing the content of the article.

Methods

Result.json()

Returns a dictionary with the attributes of the class formatted in JSON.

ex.

{
  "context": "global",
  "title": "title of the article",
  "summary": "summary of the article",
  "content": "content of the article"
}

News RSS Feeds

All of these functions return a set of NewsScraper.Result containing fetched results of the described RSS feeds.

fetch_abc()
fetch_cnn()
fetch_yahoo()
fetch_fox_news()

Can include categories: GLOBAL, US, EU.

Alternatively, you can use fetch_news_data() to receive results from all of them.

Crypto RSS Feeds

All of these functions return a set of NewsScraper.Result containing fetched results of the described RSS feeds.

fetch_coinjournal()
fetch_cryptocurrencynews()

Can include categories: CRYPTO, BLOCKCHAIN, BTC, ETH, LTC.

Alternatively, you can use fetch_news_data() to receive results from all of them.

🤝 Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

📝 License

This project is licensed under the MIT license.

Newsscraper - A simple Python 3 module to get crypto or news articles and their content from various RSS feeds.

Related tags

Overview

NewsScraper

🔧 Installation

✨ Basic Usage

🔨 Advanced Usage

NewsScraper.Result class

context : str

title : str

summary : str

content : str

Result.json()

News RSS Feeds

Crypto RSS Feeds

🤝 Contributing

📝 License

Owner

Rokas

Binance Smart Chain Contract Scraper + Contract Evaluator

茅台抢购最新优化版本，茅台秒杀，优化了抢购协程队列

A Python web scraper to scrape latest posts from official Coinbase's Blog.

Script for scrape user data like "id,username,fullname,followers,tweets .. etc" by Twitter's search engine .

This is a web crawler that works on employ email data by gmane.org and visualizes it in different ways.

Web scrapping

Scraping news from Ucsal portal with Scrapy.

Using Python and Pushshift.io to Track stocks on the WallStreetBets subreddit

Scrape puzzle scrambles from csTimer.net

Python web scrapper

PaperRobot: a paper crawler that can quickly download numerous papers, facilitating paper studying and management

SearchifyX, predecessor to Searchify, is a fast Quizlet, Quizizz, and Brainly webscraper with various stealth features.

This tool crawls a list of websites and download all PDF and office documents

Pseudo API for Google Trends

Scrape all the media from an OnlyFans account - Updated regularly

This was supposed to be a web scraping project, but somehow I've turned it into a spamming project

Nekopoi scraper using python3

京东抢茅台，秒杀成功很多次讨论，天猫抢购，赚钱交流等。

A web service for scanning media hosted by a Matrix media repository

让中国用户使用git从github下载的速度提高1000倍!