Web Scrapping com Python

Esse projeto consiste em um código para o usuário buscar as últimas nóticias sobre um termo qualquer, no site G1. Para esse projeto foi escolhida a linguagem de programação Python. Para que fosse possível realizar essa busca, foram utilizadas três bibiliotecas, que foram:

selenium - Utilizada para automatizar o processo e obter o conteúdo da página Web.
bs4 - BeautifoulSoup - Utilizada para manipular o conteúdo HTML.
Pandas - Utilizada para criar e exportar um dataframe com as informações obtidas.

💻 Pré-Requisitos

Antes de comerçar, verifique se você atende os seguintes requisitos:

Possuir Windows, Linux or Mac.
Possuir o Python instalado em sua máquina.
Possuir o navegador Google Chrome instalado em sua máquina na versão 97.0.4692.71.
Possuir conexão à Internet

💻 Running

Instale os pacotes necessários:

$ pip install -r requirements.txt

Execute o arquivo main.py, aguarde alguns segundos e será gerada uma planilha XLSX e um arquivo CSV com as informações.

License

MIT

Free Software, Hell Yeah!

WebScrapping Project - G1 Latest News

Related tags

Overview

Web Scrapping com Python

💻 Pré-Requisitos

💻 Running

License

Owner

Eduardo Henrique

Scrapping the data from each page of biocides listed on the BAUA website into a csv file

Collection of code files to scrap different kinds of websites.

Scrapy-based cyber security news finder

A modern CSS selector implementation for BeautifulSoup

Web Scraping Framework

Web scraper build using python.

tweet random sand cat pictures

Distributed Crawler Management Framework Based on Scrapy, Scrapyd, Django and Vue.js

HappyScrapper - Google news web scrapper with python

Extract gene TSS site form gencode/ensembl/gencode database GTF file and export bed format file.

Haphazard scripts for scraping bitcoin/bitcoin data from GitHub

Python script who crawl first shodan page and check DBLTEK vulnerability

Python scrapper scrapping torrent website and download new movies Automatically.

AssistScraper - program for /r/nba to use to find list of all players a player assisted and how many assists each player recieved

Audio media crawler for lbry.

Dictionary - Application focused on word search through web scraping

Linkedin webscraping - Linkedin web scraping with python

IGLS - Instagram Like Scraper CLI tool

This is a sport analytics project that combines the knowledge of OOP and Webscraping

Unja is a fast & light tool for fetching known URLs from Wayback Machine