webScrap

WebScraping first step.

Authors: Paulo, Claudio M.

First steps in Web Scraping. Project carried out for training in Web Scrapping. The export of information to a structured database (Pandas DataFrame) where the information was obtained by making a request() call from pages with known addresses. Find the information in the 'lxml' code formatted by BeautfullSoup, and finally exported in csv format.

How to automate the search for related words in OLX ads.
Can I use quartile analysis to find the best product at the best price?

Our Plan

Select the list of related words.
Use requests to download the page.
Use BSsoup to format the downloaded page in lxml.
Create a structured database with date and time of posting, ad title, product value, city and neighborhood where it is being advertised.
Filter the database by removing ads whose ad title does not contain the desired words.
Use the percentile and average value metric to find the average price of advertisements by cities (of Brazilian states).

Current progress

Data scraping was carried out and the database was created to analyze the average value by city.

Database formed by information in OLX Brasil website advertisements.

The code is with variables and comments in Portuguese, and the search for advertisements is carried out with words in the Portuguese language.

Web Scraping OLX with Python and Bsoup.

Related tags

Overview

webScrap

WebScraping first step.

Authors: Paulo, Claudio M.

Our Plan

Current progress

References

Owner

claudio paulo

Rottentomatoes, Goodreads and IMDB sites crawler. Semantic Web final project.

Using Selenium with Python to Web Scrap Popular Youtube Tech Channels.

A web crawler for recording posts in "sina weibo"

Scrap-mtg-top-8 - A top 8 mtg scraper using python

This tool can be used to extract information from any website

This tool crawls a list of websites and download all PDF and office documents

Script used to download data for stocks.

FilmMikirAPI - A simple rest-api which is used for scrapping on the Kincir website using the Python and Flask package

Web scraper for Zillow

Telegram group scraper tool

Basic-html-scraper - A complete how to of web scraping with Python for beginners

Scraping followers of an instagram account

Lovely Scrapper

Collection of code files to scrap different kinds of websites.

Web Content Retrieval for Humans™

An experiment to deploy a serverless infrastructure for a scrapy project.

A python tool to scrape NFT's off of OpenSea

Simple tool to scrape and download cross country ski timings and results from live.skidor.com

Python based Web Scraper which can discover javascript files and parse them for juicy information (API keys, IP's, Hidden Paths etc)

New World Market Scraper