Finds, downloads, parses, and standardizes public bikeshare data into a standard pandas dataframe format

Last update: Dec 01, 2021

Related tags

Overview

opendata

Finds, downloads, parses, and standardizes public bikeshare data into a standard pandas dataframe format.

import asyncio
from opendata.sources.bikeshare.bay_wheels import trips as bay_wheels

trips_df, _ = asyncio.run(bay_wheels.async_load(trip_sample_rate=1000))

len(trips_df.index)
# 8731

trips_df.columns
# Index(['started_at', 'ended_at', 'start_station_id', 'end_station_id',
#        'start_station_name', 'end_station_name', 'rideable_type', 'ride_id',
#        'start_lat', 'start_lng', 'end_lat', 'end_lng', 'gender', 'user_type',
#        'bike_id', 'birth_year'],
#       dtype='object')

An example analysis can be found here: https://observablehq.com/@brady/bikeshare

Supports sampling and local file caching to improve performance.

Markets supported

import opendata.sources.bikeshare.bay_wheels
import opendata.sources.bikeshare.bixi
import opendata.sources.bikeshare.divvy
import opendata.sources.bikeshare.capital_bikeshare
import opendata.sources.bikeshare.citi_bike
import opendata.sources.bikeshare.cogo
import opendata.sources.bikeshare.niceride
import opendata.sources.bikeshare.bluebikes
import opendata.sources.bikeshare.metro_bike_share
import opendata.sources.bikeshare.indego

Bootstrap

Set up your environment

brew install chromedriver
brew install python3
python3 -m pip install pre-commit

pre-commit install --install-hooks
python3 -m venv venv
source venv/bin/activate
python3 -m pip install -r requirements.txt

Entering virtualenv

python3 -m venv venv
source venv/bin/activate
python3 -m pip install -r requirements.txt

Usage

Try the test export to CSV:

python3 test.py

Updating pip requirements

pip-compile

Pre-commit setup

pre-commit install --install-hooks

Finds, downloads, parses, and standardizes public bikeshare data into a standard pandas dataframe format

Related tags

Overview

opendata

Markets supported

Bootstrap

Entering virtualenv

Usage

Updating pip requirements

Pre-commit setup

Bikeshare markets to add

USA

World

Owner

Brady Law

Accurately separate the TLD from the registered domain and subdomains of a URL, using the Public Suffix List.

Wafer Fault Detection - Wafer circleci with python

pyETT: Python library for Eleven VR Table Tennis data

A collection of learning outcomes data analysis using Python and SQL, from DQLab.

The lastest all in one bombing tool coded in python uses tbomb api

A python package which can be pip installed to perform statistics and visualize binomial and gaussian distributions of the dataset

A columnar data container that can be compressed.

Catalogue data - A Python Scripts to prepare catalogue data

A Python package for the mathematical modeling of infectious diseases via compartmental models

DataPrep — The easiest way to prepare data in Python

An orchestration platform for the development, production, and observation of data assets.

follow-analyzer helps GitHub users analyze their following and followers relationship

PandaPy has the speed of NumPy and the usability of Pandas 10x to 50x faster (by @firmai)

The official repository for ROOT: analyzing, storing and visualizing big data, scientifically

VHub - An API that permits uploading of vulnerability datasets and return of the serialized data

Stock Analysis dashboard Using Streamlit and Python

Fast, flexible and easy to use probabilistic modelling in Python.

Pip install minimal-pandas-api-for-polars

AptaMat is a simple script which aims to measure differences between DNA or RNA secondary structures.

Churn prediction with PySpark