Jupyter notebook and datasets from the pandas Q&A video series

Overview

Python pandas Q&A video series

Read about the series, and view all of the videos on one page: Easier data analysis in Python with pandas.

Jupyter Notebooks

Videos (playlist)

  1. What is pandas? (Introduction to the Q&A series) (6:24)
  2. How do I read a tabular data file into pandas? (8:54)
  3. How do I select a pandas Series from a DataFrame? (11:10)
  4. Why do some pandas commands end with parentheses (and others don't)? (8:45)
  5. How do I rename columns in a pandas DataFrame? (9:36)
  6. How do I remove columns from a pandas DataFrame? (6:35)
  7. How do I sort a pandas DataFrame or a Series? (8:56)
  8. How do I filter rows of a pandas DataFrame by column value? (13:44)
  9. How do I apply multiple filter criteria to a pandas DataFrame? (9:51)
  10. Your pandas questions answered! (9:06)
  11. How do I use the "axis" parameter in pandas? (8:33)
  12. How do I use string methods in pandas? (6:16)
  13. How do I change the data type of a pandas Series? (7:28)
  14. When should I use a "groupby" in pandas? (8:24)
  15. How do I explore a pandas Series? (9:50)
  16. How do I handle missing values in pandas? (14:27)
  17. What do I need to know about the pandas index? (Part 1) (13:36)
  18. What do I need to know about the pandas index? (Part 2) (10:38)
  19. How do I select multiple rows and columns from a pandas DataFrame? (21:46)
  20. When should I use the "inplace" parameter in pandas? (10:18)
  21. How do I make my pandas DataFrame smaller and faster? (19:05)
  22. How do I use pandas with scikit-learn to create Kaggle submissions? (13:25)
  23. More of your pandas questions answered! (19:23)
  24. How do I create dummy variables in pandas? (13:13)
  25. How do I work with dates and times in pandas? (10:20)
  26. How do I find and remove duplicate rows in pandas? (9:47)
  27. How do I avoid a SettingWithCopyWarning in pandas? (13:29)
  28. How do I change display options in pandas? (14:55)
  29. How do I create a pandas DataFrame from another object? (14:25)
  30. How do I apply a function to a pandas Series or DataFrame? (17:57)
  31. Bonus: How do I use the MultiIndex in pandas? (25:00)
  32. Bonus: How do I merge DataFrames in pandas? (21:48)
  33. Bonus: 4 new time-saving tricks in pandas (14:50)
  34. Bonus: 5 new changes in pandas you need to know about (20:54)
  35. Bonus: My top 25 pandas tricks (27:37)
  36. Bonus: Data Science Best Practices with pandas (PyCon 2019) (1:44:16)
  37. Bonus: Your pandas questions answered! (webcast) (1:56:01)

Datasets

Filename Description Raw File Original Source Other
chipotle.tsv Online orders from the Chipotle restaurant chain bit.ly/chiporders The Upshot Upshot article
drinks.csv Alcohol consumption by country bit.ly/drinksbycountry FiveThirtyEight FiveThirtyEight article
imdb_1000.csv Top rated movies from IMDb bit.ly/imdbratings IMDb Web scraping script
stocks.csv Small dataset of stock prices bit.ly/smallstocks DataCamp
titanic_test.csv Testing set from Kaggle's Titanic competition bit.ly/kaggletest Kaggle Data dictionary
titanic_train.csv Training set from Kaggle's Titanic competition bit.ly/kaggletrain Kaggle Data dictionary
u.data Movie ratings by MovieLens users bit.ly/movielensdata GroupLens Data dictionary
u.item Movie information from MovieLens bit.ly/movieitems GroupLens Data dictionary
u.user Demographic information about MovieLens users bit.ly/movieusers GroupLens Data dictionary
ufo.csv Reports of UFO sightings from 1930-2000 bit.ly/uforeports National UFO Reporting Center Web scraping script
Sci palettes for matplotlib/seaborn

sci palettes for matplotlib/seaborn Installation python3 -m pip install sci-palettes Usage import seaborn as sns import matplotlib.pyplot as plt impor

Qingdong Su 2 Jun 07, 2022
A small timeseries transformation API built on Flask and Pandas

#Mcflyin ###A timeseries transformation API built on Pandas and Flask This is a small demo of an API to do timeseries transformations built on Flask a

Rob Story 84 Mar 25, 2022
An(other) implementation of JSON Schema for Python

jsonschema jsonschema is an implementation of JSON Schema for Python. from jsonschema import validate # A sample schema, like what we'd get f

Julian Berman 4k Jan 04, 2023
A tool to plot and execute Rossmos's Formula, that helps to catch serial criminals using mathematics

Rossmo Plotter A tool to plot and execute Rossmos's Formula using python, that helps to catch serial criminals using mathematics Author: Amlan Saha Ku

Amlan Saha Kundu 3 Aug 29, 2022
3D Vision functions with end-to-end support for deep learning developers, written in Ivy.

Ivy vision focuses predominantly on 3D vision, with functions for camera geometry, image projections, co-ordinate frame transformations, forward warping, inverse warping, optical flow, depth triangul

Ivy 61 Dec 29, 2022
A python package for animating plots build on matplotlib.

animatplot A python package for making interactive as well as animated plots with matplotlib. Requires Python = 3.5 Matplotlib = 2.2 (because slider

Tyler Makaro 394 Dec 18, 2022
Python support for Godot 🐍🐍🐍

Godot Python, because you want Python on Godot ! The goal of this project is to provide Python language support as a scripting module for the Godot ga

Emmanuel Leblond 1.4k Jan 04, 2023
finds grocery stores and stuff next to route (gpx)

Route-Report Route report is a command-line utility that can be used to locate points-of-interest near your planned route (gpx). The results are based

Clemens Mosig 5 Oct 10, 2022
Here I plotted data for the average test scores across schools and class sizes across school districts.

HW_02 Here I plotted data for the average test scores across schools and class sizes across school districts. Average Test Score by Race This graph re

7 Oct 27, 2021
Datapane is the easiest way to create data science reports from Python.

Datapane Teams | Documentation | API Docs | Changelog | Twitter | Blog Share interactive plots and data in 3 lines of Python. Datapane is a Python lib

Datapane 744 Jan 06, 2023
Python wrapper for Synoptic Data API. Retrieve data from thousands of mesonet stations and networks. Returns JSON from Synoptic as Pandas DataFrame

☁ Synoptic API for Python (unofficial) The Synoptic Mesonet API (formerly MesoWest) gives you access to real-time and historical surface-based weather

Brian Blaylock 23 Jan 06, 2023
Automatically Visualize any dataset, any size with a single line of code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.

AutoViz Automatically Visualize any dataset, any size with a single line of code. AutoViz performs automatic visualization of any dataset with one lin

AutoViz and Auto_ViML 1k Jan 02, 2023
Geocoding library for Python.

geopy geopy is a Python client for several popular geocoding web services. geopy makes it easy for Python developers to locate the coordinates of addr

geopy 3.8k Jan 02, 2023
Main repository for Vispy

VisPy: interactive scientific visualization in Python Main website: http://vispy.org VisPy is a high-performance interactive 2D/3D data visualization

vispy 3k Jan 03, 2023
A custom qq-plot for two sample data comparision

QQ-Plot 2 Sample Just a gist to include the custom code to draw a qq-plot in python when dealing with a "two sample problem". This means when u try to

1 Dec 20, 2021
Draw datasets from within Jupyter.

drawdata This small python app allows you to draw a dataset in a jupyter notebook. This should be very useful when teaching machine learning algorithm

vincent d warmerdam 505 Nov 27, 2022
Plotting library for IPython/Jupyter notebooks

bqplot 2-D plotting library for Project Jupyter Introduction bqplot is a 2-D visualization system for Jupyter, based on the constructs of the Grammar

3.4k Dec 29, 2022
Lightweight data validation and adaptation Python library.

Valideer Lightweight data validation and adaptation library for Python. At a Glance: Supports both validation (check if a value is valid) and adaptati

Podio 258 Nov 22, 2022
Pebble is a stat's visualization tool, this will provide a skeleton to develop a monitoring tool.

Pebble is a stat's visualization tool, this will provide a skeleton to develop a monitoring tool.

Aravind Kumar G 2 Nov 17, 2021
The Spectral Diagram (SD) is a new tool for the comparison of time series in the frequency domain

The Spectral Diagram (SD) is a new tool for the comparison of time series in the frequency domain. The SD provides a novel way to display the coherence function, power, amplitude, phase, and skill sc

Mabel 3 Oct 10, 2022