The repo for mlbtradetrees.com. Analyze any trade in baseball history!

Last update: Nov 20, 2022

Related tags

Data Analysis BaseballTradeTrees

Overview

MLB Trade Trees

2.0.0 Release: November 24, 2021

www.mlbtradetrees.com allows you to view the trade tree of any player in MLB history.

What is a trade tree?

A trade tree will show you the complete details of a trade made by a team. Let's use Hall Of Fame candidate Cliff Lee for some examples, as he was traded multiple times throughout his career..

Here is the simplest form of his tree:

Cliff Lee was traded to the Mariners in 2009, and the Phillies received 3 players in return. All players the Phillies received in return either retired or became free agents, ending the tree with them.

Let's take a look at a more complicated example:

We can see the Mariners traded away Cliff Lee in 2010, receiving 4 players in return. 2 Players' lines end due to free agency and being picked up on waivers. 2 players' lines continue due to being traded away the next year. Some of those players' lines end however some continue to be traded away, so the tree grows. The tree finally ends in 2014 due to the final player hitting free agency.

Some of these trees can get pretty massive, spanning decades and dozens of trades. An example is Harry Simpson.

The Database

The transaction, team and player databases are thanks to Retrosheet. I will only update transactions when they update the database.

I have made some adjustments to the database that allows the search to go more smoothly:

Transaction database (data/sorted_transactions_final.csv)

Nan players involved in trades were changed to "PTBNL/Cash" (player to be named later). Most of the time you see this in a tree, it is a cash transaction.
Transactions of players that were released or granted free agency, then signed back with the team as their next transaction were deleted as it caused trees to end prematurely.
Franchise tags were added to the database to ensure that a team name change doesn't end a tree.

Team database (data/teams.csv)

All teams in the database received a franchise tag if they are part of the same franchise. They received a unique franchise code if they are an independant team.

Player database (data/teams.csv)

Nothing changed, just made a copy with the full name to easily get the user input. (static/css/searchable_players.csv)

Installing Locally

If you want to run the website locally:

install flask
install pandas
install JSGlue (allows Jinja to work in a js file)

Run server.py

What am I working on?

Updated Nov. 24 2021

Some players don't display properly due to having very old teams not listed in the teams database. Usually these are players before 1920. I just need to update the transactions database to find all teams without the franchise tag.
Adding stat support with pybaseball. I'd like to add total war contributed by players in a trade on the tree.
Searching for and filtering trees based on team, year, players in a tree, length of trees, etc.
Various UI enhancements, like clickable nodes to get a player's tree, collapsable nodes for easier readability.

The repo for mlbtradetrees.com. Analyze any trade in baseball history!

Related tags

Overview

MLB Trade Trees

2.0.0 Release: November 24, 2021

www.mlbtradetrees.com allows you to view the trade tree of any player in MLB history.

What is a trade tree?

The Database

Transaction database (data/sorted_transactions_final.csv)

Team database (data/teams.csv)

Player database (data/teams.csv)

Installing Locally

What am I working on?

Updated Nov. 24 2021

Owner

Additional tools for particle accelerator data analysis and machine information

A highly efficient and modular implementation of Gaussian Processes in PyTorch

Create HTML profiling reports from pandas DataFrame objects

Efficient matrix representations for working with tabular data

Bearsql allows you to query pandas dataframe with sql syntax.

Synthetic data need to preserve the statistical properties of real data in terms of their individual behavior and (inter-)dependences

Deep universal probabilistic programming with Python and PyTorch

🌍 Create 3d-printable STLs from satellite elevation data 🌏

In this project, ETL pipeline is build on data warehouse hosted on AWS Redshift.

Building house price data pipelines with Apache Beam and Spark on GCP

ETL pipeline on movie data using Python and postgreSQL

Very basic but functional Kakuro solver written in Python.

ELFXtract is an automated analysis tool used for enumerating ELF binaries

University Challenge 2021 With Python

Pandas and Spark DataFrame comparison for humans

Utilize data analytics skills to solve real-world business problems using Humana’s big data

Improving your data science workflows with

Projeto para realizar o RPA Challenge . Utilizando Python e as bibliotecas Selenium e Pandas.

4CAT: Capture and Analysis Toolkit

X-news - Pipeline data use scrapy, kafka, spark streaming, spark ML and elasticsearch, Kibana