List of Implementations:

Currently, the reimplementation of the DeepAR paper(DeepAR: Probabilistic Forecasting with Autoregressive Recurrent Networks https://arxiv.org/abs/1704.04110) is available in PyTorch. More papers will be coming soon.

Authors:

Yunkai Zhang([email protected]) - University of California, Santa Barbara
Qiao Jiang - Brown University
Xueying Ma - Columbia University
Acknowledgement: Professor Xifeng Yan's group at UC Santa Barbara. Part of the work was done at WeWork.

To run:

Install all dependencies listed in requirements.txt. Note that the model has only been tested in the versions shown in the text file.
Download the dataset and preprocess the data:
```
python preprocess_elect.py
```
Start training:
```
python train.py
```
- If you want to perform ancestral sampling,
```
python train.py --sampling
```
- If you do not want to do normalization during evaluation,
```
python train.py --relative-metrics
```
Evaluate a set of saved model weights:
```
python evaluate.py
```
Perform hyperparameter search:
```
 python search_params.py
```

Results

The model is evaluated on the electricity dataset, which contains the electricity consumption of 370 households from 2011 to 2014. Under hourly frequency, we use the first week of September, 2014 as the test set and all time steps prior to that as the train set. Following the experiment design in DeepAR, the window size is chosen to be 192, where the last 24 is the forecasting horizon. History (number of time steps since the beginning of each household), month of the year, day of the week, and hour of the day are used as time covariates. Notice that some households started at different times, so we only use windows that contain non-missing values.

Under Gaussian likelihood, we use the Adam optimizer with early stopping to train the model for 20 epoches. The same set of hyperparameters is used as outlined in the paper. Weights with the best ND value is selected, where ND = 0.06349, RMSE = 0.452, rou90 = 0.034 and rou50 = 0.063.

Sample results on electricity. The top 10 plots are sampled from the test set with the highest 10% ND values, whereas the bottom 10 plots are sampled from the rest of the test set.

Implementation of deep learning models for time series in PyTorch.

Related tags

Overview

List of Implementations:

Authors:

To run:

Results

Owner

Yunkai Zhang

An AutoML survey focusing on practical systems.

Bayesian optimization in JAX

MCML is a toolkit for semi-supervised dimensionality reduction and quantitative analysis of Multi-Class, Multi-Label data

Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.

A Python Module That Uses ANN To Predict A Stocks Price And Also Provides Accurate Technical Analysis With Many High Potential Implementations!

Pyomo is an object-oriented algebraic modeling language in Python for structured optimization problems.

Titanic Traveller Survivability Prediction

A scikit-learn based module for multi-label et. al. classification

pandas, scikit-learn, xgboost and seaborn integration

Official code for HH-VAEM

Hierarchical Time Series Forecasting using Prophet

A framework for building (and incrementally growing) graph-based data structures used in hierarchical or DAG-structured clustering and nearest neighbor search

Python module for machine learning time series:

BioPy is a collection (in-progress) of biologically-inspired algorithms written in Python

MiniTorch - a diy teaching library for machine learning engineers

A statistical library designed to fill the void in Python's time series analysis capabilities, including the equivalent of R's auto.arima function.

MLOps pipeline project using Amazon SageMaker Pipelines

Markov bot - A Writing bot based on Markov Chain for Data Structure Lab

The project's goal is to show a real world application of image segmentation using k means algorithm

My capstone project for Udacity's Machine Learning Nanodegree