Python module for machine learning time series:

Last update: Dec 29, 2022

Overview

seglearn

Seglearn is a python package for machine learning time series or sequences. It provides an integrated pipeline for segmentation, feature extraction, feature processing, and final estimator. Seglearn provides a flexible approach to multivariate time series and related contextual (meta) data for classification, regression, and forecasting problems. Support and examples are provided for learning time series with classical machine learning and deep learning models. It is compatible with scikit-learn.

Documentation

Installation documentation, API documentation, and examples can be found on the documentation.

Dependencies

seglearn is tested to work under Python 3.5. The dependency requirements are based on the last scikit-learn release:

scipy(>=0.17.0)
numpy(>=1.11.0)
scikit-learn(>=0.21.3)

Additionally, to run the examples, you need:

matplotlib(>=2.0.0)
keras (>=2.1.4) for the neural network examples
pandas

In order to run the test cases, you need:

pytest

The neural network examples were tested on keras using the tensorflow-gpu backend, which is recommended.

Installation

seglearn-learn is currently available on the PyPi's repository and you can install it via pip:

pip install -U seglearn

or if you use python3:

pip3 install -U seglearn

If you prefer, you can clone it and run the setup.py file. Use the following commands to get a copy from GitHub and install all dependencies:

git clone https://github.com/dmbee/seglearn.git
cd seglearn
pip install .

Or install using pip and GitHub:

pip install -U git+https://github.com/dmbee/seglearn.git

Testing

After installation, you can use pytest to run the test suite from seglearn's root directory:

pytest

Change Log

Version history can be viewed in the Change Log.

Development

The development of this scikit-learn-contrib is in line with the one of the scikit-learn community. Therefore, you can refer to their Development Guide.

Please submit new pull requests on the dev branch with unit tests and an example to demonstrate any new functionality / api changes.

Citing seglearn

If you use seglearn in a scientific publication, we would appreciate citations to the following paper:

@article{arXiv:1803.08118,
author  = {David Burns, Cari Whyne},
title   = {Seglearn: A Python Package for Learning Sequences and Time Series},
journal = {arXiv},
year    = {2018},
url     = {https://arxiv.org/abs/1803.08118}
}

If you use the seglearn test data in a scientific publication, we would appreciate citations to the following paper:

@article{arXiv:1802.01489,
author  = {David Burns, Nathan Leung, Michael Hardisty, Cari Whyne, Patrick Henry, Stewart McLachlin},
title   = {Shoulder Physiotherapy Exercise Recognition: Machine Learning the Inertial Signals from a Smartwatch},
journal = {arXiv},
year    = {2018},
url     = {https://arxiv.org/abs/1802.01489}
}

Python module for machine learning time series:

Related tags

Overview

seglearn

Documentation

Dependencies

Installation

Testing

Change Log

Development

Citing seglearn

Owner

David Burns

TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.

jaxfg - Factor graph-based nonlinear optimization library for JAX.

🌊 River is a Python library for online machine learning.

Course files for "Ocean/Atmosphere Time Series Analysis"

A machine learning web application for binary classification using streamlit

vortex particles for simulating smoke in 2d

Model Agnostic Confidence Estimator (MACEST) - A Python library for calibrating Machine Learning models' confidence scores

FLAML is a lightweight Python library that finds accurate machine learning models automatically, efficiently and economically

Titanic Traveller Survivability Prediction

A scikit-learn based module for multi-label et. al. classification

Formulae is a Python library that implements Wilkinson's formulas for mixed-effects models.

A unified framework for machine learning with time series

A framework for building (and incrementally growing) graph-based data structures used in hierarchical or DAG-structured clustering and nearest neighbor search

A simple example of ML classification, cross validation, and visualization of feature importances

虚拟货币(BTC、ETH)炒币量化系统项目。在一版本的基础上加入了趋势判断

A Python implementation of the Robotics Toolbox for MATLAB

Timeseries analysis for neuroscience data

Mortality risk prediction for COVID-19 patients using XGBoost models

Code for the TCAV ML interpretability project

Made in collaboration with Chris George for Art + ML Spring 2019.