Sequence learning toolkit for Python

Last update: Dec 27, 2022

Related tags

Overview

seqlearn

seqlearn is a sequence classification toolkit for Python. It is designed to extend scikit-learn and offer as similar as possible an API.

Compiling and installing

Get NumPy >=1.6, SciPy >=0.11, Cython >=0.20.2 and a recent version of scikit-learn. Then issue:

python setup.py install

to install seqlearn.

If you want to use seqlearn from its source directory without installing, you have to compile first:

python setup.py build_ext --inplace

Getting started

The easiest way to start using seqlearn is to fetch a dataset in CoNLL 2000 format. Define a task-specific feature extraction function, e.g.:

>>> def features(sequence, i):
...     yield "word=" + sequence[i].lower()
...     if sequence[i].isupper():
...         yield "Uppercase"
...

Load the training file, say train.txt:

>>> from seqlearn.datasets import load_conll
>>> X_train, y_train, lengths_train = load_conll("train.txt", features)

Train a model:

>>> from seqlearn.perceptron import StructuredPerceptron
>>> clf = StructuredPerceptron()
>>> clf.fit(X_train, y_train, lengths_train)

Check how well you did on a validation set, say validation.txt:

>>> X_test, y_test, lengths_test = load_conll("validation.txt", features)
>>> from seqlearn.evaluation import bio_f_score
>>> y_pred = clf.predict(X_test, lengths_test)
>>> print(bio_f_score(y_test, y_pred))

For more information, see the documentation.

Sequence learning toolkit for Python

Related tags

Overview

seqlearn

Compiling and installing

Getting started

Owner

Lars

Model Agnostic Confidence Estimator (MACEST) - A Python library for calibrating Machine Learning models' confidence scores

Real-time domain adaptation for semantic segmentation

Probabilistic programming framework that facilitates objective model selection for time-varying parameter models.

Machine learning algorithms implementation

A chain of stores, 10 different stores and 50 different requests a 3-month demand forecast for its product.

Predicting India’s COVID-19 Third Wave with LSTM

This is a Cricket Score Predictor that predicts the first innings score of a T20 Cricket match using Machine Learning

Dragonfly is an open source python library for scalable Bayesian optimisation.

Deploy AutoML as a service using Flask

Greykite: A flexible, intuitive and fast forecasting library

Automatic extraction of relevant features from time series:

All-in-one web-based development environment for machine learning

Transpile trained scikit-learn estimators to C, Java, JavaScript and others.

onelearn: Online learning in Python

FLAML is a lightweight Python library that finds accurate machine learning models automatically, efficiently and economically

TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.

Nixtla is an open-source time series forecasting library.

A simple and lightweight genetic algorithm for optimization of any machine learning model

AutoOED: Automated Optimal Experiment Design Platform

Tribuo - A Java machine learning library

Sequence learning toolkit for Python

Related tags

Overview

seqlearn

Compiling and installing

Getting started

Owner

Lars

Model Agnostic Confidence Estimator (MACEST) - A Python library for calibrating Machine Learning models' confidence scores

Real-time domain adaptation for semantic segmentation

Probabilistic programming framework that facilitates objective model selection for time-varying parameter models.

Machine learning algorithms implementation

A chain of stores, 10 different stores and 50 different requests a 3-month demand forecast for its product.

Predicting India’s COVID-19 Third Wave with LSTM

This is a Cricket Score Predictor that predicts the first innings score of a T20 Cricket match using Machine Learning

Dragonfly is an open source python library for scalable Bayesian optimisation.

Deploy AutoML as a service using Flask

﻿Greykite: A flexible, intuitive and fast forecasting library

Automatic extraction of relevant features from time series:

All-in-one web-based development environment for machine learning

Transpile trained scikit-learn estimators to C, Java, JavaScript and others.

onelearn: Online learning in Python

FLAML is a lightweight Python library that finds accurate machine learning models automatically, efficiently and economically

TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.

Nixtla is an open-source time series forecasting library.

A simple and lightweight genetic algorithm for optimization of any machine learning model

AutoOED: Automated Optimal Experiment Design Platform

Tribuo - A Java machine learning library

Greykite: A flexible, intuitive and fast forecasting library