Implementation of ETSformer, state of the art time-series Transformer, in Pytorch

Last update: Dec 30, 2022

Overview

ETSformer - Pytorch

Implementation of ETSformer, state of the art time-series Transformer, in Pytorch

Install

$ pip install etsformer-pytorch

Usage

import torch
from etsformer_pytorch import ETSFormer

model = ETSFormer(
    time_features = 4,
    model_dim = 512,                # in paper they use 512
    embed_kernel_size = 3,          # kernel size for 1d conv for input embedding
    layers = 2,                     # number of encoder and corresponding decoder layers
    heads = 8,                      # number of exponential smoothing attention heads
    K = 4,                          # num frequencies with highest amplitude to keep (attend to)
    dropout = 0.2                   # dropout (in paper they did 0.2)
)

timeseries = torch.randn(1, 1024, 4)

pred = model(timeseries, num_steps_forecast = 32) # (1, 32, 4) - (batch, num steps forecast, num time features)

For using ETSFormer for classification, using cross attention pooling on all latents and level output

import torch
from etsformer_pytorch import ETSFormer, ClassificationWrapper

etsformer = ETSFormer(
    time_features = 1,
    model_dim = 512,
    embed_kernel_size = 3,
    layers = 2,
    heads = 8,
    K = 4,
    dropout = 0.2
)

adapter = ClassificationWrapper(
    etsformer = etsformer,
    dim_head = 32,
    heads = 16,
    dropout = 0.2,
    level_kernel_size = 5,
    num_classes = 10
)

timeseries = torch.randn(1, 1024)

logits = adapter(timeseries) # (1, 10)

Citation

@misc{woo2022etsformer,
    title   = {ETSformer: Exponential Smoothing Transformers for Time-series Forecasting}, 
    author  = {Gerald Woo and Chenghao Liu and Doyen Sahoo and Akshat Kumar and Steven Hoi},
    year    = {2022},
    eprint  = {2202.01381},
    archivePrefix = {arXiv},
    primaryClass = {cs.LG}
}

Comments

What are your thoughts on using latents for additional classification task
Hi! I was wondering if you have thought about aggregating seasonal and growth latents for additional tasks (for example classification)? What are the possible ways to bring latents into single feature vector in your opinion? The easiest one would be just get the mean along layers and time dimensions but that seams to be too naive. Another idea I had it to use Cross Attention mechanic with single time query key to aggregate latents:

all_latents = torch.cat([latent_growths, latent_seasonals], dim=-1) all_latents = rearrange(all_latents, 'b n l d -> (b l) n d') # q = nn.Parameter(torch.randn(all_latents_dim)) q = repeat(q, 'd -> b 1 d', b = all_latents.shape[0]) agg_latent = cross_attention(query=q, context=all_latents) agg_latent = rearrange(all_latents, '(b l) n d -> b (l n) d') agg_latent = agg_latent.mean(dim=1) # may be we should have done it before cross attention?

Would be great to hear your thoughts
opened by inspirit 15
Pre LayerNorm might be required for k,v?

https://github.com/lucidrains/ETSformer-pytorch/blob/2561053007e919409b3255eb1d0852c68799d24f/etsformer_pytorch/etsformer_pytorch.py#L440

In my early tests I see some instability in training results, I was wondering if it might be good idea to LayerNorm latents before constructing key and values?

opened by inspirit 5
growth_term calculation error

https://github.com/lucidrains/ETSformer-pytorch/blob/e1d8514b44d113ead523aa6307986833e68eecc5/etsformer_pytorch/etsformer_pytorch.py#L233-L235

It looks like you are not using growth and growth_smoothing_weightsto calculate growth_term

opened by inspirit 4
Backward gradient error
Hello,

i was trying to run the provided class and see following error: Function ScatterBackward0 returned an invalid gradient at index 1 - got [64, 4, 128] but expected shape compatible with [64, 33, 128]

model = ETSFormer( time_features = 9, model_dim = 128, embed_kernel_size = 3, layers = 2, heads = 4, K = 4, dropout = 0.2 )

input = torch.rand(64, 64, 9) x = model(input, num_steps_forecast = 16)
opened by inspirit 3
Does ETS-Former allow adding features

@lucidrains Thanks for making the code of the model available!

In your paper, you state that the model infers seasonal patterns itself, so that there is no need to add time features like week, month, etc.

Still, to increase the applicability of your approach, does the current implementation allow to add any (time-invariant and time-varying) features, e.g., categorical or numeric?

opened by StatMixedML 2
wrong order of arguments

https://github.com/lucidrains/ETSformer-pytorch/blob/2e0d465576c15fc8d84c4673f93fdd71d45b799c/etsformer_pytorch/etsformer_pytorch.py#L327

you pass latents on wrong order to Level module: according to forward method first should be growth and then seasonal

opened by inspirit 1
Clarification regarding data pre-processing

Hello,

I was trying to run the ETSformer for ETT dataset. The paper mentions that the dataset is split as 60/20/20 for train, validation and test. Could you give some insight as to how the dataset split is happening in the code.

Thank you.

opened by vageeshmaiya 2

Releases(0.0.16)

0.0.16(Mar 22, 2022)

Source code(tar.gz)
Source code(zip)
0.0.15(Mar 22, 2022)

Source code(tar.gz)
Source code(zip)
0.0.14a(Mar 22, 2022)

Source code(tar.gz)
Source code(zip)
0.0.12(Mar 20, 2022)

Source code(tar.gz)
Source code(zip)
0.0.11(Mar 20, 2022)

Source code(tar.gz)
Source code(zip)
0.0.10(Mar 20, 2022)

Source code(tar.gz)
Source code(zip)
0.0.9(Mar 20, 2022)

Source code(tar.gz)
Source code(zip)
0.0.8(Mar 20, 2022)

Source code(tar.gz)
Source code(zip)
0.0.7(Mar 19, 2022)

Source code(tar.gz)
Source code(zip)
0.0.6(Mar 18, 2022)

Source code(tar.gz)
Source code(zip)
0.0.5(Mar 17, 2022)

Source code(tar.gz)
Source code(zip)
0.0.4(Mar 17, 2022)

Source code(tar.gz)
Source code(zip)
0.0.3a(Mar 16, 2022)

Source code(tar.gz)
Source code(zip)
0.0.1(Mar 15, 2022)

Source code(tar.gz)
Source code(zip)

Owner

Phil Wang

Working with Attention. It's all we need

GitHub Repository

GRaNDPapA: Generator of Rad Names from Decent Paper Acronyms

GRaNDPapA: Generator of Rad Names from Decent Paper Acronyms Trying to publish a new machine learning model and can't write a decent title for your pa

264 Nov 08, 2022

Black-Box-Tuning - Black-Box Tuning for Language-Model-as-a-Service

Black-Box-Tuning Source code for paper "Black-Box Tuning for Language-Model-as-a-Service". Being busy recently, the code in this repo and this tutoria

149 Jan 04, 2023

Official implementation of "Membership Inference Attacks Against Self-supervised Speech Models"

Introduction Official implementation of "Membership Inference Attacks Against Self-supervised Speech Models". In this work, we demonstrate that existi

7 Nov 01, 2022

Real-time pose estimation accelerated with NVIDIA TensorRT

trt_pose Want to detect hand poses? Check out the new trt_pose_hand project for real-time hand pose and gesture recognition! trt_pose is aimed at enab

803 Jan 06, 2023

[ICRA 2022] An opensource framework for cooperative detection. Official implementation for OPV2V.

OpenCOOD OpenCOOD is an Open COOperative Detection framework for autonomous driving. It is also the official implementation of the ICRA 2022 paper OPV

322 Dec 23, 2022

GLM (General Language Model)

GLM GLM is a General Language Model pretrained with an autoregressive blank-filling objective and can be finetuned on various natural language underst

421 Jan 04, 2023

This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".

Project This repo has been populated by an initial template to help get you started. Please make sure to update the content to build a great experienc

674 Dec 26, 2022

ROCKET: Exceptionally fast and accurate time series classification using random convolutional kernels

ROCKET + MINIROCKET ROCKET: Exceptionally fast and accurate time series classification using random convolutional kernels. Data Mining and Knowledge D

298 Dec 26, 2022

StarGAN v2-Tensorflow - Simple Tensorflow implementation of StarGAN v2

Official Tensorflow implementation Open ! - Clova AI StarGAN v2 — Un-official TensorFlow Implementation [Paper] [Pytorch] : Diverse Image Synthesis f

110 Jul 02, 2022

Implementation of Graph Transformer in Pytorch, for potential use in replicating Alphafold2

Graph Transformer - Pytorch Implementation of Graph Transformer in Pytorch, for potential use in replicating Alphafold2. This was recently used by bot

97 Dec 28, 2022

CLIP + VQGAN / PixelDraw

clipit Yet Another VQGAN-CLIP Codebase This started as a fork of @nerdyrodent's VQGAN-CLIP code which was based on the notebooks of @RiversWithWings a

276 Dec 12, 2022

MetaBalance: High-Performance Neural Networks for Class-Imbalanced Data

This repository is the official PyTorch implementation of Meta-Balance. Find the paper on arxiv MetaBalance: High-Performance Neural Networks for Clas

20 Oct 18, 2021

Toward Spatially Unbiased Generative Models (ICCV 2021)

Toward Spatially Unbiased Generative Models Implementation of Toward Spatially Unbiased Generative Models (ICCV 2021) Overview Recent image generation

88 Dec 01, 2022

The dataset of tweets pulling from Twitters with keyword: Hydroxychloroquine, location: US, Time: 2020

HCQ_Tweet_Dataset: FREE to Download. Keywords: HCQ, hydroxychloroquine, tweet, twitter, COVID-19 This dataset is associated with the paper "Understand

2 Mar 16, 2022

Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥

TensorLayer is a novel TensorFlow-based deep learning and reinforcement learning library designed for researchers and engineers. It provides an extens

7.1k Dec 29, 2022

Voila - Voilà turns Jupyter notebooks into standalone web applications

Rendering of live Jupyter notebooks with interactive widgets. Introduction Voilà turns Jupyter notebooks into standalone web applications. Unlike the

4.5k Jan 03, 2023

Justmagic - Use a function as a method with this mystic script, like in Nim

justmagic Use a function as a method with this mystic script, like in Nim. Just

8 Oct 08, 2022

CowHerd is a partially-observed reinforcement learning environment

CowHerd is a partially-observed reinforcement learning environment, where the player walks around an area and is rewarded for milking cows. The cows try to escape and the player can place fences to h

6 Mar 06, 2022

Adversarial-autoencoders - Tensorflow implementation of Adversarial Autoencoders

Adversarial Autoencoders (AAE) Tensorflow implementation of Adversarial Autoencoders (ICLR 2016) Similar to variational autoencoder (VAE), AAE imposes

236 Nov 13, 2022

Mercer Gaussian Process (MGP) and Fourier Gaussian Process (FGP) Regression

Mercer Gaussian Process (MGP) and Fourier Gaussian Process (FGP) Regression We provide the code used in our paper "How Good are Low-Rank Approximation

0 Dec 13, 2021

Implementation of ETSformer, state of the art time-series Transformer, in Pytorch

Related tags

Overview

ETSformer - Pytorch

Install

Usage

Citation

Comments

What are your thoughts on using latents for additional classification task

Pre LayerNorm might be required for k,v?

growth_term calculation error

Backward gradient error

Does ETS-Former allow adding features

wrong order of arguments

Clarification regarding data pre-processing

Releases(0.0.16)

0.0.16(Mar 22, 2022)

0.0.15(Mar 22, 2022)

0.0.14a(Mar 22, 2022)

0.0.12(Mar 20, 2022)

0.0.11(Mar 20, 2022)

0.0.10(Mar 20, 2022)

0.0.9(Mar 20, 2022)

0.0.8(Mar 20, 2022)

0.0.7(Mar 19, 2022)

0.0.6(Mar 18, 2022)

0.0.5(Mar 17, 2022)

0.0.4(Mar 17, 2022)

0.0.3a(Mar 16, 2022)

0.0.1(Mar 15, 2022)

Owner

Phil Wang

GRaNDPapA: Generator of Rad Names from Decent Paper Acronyms

Black-Box-Tuning - Black-Box Tuning for Language-Model-as-a-Service

Official implementation of "Membership Inference Attacks Against Self-supervised Speech Models"

Real-time pose estimation accelerated with NVIDIA TensorRT

[ICRA 2022] An opensource framework for cooperative detection. Official implementation for OPV2V.

GLM (General Language Model)

This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".

ROCKET: Exceptionally fast and accurate time series classification using random convolutional kernels

StarGAN v2-Tensorflow - Simple Tensorflow implementation of StarGAN v2

Implementation of Graph Transformer in Pytorch, for potential use in replicating Alphafold2

CLIP + VQGAN / PixelDraw

MetaBalance: High-Performance Neural Networks for Class-Imbalanced Data

Toward Spatially Unbiased Generative Models (ICCV 2021)

The dataset of tweets pulling from Twitters with keyword: Hydroxychloroquine, location: US, Time: 2020

Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥

Voila - Voilà turns Jupyter notebooks into standalone web applications

Justmagic - Use a function as a method with this mystic script, like in Nim

CowHerd is a partially-observed reinforcement learning environment

Adversarial-autoencoders - Tensorflow implementation of Adversarial Autoencoders

Mercer Gaussian Process (MGP) and Fourier Gaussian Process (FGP) Regression