PyTorch implementation of DCT fast weight RNNs

Last update: Dec 24, 2022

Overview

DCT based fast weights

This repository contains the official code for the paper: Training and Generating Neural Networks in Compressed Weight Space.

The main code includes:

DCT LSTM: LSTMs whose weights are encoded by discrete cosine transform (DCT).
DCT fast weight RNN: RNNs whose weights are encoded by DCT, and the DCT coefficients are parameterized by LSTMs.

The language modeling experiments reported in the paper were produced by porting code (with minor changes due to some clean-up) of this repository in a fork of this toolkit.

Requirements

torch_dct (can be installed via pip install torch_dct)
PyTorch with a version compatible with torch_dct.

Our experiments were conducted using PyTorch version 1.6.0 . More recent versions are apparently not compatible with torch_dct (at least at the time of writing this file). We recommend to run python custom_layer.py to check the compatibility.

References

If you make use of this toolkit for your experiments, please cite:

@inproceedings{irie2021training,
  title={Training and Generating Neural Networks in Compressed Weight Space},
  author={Kazuki Irie and J{\"u}rgen Schmidhuber},
  booktitle={Neural Compression: From Information Theory to Applications -- Workshop @ ICLR 2021},
  year={2021},
  address={Virtual only},
  month=may
}

PyTorch implementation of DCT fast weight RNNs

Related tags

Overview

DCT based fast weights

Requirements

References

Owner

Kazuki Irie

VID-Fusion: Robust Visual-Inertial-Dynamics Odometry for Accurate External Force Estimation

CATE: Computation-aware Neural Architecture Encoding with Transformers

Open-World Entity Segmentation

FIRM-AFL is the first high-throughput greybox fuzzer for IoT firmware.

PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces

Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games

Simple image captioning model - CLIP prefix captioning.

Table-Extractor 表格抽取

Code for our NeurIPS 2021 paper: Sparsely Changing Latent States for Prediction and Planning in Partially Observable Domains

Mall-Customers-Segmentation - Customer Segmentation Using K-Means Clustering

Code for Towards Streaming Perception (ECCV 2020) :car:

Official git for "CTAB-GAN: Effective Table Data Synthesizing"

Official code of the paper "ReDet: A Rotation-equivariant Detector for Aerial Object Detection" (CVPR 2021)

Revisiting Self-Training for Few-Shot Learning of Language Model.

Semi-supervised Implicit Scene Completion from Sparse LiDAR

[ICLR 2022] DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR

DARTS-: Robustly Stepping out of Performance Collapse Without Indicators

Automatically replace ONNX's RandomNormal node with Constant node.

Point detection through multi-instance deep heatmap regression for sutures in endoscopy

Make Watson Assistant send messages to your Discord Server