Code for the paper "Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks"

Last update: Nov 21, 2022

Related tags

Overview

ON-LSTM

This repository contains the code used for word-level language model and unsupervised parsing experiments in Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks paper, originally forked from the LSTM and QRNN Language Model Toolkit for PyTorch. If you use this code or our results in your research, we'd appreciate if you cite our paper as following:

@article{shen2018ordered,
  title={Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks},
  author={Shen, Yikang and Tan, Shawn and Sordoni, Alessandro and Courville, Aaron},
  journal={arXiv preprint arXiv:1810.09536},
  year={2018}
}

Software Requirements

Python 3.6, NLTK and PyTorch 0.4 are required for the current codebase.

Steps

Install PyTorch 0.4 and NLTK
Download PTB data. Note that the two tasks, i.e., language modeling and unsupervised parsing share the same model strucutre but require different formats of the PTB data. For language modeling we need the standard 10,000 word Penn Treebank corpus data, and for parsing we need Penn Treebank Parsed data.
Scripts and commands
- Train Language Modeling python main.py --batch_size 20 --dropout 0.45 --dropouth 0.3 --dropouti 0.5 --wdrop 0.45 --chunk_size 10 --seed 141 --epoch 1000 --data /path/to/your/data
- Test Unsupervised Parsing python test_phrase_grammar.py --cuda
The default setting in main.py achieves a perplexity of approximately 56.17 on PTB test set and unlabeled F1 of approximately 47.7 on WSJ test set.

Code for the paper "Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks"

Related tags

Overview

ON-LSTM

Software Requirements

Steps

Owner

Yikang Shen

Official PyTorch implementation of the NeurIPS 2021 paper StyleGAN3

A machine learning project which can detect and predict the skin disease through image recognition.

Springer Link Download Module for Python

A multi-functional library for full-stack Deep Learning. Simplifies Model Building, API development, and Model Deployment.

Official PyTorch implementation of paper: Standardized Max Logits: A Simple yet Effective Approach for Identifying Unexpected Road Obstacles in Urban-Scene Segmentation (ICCV 2021 Oral Presentation)

PyTorch implementation for COMPLETER: Incomplete Multi-view Clustering via Contrastive Prediction (CVPR 2021)

Vision-Language Transformer and Query Generation for Referring Segmentation (ICCV 2021)

PyTorch reimplementation of the paper Involution: Inverting the Inherence of Convolution for Visual Recognition [CVPR 2021].

Repository for MeshTalk supplemental material and code once the (already approved) 16 GHS captures our lab will make publicly available are released.

4K videos with annotated masks in our ICCV2021 paper 'Internal Video Inpainting by Implicit Long-range Propagation'.

An experimentation and research platform to investigate the interaction of automated agents in an abstract simulated network environments.

ICCV2021 - Mining Contextual Information Beyond Image for Semantic Segmentation

DeepFaceEditing: Deep Face Generation and Editing with Disentangled Geometry and Appearance Control

Release of the ConditionalQA dataset

The code for 'Deep Residual Fourier Transformation for Single Image Deblurring'

WebUAV-3M: A Benchmark Unveiling the Power of Million-Scale Deep UAV Tracking

Deep learning models for classification of 15 common weeds in the southern U.S. cotton production systems.

Unsupervised Learning of Multi-Frame Optical Flow with Occlusions

Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)

A fast python implementation of Ray Tracing in One Weekend using python and Taichi