Grading tools for Advanced NLP (11-711)

Installation

You'll need docker and unzip to use this repo. For docker, visit the official guide to get started. For unzip, you can install it on ubuntu via sudo apt-get install unzip.

Install the python package by

git clone https://github.com/ProKil/anlp-grading-tools
cd anlp-grading-tools
pip install -e .

Usage

To evaluate your code, you'll need to change the environment variables in test.sh.

ANLP_TMP_DIR: mkdir a new folder, e.g. mkdir tmp, and point this variable to the absolute path of the tmp folder.

SUBMISSION_DIR: this should point to the folder containing your submission zip file. Note that the toolkit will automatically evaluate all zip files in the folder.

SCORES_DIR: this should point to an empty folder. Your score will be logged in a text file there.

DATA_DIR: this should point to the data folder of minnn-assignment. Please copy the original minnn-assignment/classifier.py to minnn-assignment/data/classifier_orig.py to test if your code can be executed with the original classifier.

Example code to prepare the folders:

mkdir tmp
mkdir scores
cp -r path/to/minnn-assignment/data ./
cp path/to/minnn-assignment/classifier.py data/classifier_orig.py
mkdir submission
cp your/submission.zip submission

Now you can evaluate your code through bash test.sh, after which your scores are at SCORES_DIR/andrewid. It is normal to get 0s for the last two (correct labels for the imdb test set are not available), but you should get reasonable accuracies for the first two (~40).

Troubleshooting

You may find writing files inside ANLP_TMP_DIR and SCORE_DIR requiring permission. You can either use sudo or log into docker through docker run -v FOLDER_TO_WRITE:/mnt -it --entrypoint /bin/bash anlp and cd /mnt to write those files.
You may experience other permission issues with docker. Please refer to this page to use docker without sudo.

Grading tools for Advanced NLP (11-711)Grading tools for Advanced NLP (11-711)

Related tags

Overview

Grading tools for Advanced NLP (11-711)

Installation

Usage

Troubleshooting

Owner

Hao Zhu

A fast, efficient universal vector embedding utility package.

Index different CKAN entities in Solr, not just datasets

CPC-big and k-means clustering for zero-resource speech processing

Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/

TaCL: Improve BERT Pre-training with Token-aware Contrastive Learning

用Resnet101+GPT搭建一个玩王者荣耀的AI

Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing

The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.

Python module (C extension and plain python) implementing Aho-Corasick algorithm

ADCS cert template modification and ACL enumeration

Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition

Deep Learning for Natural Language Processing - Lectures 2021

text to speech toolkit. 好用的中文语音合成工具箱，包含语音编码器、语音合成器、声码器和可视化模块。

This github repo is for Neurips 2021 paper, NORESQA A Framework for Speech Quality Assessment using Non-Matching References.

A flask application to predict the speech emotion of any .wav file.

Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.

Linking data between GBIF, Biodiverse, and Open Tree of Life

Speech to text streamlit app

Source code of the "Graph-Bert: Only Attention is Needed for Learning Graph Representations" paper

In this project, we aim to achieve the task of predicting emojis from tweets. We aim to investigate the relationship between words and emojis.