Codes for coreference-aware machine reading comprehension

Last update: Sep 29, 2022

Related tags

Overview

Data and code for the paper "Tracing Origins: Coreference-aware Machine Reading Comprehension" at ACL2022.

Dataset

There are three folders for our three models mentioned in the paper: Coref_additive_spacy for Coref_additive_attention, Coref_dgl_spacy for GNN and Coref_multiplication_spacy for Coref_multiplication_attention, and each contains the train data set and the dev data set under the quoref folder.

each sample contains

context: the paragraph text
context_id: the unique identifier of the context
qas: a group of questions
question: question text
id: the unique identifier of the question
answers: a group of the answers to one question
text: answer text
answer_start: the start_position of one answer

Models

If you want to use our trained model, please download it from Google drive

Training

python run_quoref.py --train_file "quoref/train.json" --predict_file "quoref/dev.json" --model_type "roberta_multi" --model_name_or_path "roberta-large" --output_dir "out" --do_train --do_eval --eval_all_checkpoints --learning_rate 1e-5 --num_train_epochs 6 --overwrite_output_dir --per_gpu_train_batch_size 4 --save_steps 6000 --coref_weight 0.4

Kindly Hint

There is an open issue regarding the compatibility between NeuralCoref and spaCy 3.0. If you intend to use the latest spaCy models, please watch the issue.

Cite

If you extend or use this work, please cite the paper where it was introduced:

@article{Huang2021TracingOC,
  title={Tracing Origins: Coref-aware Machine Reading Comprehension},
  author={Baorong Huang and Zhuosheng Zhang and Hai Zhao},
  journal={ArXiv},
  year={2021},
  volume={abs/2110.07961}
}

Codes for coreference-aware machine reading comprehension

Related tags

Overview

Dataset

Models

Training

Kindly Hint

Cite

Owner

Textpipe: clean and extract metadata from text

Continuously update some NLP practice based on different tasks.

Rhythm-Finder is a unsupervised ML driven python powered web-application that can find the songs that suits you.

SimCTG - A Contrastive Framework for Neural Text Generation

HuggingTweets - Train a model to generate tweets

glow-speak is a fast, local, neural text to speech system that uses eSpeak-ng as a text/phoneme front-end.

SAVI2I: Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors

InferSent sentence embeddings

Kinky furry assitant based on GPT2

PUA Programming Language written in Python.

📜 GPT-2 Rhyming Limerick and Haiku models using data augmentation

Takes a string and puts it through different languages in Google Translate a requested amount of times, returning nonsense.

Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization

Vad-sli-asr - A Python scripts for a speech processing pipeline with Voice Activity Detection (VAD)

JaQuAD: Japanese Question Answering Dataset

Deep Learning for Natural Language Processing - Lectures 2021

Write Python in Urdu - اردو میں کوڈ لکھیں

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

RuCLIP-SB (Russian Contrastive Language–Image Pretraining SWIN-BERT) is a multimodal model for obtaining images and text similarities and rearranging captions and pictures. Unlike other versions of the model we use BERT for text encoder and SWIN transformer for image encoder.

Visual Automata is a Python 3 library built as a wrapper for Caleb Evans' Automata library to add more visualization features.