[ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction

Last update: Jan 06, 2023

Overview

REval

Introduction
Overview
Requirements
Installation
Probing
Usage
Citation
License

🎓 Introduction

REval is a simple framework for probing sentence-level representations of Relation Extraction models.

✅ Requirements

REval is tested with:

Python 3.7

🚀 Installation

With pip

<TBD>

From source

git clone https://github.com/DFKI-NLP/REval
cd REval
pip install -r requirements.txt

🔬 Probing

Supported Datasets

SemEval 2010 Task 8 (CoreNLP annotated version) [LINK]
TACRED (obtained via LDC) [LINK]

Probing Tasks

Task	SemEval 2010	TACRED
ArgTypeHead	✔️	✔️
ArgTypeTail	✔️	✔️
Length	✔️	✔️
EntityDistance	✔️	✔️
ArgumentOrder		✔️
EntityExistsBetweenHeadTail	✔️	✔️
PosTagHeadLeft	✔️	✔️
PosTagHeadRight	✔️	✔️
PosTagTailLeft	✔️	✔️
PosTagTailRight	✔️	✔️
TreeDepth	✔️	✔️
SDPTreeDepth	✔️	✔️
ArgumentHeadGrammaticalRole	✔️	✔️
ArgumentTailGrammaticalRole	✔️	✔️

🔧 Usage

Step 1: create the probing task datasets from the original datasets.

SemEval 2010 Task 8

python reval.py generate-all-from-semeval \
    --train-file <SEMEVAL DIR>/train.json \
    --validation-file <SEMEVAL DIR>/dev.json \
    --test-file <SEMEVAL DIR>/test.json \
    --output-dir ./data/semeval/

TACRED

python reval.py generate-all-from-tacred \
    --train-file <TACRED DIR>/train.json \
    --validation-file <TACRED DIR>/dev.json \
    --test-file <TACRED DIR>/test.json \
    --output-dir ./data/tacred/

Step 2: Run the probing tasks on a model.

For example, download a Relation Extraction model trained with RelEx, e.g., the CNN trained on SemEval.

mkdir -p models/cnn_semeval
wget --content-disposition https://cloud.dfki.de/owncloud/index.php/s/F3gf9xkeb2foTFe/download -P models/cnn_semeval

python probing_task_evaluation.py \
    --model-dir ./models/cnn_semeval/ \
    --data-dir ./data/semeval/ \
    --dataset semeval2010 \
    --cuda-device 0 \
    --batch-size 64 \
    --cache-representations

After the run is completed, the results are stored to probing_task_results.json in the model-dir.

{
    "ArgTypeHead": {
        "acc": 75.82,
        "devacc": 78.96,
        "ndev": 670,
        "ntest": 2283
    },
    "ArgTypeTail": {
        "acc": 75.4,
        "devacc": 78.79,
        "ndev": 627,
        "ntest": 2130
    },
    [...]
}

📚 Citation

If you use REval, please consider citing the following paper:

@inproceedings{alt-etal-2020-probing,
    title={Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction},
    author={Christoph Alt and Aleksandra Gabryszak and Leonhard Hennig},
    year={2020},
    booktitle={Proceedings of ACL},
    url={https://arxiv.org/abs/2004.08134}
}

📘 License

REval is released under the terms of the MIT License.

[ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction

Related tags

Overview

REval

Table of Contents

🎓 Introduction

✅ Requirements

🚀 Installation

With pip

From source

🔬 Probing

Supported Datasets

Probing Tasks

🔧 Usage

Step 1: create the probing task datasets from the original datasets.

SemEval 2010 Task 8

TACRED

Step 2: Run the probing tasks on a model.

📚 Citation

📘 License

Owner

Implementation of Lie Transformer, Equivariant Self-Attention, in Pytorch

Implementation of H-UCRL Algorithm

Self-supervised learning on Graph Representation Learning (node-level task)

Multi-Joint dynamics with Contact. A general purpose physics simulator.

ESP32 python application to read data from a Tilt™ Hydrometer for homebrewing

Attention-guided gan for synthesizing IR images

Multi-objective gym environments for reinforcement learning.

Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)

Codebase for the solution that won first place and was awarded the most human-like agent in the 2021 NeurIPS Competition MineRL BASALT Challenge.

Code repository for our paper regarding the L3D dataset.

Creating a custom CNN hypertunned architeture for the Fashion MNIST dataset with Python, Keras and Tensorflow.

Supplemental Code for "ImpressionNet :A Multi view Approach to Predict Socio Facial Impressions"

The official implementation of Equalization Loss for Long-Tailed Object Recognition (CVPR 2020) based on Detectron2

Learn about Spice.ai with in-depth samples

Enhancing Knowledge Tracing via Adversarial Training

FrankMocap: A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

Weak-supervised Visual Geo-localization via Attention-based Knowledge Distillation

Pose Detection and Machine Learning for real-time body posture analysis during exercise to provide audiovisual feedback on improvement of form.

Official pytorch implementation of the AAAI 2021 paper Semantic Grouping Network for Video Captioning

Predicting Semantic Map Representations from Images with Pyramid Occupancy Networks