Knowledge Base Completion Meets Transfer Learning

This code accompanies the paper Knowledge Base Completion Meets Transfer Learning published at EMNLP 2021.

Setup

Following packages are needed to run the code

Python >=3.6
pytorch>=1.6
spacy>=2.0 and en_core_web_sm model
tqdm

Run setup.sh to download and transform GloVe embeddings and OlpBench dataset. Please note that this downloads 3.5GB of files which unzip into around 10GB of content.

Running the code

For full help, run python main.py -h, a couple of examples are given below:

For pre-training TuckER on OlpBench, run

python main.py -data Data/OlpBench -dim 300 -lr 1e-4 -batch 4096 -n_epochs 100 -embedding TuckER -dropout 0.3 -encoder GRU -hits [1,3,5,10,30,50] -output_dir TuckEROlpBench300 -dump_vocab -only_batch_negative

For pretraining, it is important to add -dump_vocab to store encoder vocabulary. Otherwise it is not possible to load the stored model for fine-tuning. For any large-scale pre-training it is important to add -only_batch_negative argument to avoid encoding all entities at every training step.

To fine-tune the model obtained with the above command on ReVerb20K using NoEncoder, use the command below.

python main.py -data Data/ReVerb20K -dim 300 -lr 3e-4 -batch 512 -n_epochs 500 -embedding TuckER -dropout 0.3 -encoder NoEncoder -hits [1,3,5,10,30,50] -output_dir TuckERReVerb20K -pretrained_dir TuckEROlpBench300

To train the with same setup but from a randomly-initialized model, just remove the -pretrained_dir argument.

python main.py -data Data/ReVerb20K -dim 300 -lr 3e-4 -batch 512 -n_epochs 500 -embedding TuckER -dropout 0.3 -encoder NoEncoder -hits [1,3,5,10,30,50] -output_dir TuckERReVerb20K

Reference

If you use the code from this repo, please cite the following work.

@inproceedings{kocijan2021KBCtransfer,
    title = "Knowledge Base Completion Meets Transfer Learning",
    author = "Kocijan, Vid  and
      Lukasiewicz, Thomas",
    booktitle = "Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP)",
    month = nov,
    year = "2021",
    address = "Punta Cana, Dominican Republic",
    publisher = "Association for Computational Linguistics",
}

Code accompanying the paper "Knowledge Base Completion Meets Transfer Learning"

Related tags

Overview

Knowledge Base Completion Meets Transfer Learning

Setup

Running the code

Reference

Owner

The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGIR2022

The repo for the paper "I3CL: Intra- and Inter-Instance Collaborative Learning for Arbitrary-shaped Scene Text Detection".

Official PyTorch implementation of "Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-supervised Action Recognition" in AAAI2022.

RL algorithm PPO and IRL algorithm AIRL written with Tensorflow.

TensorFlow implementation of "A Simple Baseline for Bayesian Uncertainty in Deep Learning"

LIMEcraft: Handcrafted superpixel selectionand inspection for Visual eXplanations

TensorFlow-based neural network library

Unofficial implementation of Pix2SEQ

Official implementation for paper Knowledge Bridging for Empathetic Dialogue Generation (AAAI 2021).

small collection of functions for neural networks

Simple and ready-to-use tutorials for TensorFlow

Trajectory Prediction with Graph-based Dual-scale Context Fusion

Measures input lag without dedicated hardware, performing motion detection on recorded or live video

Deploy optimized transformer based models on Nvidia Triton server

Intrinsic Image Harmonization

STEAL - Learning Semantic Boundaries from Noisy Annotations (CVPR 2019)

a reccurrent neural netowrk that when trained on a peice of text and fed a starting prompt will write its on 250 character text using LSTM layers

Official PyTorch code for Mutual Affine Network for Spatially Variant Kernel Estimation in Blind Image Super-Resolution (MANet, ICCV2021)

code for "Self-supervised edge features for improved Graph Neural Network training",

Codes for the ICCV'21 paper "FREE: Feature Refinement for Generalized Zero-Shot Learning"