Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations

Last update: Dec 29, 2022

Related tags

Overview

Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations

Code repo for paper Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations.

Dependencies

torch=1.8.1
transformers=4.9.0
sentence-transformers=2.0.0

Please view `requirements.txt' for more details.

Train

Self-distillation:

>> bash train_self_distill.sh 0

0 denotes GPU device index.

Mutual-distillation (two GPUs needed):

>> bash train_mutual_distill.sh 1,2

Train with your custom corpus:

>> CUDA_VISIBLE_DEVICES=0,1 python src/mutual_distill_parallel.py \
         --batch_size_bi_encoder 128 \
         --batch_size_cross_encoder 64 \
         --num_epochs_bi_encoder 10 \
         --num_epochs_cross_encoder 1 \
         --cycle 3 \
         --bi_encoder1_pooling_mode cls \
         --bi_encoder2_pooling_mode cls \
         --init_with_new_models \
         --task custom \
         --random_seed 2021 \
         --custom_corpus_path CORPUS_PATH

CORPUS_PATH should point to your custom corpus in which every line should be a sentence pair in the form of sent1||sent2.

Evaluate

>> python src/eval.py

Authors

Fangyu Liu: Main contributor

Security

See CONTRIBUTING for more information.

License

This project is licensed under the Apache-2.0 License.

Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations

Related tags

Overview

Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations

Dependencies

Train

Evaluate

Authors

Security

License

Owner

Amazon

Official code release for: EditGAN: High-Precision Semantic Image Editing

DGL-TreeSearch and the Gurobi-MWIS interface

Bridging the Gap between Label- and Reference based Synthesis(ICCV 2021)

Selfplay In MultiPlayer Environments

Research Artifact of USENIX Security 2022 Paper: Automated Side Channel Analysis of Media Software with Manifold Learning

PyTorch implementation of the Quasi-Recurrent Neural Network - up to 16 times faster than NVIDIA's cuDNN LSTM

This is a simple plugin for Vim that allows you to use OpenAI Codex.

CoCosNet v2: Full-Resolution Correspondence Learning for Image Translation

This repository is an open-source implementation of the ICRA 2021 paper: Locus: LiDAR-based Place Recognition using Spatiotemporal Higher-Order Pooling.

"Inductive Entity Representations from Text via Link Prediction" @ The Web Conference 2021

A trusty face recognition research platform developed by Tencent Youtu Lab

A Pytorch Implementation of [Source data‐free domain adaptation of object detector through domain

Convolutional Neural Network for Text Classification in Tensorflow

Snscrape-jsonl-urls-extractor - Extracts urls from jsonl produced by snscrape

You Only Look One-level Feature (YOLOF), CVPR2021, Detectron2

Deep-Learning-Book-Chapter-Summaries - Attempting to make the Deep Learning Book easier to understand.

COLMAP - Structure-from-Motion and Multi-View Stereo

quantize aware training package for NCNN on pytorch

Keyhole Imaging: Non-Line-of-Sight Imaging and Tracking of Moving Objects Along a Single Optical Path

Stochastic Normalizing Flows