PiRank: Learning to Rank via Differentiable Sorting

Last update: Dec 17, 2022

Related tags

Overview

PiRank: Learning to Rank via Differentiable Sorting

This repository provides a reference implementation for learning PiRank-based models as described in the paper:

PiRank: Learning to Rank via Differentiable Sorting
Robin Swezey, Aditya Grover, Bruno Charron and Stefano Ermon.
Paper: https://arxiv.org/abs/2012.06731

Requirements

The codebase is implemented in Python 3.7. To install the necessary base requirements, run the following commands:

pip install -r requirements.txt

If you intend to use a GPU, modify requirements.txt to install tensorflow-gpu instead of tensorflow.

You will also need the NeuralSort implementation available here. Make sure it is added to your PYTHONPATH.

Datasets

PiRank was tested on the two following datasets:

The MSLR WEB30K data can be found at this address.
The Yahoo! C14 dataset can be found at this address.

Additionally, the code is expected to work with any dataset stored in the standard LibSVM format used for LTR experiments.

Scripts

There are two scripts for the code:

pirank_simple.py implements a simple depth-1 PiRank loss (d=1). It is used in the experiments of sections 4.1 (benchmark evaluation on MSLR-WEB30K and Yahoo! C14 datasets), 4.2.1 (effect of temperature parameter), and 4.2.2 (effect of training list size).
pirank_deep.py implements the deeper PiRank losses (d>=1). It is used for the experiments of section 4.2.3 and comes with a convenient synthetic data generator as well as more tuning options.

Options

Options are handled by Sacred (see Examples section below).

pirank_simple.py and pirank_deep.py

PiRank-related:

Parameter	Default Value	Description
loss_fn	pirank_simple_loss	The loss function to use (either a TFR RankingLossKey, or loss function from the script)
ste	False	Whether to use the Straight-Through Estimator
ndcg_k	15	[email protected] cutoff when using NS-NDCG loss

NeuralSort-related:

Parameter	Default Value	Description
tau	5	Temperature
taustar	1e-10	Temperature for trues and straight-through estimation.

TensorFlow-Ranking and architecture-related:

Parameter	Default Value	Description
hidden_layers	"256,tanh,128,tanh,64,tanh"	Hidden layers for an example-wise feedforward network in the format size,activation,...,size,activation
num_features	136	Number of features per document. The default value is for MSLR and depends on the dataset (e.g. for Yahoo!, please change to 700).
list_size	100	List size used for training
group_size	1	Group size used in score function

Training-related:

Parameter	Default Value	Description
train_path	"/data/MSLR-WEB30K/Fold*/train.txt"	Input file path used for training
vali_path	"/data/MSLR-WEB30K/Fold*/vali.txt"	Input file path used for validation
test_path	"/data/MSLR-WEB30K/Fold*/test.txt"	Input file path used for testing
model_dir	None	Output directory for models
num_epochs	200	Number of epochs to train, set 0 to just test
lr	1e-4	initial learning rate
batch_size	32	The batch size for training
num_train_steps	None	Number of steps for training
num_vali_steps	None	Number of steps for validation
num_test_steps	None	Number of steps for testing
learning_rate	0.01	Learning rate for optimizer
dropout_rate	0.5	The dropout rate before output layer
optimizer	Adagrad	The optimizer for gradient descent

Sacred:

In addition, you can use regular parameters from Sacred (such as -m for logging the experiment to MongoDB).

pirank_deep.py only

Parameter	Default Value	Description
merge_block_size	None	Block size used if merging, None if not merging
top_k	None	Use a different Top-k for merging than final [email protected] for loss
straight_backprop	False	Backpropagate on scores only through NS operator
full_loss	False	Use the complete loss at the end of merge
tau_scheme	None	Which scheme to use for temperature going deeper (default: constant)
data_generator	None	Data generator (default: TFR\s libsvm); use this for synthetic generation
num_queries	30000	Number of queries for synthetic data generator
num_query_features	10	Number of columns used as factors for each query by synthetic data generator
actual_list_size	None	Size of actual list per query in synthetic data generation
train_path	"/data/MSLR-WEB30K/Fold*/train.txt"	Input file path used for training; alternatively value of seed if using data generator
vali_path	"/data/MSLR-WEB30K/Fold*/vali.txt"	Input file path used for validation; alternatively value of seed if using data generator
test_path	"/data/MSLR-WEB30K/Fold*/test.txt"	Input file path used for testing; alternatively value of seed if using data generator
with_opa	True	Include pairwise metric OPA

Examples

Run the benchmark experiment of section 4.1 with PiRank simple loss on MSLR-WEB30K

cd pirank
python3 pirank_simple.py with loss_fn=pirank_simple_loss \
    ndcg_k=10 \
    tau=5 \
    list_size=80 \
    hidden_layers=256,relu,256,relu,128,relu,64,relu \
    train_path=/data/MSLR-WEB30K/Fold1/train.txt \
    vali_path=/data/MSLR-WEB30K/Fold1/vali.txt \
    test_path=/data/MSLR-WEB30K/Fold1/test.txt \
    num_features=136 \
    optimizer=Adam \
    learning_rate=0.00001 \
    num_epochs=100 \
    batch_size=16 \
    model_dir=/tmp/model

Run the benchmark experiment of section 4.1 with PiRank simple loss on Yahoo! C14

cd pirank
python3 pirank_simple.py with loss_fn=pirank_simple_loss \
    ndcg_k=10 \
    tau=5 \
    list_size=80 \
    hidden_layers=256,relu,256,relu,128,relu,64,relu \
    train_path=/data/YAHOO/set1.train.txt \
    vali_path=/data/YAHOO/set1.valid.txt \
    test_path=/data/YAHOO/set1.test.txt \
    num_features=700 \
    optimizer=Adam \
    learning_rate=0.00001 \
    num_epochs=100 \
    batch_size=16 \
    model_dir=/tmp/model

Run the benchmark experiment of section 4.1 with classic LambdaRank on MSLR-WEB30K

cd pirank
python3 pirank_simple.py with loss_fn=lambda_rank_loss \
    ndcg_k=10 \
    tau=5 \
    list_size=80 \
    hidden_layers=256,relu,256,relu,128,relu,64,relu \
    train_path=/data/MSLR-WEB30K/Fold1/train.txt \
    vali_path=/data/MSLR-WEB30K/Fold1/vali.txt \
    test_path=/data/MSLR-WEB30K/Fold1/test.txt \
    num_features=136 \
    optimizer=Adam \
    learning_rate=0.00001 \
    num_epochs=100 \
    batch_size=16 \
    model_dir=/tmp/model

Run the scaling ablation experiment of section 4.2.3 using synthetic data generation (d=2)

cd pirank
python3 pirank_deep.py with loss_fn=pirank_deep_loss \
    ndcg_k=10 \
    ste=True \
    merge_block_size=100 \
    tau=5 \
    taustar=1e-10 \
    tau_scheme=square \
    data_generator=synthetic_data_generator \
    actual_list_size=1000 \
    list_size=1000 \
    vali_list_size=1000 \
    test_list_size=1000 \
    full_loss=False \
    train_path=0 \
    vali_path=1 \
    test_path=2 \
    num_queries=1000 \
    num_features=25 \
    num_query_features=5 \
    hidden_layers=256,relu,256,relu,128,relu,128,relu,64,relu,64,relu \
    optimizer=Adam \
    learning_rate=0.00001 \
    num_epochs=100 \
    batch_size=16

Help

If you need help, reach out to Robin Swezey or raise an issue.

Citing

If you find PiRank useful in your research, please consider citing the following paper:

@inproceedings{
swezey2020pirank,
title={PiRank: Learning to Rank via Differentiable Sorting},
author={Robin Swezey and Aditya Grover and Bruno Charron and Stefano Ermon},
year={2020},
url={},
}

PiRank: Learning to Rank via Differentiable Sorting

Related tags

Overview

PiRank: Learning to Rank via Differentiable Sorting

Requirements

Datasets

Scripts

Options

pirank_simple.py and pirank_deep.py

pirank_deep.py only

Examples

Help

Citing

Owner

Control-Robot-Arm-using-PS4-Controller - A Robotic Arm based on Raspberry Pi and Arduino that controlled by PS4 Controller

TAug :: Time Series Data Augmentation using Deep Generative Models

FactSeg: Foreground Activation Driven Small Object Semantic Segmentation in Large-Scale Remote Sensing Imagery (TGRS)

PyBrain - Another Python Machine Learning Library.

VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.

[TIP 2020] Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion

Clinica is a software platform for clinical research studies involving patients with neurological and psychiatric diseases and the acquisition of multimodal data

Official git for "CTAB-GAN: Effective Table Data Synthesizing"

Make your first PR. A beginner friendly repository made specifically for open source beginners. Add any program under any language (it can be anything from a simple program to a complex data structure algorithm). Happy coding...

Implementation of Graph Transformer in Pytorch, for potential use in replicating Alphafold2

Organseg dags - The repository contains the codebase for multi-organ segmentation with directed acyclic graphs (DAGs) in CT.

Testing the Facial Emotion Recognition (FER) algorithm on animations

Official implementation of the paper Momentum Capsule Networks (MoCapsNet)

Numerical differential equation solvers in JAX. Autodifferentiable and GPU-capable.

68 keypoint annotations for COFW test data

This is our ARTS test set, an enriched test set to probe Aspect Robustness of ABSA.

TJU Deep Learning & Neural Network

On Effective Scheduling of Model-based Reinforcement Learning

Pytorch implementation for "Distribution-Balanced Loss for Multi-Label Classification in Long-Tailed Datasets" (ECCV 2020 Spotlight)

The code for "Deep Level Set for Box-supervised Instance Segmentation in Aerial Images".