Show Me the Whole World: Towards Entire Item Space Exploration for Interactive Personalized Recommendations

Last update: Sep 09, 2022

Related tags

Overview

HierarchicyBandit

Introduction

This is the implementation of WSDM 2022 paper : Show Me the Whole World: Towards Entire Item Space Exploration for Interactive Personalized Recommendations
The reference codes for HCB and pHCB, which are based on three different base bandit algorithms.

LinUCB from A contextual-bandit approach to personalized news article recommendation
epsilon-Greedy [This strategy, with random exploration on an epsilon fraction of the traffic and greedy exploitation on the rest]
Thompson Sampling from Thompson Sampling for Contextual Bandits with Linear Payoffs

Files in the folder

data/
- MIND/ and TaoBao/
  - item_info.pkl: processed item file, including item id, item feature and embeddings for simulator;
  - user_info.pkl: processed user file, including user id, and embeddings for simulator;
  - item_info_ts.pkl: processed item file for Thompson sampling;
algs/: implementations of PCB and pHCB based on LinUCB.
algsE/: implementations of PCB and pHCB based on epsilon-Greedy.
algsTS/: implementations of PCB and pHCB based on Thompson Sampling.

Note

Before testing the algorithms, you should modify the settings in config.py.
For thompson sampling, we provide another 16 dimensonal feature vectors to run the experiments, since it can be faster . The original feature vectors are also work with the algorithms.
the user_info.pkl and item_info.pkl is formated as dictionary type.
The implementation of ConUCB is released at ConUCB. HMAB and ICTRUCB are specical case of CB-Category and CB-Leaf.

Usage:

Download the HierarchicyBandit.zip and unzip. You will get five folders, they are algs/, algsE/, algsTS/, data/, and logger/.

Parameters:
The config.py file contains:

dataset: is the dataset used in the experiment, it can be 'MIND' or 'TaoBao';  
T: the number of rounds of each bandit algorithm;  
k: the number of items recommended to user at each round, default is 1;  
activate_num: the hyper-papamter p for pHCB;  
activate_prob: the hyper-papamter q for pHCB;  
epsilon: the epsilon value for greedy-based algorithms;  
new_tree_file: the tree file name;  
noise_scale: the standard deviation of environmental noise;  
keep_prob: sample ratio; default is 1.0, which means testing all users.
linucb_para: the hyper-parameters for linucb algorithm;
ts_para: the hyper-parameters for thompson sampling algorithm;
poolsize: the size of candidate pool;
random_choice: whether random choice an item to user;

Environment: python 3.6 with Anaconda To run the bandit codes based on LinUCB:

$ cd algs
$ python simulator_multi_process.py

To run the bandit codes based on epsilon-Greedy:

$ cd algsE
$ python simulator_multi_process.py

To run the bandit codes based on Thompson sampling:

$ cd algsTS
$ python simulator_multi_process.py

Show Me the Whole World: Towards Entire Item Space Exploration for Interactive Personalized Recommendations

Related tags

Overview

HierarchicyBandit

Introduction

Files in the folder

Usage:

Owner

yu song

BiSeNet based on pytorch

This repo holds the code of TransFuse: Fusing Transformers and CNNs for Medical Image Segmentation

RuleBERT: Teaching Soft Rules to Pre-Trained Language Models

Python implementation of ADD: Frequency Attention and Multi-View based Knowledge Distillation to Detect Low-Quality Compressed Deepfake Images, AAAI2022.

A vanilla 3D face modeling on pose-invariant and multi-lightning image data

Official Pytorch implementation of Online Continual Learning on Class Incremental Blurry Task Configuration with Anytime Inference (ICLR 2022)

计算机视觉中用到的注意力模块和其他即插即用模块PyTorch Implementation Collection of Attention Module and Plug&Play Module

Title: Heart-Failure-Classification

Official implementation of GraphMask as presented in our paper Interpreting Graph Neural Networks for NLP With Differentiable Edge Masking.

Official code repository of the paper Learning Associative Inference Using Fast Weight Memory by Schlag et al.

KinectFusion implemented in Python with PyTorch

Face and Body Tracking for VRM 3D models on the web.

FIRA: Fine-Grained Graph-Based Code Change Representation for Automated Commit Message Generation

Implementation of Geometric Vector Perceptron, a simple circuit for 3d rotation equivariance for learning over large biomolecules, in Pytorch. Idea proposed and accepted at ICLR 2021

PyTorch implementation for the ICLR 2020 paper "Understanding the Limitations of Variational Mutual Information Estimators"

CNNs for Sentence Classification in PyTorch

Artstation-Artistic-face-HQ Dataset (AAHQ)

Ensemble Visual-Inertial Odometry (EnVIO)

DRIFT is a tool for Diachronic Analysis of Scientific Literature.

Towards uncontrained hand-object reconstruction from RGB videos