A TensorFlow implementation of SOFA, the Simulator for OFfline LeArning and evaluation.

Last update: Nov 23, 2022

Overview

SOFA

This repository is the implementation of SOFA, the Simulator for OFfline leArning and evaluation.

Keeping Dataset Biases out of the Simulation: A Debiased Simulator for Reinforcement Learning based Recommender Systems. Jin Huang, Harrie Oosterhuis, Maarten de Rijke, Herke van Hoof. Recsys 2020.

The framework shows how RL4Rec typically interacts with a simulation-based environment. A state is user historical interactions, an action is an item being recommended bytheRS, and a reward is related to user feedback.

As a solution to the effect of bias present in logged data, we introduce a debiasing step in the simulation pipeline, which corrects for the biases present in the logged data before it is used to simulate user behavior.

Running the code

$ cd examples
$ python run_dqn.py

More details

We provide the details of DQN-based Policy used in experiments and the related hyperparamters (See Appendix). And we also provide the slide used for presentation in recsys 2020.

Cite

If you use our code, please cite our paper:

@inproceedings{huang2020keeping,
  title={Keeping Dataset Biases out of the Simulation: A Debiased Simulator for Reinforcement Learning based Recommender Systems},
  author={Huang, Jin and Oosterhuis, Harrie and de Rijke, Maarten and van Hoof, Herke},
  booktitle={Fourteenth ACM Conference on Recommender Systems},
  pages={190--199},
  year={2020}
}

A TensorFlow implementation of SOFA, the Simulator for OFfline LeArning and evaluation.

Related tags

Overview

SOFA

Running the code

More details

Cite

Owner

A Jupyter notebook to play with NVIDIA's StyleGAN3 and OpenAI's CLIP for a text-based guided image generation.

Deployment of PyTorch chatbot with Flask

This is a simple backtesting framework to help you test your crypto currency trading. It includes a way to download and store historical crypto data and to execute a trading strategy.

PyTorch implementations of the paper: "Learning Independent Instance Maps for Crowd Localization"

Universal Probability Distributions with Optimal Transport and Convex Optimization

PyTorch implementation of our ICCV 2021 paper Intrinsic-Extrinsic Preserved GANs for Unsupervised 3D Pose Transfer.

AnimationKit: AI Upscaling & Interpolation using Real-ESRGAN+RIFE

Automatic meme generation model using Tensorflow Keras.

PyTorch implementation for 3D human pose estimation

Official page of Struct-MDC (RA-L'22 with IROS'22 option); Depth completion from Visual-SLAM using point & line features

Fast and Easy Infinite Neural Networks in Python

Mixed Transformer UNet for Medical Image Segmentation

MultiMix: Sparingly Supervised, Extreme Multitask Learning From Medical Images (ISBI 2021, MELBA 2021)

A curated (most recent) list of resources for Learning with Noisy Labels

Official repository of "DeepMIH: Deep Invertible Network for Multiple Image Hiding", TPAMI 2022.

OverFeat is a Convolutional Network-based image classifier and feature extractor.

A basic neural network for image segmentation.

PyTorch implementation of "VRT: A Video Restoration Transformer"

Implementation for paper "Towards the Generalization of Contrastive Self-Supervised Learning"

This code reproduces the results of the paper, "Measuring Data Leakage in Machine-Learning Models with Fisher Information"