The source code and dataset for the RecGURU paper (WSDM 2022)

Last update: Jan 07, 2023

Overview

RecGURU

About The Project

Source code and baselines for the RecGURU paper "RecGURU: Adversarial Learning of Generalized User Representations for Cross-Domain Recommendation (WSDM 2022)"

Code Structure

RecGURU  
├── README.md                                 Read me file 
├── data_process                              Data processing methods
│   ├── __init__.py                           Package initialization file     
│   └── amazon_csv.py                         Code for processing the amazon data (in .csv format)
│   └── business_process.py                   Code for processing the collected data
│   └── item_frequency.py                     Calculate item frequency in each domain
│   └── run.sh                                Shell script to perform data processing  
├── GURU                                      Scripts for modeling, training, and testing 
│   ├── data                                  Dataloader package      
│     ├── __init__.py                         Package initialization file 
│     ├── data_loader.py                      Customized dataloaders 
│   └── tools                                 Tools such as loss function, evaluation metrics, etc.
│     ├── __init__.py                         Package initialization file
│     ├── lossfunction.py                     Customized loss functions
│     ├── metrics.py                          Evaluation metrics
│     ├── plot.py                             Plot function
│     ├── utils.py                            Other tools
│  ├── Transformer                            Transformer package
│     ├── __init__.py                         Package initialization 
│     ├── transformer.py                      transformer module
│  ├── AutoEnc4Rec.py                         Autoencoder based sequential recommender
│  ├── AutoEnc4Rec_cross.py                   Cross-domain recommender modules
│  ├── config_auto4rec.py                     Model configuration file
│  ├── gan_training.py                        Training methods of the GAN framework
│  ├── train_auto.py                          Main function for training and testing single-domain sequential recommender
│  ├── train_gan.py                           Main function for training and testing cross-domain sequential recommender
└── .gitignore                                gitignore file

Dataset

The public datasets: Amazon view dataset at: https://nijianmo.github.io/amazon/index.html
Collected datasets: https://drive.google.com/file/d/1NbP48emGPr80nL49oeDtPDR3R8YEfn4J/view
Data processing:

Amazon dataset:

```shell
cd ../data_process
python amazon_csv.py   
```

Collected dataset

```shell
cd ../data_process
python business_process.py --rate 0.1  # portion of overlapping user = 0.1   
```

After data process, for each cross-domain scenario we have a dataset folder:

."a_domain"-"b_domain"
├── a_only.pickle         # users in domain a only
├── b_only.pickle         # users in domain b only
├── a.pickle              # all users in domain a
├── b.pickle              # all users in domain b
├── a_b.pickle            # overlapped users of domain a and b

Note: see the code for processing details and make modifications accordingly.

Run

Single-domain Methods:

# SAS
python train_auto.py --sas "True"
# AutoRec (ours)
python train_auto.py

Cross-Domain Methods:

# RecGURU
python train_gan.py --cross "True"

The source code and dataset for the RecGURU paper (WSDM 2022)

Related tags

Overview

RecGURU

About The Project

Code Structure

Dataset

Amazon dataset:

Collected dataset

Run

Owner

Chenglin Li

A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.

Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations

SuRE Evaluation: A Supplementary Material

基于tensorflow 2.x的图片识别工具集

PoolFormer: MetaFormer is Actually What You Need for Vision

Instance-based label smoothing for improving deep neural networks generalization and calibration

某学校选课系统GIF验证码数据集 + Baseline模型 + 上下游相关工具

This is the official pytorch implementation of the BoxEL for the description logic EL++

Spam your friends and famly and when you do your famly will disown you and you will have no friends.

DilatedNet in Keras for image segmentation

Tensorflow Implementation of SMU: SMOOTH ACTIVATION FUNCTION FOR DEEP NETWORKS USING SMOOTHING MAXIMUM TECHNIQUE

This project provides a stock market environment using OpenGym with Deep Q-learning and Policy Gradient.

PyTorch implementation for SDEdit: Image Synthesis and Editing with Stochastic Differential Equations

Godot RL Agents is a fully Open Source packages that allows video game creators

Picasso: A CUDA-based Library for Deep Learning over 3D Meshes

Official Pytorch Implementation of Unsupervised Image Denoising with Frequency Domain Knowledge

A repo with study material, exercises, examples, etc for Devnet SPAUTO

An alarm clock coded in Python 3 with Tkinter

SPRING is a seq2seq model for Text-to-AMR and AMR-to-Text (AAAI2021).

Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research