PyTorch implementation for "Mining Latent Structures with Contrastive Modality Fusion for Multimedia Recommendation"

Last update: Dec 08, 2022

Related tags

Overview

MIRCO

PyTorch implementation for paper: Latent Structures Mining with Contrastive Modality Fusion for Multimedia Recommendation

Dependencies

Python 3.6
torch==1.5.0
scikit-learn==0.24.2
torch-scatter==2.0.8

Dataset Preparation

Download 5-core reviews data, meta data, and image features from Amazon product dataset. Put data into the directory data/meta-data/.

Install sentence-transformers and download pretrained models to extract textual features. Unzip pretrained model into the directory sentence-transformers/:

├─ data/: 
    ├── sports/
    	├── meta-data/
    		├── image_features_Sports_and_Outdoors.b
    		├── meta-Sports_and_Outdoors.json.gz
    		├── reviews_Sports_and_Outdoors_5.json.gz
    ├── sentence-transformers/
        	├── stsb-roberta-large

Run python build_data.py to preprocess data.
Run python cold_start.py to build cold-start data.
We provide processed data Baidu Yun (access code: m37q), Google Drive.

Usage

Start training and inference as:

cd codes
python main.py --dataset {DATASET}

For cold-start settings:

python main.py --dataset {DATASET} --core 0 --verbose 1 --lr 1e-5

Citation

If you want to use our codes in your research, please cite:

@article{MICRO21,
  title     = {Latent Structures Mining with Contrastive Modality Fusion for Multimedia Recommendation},
  author    = {Zhang, Jinghao and 
               Zhu, Yanqiao and 
               Liu, Qiang and
               Zhang, Mengqi and
               Wu, Shu and 
               Wang, Liang},
  journal = {arXiv.org},
  year={2021},
  eprint={2111.00678},
  archivePrefix={arXiv},
  primaryClass={cs.IR}
}

Acknowledgement

The structure of this code is largely based on LightGCN. Thank for their work.

PyTorch implementation for "Mining Latent Structures with Contrastive Modality Fusion for Multimedia Recommendation"

Related tags

Overview

MIRCO

Dependencies

Dataset Preparation

Usage

Citation

Acknowledgement

Owner

Big Data and Multi-modal Computing Group, CRIPAC

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.

Synthetic Humans for Action Recognition, IJCV 2021

A proof of concept ai-powered Recaptcha v2 solver

PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers

Home for cuQuantum Python & NVIDIA cuQuantum SDK C++ samples

PyTorch implementation of MulMON

Addition of pseudotorsion caclulation eta, theta, eta', and theta' to barnaba package

Official code for paper Exemplar Based 3D Portrait Stylization.

Cupytorch - A small framework mimics PyTorch using CuPy or NumPy

Analysis of Smiles through reservoir sampling & RDkit

UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation

Multiview 3D object detection on MultiviewC dataset through moft3d.

FMA: A Dataset For Music Analysis

EquiBind: Geometric Deep Learning for Drug Binding Structure Prediction

Graph Neural Networks with Keras and Tensorflow 2.

Multi-modal Vision Transformers Excel at Class-agnostic Object Detection

A library that can print Python objects in human readable format

SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model

Toward Realistic Single-View 3D Object Reconstruction with Unsupervised Learning from Multiple Images (ICCV 2021)

Answer a series of contextually-dependent questions like they may occur in natural human-to-human conversations.