The code for MM2021 paper "Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning"

Last update: Apr 20, 2022

Related tags

Overview

The Code for MM2021 paper "Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning"

Setting up and using the repo

Get the dataset. Follow the steps in data/README.md. This includes the steps to get the pretrained BERT embeddings and visual representations.
Install cuda 11.0 if it's not available already.
Install anaconda if it's not available already, and create a new environment. You need to install a few things, namely, pytorch 1.7.1, torchvision, and allennlp.

wget https://repo.anaconda.com/archive/Anaconda3-5.2.0-Linux-x86_64.sh
conda update -n base -c defaults conda
conda create --name MCC python=3.6
source activate MCC

conda install numpy pyyaml setuptools cmake cffi tqdm pyyaml scipy ipython mkl mkl-include cython typing h5py pandas nltk spacy numpydoc scikit-learn jpeg

conda install pytorch==1.7.1 torchvision==0.8.2 cudatoolkit=11.0 -c pytorch

pip install -r allennlp-requirements.txt
pip install --no-deps allennlp==0.8.0
python -m spacy download en_core_web_sm


# this one is optional but it should help make things faster
pip uninstall pillow && CC="cc -mavx2" pip install -U --force-reinstall pillow-simd

That's it! Now to set up the environment, run source activate MCC.

Train/Evaluate models

Please refer to models/README.md.

Acknowledgement

We refer to the repo r2c and tab-vcr for preprocessing codes.

Cite

@inproceedings{zhang2021multi,
  title={Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning},
  author={Zhang, Xi and Zhang, Feifei and Xu, Changsheng},
  booktitle={Proceedings of the 29th ACM International Conference on Multimedia},
  pages={1793--1802},
  year={2021}
}

The code for MM2021 paper "Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning"

Related tags

Overview

The Code for MM2021 paper "Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning"

Setting up and using the repo

Train/Evaluate models

Acknowledgement

Cite

Owner

DCA - Official Python implementation of Delaunay Component Analysis algorithm

PyTorch implementation for 3D human pose estimation

Unsupervised Semantic Segmentation by Contrasting Object Mask Proposals.

Image-to-image translation with conditional adversarial nets

Model serving at scale

SE-MSCNN: A Lightweight Multi-scaled Fusion Network for Sleep Apnea Detection Using Single-Lead ECG Signals

A PyTorch Reimplementation of TecoGAN: Temporally Coherent GAN for Video Super-Resolution

[ICCV 2021] Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain

style mixing for animation face

[CVPR 2022] Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels

Make a surveillance camera from your raspberry pi!

[Preprint] "Bag of Tricks for Training Deeper Graph Neural Networks A Comprehensive Benchmark Study" by Tianlong Chen, Kaixiong Zhou, Keyu Duan, Wenqing Zheng, Peihao Wang, Xia Hu, Zhangyang Wang

Build Graph Nets in Tensorflow

DeconvNet : Learning Deconvolution Network for Semantic Segmentation

Implementation of the pix2pix model on satellite images

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Official implementation of MSR-GCN (ICCV 2021 paper)

GPU Programming with Julia - course at the Swiss National Supercomputing Centre (CSCS), ETH Zurich

Make differentially private training of transformers easy for everyone

Pytorch implementation of "M-LSD: Towards Light-weight and Real-time Line Segment Detection"

The code for MM2021 paper "Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning"

Related tags

Overview

The Code for MM2021 paper "Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning"

Setting up and using the repo

Train/Evaluate models

Acknowledgement

Cite

Owner

DCA - Official Python implementation of Delaunay Component Analysis algorithm

PyTorch implementation for 3D human pose estimation

Unsupervised Semantic Segmentation by Contrasting Object Mask Proposals.

Image-to-image translation with conditional adversarial nets

Model serving at scale

SE-MSCNN: A Lightweight Multi-scaled Fusion Network for Sleep Apnea Detection Using Single-Lead ECG Signals

A PyTorch Reimplementation of TecoGAN: Temporally Coherent GAN for Video Super-Resolution

[ICCV 2021] Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain

style mixing for animation face

[CVPR 2022] Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels

Make a surveillance camera from your raspberry pi!

[Preprint] "Bag of Tricks for Training Deeper Graph Neural Networks A Comprehensive Benchmark Study" by Tianlong Chen*, Kaixiong Zhou*, Keyu Duan, Wenqing Zheng, Peihao Wang, Xia Hu, Zhangyang Wang

Build Graph Nets in Tensorflow

DeconvNet : Learning Deconvolution Network for Semantic Segmentation

Implementation of the pix2pix model on satellite images

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Official implementation of MSR-GCN (ICCV 2021 paper)

GPU Programming with Julia - course at the Swiss National Supercomputing Centre (CSCS), ETH Zurich

Make differentially private training of transformers easy for everyone

Pytorch implementation of "M-LSD: Towards Light-weight and Real-time Line Segment Detection"

[Preprint] "Bag of Tricks for Training Deeper Graph Neural Networks A Comprehensive Benchmark Study" by Tianlong Chen, Kaixiong Zhou, Keyu Duan, Wenqing Zheng, Peihao Wang, Xia Hu, Zhangyang Wang