HOI Transformer

Code for CVPR 2021 accepted paper End-to-End Human Object Interaction Detection with HOI Transformer.

Reproduction

We recomend you to setup in the following steps:

1.Clone the repo.

git clone https://github.com/bbepoch/HoiTransformer.git

2.Download the MS-COCO pretrained DETR model.

cd data/detr_coco && bash download_model.sh

3.You are supposed to make a soft link named 'images' in 'data/hico/' to refer to your HICO-DET path, or your will have to modify the data path manually in hico.py.

ln -s /path-to-your-hico-det-dataset/hico_20160224_det/images images

4.Train a model.

python3 -m torch.distributed.launch --nproc_per_node=8 --use_env main.py --epochs=250 --lr_drop=200 --dataset_file=hico --batch_size=2 --backbone=resnet50

5.Test a model.

python3 test.py --dataset_file=hico --batch_size=1 --log_dir=./ --model_path=your_model_path

Citation

@inproceedings{zou2021_hoitrans,
author = {Zou, Cheng and Wang, Bohan and Hu, Yue and Liu, Junqi and Wu, Qian and Zhao, Yu and Li, Boxun and Zhang, Chenguang and Zhang, Chi and Wei, Yichen and Sun, Jian},
title = {End-to-End Human Object Interaction Detection with HOI Transformer},
booktitle={CVPR},
year = {2021},
}

Acknowledgement

We sincerely thank all previous works, especially DETR, PPDM, iCAN, for some of the codes are built upon them.

This is the code for HOI Transformer

Related tags

Overview

HOI Transformer

Reproduction

Citation

Acknowledgement

Owner

BigBangEpoch

Official code for ICCV2021 paper "M3D-VTON: A Monocular-to-3D Virtual Try-on Network"

Attentional Focus Modulates Automatic Finger‑tapping Movements

Python PID Tuner - Makes a model of the System from a Process Reaction Curve and calculates PID Gains

Pytorch implementation of the paper Improving Text-to-Image Synthesis Using Contrastive Learning

SASM - simple crossplatform IDE for NASM, MASM, GAS and FASM assembly languages

project page for VinVL

Bayesian Optimization Library for Medical Image Segmentation.

A fuzzing framework for SMT solvers

YuNetのPythonでのONNX、TensorFlow-Lite推論サンプル

Physics-informed convolutional-recurrent neural networks for solving spatiotemporal PDEs

This repository contains the code for the paper "PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization"

FIGARO: Generating Symbolic Music with Fine-Grained Artistic Control

Code for the paper "PortraitNet: Real-time portrait segmentation network for mobile device" @ CAD&Graphics2019

Grounding Representation Similarity with Statistical Testing

Repository for "Space-Time Correspondence as a Contrastive Random Walk" (NeurIPS 2020)

Hyperbolic Procrustes Analysis Using Riemannian Geometry

DrQ-v2: Improved Data-Augmented Reinforcement Learning

Pytorch implementation of ProjectedGAN

The open source code of SA-UNet: Spatial Attention U-Net for Retinal Vessel Segmentation.

CC-GENERATOR - A python script for generating CC