Robust Partial Matching for Person Search in the Wild

Last update: Dec 18, 2022

Related tags

Overview

APNet for Person Search

Introduction

This is the code of Robust Partial Matching for Person Search in the Wild accepted in CVPR2020. The Align-to-Part Network(APNet) is proposed to alleviate the misalignment problem occurred in pedestrian detector, facilitating the downstream re-identification task. The code is based on maskrcnn-benchmark.

Quick start

Installation

Please follow the offical installation INSTALL.md. This code does not support the mixed precision training, so feel free to skip the installation of apex.

NOTE: If you meet some problems during the installation, you may find a solution in issues of official maskrcnn-benchmark.

Install APNet

git clone https://github.com/zhongyingji/APNet.git
cd APNet
rm -rf build/
python setup.py build develop

Dataset Preparation

Make sure you have downloaded the dataset of person search like PRW-v16.04.20.

Since the training of APNet relies on the keypoint annotation, we provide the keypoint estimation file by AlphaPose in keypoint_pred/. Copy all the files into the root dir of dataset, like /path_to_prw_dataset/PRW-v16.04.20/:

cp keypoint_pred/* /path_to_prw_dataset/PRW-v16.04.20/

Symlink the path to the dataset to datasets/ as follows:

ln -s /path_to_prw_dataset/PRW-v16.04.20/ maskrcnn_benchmark/datasets/PRW-v16.04.20

Training

APNet composes of three modules, OIM, RSFE and BBA. To train the entire network, you can simply run:

./train.sh

which contains the training scripts of the three modules.

NOTE: Both RSFE and BBA are required to be intialised with the trained OIM. For more details, please check train.sh.

You can alter the scripts in train.sh in the following aspects:

We train OIM on 2 GPUS with batchsize 4. If you encounter out-of-memory (OOM) error, reduce the batchsize by setting SOLVER.IMS_PER_BATCH to a smaller number.
If you want to use 1 GPU, replace the command of OIM with single GPU training script:

python tools/train_net.py --config-file "configs/reid/prw_R_50_C4.yaml" SOLVER.IMS_PER_BATCH 2 TEST.IMS_PER_BATCH 8 OUTPUT_DIR "models/prw_oim"

Test

After each of the module has been trained, you can run exactly the same training script of that module to test the performance.

Citation

If you find this work or code is helpful in your research, please consider citing:

Robust Partial Matching for Person Search in the Wild

Related tags

Overview

APNet for Person Search

Introduction

Quick start

Installation

Dataset Preparation

Training

Test

Citation

Owner

Yingji Zhong

A High-Level Fusion Scheme for Circular Quantities published at the 20th International Conference on Advanced Robotics

A vanilla 3D face modeling on pose-invariant and multi-lightning image data

Simulation environments for the CrazyFlie quadrotor: Used for Reinforcement Learning and Sim-to-Real Transfer

Create Data & AI apps in 20 lines of code with Shimoku

Code for ICCV 2021 paper "HuMoR: 3D Human Motion Model for Robust Pose Estimation"

Source code for "Progressive Transformers for End-to-End Sign Language Production" (ECCV 2020)

Usable Implementation of "Bootstrap Your Own Latent" self-supervised learning, from Deepmind, in Pytorch

A project to make Amazon Echo respond to sign language using your webcam

PyTorch implementation of Neural View Synthesis and Matching for Semi-Supervised Few-Shot Learning of 3D Pose

Dyalog-apl-docset - Dyalog APL Dash Docset Generator

PyTorch implementation of Pointnet2/Pointnet++

This repository contains the accompanying code for Deep Virtual Markers for Articulated 3D Shapes, ICCV'21

Discovering and Achieving Goals via World Models

A dual benchmarking study of visual forgery and visual forensics techniques

Pytorch implementation for "Implicit Semantic Response Alignment for Partial Domain Adaptation"

Implementation of Memformer, a Memory-augmented Transformer, in Pytorch

A simple baseline for 3d human pose estimation in tensorflow. Presented at ICCV 17.

PyTorch Implementation of Exploring Explicit Domain Supervision for Latent Space Disentanglement in Unpaired Image-to-Image Translation.

GeoTransformer - Geometric Transformer for Fast and Robust Point Cloud Registration

A Pytorch Implementation of a continuously rate adjustable learned image compression framework.