The official codes for the ICCV2021 presentation "Uniformity in Heterogeneity: Diving Deep into Count Interval Partition for Crowd Counting"

Last update: Dec 14, 2022

Related tags

Deep Learning CrowdCounting-UEPNet

Overview

UEPNet (ICCV2021 Poster Presentation)

This repository contains codes for the official implementation in PyTorch of UEPNet as described in Uniformity in Heterogeneity: Diving Deep into Count Interval Partition for Crowd Counting.

The codes is tested with PyTorch 1.5.0. It may not run with other versions.

Visualized results for UEPNet

The network

The network structure of the proposed UEPNet. It consists of a simple encoderdecoder network for feature extraction and an Interleaved Prediction Head to classify each patch into certain interval.

Comparison with state-of-the-art methods

The UEPNet achieved state-of-the-art performance on several challenging datasets with various densities, although using a quite simple network structure.

Installation

Clone this repo into a directory named UEPNet_ROOT
Organize your datasets as required
Install Python dependencies. We use python 3.6.5 and pytorch 1.5.0

pip install -r requirements.txt

Organize the counting dataset

We use a list file to collect all the images and their ground truth annotations in a counting dataset. When your dataset is organized as recommended in the following, the format of this list file is defined as:

train/scene01/img01.jpg train/scene01/img01.txt
train/scene01/img02.jpg train/scene01/img02.txt
...
train/scene02/img01.jpg train/scene02/img01.txt

Dataset structures:

DATA_ROOT/
        |->train/
        |    |->scene01/
        |    |->scene02/
        |    |->...
        |->test/
        |    |->scene01/
        |    |->scene02/
        |    |->...
        |->train.list
        |->test.list

DATA_ROOT is your path containing the counting datasets.

Annotations format

For the annotations of each image, we use a single txt file which contains one annotation per line. Note that indexing for pixel values starts at 0. The expected format of each line is:

x1 y1
x2 y2
...

Testing

A trained model (with an MAE of 54.64) on SHTechPartA is available at "./ckpt", run the following commands to conduct an evaluation:

CUDA_VISIBLE_DEVICES=0 python3 test.py \
    --train_lists $DATA_ROOT/train.list \
    --test_lists $DATA_ROOT/test.list \
    --dataset_mode shtechparta \
    --checkpoints_dir ./ckpt/ \
    --dataroot $DATA_ROOT \
    --model uep \
    --phase test \
    --vgg_post_pool \
    --gpu_ids 0

Acknowledgements

Part of codes are borrowed from the pytorch-CycleGAN-and-pix2pix.

Citing UEPNet

If you find UEPNet is useful in your project, please consider citing us:

@inproceedings{wang2021uniformity,
  title={Uniformity in Heterogeneity: Diving Deep into Count Interval Partition for Crowd Counting},
  author={Wang, Changan and Song, Qingyu and Zhang, Boshen and Wang, Yabiao and Tai, Ying and Hu, Xuyi and Wang, Chengjie and Li, Jilin and Ma, Jiayi and Wu, Yang},
  journal={Proceedings of the IEEE/CVF International Conference on Computer Vision},
  year={2021}
}

Related works from Tencent Youtu Lab

[AAAI2021] To Choose or to Fuse? Scale Selection for Crowd Counting. (paper link & codes)
[ICCV2021] Rethinking Counting and Localization in Crowds: A Purely Point-Based Framework. (paper link & codes)

The official codes for the ICCV2021 presentation "Uniformity in Heterogeneity: Diving Deep into Count Interval Partition for Crowd Counting"

Related tags

Overview

UEPNet (ICCV2021 Poster Presentation)

Visualized results for UEPNet

The network

Comparison with state-of-the-art methods

Installation

Organize the counting dataset

Dataset structures:

Annotations format

Testing

Acknowledgements

Citing UEPNet

Related works from Tencent Youtu Lab

Owner

Tencent YouTu Research

Traffic4D: Single View Reconstruction of Repetitious Activity Using Longitudinal Self-Supervision

HeartRate detector with ArduinoandPython - Use Arduino and Python create a heartrate detector.

Location-Sensitive Visual Recognition with Cross-IOU Loss

Stitch it in Time: GAN-Based Facial Editing of Real Videos

Athena is the only tool that you will ever need to optimize your portfolio.

Text Extraction Formulation + Feedback Loop for state-of-the-art WSD (EMNLP 2021)

Local Similarity Pattern and Cost Self-Reassembling for Deep Stereo Matching Networks

시각 장애인을 위한 스마트 지팡이에 활용될 딥러닝 모델 (DL Model Repo)

Implementation of self-attention mechanisms for general purpose. Focused on computer vision modules. Ongoing repository.

Create Data & AI apps in 20 lines of code with Shimoku

DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference

Noise Conditional Score Networks (NeurIPS 2019, Oral)

Style transfer between images was performed using the VGG19 model

This is a simple face recognition mini project that was completed by a team of 3 members in 1 week's time

Apache Flink

Non-stationary GP package written from scratch in PyTorch

Codecov coverage standard for Python

[ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction

Deep Residual Learning for Image Recognition

FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.