Unifying Global-Local Representations in Salient Object Detection with Transformer

Last update: Aug 24, 2022

Related tags

Overview

GLSTR (Global-Local Saliency Transformer)

This is the official implementation of paper "Unifying Global-Local Representations in Salient Object Detection with Transformer" by Sucheng Ren, Qiang Wen, Nanxuan Zhao, Guoqiang Han, Shengfeng He

Prerequisites

The whole training process can be done on eight RTX2080Ti or four RTX3090.

Pytorch 1.6

Datasets

Training Set

We use the training set of DUTS (DUTS-TR) to train our model.

/path/to/DUTS-TR/
   img/
      img1.jpg
   label/
      label1.png

Testing Set

We test our model on the testing set of DUTS, ECSSD, HKU-IS, PASCAL-S, DUT-OMRON, and SOD to test our model.

Training

Download the pretrained transformer backbone on ImageNet.

# input the path to training data and pretrained backbone in train.sh
bash train.sh

Testing

Download the pretrained model from Baidu pan(code: uo0a), Google drive, and put it int ./ckpt/

python test.py

Evaluation

The precomputed saliency maps (DUTS-TE, ECSSD, HKU-IS, PASCAL-S, DUT-OMRON, and SOD) can be found at Baidu pan(code: uo0a), Google drive.

After paper submission, we retrain the model, and the performance is improved. Feel free to use the results of our paper or the precomputed saliency maps.

Contact

If you have any questions, feel free to email Sucheng Ren :) ([email protected])

Citation

Please cite our paper if you think the code and paper are helpful.

@article{ren2021unifying,
  title={Unifying Global-Local Representations in Salient Object Detection with Transformer},
  author={Ren, Sucheng and Wen, Qiang and Zhao, Nanxuan and Han, Guoqiang and He, Shengfeng},
  journal={arXiv preprint arXiv:2108.02759},
  year={2021}
}

Unifying Global-Local Representations in Salient Object Detection with Transformer

Related tags

Overview

GLSTR (Global-Local Saliency Transformer)

Prerequisites

Datasets

Training Set

Testing Set

Training

Testing

Evaluation

Contact

Citation

Owner

Discover hidden deepweb pages

Efficient training of deep recommenders on cloud.

【Arxiv】Exploring Separable Attention for Multi-Contrast MR Image Super-Resolution

Data Engineering ZoomCamp

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

Materials for my scikit-learn tutorial

Data & Code for ACCENTOR Adding Chit-Chat to Enhance Task-Oriented Dialogues

An unofficial personal implementation of UM-Adapt, specifically to tackle joint estimation of panoptic segmentation and depth prediction for autonomous driving datasets.

Unimodal Face Classification with Multimodal Training

Official PyTorch Implementation of Rank & Sort Loss [ICCV2021]

Supplementary code for the experiments described in the 2021 ISMIR submission: Leveraging Hierarchical Structures for Few Shot Musical Instrument Recognition.

Machine learning, in numpy

[2021][ICCV][FSNet] Full-Duplex Strategy for Video Object Segmentation

Script for getting information in discord

PyTorch implementation for paper "Full-Body Visual Self-Modeling of Robot Morphologies".

PowerGridworld: A Framework for Multi-Agent Reinforcement Learning in Power Systems

Keywords : Streamlit, BertTokenizer, BertForMaskedLM, Pytorch

Object tracking and object detection is applied to track golf puts in real time and display stats/games.

Train Dense Passage Retriever (DPR) with a single GPU