Transparent Transformer Segmentation

Last update: Jan 02, 2023

Related tags

Overview

Transparent Transformer Segmentation

Introduction

This repository contains the data and code for IJCAI 2021 paper Segmenting transparent object in the wild with transformer.

Environments

python 3
torch = 1.4.0
torchvision
pyyaml
Pillow
numpy

INSTALL

python setup.py develop --user

Data Preparation

create dirs './datasets/transparent/Trans10K_v2'
put the train/validation/test data under './datasets/transparent/Trans10K_v2'. Data Structure is shown below.

Trans10K_v2
├── test
│   ├── images
│   └── masks_12
├── train
│   ├── images
│   └── masks_12
└── validation
    ├── images
    └── masks_12

Download Dataset: Google Drive. Baidu Drive. code: oqms

Network Define

The code of Network pipeline is in segmentron/models/trans2seg.py.

The code of Transformer Encoder-Decoder is in segmentron/modules/transformer.py.

Train

Our experiments are based on one machine with 8 V100 GPUs with 32g memory, about 1 hour training time.

bash tools/dist_train.sh $CONFIG-FILE $GPUS

For example:

bash tools/dist_train.sh configs/trans10kv2/trans2seg/trans2seg_medium.yaml 8

Test

bash tools/dist_train.sh $CONFIG-FILE $GPUS --test TEST.TEST_MODEL_PATH $MODEL_PATH

Citations

Please consider citing our paper in your publications if the project helps your research. BibTeX reference is as follows.

@article{xie2021segmenting,
  title={Segmenting transparent object in the wild with transformer},
  author={Xie, Enze and Wang, Wenjia and Wang, Wenhai and Sun, Peize and Xu, Hang and Liang, Ding and Luo, Ping},
  journal={arXiv preprint arXiv:2101.08461},
  year={2021}
}

Transparent Transformer Segmentation

Related tags

Overview

Transparent Transformer Segmentation

Introduction

Environments

INSTALL

Data Preparation

Network Define

Train

Test

Citations

Owner

谢恩泽

Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"

DCSL - Generalizable Crowd Counting via Diverse Context Style Learning

Pytorch implementation code for [Neural Architecture Search for Spiking Neural Networks]

You Only Look Once for Panopitic Driving Perception

PyTorch implementation for the Neuro-Symbolic Sudoku Solver leveraging the power of Neural Logic Machines (NLM)

Pytorch codes for Feature Transfer Learning for Face Recognition with Under-Represented Data

Implementation of paper "Decision-based Black-box Attack Against Vision Transformers via Patch-wise Adversarial Removal"

Learning Representations that Support Robust Transfer of Predictors

Self-Supervised Learning for Domain Adaptation on Point-Clouds

HINet: Half Instance Normalization Network for Image Restoration

This is a project based on ConvNets used to identify whether a road is clean or dirty. We have used MobileNet as our base architecture and the weights are based on imagenet.

Image marine sea litter prediction Shiny

Opinionated code formatter, just like Python's black code formatter but for Beancount

TVNet: Temporal Voting Network for Action Localization

Implementation of "Efficient Regional Memory Network for Video Object Segmentation" (Xie et al., CVPR 2021).

Official code repository for "Exploring Neural Models for Query-Focused Summarization"

MASA-SR: Matching Acceleration and Spatial Adaptation for Reference-Based Image Super-Resolution (CVPR2021)

Wandb-predictions - WANDB Predictions With Python

Language Models Can See: Plugging Visual Controls in Text Generation

GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond