Implementation of the paper ''Implicit Feature Refinement for Instance Segmentation''.

Last update: Dec 28, 2022

Related tags

Overview

Implicit Feature Refinement for Instance Segmentation

This repository is an official implementation of the ACM Multimedia 2021 paper Implicit Feature Refinement for Instance Segmentation.

Introduction

TL; DR. Implicit feature refinement (IFR) enjoys several advantages: 1) simulates an infinite-depth refinement network while only requiring parameters of single residual block; 2) produces high-level equilibrium instance features of global receptive field; 3) serves as a general plug-and-play module easily extended to most object recognition frameworks.

Get Started

Install cvpods following the instructions

# Install cvpods
git clone https://github.com/Megvii-BaseDetection/cvpods.git
cd cvpods 
## build cvpods (requires GPU)
python3 setup.py build develop
## preprare data path
mkdir datasets
ln -s /path/to/your/coco/dataset datasets/coco

To save the training and testing time, the explicit form of our IFR, annotated with "weight_sharing", is provided on mask_rcnn to achieve competitive performance.
For fast evaluation, please download trained model from here.
Run the project

git clone https://github.com/lufanma/IFR.git

# for example(e.g. mask_rcnn.ifr)
cd IFR/mask_rcnn.ifr.res50.fpn.coco.multiscale.1x/

# train
sh pods_train.sh

# test
sh pods_test.sh
# test with provided weights
sh pods_test.sh \
    MODEL.WEIGHTS /path/to/your/save_dir/ckpt.pth # optional
    OUTPUT_DIR /path/to/your/save_dir # optional

Results

Model	AP	AP50	AP75	APs	APm	APl	Link
mask_rcnn.ifr.res50.fpn.coco.multiscale.1x	36.3	56.8	39.2	17.3	39.0	52.2	download
mask_rcnn.res50.fpn.coco.multiscale.weight_sharing.1x	35.9	56.7	38.5	17.1	38.5	51.8	download
cascade_rcnn.ifr.res50.fpn.coco.800size.1x	36.9	57.1	39.8	17.4	39.3	54.6	download

Citing IFR

If you find IFR useful to your research, please consider citing:

@inproceedings{ma2021implicit,
  title={Implicit Feature Refinement for Instance Segmentation},
  author={Ma, Lufan and Wang, Tiancai and Dong, Bin and Yan, Jiangpeng and Li, Xiu and Zhang, Xiangyu},
  booktitle={Proceedings of the 29th ACM International Conference on Multimedia},
  pages={3088--3096},
  year={2021}
}

Given thanks to the open source of DEQ and MDEQ, our IFR is developed based on them.

Implementation of the paper ''Implicit Feature Refinement for Instance Segmentation''.

Related tags

Overview

Implicit Feature Refinement for Instance Segmentation

Introduction

Get Started

Results

Citing IFR

Owner

Lufan Ma

Python Actor concurrency library

docTR by Mindee (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Miscellaneous and lightweight network tools

Official repository for "Intriguing Properties of Vision Transformers" (2021)

An implementation of "Learning human behaviors from motion capture by adversarial imitation"

The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.

A multilingual version of MS MARCO passage ranking dataset

Paddle implementation for "Highly Efficient Knowledge Graph Embedding Learning with Closed-Form Orthogonal Procrustes Analysis" (NAACL 2021)

Vignette is a face tracking software for characters using osu!framework.

Unsupervised MRI Reconstruction via Zero-Shot Learned Adversarial Transformers

Model-based Reinforcement Learning Improves Autonomous Racing Performance

code for the ICLR'22 paper: On Robust Prefix-Tuning for Text Classification

My usage of Real-ESRGAN to upscale anime, some test and results in the test_img folder

Semi-supervised Implicit Scene Completion from Sparse LiDAR

Real-Time-Student-Attendence-System - Real Time Student Attendence System

Leibniz is a python package which provide facilities to express learnable partial differential equations with PyTorch

Benchmarks for Model-Based Optimization

Posterior temperature optimized Bayesian models for inverse problems in medical imaging

[ACMMM 2021, Oral] Code release for "Elastic Tactile Simulation Towards Tactile-Visual Perception"

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis