RINDNet: Edge Detection for Discontinuity in Reflectance, Illumination, Normal and Depth, in ICCV 2021 (oral)

Last update: Dec 15, 2022

Overview

RINDNet

RINDNet: Edge Detection for Discontinuity in Reflectance, Illumination, Normal and Depth
Mengyang Pu, Yaping Huang, Qingji Guan and Haibin Ling
ICCV 2021 (oral)

Please refer to supplementary material (code:p86d) (~60M) for more results.

Benchmark --- 🔥 🔥 BSDS-RIND 🔥 🔥

BSDS-RIND is the first public benchmark that dedicated to studying simultaneously the four edge types, namely Reflectance Edge (RE), Illumination Edge (IE), Normal Edge (NE) and Depth Edge (DE). It is created by carefully labeling images from the BSDS500. The datasets can be downloaded from:

Original images: BSDS500
Our annotations: BSDS-RIND (BaiDuNetdisk, code:e7rg ; GoogleDrive)

Abstract

As a fundamental building block in computer vision, edges can be categorised into four types according to the discontinuity in surface-Reflectance, Illumination, surface-Normal or Depth. While great progress has been made in detecting generic or individual types of edges, it remains under-explored to comprehensively study all four edge types together. In this paper, we propose a novel neural network solution, RINDNet, to jointly detect all four types of edges. Taking into consideration the distinct attributes of each type of edges and the relationship between them, RINDNet learns effective representations for each of them and works in three stages. In stage I, RINDNet uses a common backbone to extract features shared by all edges. Then in stage II it branches to prepare discriminative features for each edge type by the corresponding decoder. In stage III, an independent decision head for each type aggregates the features from previous stages to predict the initial results. Additionally, an attention module learns attention maps for all types to capture the underlying relations between them, and these maps are combined with initial results to generate the final edge detection results. For training and evaluation, we construct the first public benchmark, BSDS-RIND, with all four types of edges carefully annotated. In our experiments, RINDNet yields promising results in comparison with state-of-the-art methods.

Code and Main results ----- Coming Soon...

Acknowledgments

The work is partially done while Mengyang was at Stony Brook University.
We thank the anonymous reviewers for valuable and inspiring comments and suggestions.

RINDNet: Edge Detection for Discontinuity in Reflectance, Illumination, Normal and Depth, in ICCV 2021 (oral)

Related tags

Overview

RINDNet

Benchmark --- 🔥 🔥 BSDS-RIND 🔥 🔥

Abstract

Code and Main results ----- Coming Soon...

Acknowledgments

Owner

Mengyang Pu

TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning

The code of NeurIPS 2021 paper "Scalable Rule-Based Representation Learning for Interpretable Classification".

PyGCL: Graph Contrastive Learning Library for PyTorch

Code for Massive-scale Decoding for Text Generation using Lattices

[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

On-device wake word detection powered by deep learning.

The Video-based Accident Detection System built in Python

Code for EMNLP'21 paper "Types of Out-of-Distribution Texts and How to Detect Them"

Pre-trained Deep Learning models and demos (high quality and extremely fast)

Reinforcement Learning Theory Book (rus)

[ICCV'21] UNISURF: Unifying Neural Implicit Surfaces and Radiance Fields for Multi-View Reconstruction

CFNet: Cascade and Fused Cost Volume for Robust Stereo Matching（CVPR2021）

List of awesome things around semantic segmentation 🎉

Resources related to our paper "CLIN-X: pre-trained language models and a study on cross-task transfer for concept extraction in the clinical domain"

Gif-caption - A straightforward GIF Captioner written in Python

Code to reproduce the results in "Visually Grounded Reasoning across Languages and Cultures", EMNLP 2021.

🕵 Artificial Intelligence for social control of public administration

Official pytorch code for SSC-GAN: Semi-Supervised Single-Stage Controllable GANs for Conditional Fine-Grained Image Generation(ICCV 2021)

This is an official implementation for "PlaneRecNet".

BaseCls BaseCls 是一个基于 MegEngine 的预训练模型库，帮助大家挑选或训练出更适合自己科研或者业务的模型结构