we propose a novel deep network, named feature aggregation and refinement network (FARNet), for the automatic detection of anatomical landmarks.

Last update: Aug 28, 2022

Related tags

Deep Learning FARNet

Overview

Feature Aggregation and Refinement Network for 2D Anatomical Landmark Detection

Overview

Localization of anatomical landmarks is essential for clinical diagnosis, treatment planning, and research. In this paper, we propose a novel deep network, named feature aggregation and refinement network (FARNet), for the automatic detection of anatomical landmarks. To alleviate the problem of limited training data in the medical domain, our network adopts a CNN pre-trained on natural images as the backbone network and several popular networks have been compared. Our FARNet also includes a multi-scale feature aggregation module for multiscale feature fusion and a feature refinement module for high-resolution heatmap regression. Coarse-to-fine supervisions are applied to the two modules to facilitate the endto-end training. We further propose a novel loss function named Exponential Weighted Center loss for more accurate heatmap regression, which focuses on the losses from the pixels near landmarks and suppresses the ones from far away. Our network has been evaluated on three publicly available anatomical landmark detection datasets, including cephalometric radiographs, hand radiographs, and spine radiographs, and achieves state-of-art performances on all three datasets.

Data

In this paper, we evaluate our landmark detection network on three public benchmark data sets, a cephalometric X-rays dataset [1], a hand X-rays dataset [2] and a Spinal AnteriorPosterior (AP) X-rays dataset [3].

How to use

Dependencies

This tutorial depends on the following libraries:

pytorch = 1.0.1
numpy = 1.18.5
python >= 3.6
xlwt

config.py

You should set the image path in config by yourself

Run main.py

Run main.py to train the model and test its performance

Some results

Reference

[1] C.-W. Wang, C.-T. Huang, J.-H. Lee, C.-H. Li, S.-W. Chang, M.-J.Siao, T.-M. Lai, B. Ibragimov, T. Vrtovec, O. Ronneberger, et al., “A benchmark for comparison of dental radiography analysis algorithms,” Medical image analysis, vol. 31, pp. 63–76, 2016.
[2] C. Payer, D. ˇStern, H. Bischof, and M. Urschler, “Integrating spatial configuration into heatmap regression based cnns for landmark localization,” Medical Image Analysis, vol. 54, pp. 207–219, 2019.
[3] H. Wu, C. Bailey, P. Rasoulinejad, and S. Li, “Automatic landmark estimation for adolescent idiopathic scoliosis assessment using boostnet,” in International Conference on Medical Image Computing and ComputerAssisted Intervention, 2017.

we propose a novel deep network, named feature aggregation and refinement network (FARNet), for the automatic detection of anatomical landmarks.

Related tags

Overview

Feature Aggregation and Refinement Network for 2D Anatomical Landmark Detection

Overview

Data

How to use

Dependencies

config.py

Run main.py

Some results

Reference

Owner

aoyueyuan

The code of paper "Block Modeling-Guided Graph Convolutional Neural Networks".

Meta Representation Transformation for Low-resource Cross-lingual Learning

Source code for "UniRE: A Unified Label Space for Entity Relation Extraction.", ACL2021.

Simulator for FRC 2022 challenge: Rapid React

PyTorch reimplementation of the Smooth ReLU activation function proposed in the paper "Real World Large Scale Recommendation Systems Reproducibility and Smooth Activations" [arXiv 2022].

Code for Subgraph Federated Learning with Missing Neighbor Generation (NeurIPS 2021)

[NeurIPS 2019] Learning Imbalanced Datasets with Label-Distribution-Aware Margin Loss

High-quality implementations of standard and SOTA methods on a variety of tasks.

PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)

Code for "Causal autoregressive flows" - AISTATS, 2021

Repositório da disciplina de APC, no segundo semestre de 2021

A PyTorch implementation of Implicit Q-Learning

SpeechNAS Better Trade off between Latency and Accuracy for Large Scale Speaker Verification

Codebase for BMVC 2021 paper "Text Based Person Search with Limited Data"

Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"

A program that uses computer vision to detect hand gestures, used for controlling movie players.

A Dying Light 2 (DL2) PAKFile Utility for Modders and Mod Makers.

PASSL包含 SimCLR，MoCo，BYOL，CLIP等基于对比学习的图像自监督算法以及 Vision-Transformer，Swin-Transformer，BEiT，CVT，T2T，MLP_Mixer等视觉Transformer算法

Stereo Hybrid Event-Frame (SHEF) Cameras for 3D Perception, IROS 2021

Code associated with the paper "Deep Optics for Single-shot High-dynamic-range Imaging"