Vision Transformer for 3D medical image registration (Pytorch).

Last update: Dec 20, 2022

Overview

ViT-V-Net: Vision Transformer for Volumetric Medical Image Registration

keywords: vision transformer, convolutional neural networks, image registration

This is a PyTorch implementation of my short paper:

Chen, Junyu, et al. "ViT-V-Net: Vision Transformer for Unsupervised Volumetric Medical Image Registration. " arXiv, 2021.

train.py is the training script. models.py contains ViT-V-Net model.

Pretrained ViT-V-Net: pretrained model

Dataset: Due to restrictions, we cannot distribute our brain MRI data. However, several brain MRI datasets are publicly available online: IXI, ADNI, OASIS, ABIDE, etc. Note that those datasets may not contain labels (segmentation). To generate labels, you can use FreeSurfer, which is an open-source software for normalizing brain MRI images. Here are some useful commands in FreeSurfer: Brain MRI preprocessing and subcortical segmentation using FreeSurfer.

Model Architecture:

Vision Transformer Achitecture:

Example Results:

Quantitative Results:

Reference:

TransUnet

ViT-pytorch

VoxelMorph

If you find this code is useful in your research, please consider to cite:

@misc{chen2021vitvnet,
title={ViT-V-Net: Vision Transformer for Unsupervised Volumetric Medical Image Registration}, 
author={Junyu Chen and Yufan He and Eric C. Frey and Ye Li and Yong Du},
year={2021},
eprint={2104.06468},
archivePrefix={arXiv},
primaryClass={eess.IV}
}

Vision Transformer for 3D medical image registration (Pytorch).

Related tags

Overview

ViT-V-Net: Vision Transformer for Volumetric Medical Image Registration

Model Architecture:

Vision Transformer Achitecture:

Example Results:

Quantitative Results:

Reference:

About Me

Owner

Junyu Chen

Text and code for the forthcoming second edition of Think Bayes, by Allen Downey.

MatryODShka: Real-time 6DoF Video View Synthesis using Multi-Sphere Images

We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances to transformations applied to both the audio and video streams.

Unofficial Implementation of Oboe (SIGCOMM'18').

Bald-to-Hairy Translation Using CycleGAN

TensorFlow implementation of "Variational Inference with Normalizing Flows"

PyTorch implementation of the paper Deep Networks from the Principle of Rate Reduction

Py-FEAT: Python Facial Expression Analysis Toolbox

Project page for End-to-end Recovery of Human Shape and Pose

The Dual Memory is build from a simple CNN for the deep memory and Linear Regression fro the fast Memory

Bilinear attention networks for visual question answering

Fuzzy Overclustering (FOC)

Seeing All the Angles: Learning Multiview Manipulation Policies for Contact-Rich Tasks from Demonstrations

🤖 A Python library for learning and evaluating knowledge graph embeddings

A lightweight face-recognition toolbox and pipeline based on tensorflow-lite

An implementation of "MixHop: Higher-Order Graph Convolutional Architectures via Sparsified Neighborhood Mixing" (ICML 2019).

Scripts of Machine Learning Algorithms from Scratch. Implementations of machine learning models and algorithms using nothing but NumPy with a focus on accessibility. Aims to cover everything from basic to advance.

TensorFlow-based neural network library

A library for Deep Learning Implementations and utils

Labels4Free: Unsupervised Segmentation using StyleGAN