SegFormer_Segmentation

The code uses SegFormer for Semantic Segmentation on Drone Dataset.
The details for the SegFormer can be obtained from the following cited paper and the drone dataset can be downloaded from the link below.
Alternatively, you can also download the dataset from Kaggle, the link is mentioned below.
Clone the repository and install all the packages mentioned in the requirement.txt file.
If you just want to infer the semantic segmentation, open the segformer_inf.py, change the image file name you want to test and run the code.
Make sure the trained model is in the model folder. You can download the model at https://drive.google.com/file/d/1zsHyMlGJCpPZrDB0v3ZeaogTcUULmUVB/view?usp=sharing.
Alternatively, you can train the model and save it, locally, by running segformer_train.py.

If you want to train the SegFormer on the drone dataset. Make sure that the directory structure is as follows:
root
| drone_dataset
|---images
|----|---test
|----|---train
|---mask
|----|---test
|----|---train
|---class_dict_seg.csv

Demo Inference

Citations and References

SegFormer
@article{xie2021segformer,
  title={SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers},
  author={Xie, Enze and Wang, Wenhai and Yu, Zhiding and Anandkumar, Anima and Alvarez, Jose M and Luo, Ping},
  journal={arXiv preprint arXiv:2105.15203},
  year={2021}
}

Drone Dataset
http://dronedataset.icg.tugraz.at/

https://www.kaggle.com/bulentsiyah/semantic-drone-dataset

The code uses SegFormer for Semantic Segmentation on Drone Dataset.

Related tags

Overview

SegFormer_Segmentation

Citations and References

Owner

Dr. Sander Ali Khowaja

PyTorch implementation of probabilistic deep forecast applied to air quality.

Interactive Visualization to empower domain experts to align ML model behaviors with their knowledge.

Code release for Hu et al. Segmentation from Natural Language Expressions. in ECCV, 2016

SatelliteNeRF - PyTorch-based Neural Radiance Fields adapted to satellite domain

GNPy: Optical Route Planning and DWDM Network Optimization

BARTScore: Evaluating Generated Text as Text Generation

Object detection on multiple datasets with an automatically learned unified label space.

Code for "Unsupervised State Representation Learning in Atari"

This repository contains the PyTorch implementation of the paper STaCK: Sentence Ordering with Temporal Commonsense Knowledge appearing at EMNLP 2021.

Manim is an engine for precise programmatic animations, designed for creating explanatory math videos

Multiview Dataset Toolkit

Joint Unsupervised Learning (JULE) of Deep Representations and Image Clusters.

Official and maintained implementation of the paper "OSS-Net: Memory Efficient High Resolution Semantic Segmentation of 3D Medical Data" [BMVC 2021].

Implementing a simplified copy of Shazam application from scratch using MinHashing and LSH.

An Intelligent Self-driving Truck System For Highway Transportation

Material for my PyConDE & PyData Berlin 2022 Talk "5 Steps to Speed Up Your Data-Analysis on a Single Core"

A hybrid SOTA solution of LiDAR panoptic segmentation with C++ implementations of point cloud clustering algorithms. ICCV21, Workshop on Traditional Computer Vision in the Age of Deep Learning

adversarial_multi_armed_bandit_variable_plays

A collection of educational notebooks on multi-view geometry and computer vision.

This is Unofficial Repo. Lips Don't Lie: A Generalisable and Robust Approach to Face Forgery Detection (CVPR 2021)