Code for the paper: Fusformer: A Transformer-based Fusion Approach for Hyperspectral Image Super-resolution

Last update: Dec 12, 2022

Related tags

Computer Vision Fusformer

Overview

Fusformer

Code for the paper: "Fusformer: A Transformer-based Fusion Approach for Hyperspectral Image Super-resolution"

Plateform

Python 3.8.5 + Pytorch 1.7.1

Data

demo_cave.h5: A testing image from the CAVE dataset, containing "GT" ( 512x512x31), "RGB" (128x128x3) and "LRHSI" (128x128x31).
demo_cave_patches_h5: If your GPU memory is too small to run with the whole testing image, we cut the testing image into 64 patches with the size 64x64x31 of GT, 63x64x3 of RGB, and 16x16x31 of LRHSI, respectively.

Download link: https://github.com/J-FHu/Fusformer/releases/tag/Pytorch

Code

data.py: The dataloader of the training and testing data.
main.py: The training and testing function of our Fusformer.
model.py: The whole model of our Fusformer.
reshape2big.m : If you use the "demo_cave_patches.h5" as the testing data, the output ("PatchOutput-cave.mat") generated by the network will be 64 patches with the size of 64x64x31, which should be reshaped to the original size of 512x512x31.

Comments

HSI-->MSI

Thanks for sharing, it is a really great job! This paper claims that HR-MSI is obtained by Canon EOS 5D Mark II coupled with the HR-HSI. Although we can get the spectral response function of Canon EOS 5D Mark II, the band range represented by each channel of the HSI is not available. As a result, how I can get the HR-MSI? I am looking forward to your reply. Thank you very much!

opened by ZhilingGuo 1

Releases(Pytorch)

Pytorch(Jan 4, 2022)
Testing data for "Fusformer: A Transformer-based Fusion Approach for Hyperspectral Image Super-resolution"

demo_cave.h5: A testing image from the CAVE dataset, containing "GT" ( 512x512x31), "RGB" (128x128x3) and "LRHSI" (128x128x31).

demo_cave_patches_h5: If your GPU memory is too small to run with the whole testing image, we cut the testing image into 64 patches with the size 64x64x31 of GT, 63x64x3 of RGB, and 16x16x31 of LRHSI, respectively.

Source code(tar.gz)
Source code(zip)
demo_cave.h5(71.87 MB)
demo_cave_patches.h5(71.87 MB)

This is the official PyTorch implementation of the paper "TransFG: A Transformer Architecture for Fine-grained Recognition" (Ju He, Jie-Neng Chen, Shuai Liu, Adam Kortylewski, Cheng Yang, Yutong Bai, Changhu Wang, Alan Yuille).

TransFG: A Transformer Architecture for Fine-grained Recognition Official PyTorch code for the paper: TransFG: A Transformer Architecture for Fine-gra

307 Jan 3, 2023

Code for the paper "DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks" (ICCV '19)

DewarpNet This repository contains the codes for DewarpNet training. Recent Updates [May, 2020] Added evaluation images and an important note about Ma

354 Jan 1, 2023

Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model"

PPE ✨ Repository for our CVPR'2022 paper: Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-

34 Nov 28, 2022

Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding for Zero-Example Video Retrieval.

Dual Encoding for Video Retrieval by Text Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding

81 Dec 1, 2022

Code for paper "Role-based network embedding via structural features reconstruction with degree-regularized constraint"

Role-based network embedding via structural features reconstruction with degree-regularized constraint Train python main.py --dataset brazil-flights

1 Jun 28, 2022

Deskew is a command line tool for deskewing scanned text documents. It uses Hough transform to detect "text lines" in the image. As an output, you get an image rotated so that the lines are horizontal.

Deskew by Marek Mauder https://galfar.vevb.net/deskew https://github.com/galfar/deskew v1.30 2019-06-07 Overview Deskew is a command line tool for des

127 Dec 3, 2022

A facial recognition device is a device that takes an image or a video of a human face and compares it to another image faces in a database.

A facial recognition device is a device that takes an image or a video of a human face and compares it to another image faces in a database. The structure, shape and proportions of the faces are compared during the face recognition steps.

4 Mar 19, 2022

An advanced 2D image manipulation with features such as edge detection and image segmentation built using OpenCV

OpenCV-ToothPaint3-Advanced-Digital-Image-Editor This application named ‘Tooth Paint’ version TP_2020.3 (64-bit) or version 3 was developed within a w

1 Nov 5, 2021

This pyhton script converts a pdf to Image then using tesseract as OCR engine converts Image to Text

Script_Convertir_PDF_IMG_TXT Este script de pyhton convierte un pdf en Imagen luego utilizando tesseract como motor OCR convierte la Imagen a Texto. p

1 Jan 27, 2022

Code for the paper: Fusformer: A Transformer-based Fusion Approach for Hyperspectral Image Super-resolution

Related tags

Overview

Fusformer

You might also like...

This is the official PyTorch implementation of the paper "TransFG: A Transformer Architecture for Fine-grained Recognition" (Ju He, Jie-Neng Chen, Shuai Liu, Adam Kortylewski, Cheng Yang, Yutong Bai, Changhu Wang, Alan Yuille).

Code for the paper "DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks" (ICCV '19)

Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model"

Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding for Zero-Example Video Retrieval.

Code for paper "Role-based network embedding via structural features reconstruction with degree-regularized constraint"

Deskew is a command line tool for deskewing scanned text documents. It uses Hough transform to detect "text lines" in the image. As an output, you get an image rotated so that the lines are horizontal.

A facial recognition device is a device that takes an image or a video of a human face and compares it to another image faces in a database.

An advanced 2D image manipulation with features such as edge detection and image segmentation built using OpenCV

This pyhton script converts a pdf to Image then using tesseract as OCR engine converts Image to Text

Comments

HSI-->MSI

Releases(Pytorch)

Pytorch(Jan 4, 2022)

Owner

Jin-Fan Hu (胡锦帆)

Aloception is a set of package for computer vision: aloscene, alodataset, alonet.

Machine Leaning applied to denoise images to improve OCR Accuracy

DouZero is a reinforcement learning framework for DouDizhu - 斗地主AI

Provides OCR (Optical Character Recognition) services through web applications

Semantic-based Patch Detection for Binary Programs

A facial recognition device is a device that takes an image or a video of a human face and compares it to another image faces in a database.

PyTorch Re-Implementation of EAST: An Efficient and Accurate Scene Text Detector

Document blur detection based on Laplacian operator and text detection.

With the virtual keyboard, you can write on the real time images by combining the thumb and index fingers on the letter you want.

Simple SDF mesh generation in Python

Face Anonymizer - FaceAnonApp v1.0

Python bindings for JIGSAW: a Delaunay-based unstructured mesh generator.

Code for the "Sensing leg movement enhances wearable monitoring of energy expenditure" paper.

Discord QR Scam Code Generator + Token grab mobile device.

Shape Detection - It's a shape detection project with OpenCV and Python.

Repository for Scene Text Detection with Supervised Pyramid Context Network with tensorflow.

Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.

Code related to "Have Your Text and Use It Too! End-to-End Neural Data-to-Text Generation with Semantic Fidelity" paper

Layout Analysis Evaluator for the ICDAR 2017 competition on Layout Analysis for Challenging Medieval Manuscripts

An interactive interface for using OpenCV's GrabCut algorithm for image segmentation.