This repository contains a CBIR system that uses swin transformer to extract image's feature.

Last update: Nov 17, 2022

Related tags

Overview

Swin-transformer based CBIR

This repository contains a CBIR(content-based image retrieval) system. Here we use Swin-transformer to extract query image's feature, and retrieve similar ones from image database. Notably, our program achieves intelligent user interaction, including selecting an image by opening explorer dialog and cropping interested region by drafting mouse.

Structure

SWIN_CBIR/
|-- checkpoints/
|
|-- database/
|   |-- data/
|   |   |-- 1.jpg
|   |   |-- 2.jpg
|   |  
|   |-- DB.npz
|   |-- index.txt
|
|-- models/
|   |-- __init__.py
|   |-- build.py
|   |-- swin_transformer.py
|
|-- scripts/
|   |-- generate_DB.sh
|
|-- test/
|
|-- config.py
|-- database.py
|-- generate_DB.py
|-- main.py
|-- requirements.txt
|-- README

Getting Started

Prepare images database

Just find out some images and put them into database/data/.
run ./script/generate_DB.sh in linux machine to extract features of all images and package them into DB.npz.
run main.py, open an image and select interested region, then program will find similar images in database automatically!

Results

Here we show two image retrieval results. Two images in the first row are original image and cropped image respectively while the others are retrieval results (have been sorted by similarity).

Note: all images are resize to square for visual requirement, so there would be distorted in some of the images.

Acknowledgments

Part of code in this repository are copied from Swin-transformer, thank the authors for their exquiste code.

This repository contains a CBIR system that uses swin transformer to extract image's feature.

Related tags

Overview

Swin-transformer based CBIR

Structure

Getting Started

Results

Acknowledgments

Owner

JsHou

This repository contains code to run experiments in the paper "Signal Strength and Noise Drive Feature Preference in CNN Image Classifiers."

An image base contains 490 images for learning (400 cars and 90 boats), and another 21 images for testingAn image base contains 490 images for learning (400 cars and 90 boats), and another 21 images for testing

COD-Rank-Localize-and-Segment (CVPR2021)

URIE: Universal Image Enhancementfor Visual Recognition in the Wild

Python-kafka-reset-consumergroup-offset-example - Python Kafka reset consumergroup offset example

Implementation for "Seamless Manga Inpainting with Semantics Awareness" (SIGGRAPH 2021 issue)

CoReD: Generalizing Fake Media Detection with Continual Representation using Distillation (ACMMM'21 Oral Paper)

Sample code from the Neural Networks from Scratch book.

🔎 Super-scale your images and run experiments with Residual Dense and Adversarial Networks.

Unrolled Generative Adversarial Networks

The official pytorch implemention of the CVPR paper "Temporal Modulation Network for Controllable Space-Time Video Super-Resolution".

DRIFT is a tool for Diachronic Analysis of Scientific Literature.

Easy to use and customizable SOTA Semantic Segmentation models with abundant datasets in PyTorch

project page for VinVL

A deep learning framework for historical document image analysis

A more easy-to-use implementation of KPConv

Supporting code for the Neograd algorithm

A set of tests for evaluating large-scale algorithms for Wasserstein-2 transport maps computation.

Public implementation of "Learning from Suboptimal Demonstration via Self-Supervised Reward Regression" from CoRL'21

The implementation of PEMP in paper "Prior-Enhanced Few-Shot Segmentation with Meta-Prototypes"