Tree-based Search Graph for Approximate Nearest Neighbor Search

Last update: Dec 27, 2022

Related tags

Overview

TBSG: Tree-based Search Graph for Approximate Nearest Neighbor Search.

TBSG is a graph-based algorithm for ANNS based on Cover Tree, which is also an approximation of Monotonic Search Network (MSNET). TBSG is very efficient with high precision.

Benchmark datasets

Datasets | No. of base | dimension | No. of query | download link
Sift | 1,000,000 | 128 | 10,000 | (http://corpus-texmex.irisa.fr/)
Gist | 1,000,000 | 300 | 1,000 | (http://corpus-texmex.irisa.fr/)
Glove | 1,183,514 | 100 | 10,000 | (http://downloads.zjulearning.org.cn/data/glove-100.tar.gz)
Crawl | 1,989,995 | 300 | 10,000 | (http://commoncrawl.org/)

How to use TBSG

1) compile

Prerequisite : openmp, cmake, eigen3

$ cd /path/to/project  
$ cmake . && make

2) build an approximate kNNG

We use efanna_graph to build the kNNG.

3) create a TBSG index

$ cd /path/to/project/  
$ ./TBSG_index data_path M S MP nnfile save_path

data_path is the path of base data.
M is the maximum of size of neighbors.
S is the candidate set size to build TBSG.
MP is the minimum of min_prob.
nnfile is the file of k nearest neighbor graph.
save_path is the path to save the index.

4) search with TBSG index

$ cd /path/to/project/
$ ./TBSG_search data_path query_path groundtruth_path save_path step

data_path is the path of base data.
query_path is the path of query data.
groundtruth is the path of groundtruth data.
save_path is the path to save the index.
step is the step size to expand the search pool.

Parameters used for four datasets

parameters for building kNNG

Dataset	K	L	iter	S	R
Sift	200	200	12	10	100
Gist	400	400	12	15	100
Glove	400	420	12	20	300
Crawl	400	420	12	20	100

parameters for building index

Datasets	M	S	MP
Sift	50	100	0.53
Gist	70	200	0.515
Glove	80	300	0.53
Crawl	50	200	0.53

Tree-based Search Graph for Approximate Nearest Neighbor Search

Related tags

Overview

TBSG: Tree-based Search Graph for Approximate Nearest Neighbor Search.

Benchmark datasets

How to use TBSG

1) compile

2) build an approximate kNNG

3) create a TBSG index

4) search with TBSG index

Parameters used for four datasets

parameters for building kNNG

parameters for building index

Owner

Fanxbin

Get started with Machine Learning with Python - An introduction with Python programming examples

pytorch implementation of dftd2 & dftd3

SuperSDR: multiplatform KiwiSDR + CAT transceiver integrator

Music Generation using Neural Networks Streamlit App

Contains a bunch of different python programm tasks

Bidimensional Leaderboards: Generate and Evaluate Language Hand in Hand

Tensorflow implementation and notebooks for Implicit Maximum Likelihood Estimation

Brain tumor detection using CNN (InceptionResNetV2 Model)

A system used to detect whether a person is wearing a medical mask or not.

Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.

official code for dynamic convolution decomposition

Implement face detection, and age and gender classification, and emotion classification.

An efficient PyTorch implementation of the evaluation metrics in recommender systems.

Deploy a ML inference service on a budget in less than 10 lines of code.

ScriptProfilerPy - Module to visualize where your python script is slow

Azion the best solution of Edge Computing in the world.

Python-based Informatics Kit for Analysing Chemical Units

Message Passing on Cell Complexes

Activating More Pixels in Image Super-Resolution Transformer

Neural Style and MSG-Net