Hyperbolic Image Segmentation, CVPR 2022

Last update: Dec 29, 2022

Overview

Hyperbolic Image Segmentation, CVPR 2022

This is the implementation of paper Hyperbolic Image Segmentation (CVPR 2022).

Repository structure

assets : images and stuff
datasets : contains integer to class dictionaries, and JSON files that contain the hierarchies used.
hesp : the actual code containing layers, models, losses, etc.
samples : helper files, bash scripts, and train.py

Code is not complete yet.

How to use the code?

For installation, first run pip install -e . to register the package.

Then, run sh requirements.sh to install the requirements.

The code needs Tensorflow 1, the experiments are performed using Tensorflow 1.14. The tensorflow installed by the script is tensorflow-cpu. Change the commands to install tensorflow on GPU.

To train a model, use this code in samples directory.

python train.py --mode segmenter --batch_size 5 --dataset coco --geometry hyperbolic --dim 256 --c 0.1 --freeze_bn --train --test --backbone_init Path_to_resnet/resnet_v2_101_2017_04_14/resnet_v2_101.ckpt --output_stride 16 --segmenter_ident check

The code will train and test a hyperbolic model using coco stuff dataset, with batch size 5, curvature 0.1, freeze batch normalization, output stride 16. The result will be saved in a folder named poincare-hesp/save/segmenter/hierarchical_coco_d256_hyperbolic_c0.1_os16_resnet_v2_101_bs5_lr0.001_fbnTrue_fbbFalse_check in the samples directory.

Citation

Please consider citing this work using this BibTex entry,

@article{ghadimiatigh2022hyperbolic,
  title={Hyperbolic Image Segmentation},
  author={GhadimiAtigh, Mina and Schoep, Julian and Acar, Erman and van Noord, Nanne and Mettes, Pascal},
  journal={arXiv preprint arXiv:2203.05898},
  year={2022}
}

Hyperbolic Image Segmentation, CVPR 2022

Related tags

Overview

Hyperbolic Image Segmentation, CVPR 2022

Repository structure

How to use the code?

Citation

Owner

Mina Ghadimi Atigh

Efficient Speech Processing Tookit for Automatic Speaker Recognition

Joint learning of images and text via maximization of mutual information

Collection of generative models in Tensorflow

"Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback"

Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.

Companion code for "Bayesian logistic regression for online recalibration and revision of risk prediction models with performance guarantees"

Implementation of the famous Image Manipulation\Forgery Detector "ManTraNet" in Pytorch

Python lib to talk to pylontech lithium batteries (US2000, US3000, ...) using RS485

This YoloV5 based model is fit to detect people and different types of land vehicles, and displaying their density on a fitted map, according to their coordinates and detected labels.

This code provides various models combining dilated convolutions with residual networks

This is an early in-development version of training CLIP models with hivemind.

ACL'2021: LM-BFF: Better Few-shot Fine-tuning of Language Models

Classify the disease status of a plant given an image of a passion fruit

Official implementation of Neural Bellman-Ford Networks (NeurIPS 2021)

Resources for our AAAI 2022 paper: "LOREN: Logic-Regularized Reasoning for Interpretable Fact Verification".

MLP-Like Vision Permutator for Visual Recognition (PyTorch)

Multiple-criteria decision-making (MCDM) with Electre, Promethee, Weighted Sum and Pareto

FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.

When in Doubt: Improving Classification Performance with Alternating Normalization

This is a repo of basic Machine Learning!