Generalized Category Discovery

This repo is a placeholder for code for our paper: Generalized Category Discovery

Abstract: In this paper, we consider a highly general image recognition setting wherein, given a labelled and unlabelled set of images, the task is to categorize all images in the unlabelled set. Here, the unlabelled images may come from labelled classes or from novel ones. Existing recognition methods are not able to deal with this setting, because they make several restrictive assumptions, such as the unlabelled instances only coming from known --- or unknown --- classes and the number of unknown classes being known a-priori. We address the more unconstrained setting, naming it `Generalized Category Discovery', and challenge all these assumptions. We first establish strong baselines by taking state-of-the-art algorithms from novel category discovery and adapting them for this task. Next, we propose the use of vision transformers with contrastive representation learning for this open world setting. We then introduce a simple yet effective semi-supervised $k$-means method to cluster the unlabelled data into seen and unseen classes automatically, substantially outperforming the baselines. Finally, we also propose a new approach to estimate the number of classes in the unlabelled data. We thoroughly evaluate our approach on public datasets for generic object classification including CIFAR10, CIFAR100 and ImageNet-100, and for fine-grained visual recognition including CUB, Stanford Cars and Herbarium19, benchmarking on this new setting to foster future research.

Code for our paper 'Generalized Category Discovery'

Related tags

Overview

Generalized Category Discovery

Code Coming Soon!

Owner

"Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices", official implementation

Official Code for AdvRush: Searching for Adversarially Robust Neural Architectures (ICCV '21)

It helps user to learn Pick-up lines and share if he has a better one

Pytorch implementation of

MicroNet: Improving Image Recognition with Extremely Low FLOPs (ICCV 2021)

Keyword spotting on Arm Cortex-M Microcontrollers

《Dual-Resolution Correspondence Network》(NeurIPS 2020)

This is the paddle code for SeBoW(Self-Born wiring for neural trees), a kind of neural tree born form a large search space

BEAMetrics: Benchmark to Evaluate Automatic Metrics in Natural Language Generation

Light-weight network, depth estimation, knowledge distillation, real-time depth estimation, auxiliary data.

LUKE -- Language Understanding with Knowledge-based Embeddings

Offline Reinforcement Learning with Implicit Q-Learning

scAR (single-cell Ambient Remover) is a package for data denoising in single-cell omics.

Repo for the paper Extrapolating from a Single Image to a Thousand Classes using Distillation

Building blocks for uncertainty-aware cycle consistency presented at NeurIPS'21.

Code of TIP2021 Paper《SFace: Sigmoid-Constrained Hypersphere Loss for Robust Face Recognition》. We provide both MxNet and Pytorch versions.

Confident Semantic Ranking Loss for Part Parsing

A simple, fully convolutional model for real-time instance segmentation.

Meshed-Memory Transformer for Image Captioning. CVPR 2020

Recommendationsystem - Movie-recommendation - matrixfactorization colloborative filtering recommendation system user