Joint Unsupervised Learning (JULE) of Deep Representations and Image Clusters.

Last update: Dec 25, 2022

Related tags

Overview

Joint Unsupervised Learning (JULE) of Deep Representations and Image Clusters.

Overview

This project is a Torch implementation for our CVPR 2016 paper, which performs jointly unsupervised learning of deep CNN and image clusters. The intuition behind this is that better image representation will facilitate clustering, while better clustering results will help representation learning. Given a unlabeled dataset, it will iteratively learn CNN parameters unsupervisedly and cluster images.

Disclaimer

This is a torch version reimplementation to the code used in our CVPR paper. There is a slight difference between the code used to report the results in our paper. The Caffe version code can be found here.

License

This code is released under the MIT License (refer to the LICENSE file for details).

Citation

If you find our code is useful in your researches, please consider citing:

@inproceedings{yangCVPR2016joint,
    Author = {Yang, Jianwei and Parikh, Devi and Batra, Dhruv},
    Title = {Joint Unsupervised Learning of Deep Representations and Image Clusters},
    Booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
    Year = {2016}
}

Dependencies

Torch. Install Torch by:

$ curl -s https://raw.githubusercontent.com/torch/ezinstall/master/install-deps | bash
$ git clone https://github.com/torch/distro.git ~/torch --recursive
$ cd ~/torch; 
$ ./install.sh      # and enter "yes" at the end to modify your bashrc
$ source ~/.bashrc

After installing torch, you may also need install some packages using LuaRocks:

$ luarocks install nn
$ luarocks install image

It is preferred to run the code on GPU. Thus you need to install cunn:

$ luarocks install cunn

lua-knn. It is used to compute the distance between neighbor samples. Go into the folder, and then compile it with:
```
$ luarocks make
```

Typically, you can run our code after installing the above two packages. Please let me know if error occurs.

Installation Using Nvidia-Docker

Run docker build -t .
Run nvidia-docker run -it /bin/bash

Train model

It is very simple to run the code for training model. For example, if you want to train on USPS dataset, you can run:
```
$ th train.lua -dataset USPS -eta 0.9
```
Note that it runs on fast mode by default. You can change it to regular mode by setting "-use_fast 0". In the above command, eta is the unfolding rate. For face dataset, we recommand 0.2, while for other datasets, it is set to 0.9 to save training time. During training, you will see the normalize mutual information (NMI) for the clustering results.
You can train multiple models in parallel by:
```
$ th train.lua -dataset USPS -eta 0.9 -num_nets 5
```
By this way, you weill get 5 different models, and thus 5 possible different results. Statistics such as mean and stddev can be computed on these results.
You can also get the clustering performance when using raw image data and random CNN by
```
$ th train.lua -dataset USPS -eta 0.9 -updateCNN 0
```
You can also change other hyper parameters for model training, such as K_s, K_c, number of epochs in each partial unrolled period, etc.

Datasets

We upload six small datasets: COIL-20, USPS, MNIST-test, CMU-PIE, FRGC, UMist. The other large datasets, COIL-100, MNIST-full and YTF can be found in my google drive here.

Train on your own datasets

Alternatively, you can train the model on your own dataset. As preparations, you need:

Create a hdf5 file with size of NxCxHxW, where N is the total number of images, C is the number of channels, H is the height of image, and W the width of image. Then move it to datasets/dataset_name/data4torch.h5
Create a lua file to define the network architecture for your dataset. Put it in models_def/dataset_name.lua.
Afterwards, you can run train.lua by specifying the dataset name as your own dataset. That's it!

Compared Approaches

We upload the code for the compared approaches in matlab folder. Please refer to the original paper for details and cite them properly. In this foler, we also attach the evaluation code for two metric: normalized mutual information (NMI) and clustering accuracy (AC).

Q&A

You are welcome to send message to (jw2yang at vt.edu) if you have any issue on this code.

Joint Unsupervised Learning (JULE) of Deep Representations and Image Clusters.

Related tags

Overview

Joint Unsupervised Learning (JULE) of Deep Representations and Image Clusters.

Overview

Disclaimer

License

Citation

Dependencies

Installation Using Nvidia-Docker

Train model

Datasets

Train on your own datasets

Compared Approaches

Q&A

Owner

Jianwei Yang

Code for the paper "Can Active Learning Preemptively Mitigate Fairness Issues?" presented at RAI 2021.

This repository contains the code for the paper ``Identifiable VAEs via Sparse Decoding''.

This is the repository for the paper "Have I done enough planning or should I plan more?"

A spatial genome aligner for analyzing multiplexed DNA-FISH imaging data.

Transfer Learning library for Deep Neural Networks.

Citation Intent Classification in scientific papers using the Scicite dataset an Pytorch

Official Pytorch implementation of RePOSE (ICCV2021)

StrongSORT: Make DeepSORT Great Again

A style-based Quantum Generative Adversarial Network

UMich 500-Level Mobile Robotics Course

The source code for 'Noisy-Labeled NER with Confidence Estimation' accepted by NAACL 2021

CaLiGraph Ontology as a Challenge for Semantic Reasoners ([email protected]'21)

The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

Official implementation for "Low-light Image Enhancement via Breaking Down the Darkness"

The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario.

This is the Pytorch implementation of Progressive Attentional Manifold Alignment.

Train Yolov4 using NBX-Jobs

Gym environments used in the paper: "Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV with Thrust Vectoring Rotors"

Captcha-tensorflow - Image Captcha Solving Using TensorFlow and CNN Model. Accuracy 90%+

Keras Implementation of Neural Style Transfer from the paper "A Neural Algorithm of Artistic Style"

Joint Unsupervised Learning (JULE) of Deep Representations and Image Clusters.

Related tags

Overview

Joint Unsupervised Learning (JULE) of Deep Representations and Image Clusters.

Overview

Disclaimer

License

Citation

Dependencies

Installation Using Nvidia-Docker

Train model

Datasets

Train on your own datasets

Compared Approaches

Q&A

Owner

Jianwei Yang

Code for the paper "Can Active Learning Preemptively Mitigate Fairness Issues?" presented at RAI 2021.

This repository contains the code for the paper ``Identifiable VAEs via Sparse Decoding''.

This is the repository for the paper "Have I done enough planning or should I plan more?"

A spatial genome aligner for analyzing multiplexed DNA-FISH imaging data.

Transfer Learning library for Deep Neural Networks.

Citation Intent Classification in scientific papers using the Scicite dataset an Pytorch

Official Pytorch implementation of RePOSE (ICCV2021)

StrongSORT: Make DeepSORT Great Again

A style-based Quantum Generative Adversarial Network

UMich 500-Level Mobile Robotics Course

The source code for 'Noisy-Labeled NER with Confidence Estimation' accepted by NAACL 2021

CaLiGraph Ontology as a Challenge for Semantic Reasoners ([email protected]'21)

The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

Official implementation for "Low-light Image Enhancement via Breaking Down the Darkness"

The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario.

​ This is the Pytorch implementation of Progressive Attentional Manifold Alignment.

Train Yolov4 using NBX-Jobs

Gym environments used in the paper: "Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV with Thrust Vectoring Rotors"

Captcha-tensorflow - Image Captcha Solving Using TensorFlow and CNN Model. Accuracy 90%+

Keras Implementation of Neural Style Transfer from the paper "A Neural Algorithm of Artistic Style"

This is the Pytorch implementation of Progressive Attentional Manifold Alignment.