Implementations of paper Controlling Directions Orthogonal to a Classifier

Last update: Dec 01, 2022

Related tags

Overview

Classifier Orthogonalization

Implementations of paper Controlling Directions Orthogonal to a Classifier , ICLR 2022, Yilun Xu, Hao He, Tianxiao Shen, Tommi Jaakkola

Let's construct orthogonal classifiers for controlled style transfer, domain adaptation with label shifts and fairness problems 🤠 !

Outline

Controlled Style Transfer
- Prepare Celeba-GH dataset
- Train classifiers and CycleGAN
Domain Adaptation with label shifts
- Prepare dataset pairs
- Training
Fairness

Controlled Style Transfer

Prepare CelebA-GH dataset:

python style_transfer/celeba_dataset.py --data_dir {path}

path: path to the CelebA dataset

bash example: python style_transfer/celeba_dataset.py --data_dir ./data

One can modify the domain_fn dictionary in the style_transfer/celeba_dataset.py file to create new groups 💡

Step 1: Train principal, full and oracle orthogonal classifiers

sh style_transfer/train_classifiers.sh {gpu} {path} {dataset} {alg}

gpu: the number of gpu
path: path to the dataset (Celeba or MNIST)
dataset: dataset (Celeba | CMNIST)
alg: ERM, Fish, TRM or MLDG

CMNIST bash example: sh style_transfer/train_classifiers.sh 0 ./data CMNIST ERM

Step 2: Train controlled CycleGAN

python style_transfer/train_cyclegan.py --data_dir {path} --dataset {dataset} \
  --obj {obj} --name {name}

path: path to the dataset (Celeba or MNIST)
dataset: dataset (Celeba | CMNIST)
obj: training objective (vanilla | orthogonal)
name: name of the model

CMNIST bash example: python style_transfer/train_cyclegan.py --data_dir ./data --dataset CMNIST --obj orthogonal --name cmnist

To view training results and loss plots, run python -m visdom.server and click the URL http://localhost:8097

Evaluation and Generation

python style_transfer/generate.py --data_dir {path} --dataset {dataset} --name {name} \
 --obj {obj} --out_path {out_path} --resume_epoch {epoch} (--save)

path: path to the dataset (Celeba or MNIST)
dataset: dataset (Celeba | CMNIST)
name: name of the model
obj: training objective (vanilla | orthogonal)
out_path: output path
epoch: resuming epoch of checkpoint

Images will be save to style_transfer/generated_images/out_path

CMNIST bash example: python style_transfer/generate.py --data_dir ./data --dataset CMNIST --name cmnist --obj orthogonal --out_path cmnist_out --resume_epoch 5

Domain Adaptation (DA) with label shifts

Prepare src/tgt pairs with label shifts

Please cd /da/data and run

python {dataset}.py --r {r0} {r1}

r0: subsample ratio for the first half classes (default=0.7)
r1: subsample ratio for the first half classes (default=0.3)
dataset: mnist | mnistm | svhn | cifar | stl | signs | digits

For SynthDigits / SynthSignsdataset, please download them at link_digits / link_signs. All the other datasets will be automatically downloaded 😉

Training

python da/vada_train.py --r {r0} {r1} --src {source} --tgt {target}  --seed {seed} \
 (--iw) (--orthogonal) (--source_only)

r0: subsample ratio for the first half classes (default=0.7)
r1: subsample ratio for the first half classes (default=0.3)
source: source domain (mnist | mnistm | svhn | cifar | stl | signs | digits)
target: target domain (mnist | mnistm | svhn | cifar | stl | signs | digits)
seed: random seed
--source_only: vanilla ERM on the source domain
--iw: use importance-weighted domain adaptation algorithm [1]
--orthogonal: use orthogonal classifier
--vada: vanilla VADA [2]

Fairness

python fairness/methods/train.py --data {data} --gamma {gamma} --sigma {sigma} \
 (--orthogonal) (--laftr) (--mifr) (--hsic)

data: dataset (adult | german)
gamma: hyper-parameter for MIFR, HSIC, LAFTR
sigma: hyper-parameter for HSIC (kernel width)
--orthogonal: use orthogonal classifier
--MIFR: use L-MIFR algorithm [3]
--HSIC: use ReBias algorithm [4]
--LAFTR: use LAFTR algorithm [5]

Reference

[1] Remi Tachet des Combes, Han Zhao, Yu-Xiang Wang, and Geoffrey J. Gordon. Domain adaptation with conditional distribution matching and generalized label shift. ArXiv, abs/2003.04475, 2020.

[2] Rui Shu, H. Bui, H. Narui, and S. Ermon. A dirt-t approach to unsupervised domain adaptation. ArXiv, abs/1802.08735, 2018.

[3] Jiaming Song, Pratyusha Kalluri, Aditya Grover, Shengjia Zhao, and S. Ermon. Learning controllable fair representations. In AISTATS, 2019.

[4] Hyojin Bahng, Sanghyuk Chun, Sangdoo Yun, Jaegul Choo, and Seong Joon Oh. Learning de-biased representations with biased representations. In ICML, 2020.

[5] David Madras, Elliot Creager, T. Pitassi, and R. Zemel. Learning adversarially fair and transferable representations. In ICML, 2018.

The implementation of this repo is based on / inspired by:

https://github.com/facebookresearch/DomainBed (code structure).
https://github.com/junyanz/pytorch-CycleGAN-and-pix2pix (code structure)
https://github.com/ozanciga/dirt-t (VADA code)
https://github.com/Britefury/self-ensemble-visual-domain-adapt (data generation)

Implementations of paper Controlling Directions Orthogonal to a Classifier

Related tags

Overview

Classifier Orthogonalization

Outline

Controlled Style Transfer

Prepare CelebA-GH dataset:

Step 1: Train principal, full and oracle orthogonal classifiers

Step 2: Train controlled CycleGAN

Evaluation and Generation

Domain Adaptation (DA) with label shifts

Prepare src/tgt pairs with label shifts

Training

Fairness

Reference

Owner

Yilun Xu

A lightweight library to compare different PyTorch implementations of the same network architecture.

The Codebase for Causal Distillation for Language Models.

(CVPR2021) Kaleido-BERT: Vision-Language Pre-training on Fashion Domain

ROS-UGV-Control-Interface - Control interface which can be used in any UGV

Official repository for "Intriguing Properties of Vision Transformers" (2021)

SnapMix: Semantically Proportional Mixing for Augmenting Fine-grained Data (AAAI 2021)

A tiny, friendly, strong baseline code for Person-reID (based on pytorch).

Spatiotemporal resampling methods for mlr3

FairMOT - A simple baseline for one-shot multi-object tracking

SkipGNN: Predicting Molecular Interactions with Skip-Graph Networks (Scientific Reports)

This repository contains an implementation of ConvMixer for the ICLR 2022 submission "Patches Are All You Need?".

PyTorch implementation of paper: HPNet: Deep Primitive Segmentation Using Hybrid Representations.

HNN: Human (Hollywood) Neural Network

code for `Look Closer to Segment Better: Boundary Patch Refinement for Instance Segmentation`

✨✨✨An awesome open source toolbox for stereo matching.

(CVPR2021) Kaleido-BERT: Vision-Language Pre-training on Fashion Domain

Consistency Regularization for Adversarial Robustness

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction

Pytorch implementation of “Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement”

All course materials for the Zero to Mastery Deep Learning with TensorFlow course.