Implementation of "Adversarial purification with Score-based generative models", ICML 2021

Last update: Dec 15, 2022

Related tags

Overview

Adversarial Purification with Score-based Generative Models

by Jongmin Yoon, Sung Ju Hwang, Juho Lee

This repository includes the official PyTorch implementation of our paper:

Adversarial Purification with Score-based Generative Models

Jongmin Yoon, Sung Ju Hwang, Juho Lee

the 38th International Conference for Machine Learning (ICML 2021)

ArXiv: https://arxiv.org/abs/2106.06041

What does our work do?

We propose a method that gives adversarial robustness to a neural network model against (stochastic) adversarial attacks by using an Energy-based Model (EBM) trained with Denoising Score Matching (DSM), which is called Adversarial denosing purification (ADP).

Running Codes

Dependency

Run the following command to install some necessary python packages to run our code.

pip install -r requirements.txt

Running code

To run the experiments with adp.py or adp_decision.py, enter the following command.

python main.py --config <config-file>

For example, we provide the example configuration file configs/cifar10_bpda_eot_sigma025_eot15.yml in the repository.

Attack and defense

For adversarial attacks, the classifier PGD attack and BPDA+EOT attack are implemented in attacks/clf_pgd.py and attacks/bpda_strong.py, respectively. At the configuration file, setting the attack.attack_method into clf_pgd or bpda_strong will run these attacks, respectively. For defense, we implemented the main ADP algorithm and ADP after detecting adversarial examples (Appendix F.) in purification/adp.py and purification/adp_decision.py, respectively.

Main components

File name	Explanation
`main.py`	Execute the main code, with initializing configurations and loggers.
`runners/empirical.py`	Attacks and purifies the image to show empirical adversarial robustness.
`attacks/bpda_strong.py`	Code for BPDA+EOT attack.
`purification/adp.py`	Code for adversarial purification.
`ncsnv2/*`	Code for training the EBM, i.e., NCSNv2 (paper, code).
`networks/*`	Code for used classifier network architectures.
`utils/*`	Utility files.

Notes

For the configuration files, we use the pixel ranges [0, 255] for the perturbation scale attack.ptb and the one-step attack scale attack.alpha. And the main experiments are performed within the pixel range [0, 1] after being rescaled during execution.
For training the EBM and classifier models, we primarily used the pre-existing methods such as NCSNv2 and WideResNet classifier. Here is the repository we used for training the WideResNet classifier. Nevertheless, other classifiers, such as the pre-trained adversarially robust classifier implemented in here can be used.

Reference

If you find our work useful for your research, please consider citing this.

@inproceedings{
yoon2021advpur,
title={Adversarial Purification with Score-based Generative Models},
author={Jongmin Yoon and Sung Ju Hwang and Juho Lee},
booktitle={Proceedings of The 38th International Conference on Machine Learning (ICML 2021)},
year={2021},
}

Contact

For further details, please contact [email protected].

License

MIT

Implementation of "Adversarial purification with Score-based generative models", ICML 2021

Related tags

Overview

Adversarial Purification with Score-based Generative Models

by Jongmin Yoon, Sung Ju Hwang, Juho Lee

What does our work do?

Running Codes

Dependency

Running code

Attack and defense

Main components

Notes

Reference

Contact

License

Owner

Code for "Semantic Role Labeling as Dependency Parsing: Exploring Latent Tree Structures Inside Arguments".

Code for producing Japanese GPT-2 provided by rinna Co., Ltd.

PIZZA - a task-oriented semantic parsing dataset

PyWorld3 is a Python implementation of the World3 model

ADCS cert template modification and ACL enumeration

Tokenizer - Module python d'analyse syntaxique et de grammaire, tokenization

ASCEND Chinese-English code-switching dataset

Python functions for summarizing and improving voice dictation input.

Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser)

text to speech toolkit. 好用的中文语音合成工具箱，包含语音编码器、语音合成器、声码器和可视化模块。

A python project made to generate code using either OpenAI's codex or GPT-J (Although not as good as codex)

Open source annotation tool for machine learning practitioners.

Exploring dimension-reduced embeddings

PhoNLP: A BERT-based multi-task learning toolkit for part-of-speech tagging, named entity recognition and dependency parsing

InferSent sentence embeddings

Based on 125GB of data leaked from Twitch, you can see their monthly revenues from 2019-2021

Finally, some decent sample sentences

NeoDays-based tileset for the roguelike CDDA (Cataclysm Dark Days Ahead)

My implementation of Safaricom Machine Learning Codility test. The code has bugs, logical I guess I made errors and any correction will be appreciated.

Repo for Enhanced Seq2Seq Autoencoder via Contrastive Learning for Abstractive Text Summarization