NeurIPS-2021: Neural Auto-Curricula in Two-Player Zero-Sum Games.

Last update: Nov 11, 2022

Related tags

Overview

NAC

Official PyTorch implementation of NAC from the paper:

Neural Auto-Curricula in Two-Player Zero-Sum Games.

We release code for: Gradient based oracle(Game of skills/2d-rps), Implicit gradient(2d-rps), RL(IMP) and ES based training for Kuhn-poker.

How to run

We set our hyperparameters in the python file so you just need to run model_train.py in the corresponding directory. We also offer our pretrain model for direct test.

We use wandb to log experimental results, you may need to register for an account before running the code.

How to test

Run test.py and you can check the comment in test.py for different test configurations.

2D-RPS visualization

Visualisation results can be tested in:

2d-rps-gradient/visualisation/visualization_2d_rps.ipynb.

Kuhn->Leduc Generalization

we provide a local implementation in which one can reproduce the results of generalising our models trained on Kuhn Poker to Leduc Poker.

cd leduc_poker
# To reproduce the approximate best-response results
python3 kuhn_to_leduc.py --br_type 'approx_br_rand'
# To reproduce the exact best-response results
python3 kuhn_to_leduc.py --br_type 'exact_br'

Cite

Please cite our paper if you use the code or datasets in your own work:

@article{feng2021NAC,
  title={Neural Auto-Curricula},
  author={Feng, Xidong and Slumbers, Oliver and Yang, Yaodong and Wan, Ziyu and Liu, Bo and McAleer, Stephen and Wen, Ying and Wang, Jun},
  journal={arXiv preprint arXiv:2106.02745},
  year={2021}
}

NeurIPS-2021: Neural Auto-Curricula in Two-Player Zero-Sum Games.

Related tags

Overview

NAC

How to run

How to test

2D-RPS visualization

Kuhn->Leduc Generalization

Cite

Owner

Xidong Feng

Pytorch Implementations of large number classical backbone CNNs, data enhancement, torch loss, attention, visualization and some common algorithms.

The code for our paper submitted to RAL/IROS 2022: OverlapTransformer: An Efficient and Rotation-Invariant Transformer Network for LiDAR-Based Place Recognition.

[ICLR 2021] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yining Ding, Vikas Chandra, Yingyan Lin

SuRE Evaluation: A Supplementary Material

Codes and Data Processing Files for our paper.

Pytorch implementation of Decoupled Spatial-Temporal Transformer for Video Inpainting

Deep deconfounded recommender (Deep-Deconf) for paper "Deep causal reasoning for recommendations"

[NeurIPS 2021 Spotlight] Aligning Pretraining for Detection via Object-Level Contrastive Learning

Unofficial PyTorch implementation of Attention Free Transformer (AFT) layers by Apple Inc.

Intent parsing and slot filling in PyTorch with seq2seq + attention

TinyML Cookbook, published by Packt

labelpix is a graphical image labeling interface for drawing bounding boxes

A library for augmentation of a YOLO-formated dataset

Extremely easy multi instancing software for minecraft speedrunning.

Official code release for: EditGAN: High-Precision Semantic Image Editing

Code accompanying the paper "ProxyFL: Decentralized Federated Learning through Proxy Model Sharing"

Code of our paper "Contrastive Object-level Pre-training with Spatial Noise Curriculum Learning"

Deep Reinforcement Learning based autonomous navigation for quadcopters using PPO algorithm.

Diffusion Normalizing Flow (DiffFlow) Neurips2021

End-to-end speech secognition toolkit