PyTorch implementation of SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching

Last update: Aug 30, 2022

Related tags

Overview

SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching

This is the official PyTorch implementation of SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching.

SMODICE Demos

Tabular Experiments

Offline Imitation Learning from Mismatched Experts

python smodice_tabular/run_tabular_mismatched.py

Offline Imitation Learning from Examples

python smodice_tabular/run_tabular_example.py

Deep IL Experiments

Setup

Create conda environment and activate it:

conda env create -f environment.yml
conda activate smodice
pip install --upgrade numpy
pip install torch==1.7.1+cu110 torchvision==0.8.2+cu110 torchaudio===0.7.2 -f https://download.pytorch.org/whl/torch_stable.html
git clone https://github.com/rail-berkeley/d4rl
cd d4rl
pip install -e .

Offline IL from Observations

Run the following command with variable ENV set to any of hopper, walker2d, halfcheetah, ant, kitchen.

python run_oil_observations.py --env_name $ENV

For the AntMaze environment, first generate the random dataset:

cd envs
python generate_antmaze_random.py --noise

Then, run

python run_oil_antmaze.py

Offline IL from Mismatched Experts

For halfcheetah and ant, run

python run_oil_observations.py --env_name halfcheetah --dataset 0.5 --mismatch True

and

python run_oil_observations.py --env_name ant --dataset disabled --mismatch True

respectively. 2. For AntMaze, run

python run_oil_antmaze.py --mismatch True

Offline IL from Examples

For the PointMass-4Direction task, run

python run_oil_examples_pointmass.py

For the AntMaze task, run

python run_oil_antmaze.py --mismatch False --example True

For the Franka Kitchen based tasks, run

python run_oil_examples_kitchen.py --dataset $DATASET

where DATASET can be one of microwave, kettle.

Baselines

For any task, the BC baseline can be run by appending --disc_type bc to the above commands.

For RCE-TD3-BC and ORIL baselines, on the appropriate tasks, append --algo_type $ALGO where ALGO can be one of rce, oril.

Citation

If you find this repository useful for your research, please cite

@article{ma2022smodice,
      title={SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching}, 
      author={Yecheng Jason Ma and Andrew Shen and Dinesh Jayaraman and Osbert Bastani},
      year={2022},
      url={https://arxiv.org/abs/2202.02433}
}

Contact

If you have any questions regarding the code or paper, feel free to contact me at [email protected].

Acknowledgment

This codebase is partially adapted from optidice, rce, relay-policy-learning, and d4rl ; We thank the authors and contributors for open-sourcing their code.

PyTorch implementation of SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching

Related tags

Overview

SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching

SMODICE Demos

Tabular Experiments

Deep IL Experiments

Setup

Offline IL from Observations

Offline IL from Mismatched Experts

Offline IL from Examples

Baselines

Citation

Contact

Acknowledgment

Owner

Jason Ma

This is a Pytorch implementation of paper: DropEdge: Towards Deep Graph Convolutional Networks on Node Classification

(CVPR2021) Kaleido-BERT: Vision-Language Pre-training on Fashion Domain

MTCNN face detection implementation for TensorFlow, as a PIP package.

Source code for EquiDock: Independent SE(3)-Equivariant Models for End-to-End Rigid Protein Docking (ICLR 2022)

The AugNet Python module contains functions for the fast computation of image similarity.

Free Book about Deep-Learning approaches for Chess (like AlphaZero, Leela Chess Zero and Stockfish NNUE)

Omnidirectional Scene Text Detection with Sequential-free Box Discretization (IJCAI 2019). Including competition model, online demo, etc.

Official PyTorch implementation of Less is More: Pay Less Attention in Vision Transformers.

code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022

CT-Net: Channel Tensorization Network for Video Classification

Azion the best solution of Edge Computing in the world.

Code for ECIR'20 paper Diagnosing BERT with Retrieval Heuristics

Meta Self-learning for Multi-Source Domain Adaptation： A Benchmark

SMIS - Semantically Multi-modal Image Synthesis(CVPR 2020)

Baseline for the Spoofing-aware Speaker Verification Challenge 2022

NeuralTalk is a Python+numpy project for learning Multimodal Recurrent Neural Networks that describe images with sentences.

Neural network chess engine trained on Gary Kasparov's games.

A repository for benchmarking neural vocoders by their quality and speed.

PyTorch Implementation of PIXOR: Real-time 3D Object Detection from Point Clouds

Code for "PVNet: Pixel-wise Voting Network for 6DoF Pose Estimation" CVPR 2019 oral