Dynamic Bottleneck for Robust Self-Supervised Exploration

Last update: Nov 14, 2022

Related tags

Deep Learning DB

Overview

Dynamic Bottleneck

Introduction

This is a TensorFlow based implementation for our paper on

"Dynamic Bottleneck for Robust Self-Supervised Exploration". NeurIPS 2021

Prerequisites

python3.6 or 3.7, tensorflow-gpu 1.x, tensorflow-probability, openAI baselines, openAI Gym

Installation and Usage

Atari games

The following command should train a pure exploration agent on "Breakout" with default experiment parameters.

python run.py --env BreakoutNoFrameskip-v4

Atari games with Random-Box noise

The following command should train a pure exploration agent on "Breakout" with randomBox noise.

python run.py --env BreakoutNoFrameskip-v4 --randomBoxNoise

Atari games with Gaussian noise

The following command should train a pure exploration agent on "Breakout" with Gaussian noise.

python run.py --env BreakoutNoFrameskip-v4 --pixelNoise

Atari games with sticky actions

The following command should train a pure exploration agent on "sticky Breakout" with a probability of 0.25

python run.py --env BreakoutNoFrameskip-v4 --stickyAtari

Baselines

ICM: We use the official code of "Curiosity-driven Exploration by Self-supervised Prediction, ICML 2017" and "Large-Scale Study of Curiosity-Driven Learning, ICLR 2019".
Disagreement: We use the official code of "Self-Supervised Exploration via Disagreement, ICML 2019".
CB: We use the official code of "Curiosity-Bottleneck: Exploration by Distilling Task-Specific Novelty, ICML 2019".

Dynamic Bottleneck for Robust Self-Supervised Exploration

Related tags

Overview

Dynamic Bottleneck

Introduction

Prerequisites

Installation and Usage

Atari games

Atari games with Random-Box noise

Atari games with Gaussian noise

Atari games with sticky actions

Baselines

Owner

Bai Chenjia

Code to use Augmented Shapiro Wilks Stopping, as well as code for the paper "Statistically Signifigant Stopping of Neural Network Training"

Dynamica causal Bayesian optimisation

Generalized Jensen-Shannon Divergence Loss for Learning with Noisy Labels

The official implementation of A Unified Game-Theoretic Interpretation of Adversarial Robustness.

Demystifying How Self-Supervised Features Improve Training from Noisy Labels

Relaxed-machines - explorations in neuro-symbolic differentiable interpreters

Free like Freedom

(CVPR 2022 - oral) Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Brain Tumor Detection with Tensorflow Neural Networks.

Gym-TORCS is the reinforcement learning (RL) environment in TORCS domain with OpenAI-gym-like interface.

UniLM AI - Large-scale Self-supervised Pre-training across Tasks, Languages, and Modalities

MEDS: Enhancing Memory Error Detection for Large-Scale Applications

Code for Neural-GIF: Neural Generalized Implicit Functions for Animating People in Clothing(ICCV21)

CAR-API: Cityscapes Attributes Recognition API

This is a project based on ConvNets used to identify whether a road is clean or dirty. We have used MobileNet as our base architecture and the weights are based on imagenet.

Experiment about Deep Person Re-identification with EfficientNet-v2

Find-Lane-Line - Use openCV library and Python to detect the road-lane-line

A light and fast one class detection framework for edge devices. We provide face detector, head detector, pedestrian detector, vehicle detector......

This repository contains code and data for "On the Multimodal Person Verification Using Audio-Visual-Thermal Data"