Reinforcement Learning with Q-Learning Algorithm on gym's frozen lake environment implemented in python

Overview

Reinforcement Learning with Q Learning Algorithm

Q learning algorithm is trained on the gym's frozen lake environment.

Libraries Used

  • gym
  • Numpy
  • tqdm
  • Pytorch Deep Learning Framework

  • Install Requirement Files

    clone the repository or download the 'requirement.txt' files, then open terminal in the working directory and type
    'pip install -r requirements.txt'
    to install all the requirements for this project.

    Demo Video

    Q-learning.mp4
    An implementation for Neural Architecture Search with Random Labels (CVPR 2021 poster) on Pytorch.

    Neural Architecture Search with Random Labels(RLNAS) Introduction This project provides an implementation for Neural Architecture Search with Random L

    18 Nov 08, 2022
    Official repository of IMPROVING DEEP IMAGE MATTING VIA LOCAL SMOOTHNESS ASSUMPTION.

    IMPROVING DEEP IMAGE MATTING VIA LOCAL SMOOTHNESS ASSUMPTION This is the official repository of IMPROVING DEEP IMAGE MATTING VIA LOCAL SMOOTHNESS ASSU

    电线杆 14 Dec 15, 2022
    PyTorch reimplementation of hand-biomechanical-constraints (ECCV2020)

    Hand Biomechanical Constraints Pytorch Unofficial PyTorch reimplementation of Hand-Biomechanical-Constraints (ECCV2020). This project reimplement foll

    Hao Meng 59 Dec 20, 2022
    A light and fast one class detection framework for edge devices. We provide face detector, head detector, pedestrian detector, vehicle detector......

    A Light and Fast Face Detector for Edge Devices Big News: LFD, which is a big update of LFFD, now is released (2021.03.09). It is strongly recommended

    YonghaoHe 1.3k Dec 25, 2022
    Codes of paper "Unseen Object Amodal Instance Segmentation via Hierarchical Occlusion Modeling"

    Unseen Object Amodal Instance Segmentation (UOAIS) Seunghyeok Back, Joosoon Lee, Taewon Kim, Sangjun Noh, Raeyoung Kang, Seongho Bak, Kyoobin Lee This

    GIST-AILAB 92 Dec 13, 2022
    Vehicle direction identification consists of three module detection , tracking and direction recognization.

    Vehicle-direction-identification Vehicle direction identification consists of three module detection , tracking and direction recognization. Algorithm

    5 Nov 15, 2022
    A Python Package for Convex Regression and Frontier Estimation

    pyStoNED pyStoNED is a Python package that provides functions for estimating multivariate convex regression, convex quantile regression, convex expect

    Sheng Dai 17 Jan 08, 2023
    Ensembling Off-the-shelf Models for GAN Training

    Data-Efficient GANs with DiffAugment project | paper | datasets | video | slides Generated using only 100 images of Obama, grumpy cats, pandas, the Br

    MIT HAN Lab 1.2k Dec 26, 2022
    Ansible Automation Example: JSNAPY PRE/POST Upgrade Validation

    Ansible Automation Example: JSNAPY PRE/POST Upgrade Validation Overview This example will show how to validate the status of our firewall before and a

    Calvin Remsburg 1 Jan 07, 2022
    Hierarchical Aggregation for 3D Instance Segmentation (ICCV 2021)

    HAIS Hierarchical Aggregation for 3D Instance Segmentation (ICCV 2021) by Shaoyu Chen, Jiemin Fang, Qian Zhang, Wenyu Liu, Xinggang Wang*. (*) Corresp

    Hust Visual Learning Team 145 Jan 05, 2023
    Code to accompany our paper "Continual Learning Through Synaptic Intelligence" ICML 2017

    Continual Learning Through Synaptic Intelligence This repository contains code to reproduce the key findings of our path integral approach to prevent

    Ganguli Lab 82 Nov 03, 2022
    TorchOk - The toolkit for fast Deep Learning experiments in Computer Vision

    TorchOk - The toolkit for fast Deep Learning experiments in Computer Vision

    52 Dec 23, 2022
    DeepAL: Deep Active Learning in Python

    DeepAL: Deep Active Learning in Python Python implementations of the following active learning algorithms: Random Sampling Least Confidence [1] Margin

    Kuan-Hao Huang 583 Jan 03, 2023
    Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.

    Faster R-CNN and Mask R-CNN in PyTorch 1.0 maskrcnn-benchmark has been deprecated. Please see detectron2, which includes implementations for all model

    Facebook Research 9k Jan 04, 2023
    A tensorflow implementation of an HMM layer

    tensorflow_hmm Tensorflow and numpy implementations of the HMM viterbi and forward/backward algorithms. See Keras example for an example of how to use

    Zach Dwiel 283 Oct 19, 2022
    DiSECt: Differentiable Simulator for Robotic Cutting

    DiSECt: Differentiable Simulator for Robotic Cutting Website | Paper | Dataset | Video | Blog post DiSECt is a simulator for the cutting of deformable

    NVIDIA Research Projects 73 Oct 29, 2022
    StarGAN - Official PyTorch Implementation (CVPR 2018)

    StarGAN - Official PyTorch Implementation ***** New: StarGAN v2 is available at https://github.com/clovaai/stargan-v2 ***** This repository provides t

    Yunjey Choi 5.1k Jan 04, 2023
    Relative Positional Encoding for Transformers with Linear Complexity

    Stochastic Positional Encoding (SPE) This is the source code repository for the ICML 2021 paper Relative Positional Encoding for Transformers with Lin

    Antoine Liutkus 48 Nov 16, 2022
    StudioGAN is a Pytorch library providing implementations of representative Generative Adversarial Networks (GANs) for conditional/unconditional image generation.

    StudioGAN is a Pytorch library providing implementations of representative Generative Adversarial Networks (GANs) for conditional/unconditional image generation.

    3k Jan 08, 2023
    EMNLP 2020 - Summarizing Text on Any Aspects

    Summarizing Text on Any Aspects This repo contains preliminary code of the following paper: Summarizing Text on Any Aspects: A Knowledge-Informed Weak

    Bowen Tan 35 Nov 14, 2022