Reinforcement Learning with Q-Learning Algorithm on gym's frozen lake environment implemented in python

Overview

Reinforcement Learning with Q Learning Algorithm

Q learning algorithm is trained on the gym's frozen lake environment.

Libraries Used

  • gym
  • Numpy
  • tqdm
  • Pytorch Deep Learning Framework

  • Install Requirement Files

    clone the repository or download the 'requirement.txt' files, then open terminal in the working directory and type
    'pip install -r requirements.txt'
    to install all the requirements for this project.

    Demo Video

    Q-learning.mp4
    MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.

    MusicYOLO MusicYOLO framework uses the object detection model, YOLOX, to locate notes in the spectrogram. Its performance on the ISMIR2014 dataset, MI

    Xianke Wang 2 Aug 02, 2022
    Torchreid: Deep learning person re-identification in PyTorch.

    Torchreid Torchreid is a library for deep-learning person re-identification, written in PyTorch. It features: multi-GPU training support both image- a

    Kaiyang 3.7k Jan 05, 2023
    [CVPR 2021] Region-aware Adaptive Instance Normalization for Image Harmonization

    RainNet — Official Pytorch Implementation Region-aware Adaptive Instance Normalization for Image Harmonization Jun Ling, Han Xue, Li Song*, Rong Xie,

    130 Dec 11, 2022
    BED: A Real-Time Object Detection System for Edge Devices

    BED: A Real-Time Object Detection System for Edge Devices About this project Thi

    Data Analytics Lab at Texas A&M University 44 Nov 18, 2022
    NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework

    NLP From Scratch Without Large-Scale Pretraining This repository contains the code, pre-trained model checkpoints and curated datasets for our paper:

    Xingcheng Yao 224 Dec 08, 2022
    DeepLab-ResNet rebuilt in TensorFlow

    DeepLab-ResNet-TensorFlow This is an (re-)implementation of DeepLab-ResNet in TensorFlow for semantic image segmentation on the PASCAL VOC dataset. Fr

    Vladimir 1.2k Nov 04, 2022
    PyTorch implementations of Generative Adversarial Networks.

    This repository has gone stale as I unfortunately do not have the time to maintain it anymore. If you would like to continue the development of it as

    Erik Linder-Norén 13.4k Jan 08, 2023
    A Physics-based Noise Formation Model for Extreme Low-light Raw Denoising (CVPR 2020 Oral & TPAMI 2021)

    ELD The implementation of CVPR 2020 (Oral) paper "A Physics-based Noise Formation Model for Extreme Low-light Raw Denoising" and its journal (TPAMI) v

    Kaixuan Wei 359 Jan 01, 2023
    Official implementation of "A Unified Objective for Novel Class Discovery", ICCV2021 (Oral)

    A Unified Objective for Novel Class Discovery This is the official repository for the paper: A Unified Objective for Novel Class Discovery Enrico Fini

    Enrico Fini 118 Dec 26, 2022
    Yet another video caption

    Yet another video caption

    Fan Zhimin 5 May 26, 2022
    Laplace Redux -- Effortless Bayesian Deep Learning

    Laplace Redux - Effortless Bayesian Deep Learning This repository contains the code to run the experiments for the paper Laplace Redux - Effortless Ba

    Runa Eschenhagen 28 Dec 07, 2022
    Official PyTorch Implementation of HELP: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning (NeurIPS 2021 Spotlight)

    [NeurIPS 2021 Spotlight] HELP: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning [Paper] This is Official PyTorch implementatio

    42 Nov 01, 2022
    [ICML 2021, Long Talk] Delving into Deep Imbalanced Regression

    Delving into Deep Imbalanced Regression This repository contains the implementation code for paper: Delving into Deep Imbalanced Regression Yuzhe Yang

    Yuzhe Yang 568 Dec 30, 2022
    Sky Computing: Accelerating Geo-distributed Computing in Federated Learning

    Sky Computing Introduction Sky Computing is a load-balanced framework for federated learning model parallelism. It adaptively allocate model layers to

    HPC-AI Tech 72 Dec 27, 2022
    Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.

    Continuous Speech Separation with Conformer Introduction We examine the use of the Conformer architecture for continuous speech separation. Conformer

    Sanyuan Chen (陈三元) 81 Nov 28, 2022
    Causal Imitative Model for Autonomous Driving

    Causal Imitative Model for Autonomous Driving Mohammad Reza Samsami, Mohammadhossein Bahari, Saber Salehkaleybar, Alexandre Alahi. arXiv 2021. [Projec

    VITA lab at EPFL 8 Oct 04, 2022
    Unofficial implementation of MUSIQ (Multi-Scale Image Quality Transformer)

    MUSIQ: Multi-Scale Image Quality Transformer Unofficial pytorch implementation of the paper "MUSIQ: Multi-Scale Image Quality Transformer" (paper link

    41 Jan 02, 2023
    CSAC - Collaborative Semantic Aggregation and Calibration for Separated Domain Generalization

    CSAC Introduction This repository contains the implementation code for paper: Co

    ScottYuan 5 Jul 22, 2022
    Offcial repository for the IEEE ICRA 2021 paper Auto-Tuned Sim-to-Real Transfer.

    Offcial repository for the IEEE ICRA 2021 paper Auto-Tuned Sim-to-Real Transfer.

    47 Jun 30, 2022
    3D Human Pose Machines with Self-supervised Learning

    3D Human Pose Machines with Self-supervised Learning Keze Wang, Liang Lin, Chenhan Jiang, Chen Qian, and Pengxu Wei, “3D Human Pose Machines with Self

    Chenhan Jiang 398 Dec 20, 2022