Reinforcement Learning with Q-Learning Algorithm on gym's frozen lake environment implemented in python

Overview

Reinforcement Learning with Q Learning Algorithm

Q learning algorithm is trained on the gym's frozen lake environment.

Libraries Used

  • gym
  • Numpy
  • tqdm
  • Pytorch Deep Learning Framework

  • Install Requirement Files

    clone the repository or download the 'requirement.txt' files, then open terminal in the working directory and type
    'pip install -r requirements.txt'
    to install all the requirements for this project.

    Demo Video

    Q-learning.mp4
    A real-time motion capture system that estimates poses and global translations using only 6 inertial measurement units

    TransPose Code for our SIGGRAPH 2021 paper "TransPose: Real-time 3D Human Translation and Pose Estimation with Six Inertial Sensors". This repository

    Xinyu Yi 261 Dec 31, 2022
    Useful materials and tutorials for 110-1 NTU DBME5028 (Application of Deep Learning in Medical Imaging)

    Useful materials and tutorials for 110-1 NTU DBME5028 (Application of Deep Learning in Medical Imaging)

    7 Jun 22, 2022
    This is an unofficial implementation of the paper “Student-Teacher Feature Pyramid Matching for Unsupervised Anomaly Detection”.

    This is an unofficial implementation of the paper “Student-Teacher Feature Pyramid Matching for Unsupervised Anomaly Detection”.

    haifeng xia 32 Oct 26, 2022
    A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility

    Tensorpack is a neural network training interface based on TensorFlow. Features: It's Yet Another TF high-level API, with speed, and flexibility built

    Tensorpack 6.2k Jan 09, 2023
    Sign-to-Speech for Sign Language Understanding: A case study of Nigerian Sign Language

    Sign-to-Speech for Sign Language Understanding: A case study of Nigerian Sign Language This repository contains the code, model, and deployment config

    16 Oct 23, 2022
    TorchDistiller - a collection of the open source pytorch code for knowledge distillation, especially for the perception tasks, including semantic segmentation, depth estimation, object detection and instance segmentation.

    This project is a collection of the open source pytorch code for knowledge distillation, especially for the perception tasks, including semantic segmentation, depth estimation, object detection and i

    yifan liu 147 Dec 03, 2022
    an implementation of 3D Ken Burns Effect from a Single Image using PyTorch

    3d-ken-burns This is a reference implementation of 3D Ken Burns Effect from a Single Image [1] using PyTorch. Given a single input image, it animates

    Simon Niklaus 1.4k Dec 28, 2022
    Generating retro pixel game characters with Generative Adversarial Networks. Dataset "TinyHero" included.

    pixel_character_generator Generating retro pixel game characters with Generative Adversarial Networks. Dataset "TinyHero" included. Dataset TinyHero D

    Agnieszka Mikołajczyk 88 Nov 17, 2022
    Lipstick ain't enough: Beyond Color-Matching for In-the-Wild Makeup Transfer (CVPR 2021)

    Table of Content Introduction Datasets Getting Started Requirements Usage Example Training & Evaluation CPM: Color-Pattern Makeup Transfer CPM is a ho

    VinAI Research 248 Dec 13, 2022
    WHENet: Real-time Fine-Grained Estimation for Wide Range Head Pose

    WHENet: Real-time Fine-Grained Estimation for Wide Range Head Pose Yijun Zhou and James Gregson - BMVC2020 Abstract: We present an end-to-end head-pos

    368 Dec 26, 2022
    Code for generating the figures in the paper "Capacity of Group-invariant Linear Readouts from Equivariant Representations: How Many Objects can be Linearly Classified Under All Possible Views?"

    Code for running simulations for the paper "Capacity of Group-invariant Linear Readouts from Equivariant Representations: How Many Objects can be Lin

    Matthew Farrell 1 Nov 22, 2022
    ICS 4u HD project, start before-wards. A curtain shooting game using python.

    Touhou-Star-Salvation HDCH ICS 4u HD project, start before-wards. A curtain shooting game using python and pygame. By Jason Li For arts and gameplay,

    15 Dec 22, 2022
    Deep Structured Instance Graph for Distilling Object Detectors (ICCV 2021)

    DSIG Deep Structured Instance Graph for Distilling Object Detectors Authors: Yixin Chen, Pengguang Chen, Shu Liu, Liwei Wang, Jiaya Jia. [pdf] [slide]

    DV Lab 31 Nov 17, 2022
    A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

    Master status: Development status: Package information: TPOT stands for Tree-based Pipeline Optimization Tool. Consider TPOT your Data Science Assista

    Epistasis Lab at UPenn 8.9k Dec 30, 2022
    Rocket-recycling with Reinforcement Learning

    Rocket-recycling with Reinforcement Learning Developed by: Zhengxia Zou I have long been fascinated by the recovery process of SpaceX rockets. In this

    Zhengxia Zou 202 Jan 03, 2023
    Head and Neck Tumour Segmentation and Prediction of Patient Survival Project

    Head-and-Neck-Tumour-Segmentation-and-Prediction-of-Patient-Survival Welcome to the Head and Neck Tumour Segmentation and Prediction of Patient Surviv

    5 Oct 20, 2022
    🔮 Execution time predictions for deep neural network training iterations across different GPUs.

    Habitat: A Runtime-Based Computational Performance Predictor for Deep Neural Network Training Habitat is a tool that predicts a deep neural network's

    Geoffrey Yu 44 Dec 27, 2022
    A Real-Time-Strategy game for Deep Learning research

    Description DeepRTS is a high-performance Real-TIme strategy game for Reinforcement Learning research. It is written in C++ for performance, but provi

    Centre for Artificial Intelligence Research (CAIR) 156 Dec 19, 2022
    A framework for attentive explainable deep learning on tabular data

    🧠 kendrite A framework for attentive explainable deep learning on tabular data 💨 Quick start kedro run 🧱 Built upon Technology Description Links ke

    Marnix Koops 3 Nov 06, 2021
    Implementation of popular bandit algorithms in batch environments.

    batch-bandits Implementation of popular bandit algorithms in batch environments. Source code to our paper "The Impact of Batch Learning in Stochastic

    Danil Provodin 2 Sep 11, 2022