Reinforcement Learning with Q-Learning Algorithm on gym's frozen lake environment implemented in python

Overview

Reinforcement Learning with Q Learning Algorithm

Q learning algorithm is trained on the gym's frozen lake environment.

Libraries Used

  • gym
  • Numpy
  • tqdm
  • Pytorch Deep Learning Framework

  • Install Requirement Files

    clone the repository or download the 'requirement.txt' files, then open terminal in the working directory and type
    'pip install -r requirements.txt'
    to install all the requirements for this project.

    Demo Video

    Q-learning.mp4
    Visyerres sgdf woob - Modules Woob pour l'intranet et autres sites Scouts et Guides de France

    Vis'Yerres SGDF - Modules Woob Vous avez le sentiment que l'intranet des Scouts

    Thomas Touhey (pas un pseudonyme) 3 Dec 24, 2022
    A python3 tool to take a 360 degree survey of the RF spectrum (hamlib + rotctld + RTL-SDR/HackRF)

    RF Light House (rflh) A python script to use a rotor and a SDR device (RTL-SDR or HackRF One) to measure the RF level around and get a data set and be

    Pavel Milanes (CO7WT) 11 Dec 13, 2022
    Repositório para arquivos sobre o Módulo 1 do curso Top Coders da Let's Code + Safra

    850-Safra-DS-ModuloI Repositório para arquivos sobre o Módulo 1 do curso Top Coders da Let's Code + Safra Para aprender mais Git https://learngitbranc

    Brian Nunes 7 Dec 10, 2022
    I tried to apply the CAM algorithm to YOLOv4 and it worked.

    YOLOV4:You Only Look Once目标检测模型在pytorch当中的实现 2021年2月7日更新: 加入letterbox_image的选项,关闭letterbox_image后网络的map得到大幅度提升。 目录 性能情况 Performance 实现的内容 Achievement

    55 Dec 05, 2022
    Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features

    MediumVC MediumVC is an utterance-level method towards any-to-any VC. Before that, we propose SingleVC to perform A2O tasks(Xi → Ŷi) , Xi means utter

    谷下雨 47 Dec 25, 2022
    Clean and readable code for Decision Transformer: Reinforcement Learning via Sequence Modeling

    Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in OpenAI gym

    Nikhil Barhate 104 Jan 06, 2023
    Official Repository of NeurIPS2021 paper: PTR

    PTR: A Benchmark for Part-based Conceptual, Relational, and Physical Reasoning Figure 1. Dataset Overview. Introduction A critical aspect of human vis

    Yining Hong 32 Jun 02, 2022
    Why Are You Weird? Infusing Interpretability in Isolation Forest for Anomaly Detection

    Why, hello there! This is the supporting notebook for the research paper — Why Are You Weird? Infusing Interpretability in Isolation Forest for Anomal

    2 Dec 14, 2021
    PyTorch deep learning projects made easy.

    PyTorch Template Project PyTorch deep learning project made easy. PyTorch Template Project Requirements Features Folder Structure Usage Config file fo

    Victor Huang 3.8k Jan 01, 2023
    git《Learning Pairwise Inter-Plane Relations for Piecewise Planar Reconstruction》(ECCV 2020) GitHub:

    Learning Pairwise Inter-Plane Relations for Piecewise Planar Reconstruction Code for the ECCV 2020 paper by Yiming Qian and Yasutaka Furukawa Getting

    37 Dec 04, 2022
    GNPy: Optical Route Planning and DWDM Network Optimization

    GNPy is an open-source, community-developed library for building route planning and optimization tools in real-world mesh optical networks

    Telecom Infra Project 140 Dec 19, 2022
    ManipulaTHOR, a framework that facilitates visual manipulation of objects using a robotic arm

    ManipulaTHOR: A Framework for Visual Object Manipulation Kiana Ehsani, Winson Han, Alvaro Herrasti, Eli VanderBilt, Luca Weihs, Eric Kolve, Aniruddha

    AI2 65 Dec 30, 2022
    Deconfounding Temporal Autoencoder: Estimating Treatment Effects over Time Using Noisy Proxies

    Deconfounding Temporal Autoencoder (DTA) This is a repository for the paper "Deconfounding Temporal Autoencoder: Estimating Treatment Effects over Tim

    Milan Kuzmanovic 3 Feb 04, 2022
    EMNLP 2021 paper The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers.

    Codebase for training transformers on systematic generalization datasets. The official repository for our EMNLP 2021 paper The Devil is in the Detail:

    Csordás Róbert 57 Nov 21, 2022
    The PyTorch re-implement of a 3D CNN Tracker to extract coronary artery centerlines with state-of-the-art (SOTA) performance. (paper: 'Coronary artery centerline extraction in cardiac CT angiography using a CNN-based orientation classifier')

    The PyTorch re-implement of a 3D CNN Tracker to extract coronary artery centerlines with state-of-the-art (SOTA) performance. (paper: 'Coronary artery centerline extraction in cardiac CT angiography

    James 135 Dec 23, 2022
    QuadTree Attention for Vision Transformers (ICLR2022)

    This repository contains codes for quadtree attention. This repo contains codes for feature matching, image classficiation, object detection and seman

    tangshitao 222 Dec 28, 2022
    Face recognize and crop them

    Face Recognize Cropping Module Source 아이디어 Face Alignment with OpenCV and Python Requirement 필요 라이브러리 imutil dlib python-opence (cv2) Usage 사용 방법 open

    Cho Moon Gi 1 Feb 15, 2022
    3D Avatar Lip Syncronization from speech (JALI based face-rigging)

    visemenet-inference Inference Demo of "VisemeNet-tensorflow" VisemeNet is an audio-driven animator centric speech animation driving a JALI or standard

    Junhwan Jang 17 Dec 20, 2022
    Code for our work "Activation to Saliency: Forming High-Quality Labels for Unsupervised Salient Object Detection".

    A2S-USOD Code for our work "Activation to Saliency: Forming High-Quality Labels for Unsupervised Salient Object Detection". Code will be released upon

    15 Dec 16, 2022
    A tool to estimate time varying instantaneous reproduction number during epidemics

    EpiEstim A tool to estimate time varying instantaneous reproduction number during epidemics. It is described in the following paper: @article{Cori2013

    MRC Centre for Global Infectious Disease Analysis 78 Dec 19, 2022