Reinforcement Learning with Q-Learning Algorithm on gym's frozen lake environment implemented in python

Overview

Reinforcement Learning with Q Learning Algorithm

Q learning algorithm is trained on the gym's frozen lake environment.

Libraries Used

  • gym
  • Numpy
  • tqdm
  • Pytorch Deep Learning Framework

  • Install Requirement Files

    clone the repository or download the 'requirement.txt' files, then open terminal in the working directory and type
    'pip install -r requirements.txt'
    to install all the requirements for this project.

    Demo Video

    Q-learning.mp4
    NeWT: Natural World Tasks

    NeWT: Natural World Tasks This repository contains resources for working with the NeWT dataset. ❗ At this time the binary tasks are not publicly avail

    Visipedia 26 Oct 18, 2022
    PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).

    HIGL This is a PyTorch implementation for our paper: Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning (NeurIPS 2021). Our cod

    Junsu Kim 20 Dec 14, 2022
    A series of Python scripts to access measurements from Fluke 28X meters. Fluke IR Remote Interface required.

    Fluke289_data_access A series of Python scripts to access measurements from Fluke 28X meters. Fluke IR Remote Interface required. Created from informa

    3 Dec 08, 2022
    bespoke tooling for offensive security's Windows Usermode Exploit Dev course (OSED)

    osed-scripts bespoke tooling for offensive security's Windows Usermode Exploit Dev course (OSED) Table of Contents Standalone Scripts egghunter.py fin

    epi 268 Jan 05, 2023
    This repository contains a PyTorch implementation of the paper Learning to Assimilate in Chaotic Dynamical Systems.

    Amortized Assimilation This repository contains a PyTorch implementation of the paper Learning to Assimilate in Chaotic Dynamical Systems. Abstract: T

    4 Aug 16, 2022
    ComputerVision - This repository aims at realized easy network architecture

    ComputerVision This repository aims at realized easy network architecture Colori

    DongDong 4 Dec 14, 2022
    A self-supervised learning framework for audio-visual speech

    AV-HuBERT (Audio-Visual Hidden Unit BERT) Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction Robust Self-Supervised A

    Meta Research 431 Jan 07, 2023
    [CVPR 2022 Oral] MixFormer: End-to-End Tracking with Iterative Mixed Attention

    MixFormer The official implementation of the CVPR 2022 paper MixFormer: End-to-End Tracking with Iterative Mixed Attention [Models and Raw results] (G

    Multimedia Computing Group, Nanjing University 235 Jan 03, 2023
    Implementation of the federated dual coordinate descent (FedDCD) method.

    FedDCD.jl Implementation of the federated dual coordinate descent (FedDCD) method. Installation To install, just call Pkg.add("https://github.com/Zhen

    Zhenan Fan 6 Sep 21, 2022
    K-Means Clustering and Hierarchical Clustering Unsupervised Learning Solution in Python3.

    Unsupervised Learning - K-Means Clustering and Hierarchical Clustering - The Heritage Foundation's Economic Freedom Index Analysis 2019 - By David Sal

    David Salako 1 Jan 12, 2022
    YouRefIt: Embodied Reference Understanding with Language and Gesture

    YouRefIt: Embodied Reference Understanding with Language and Gesture YouRefIt: Embodied Reference Understanding with Language and Gesture by Yixin Che

    16 Jul 11, 2022
    Code for Max-Margin Contrastive Learning - AAAI 2022

    Max-Margin Contrastive Learning This is a pytorch implementation for the paper Max-Margin Contrastive Learning accepted to AAAI 2022. This repository

    Anshul Shah 12 Oct 22, 2022
    OneShot Learning-based hotword detection.

    EfficientWord-Net Hotword detection based on one-shot learning Home assistants require special phrases called hotwords to get activated (eg:"ok google

    ANT-BRaiN 102 Dec 25, 2022
    Fast convergence of detr with spatially modulated co-attention

    Fast convergence of detr with spatially modulated co-attention Usage There are no extra compiled components in SMCA DETR and package dependencies are

    peng gao 135 Dec 07, 2022
    Volsdf - Volume Rendering of Neural Implicit Surfaces

    Volume Rendering of Neural Implicit Surfaces Project Page | Paper | Data This re

    Lior Yariv 221 Jan 07, 2023
    Research on Event Accumulator Settings for Event-Based SLAM

    Research on Event Accumulator Settings for Event-Based SLAM This is the source code for paper "Research on Event Accumulator Settings for Event-Based

    Robin Shaun 26 Dec 21, 2022
    AdaFocus V2: End-to-End Training of Spatial Dynamic Networks for Video Recognition

    AdaFocusV2 This repo contains the official code and pre-trained models for AdaFo

    79 Dec 26, 2022
    Linear image-to-image translation

    Linear (Un)supervised Image-to-Image Translation Examples for linear orthogonal transformations in PCA domain, learned without pairing supervision. Tr

    Eitan Richardson 40 Aug 31, 2022
    ReSSL: Relational Self-Supervised Learning with Weak Augmentation

    ReSSL: Relational Self-Supervised Learning with Weak Augmentation This repository contains PyTorch evaluation code, training code and pretrained model

    mingkai 45 Oct 25, 2022
    AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages

    AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages This repository contains the code for the pa

    Kelechi 40 Nov 24, 2022