Reinforcement Learning with Q-Learning Algorithm on gym's frozen lake environment implemented in python

Overview

Reinforcement Learning with Q Learning Algorithm

Q learning algorithm is trained on the gym's frozen lake environment.

Libraries Used

  • gym
  • Numpy
  • tqdm
  • Pytorch Deep Learning Framework

  • Install Requirement Files

    clone the repository or download the 'requirement.txt' files, then open terminal in the working directory and type
    'pip install -r requirements.txt'
    to install all the requirements for this project.

    Demo Video

    Q-learning.mp4
    Elevation Mapping on GPU.

    Elevation Mapping cupy Overview This is a ros package of elevation mapping on GPU. Code are written in python and uses cupy for GPU calculation. * pla

    Robotic Systems Lab - Legged Robotics at ETH Zürich 183 Dec 19, 2022
    SPT_LSA_ViT - Implementation for Visual Transformer for Small-size Datasets

    Vision Transformer for Small-Size Datasets Seung Hoon Lee and Seunghyun Lee and Byung Cheol Song | Paper Inha University Abstract Recently, the Vision

    Lee SeungHoon 87 Jan 01, 2023
    Code for paper "Learning to Reweight Examples for Robust Deep Learning"

    learning-to-reweight-examples Code for paper Learning to Reweight Examples for Robust Deep Learning. [arxiv] Environment We tested the code on tensorf

    Uber Research 261 Jan 01, 2023
    🚗 INGI Dakar 2K21 - Be the first one on the finish line ! 🚗

    🚗 INGI Dakar 2K21 - Be the first one on the finish line ! 🚗 This year's first semester Club Info challenge will put you at the head of a car racing

    ClubINFO INGI (UCLouvain) 6 Dec 10, 2021
    LTR_CrossEncoder: Legal Text Retrieval Zalo AI Challenge 2021

    LTR_CrossEncoder: Legal Text Retrieval Zalo AI Challenge 2021 We propose a cross encoder model (LTR_CrossEncoder) for information retrieval, re-retrie

    Xuan Hieu Duong 7 Jan 12, 2022
    A simple API wrapper for Discord interactions.

    Your ultimate Discord interactions library for discord.py. About | Installation | Examples | Discord | PyPI About What is discord-py-interactions? dis

    james 641 Jan 03, 2023
    Official TensorFlow code for the forthcoming paper

    ~ Efficient-CapsNet ~ Are you tired of over inflated and overused convolutional neural networks? You're right! It's time for CAPSULES :)

    Vittorio Mazzia 203 Jan 08, 2023
    This repository is to support contributions for tools for the Project CodeNet dataset hosted in DAX

    The goal of Project CodeNet is to provide the AI-for-Code research community with a large scale, diverse, and high quality curated dataset to drive innovation in AI techniques.

    International Business Machines 1.2k Jan 04, 2023
    Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks

    Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks This repository contains a TensorFlow implementation of "

    Jingwei Zheng 5 Jan 08, 2023
    Codebase for the self-supervised goal reaching benchmark introduced in the LEXA paper

    LEXA Benchmark Codebase for the self-supervised goal reaching benchmark introduced in the LEXA paper (Discovering and Achieving Goals via World Models

    Oleg Rybkin 36 Dec 22, 2022
    Point detection through multi-instance deep heatmap regression for sutures in endoscopy

    Suture detection PyTorch This repo contains the reference implementation of suture detection model in PyTorch for the paper Point detection through mu

    artificial intelligence in the area of cardiovascular healthcare 3 Jul 16, 2022
    Disagreement-Regularized Imitation Learning

    Due to a normalization bug the expert trajectories have lower performance than the rl_baseline_zoo reported experts. Please see the following link in

    Kianté Brantley 25 Apr 28, 2022
    A repository for interferometer controller code.

    dses-interferometer-controller A repository for interferometer controller code, hardware, and simulations. See dses.science for more information on th

    Eli Reed 1 Jan 17, 2022
    Fully convolutional deep neural network to remove transparent overlays from images

    Fully convolutional deep neural network to remove transparent overlays from images

    Marc Belmont 1.1k Jan 06, 2023
    Source code for our CVPR 2019 paper - PPGNet: Learning Point-Pair Graph for Line Segment Detection

    PPGNet: Learning Point-Pair Graph for Line Segment Detection PyTorch implementation of our CVPR 2019 paper: PPGNet: Learning Point-Pair Graph for Line

    SVIP Lab 170 Oct 25, 2022
    Awesome Long-Tailed Learning

    Awesome Long-Tailed Learning This repo pays specially attention to the long-tailed distribution, where labels follow a long-tailed or power-law distri

    Stomach_ache 284 Jan 06, 2023
    Multi-Agent Reinforcement Learning (MARL) method to learn scalable control polices for multi-agent target tracking.

    scalableMARL Scalable Reinforcement Learning Policies for Multi-Agent Control CD. Hsu, H. Jeong, GJ. Pappas, P. Chaudhari. "Scalable Reinforcement Lea

    Christopher Hsu 17 Nov 17, 2022
    Python interface for the DIGIT tactile sensor

    DIGIT-INTERFACE Python interface for the DIGIT tactile sensor. For updates and discussions please join the #DIGIT channel at the www.touch-sensing.org

    Facebook Research 35 Dec 22, 2022
    A simple, high level, easy-to-use open source Computer Vision library for Python.

    ZoomVision : Slicing Aid Detection A simple, high level, easy-to-use open source Computer Vision library for Python. Installation Installing dependenc

    Nurettin Sinanoğlu 2 Mar 04, 2022
    [NeurIPS 2021] Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited Data

    Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited Data (NeurIPS 2021) This repository will provide the official PyTorch implementa

    Liming Jiang 238 Nov 25, 2022