Reinforcement Learning with Q-Learning Algorithm on gym's frozen lake environment implemented in python

Overview

Reinforcement Learning with Q Learning Algorithm

Q learning algorithm is trained on the gym's frozen lake environment.

Libraries Used

  • gym
  • Numpy
  • tqdm
  • Pytorch Deep Learning Framework

  • Install Requirement Files

    clone the repository or download the 'requirement.txt' files, then open terminal in the working directory and type
    'pip install -r requirements.txt'
    to install all the requirements for this project.

    Demo Video

    Q-learning.mp4
    [CVPR 2021] "Multimodal Motion Prediction with Stacked Transformers": official code implementation and project page.

    mmTransformer Introduction This repo is official implementation for mmTransformer in pytorch. Currently, the core code of mmTransformer is implemented

    DeciForce: Crossroads of Machine Perception and Autonomy 232 Dec 31, 2022
    Explainable Zero-Shot Topic Extraction

    Zero-Shot Topic Extraction with Common-Sense Knowledge Graph This repository contains the code for reproducing the results reported in the paper "Expl

    D2K Lab 56 Dec 14, 2022
    Self-supervised Point Cloud Prediction Using 3D Spatio-temporal Convolutional Networks

    Self-supervised Point Cloud Prediction Using 3D Spatio-temporal Convolutional Networks This is a Pytorch-Lightning implementation of the paper "Self-s

    Photogrammetry & Robotics Bonn 111 Dec 06, 2022
    Code for the paper "Query Embedding on Hyper-relational Knowledge Graphs"

    Query Embedding on Hyper-Relational Knowledge Graphs This repository contains the code used for the experiments in the paper Query Embedding on Hyper-

    DimitrisAlivas 19 Jul 26, 2022
    Scale-aware Automatic Augmentation for Object Detection (CVPR 2021)

    SA-AutoAug Scale-aware Automatic Augmentation for Object Detection Yukang Chen, Yanwei Li, Tao Kong, Lu Qi, Ruihang Chu, Lei Li, Jiaya Jia [Paper] [Bi

    DV Lab 182 Dec 29, 2022
    Rohit Ingole 2 Mar 24, 2022
    DEEPAGÉ: Answering Questions in Portuguese about the Brazilian Environment

    DEEPAGÉ: Answering Questions in Portuguese about the Brazilian Environment This repository is related to the paper DEEPAGÉ: Answering Questions in Por

    0 Dec 10, 2021
    Simulator for FRC 2022 challenge: Rapid React

    rrsim Simulator for FRC 2022 challenge: Rapid React out-1.mp4 Usage In order to run the simulator use the following: python3 rrsim.py [config_path] wh

    1 Jan 18, 2022
    LSUN Dataset Documentation and Demo Code

    LSUN Please check LSUN webpage for more information about the dataset. Data Release All the images in one category are stored in one lmdb database fil

    Fisher Yu 426 Jan 02, 2023
    Source code for Zalo AI 2021 submission

    zalo_ltr_2021 Source code for Zalo AI 2021 submission Solution: Pipeline We use the pipepline in the picture below: Our pipeline is combination of BM2

    128 Dec 27, 2022
    KITTI-360 Annotation Tool is a framework that developed based on python(cherrypy + jinja2 + sqlite3) as the server end and javascript + WebGL as the front end.

    KITTI-360 Annotation Tool is a framework that developed based on python(cherrypy + jinja2 + sqlite3) as the server end and javascript + WebGL as the front end.

    86 Dec 12, 2022
    Fiddle is a Python-first configuration library particularly well suited to ML applications.

    Fiddle Fiddle is a Python-first configuration library particularly well suited to ML applications. Fiddle enables deep configurability of parameters i

    Google 227 Dec 26, 2022
    RAFT-Stereo: Multilevel Recurrent Field Transforms for Stereo Matching

    RAFT-Stereo: Multilevel Recurrent Field Transforms for Stereo Matching This repository contains the source code for our paper: RAFT-Stereo: Multilevel

    Princeton Vision & Learning Lab 328 Jan 09, 2023
    基于PaddleClas实现垃圾分类,并转换为inference格式用PaddleHub服务端部署

    百度网盘链接及提取码: 链接:https://pan.baidu.com/s/1HKpgakNx1hNlOuZJuW6T1w 提取码:wylx 一个垃圾分类项目带你玩转飞桨多个产品(1) 基于PaddleClas实现垃圾分类,导出inference模型并利用PaddleHub Serving进行服务

    thomas-yanxin 22 Jul 12, 2022
    A fast and easy to use, moddable, Python based Minecraft server!

    PyMine PyMine - The fastest, easiest to use, Python-based Minecraft Server! Features Note: This list is not always up to date, and doesn't contain all

    PyMine 144 Dec 30, 2022
    The code for our CVPR paper PISE: Person Image Synthesis and Editing with Decoupled GAN, Project Page, supp.

    PISE The code for our CVPR paper PISE: Person Image Synthesis and Editing with Decoupled GAN, Project Page, supp. Requirement conda create -n pise pyt

    jinszhang 110 Nov 21, 2022
    EMNLP'2021: SimCSE: Simple Contrastive Learning of Sentence Embeddings

    SimCSE: Simple Contrastive Learning of Sentence Embeddings This repository contains the code and pre-trained models for our paper SimCSE: Simple Contr

    Princeton Natural Language Processing 2.5k Dec 29, 2022
    BossNAS: Exploring Hybrid CNN-transformers with Block-wisely Self-supervised Neural Architecture Search

    BossNAS This repository contains PyTorch evaluation code, retraining code and pretrained models of our paper: BossNAS: Exploring Hybrid CNN-transforme

    Changlin Li 127 Dec 26, 2022
    Python implementation of NARS (Non-Axiomatic-Reasoning-System)

    Python implementation of NARS (Non-Axiomatic-Reasoning-System)

    Bowen XU 11 Dec 20, 2022
    Tensorflow implementation of "Learning Deep Features for Discriminative Localization"

    Weakly_detector Tensorflow implementation of "Learning Deep Features for Discriminative Localization" B. Zhou, A. Khosla, A. Lapedriza, A. Oliva, and

    Taeksoo Kim 363 Jun 29, 2022