Reinforcement Learning with Q-Learning Algorithm on gym's frozen lake environment implemented in python

Overview

Reinforcement Learning with Q Learning Algorithm

Q learning algorithm is trained on the gym's frozen lake environment.

Libraries Used

  • gym
  • Numpy
  • tqdm
  • Pytorch Deep Learning Framework

  • Install Requirement Files

    clone the repository or download the 'requirement.txt' files, then open terminal in the working directory and type
    'pip install -r requirements.txt'
    to install all the requirements for this project.

    Demo Video

    Q-learning.mp4
    PyStan, a Python interface to Stan, a platform for statistical modeling. Documentation: https://pystan.readthedocs.io

    PyStan NOTE: This documentation describes a BETA release of PyStan 3. PyStan is a Python interface to Stan, a package for Bayesian inference. Stan® is

    Stan 229 Dec 29, 2022
    The dataset and source code for our paper: "Did You Ask a Good Question? A Cross-Domain Question IntentionClassification Benchmark for Text-to-SQL"

    TriageSQL The dataset and source code for our paper: "Did You Ask a Good Question? A Cross-Domain Question Intention Classification Benchmark for Text

    Yusen Zhang 22 Nov 09, 2022
    Rotation-Only Bundle Adjustment

    ROBA: Rotation-Only Bundle Adjustment Paper, Video, Poster, Presentation, Supplementary Material In this repository, we provide the implementation of

    Seong 51 Nov 29, 2022
    Navigating StyleGAN2 w latent space using CLIP

    Navigating StyleGAN2 w latent space using CLIP an attempt to build sth with the official SG2-ADA Pytorch impl kinda inspired by Generating Images from

    Mike K. 55 Dec 06, 2022
    ULMFiT for Genomic Sequence Data

    Genomic ULMFiT This is an implementation of ULMFiT for genomics classification using Pytorch and Fastai. The model architecture used is based on the A

    Karl 276 Dec 12, 2022
    Minimal PyTorch implementation of Generative Latent Optimization from the paper "Optimizing the Latent Space of Generative Networks"

    Minimal PyTorch implementation of Generative Latent Optimization This is a reimplementation of the paper Piotr Bojanowski, Armand Joulin, David Lopez-

    Thomas Neumann 117 Nov 27, 2022
    This is the official implementation of VaxNeRF (Voxel-Accelearated NeRF).

    VaxNeRF Paper | Google Colab This is the official implementation of VaxNeRF (Voxel-Accelearated NeRF). This codebase is implemented using JAX, buildin

    naruya 132 Nov 21, 2022
    Official implementation of NPMs: Neural Parametric Models for 3D Deformable Shapes - ICCV 2021

    NPMs: Neural Parametric Models Project Page | Paper | ArXiv | Video NPMs: Neural Parametric Models for 3D Deformable Shapes Pablo Palafox, Aljaz Bozic

    PabloPalafox 109 Nov 22, 2022
    C3d-pytorch - Pytorch porting of C3D network, with Sports1M weights

    C3D for pytorch This is a pytorch porting of the network presented in the paper Learning Spatiotemporal Features with 3D Convolutional Networks How to

    Davide Abati 311 Jan 06, 2023
    A Python script that creates subtitles of a given length from text paragraphs that can be easily imported into any Video Editing software such as FinalCut Pro for further adjustments.

    Text to Subtitles - Python This python file creates subtitles of a given length from text paragraphs that can be easily imported into any Video Editin

    Dmytro North 9 Dec 24, 2022
    NeuralCompression is a Python repository dedicated to research of neural networks that compress data

    NeuralCompression is a Python repository dedicated to research of neural networks that compress data. The repository includes tools such as JAX-based entropy coders, image compression models, video c

    Facebook Research 297 Jan 06, 2023
    JAX bindings to the Flatiron Institute Non-uniform Fast Fourier Transform (FINUFFT) library

    JAX bindings to FINUFFT This package provides a JAX interface to (a subset of) the Flatiron Institute Non-uniform Fast Fourier Transform (FINUFFT) lib

    Dan Foreman-Mackey 32 Oct 15, 2022
    🛰️ Awesome Satellite Imagery Datasets

    Awesome Satellite Imagery Datasets List of aerial and satellite imagery datasets with annotations for computer vision and deep learning. Newest datase

    Christoph Rieke 3k Jan 03, 2023
    Implementation of our recent paper, WOOD: Wasserstein-based Out-of-Distribution Detection.

    WOOD Implementation of our recent paper, WOOD: Wasserstein-based Out-of-Distribution Detection. Abstract The training and test data for deep-neural-ne

    8 Dec 24, 2022
    Python implementation of ADD: Frequency Attention and Multi-View based Knowledge Distillation to Detect Low-Quality Compressed Deepfake Images, AAAI2022.

    ADD: Frequency Attention and Multi-View based Knowledge Distillation to Detect Low-Quality Compressed Deepfake Images Binh M. Le & Simon S. Woo, "ADD:

    2 Oct 24, 2022
    GyroSPD: Vector-valued Distance and Gyrocalculus on the Space of Symmetric Positive Definite Matrices

    GyroSPD Code for the paper "Vector-valued Distance and Gyrocalculus on the Space of Symmetric Positive Definite Matrices" accepted at NeurIPS 2021. Re

    Federico Lopez 12 Dec 12, 2022
    Source code for CIKM 2021 paper for Relation-aware Heterogeneous Graph for User Profiling

    RHGN Source code for CIKM 2021 paper for Relation-aware Heterogeneous Graph for User Profiling Dependencies torch==1.6.0 torchvision==0.7.0 dgl==0.7.1

    Big Data and Multi-modal Computing Group, CRIPAC 6 Nov 29, 2022
    Automatic packaging of the open-composite libs for OvGME

    OvGME Packager for OpenXR – OpenComposite for DCS Note This repository is currently unsupported and needs to be migrated to the upstream OpenComposite

    12 Nov 03, 2022
    S-attack library. Official implementation of two papers "Are socially-aware trajectory prediction models really socially-aware?" and "Vehicle trajectory prediction works, but not everywhere".

    S-attack library: A library for evaluating trajectory prediction models This library contains two research projects to assess the trajectory predictio

    VITA lab at EPFL 71 Jan 04, 2023
    Fast, modular reference implementation and easy training of Semantic Segmentation algorithms in PyTorch.

    TorchSeg This project aims at providing a fast, modular reference implementation for semantic segmentation models using PyTorch. Highlights Modular De

    ycszen 1.4k Jan 02, 2023