用强化学习DQN算法，训练AI模型来玩合成大西瓜游戏，提供Keras版本和PARL（paddle）版本

Last update: Dec 17, 2022

Overview

用强化学习玩合成大西瓜

代码地址：https://github.com/Sharpiless/play-daxigua-using-Reinforcement-Learning

用强化学习DQN算法，训练AI模型来玩合成大西瓜游戏，提供Keras版本、PARL（paddle）版本和pytorch版本。

B站：https://space.bilibili.com/470550823

CSDN：https://blog.csdn.net/weixin_44936889

AI Studio：https://aistudio.baidu.com/aistudio/personalcenter/thirdview/67156

Github：https://github.com/Sharpiless

1. 打开游戏：

这里使用pygame重写了大西瓜游戏，并封装为适合RL环境的代码。

解压图片素材：

unzip res.zip

运行：

python Main.py

即可开始游戏：

2. 训练RL模型：

RL算法采用DQN算法，其中Keras版本使用了简单的卷积神经网络来计算Q值，PRAL版本使用ResNet。

运行：

python train_keras.py

或者

python train_paddle.py

或者

python train_torch.py

开始训练：

关注我的公众号：

感兴趣的同学关注我的公众号——可达鸭的深度学习教程：

用强化学习DQN算法，训练AI模型来玩合成大西瓜游戏，提供Keras版本和PARL（paddle）版本

Related tags

Overview

用强化学习玩合成大西瓜

1. 打开游戏：

2. 训练RL模型：

关注我的公众号：

Owner

Towards Flexible Blind JPEG Artifacts Removal (FBCNN, ICCV 2021)

GANsformer: Generative Adversarial Transformers Drew A

data/code repository of "C2F-FWN: Coarse-to-Fine Flow Warping Network for Spatial-Temporal Consistent Motion Transfer"

LiDAR Distillation: Bridging the Beam-Induced Domain Gap for 3D Object Detection

CVPR2020 Counterfactual Samples Synthesizing for Robust VQA

pybaum provides tools to work with pytrees which is a concept burrowed from JAX.

The official codes for the ICCV2021 Oral presentation "Rethinking Counting and Localization in Crowds: A Purely Point-Based Framework"

JstDoS - HTTP Protocol Stack Remote Code Execution Vulnerability

PyTorch implementation of Trust Region Policy Optimization

Blender add-on: Add to Cameras menu: View → Camera, View → Add Camera, Camera → View, Previous Camera, Next Camera

RCT-ART is an NLP pipeline built with spaCy for converting clinical trial result sentences into tables through jointly extracting intervention, outcome and outcome measure entities and their relations.

Template repository to build PyTorch projects from source on any version of PyTorch/CUDA/cuDNN.

MASS (Mueen's Algorithm for Similarity Search) - a python 2 and 3 compatible library used for searching time series sub-sequences under z-normalized Euclidean distance for similarity.

Human segmentation models, training/inference code, and trained weights, implemented in PyTorch

Meli Data Challenge 2021 - First Place Solution

Facilitating Database Tuning with Hyper-ParameterOptimization: A Comprehensive Experimental Evaluation

RETRO-pytorch - Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch

Improving adversarial robustness by a coupling rejection strategy

A Probabilistic End-To-End Task-Oriented Dialog Model with Latent Belief States towards Semi-Supervised Learning

Massively parallel Monte Carlo diffusion MR simulator written in Python.