Self-driving car env with PPO algorithm from stable baseline3

Last update: Dec 22, 2022

Related tags

Deep Learning Self-Driving-car

Overview

Self-driving car with RL stable baseline3

Most of the project develop from https://github.com/GerardMaggiolino/Gym-Medium-Post Please check it out!

This project focus on training self-driving car env by implementing PPO algorithm from stable baseline3

Installation

Clone the project

git clone https://github.com/SornsiriP/Self-Driving-car

Then run Gym-Medium-Post/main.py

Update

Wrap env to change observation space from box to RGB image

from simple_driving.resources.wrapper import ProcessFrame84

env = ProcessFrame84(env)

Using PPO with CNN policy instead of TRPO

from stable_baselines3 import PPO

model = PPO('CnnPolicy', env, verbose=1,learning_rate = 0.00025,tensorboard_log="./Simple-driving/",n_steps=10000,batch_size=1000,gamma=0.9995)
model.learn(total_timesteps=150000)

Normalize action space

def map_action(self, action):
  speed_range = [0,1]
  steer_range = [-0.6,0.6]
  new_speed = np.interp(action[0],[-1,1],speed_range)
  new_steer = np.interp(action[0],[-1,1],steer_range)
  return [new_speed, new_steer]

Add limited timestep reset condition

if self.current_step >1000:
  self.current_step = 0
  self.done = True

Normalize distance in reward function

previous_dist_to_goal = np.linalg.norm(tuple(map(lambda i, j: i - j, self.goal, self.prev_pos)))
current_dist_to_goal =  np.linalg.norm(tuple(map(lambda i, j: i - j, self.goal, car_ob[0:2])))

Reference

https://github.com/GerardMaggiolino/Gym-Medium-Post

https://www.etedal.net/2020/04/pybullet-panda_3.html

Contributing

Sornsiri Promma

Thanks original project from Gerard Maggiolino

Please make sure to update tests as appropriate.

Self-driving car env with PPO algorithm from stable baseline3

Related tags

Overview

Self-driving car with RL stable baseline3

Installation

Update

Reference

Contributing

Owner

Sornsiri.P

End-to-end image segmentation kit based on PaddlePaddle.

The Easy-to-use Dialogue Response Selection Toolkit for Researchers

Official implementation of EfficientPose

Neuron class provides LNU (Linear Neural Unit), QNU (Quadratic Neural Unit), RBF (Radial Basis Function), MLP (Multi Layer Perceptron), MLP-ELM (Multi Layer Perceptron - Extreme Learning Machine) neurons learned with Gradient descent or LeLevenberg–Marquardt algorithm

The codebase for Data-driven general-purpose voice activity detection.

Self-attentive task GAN for space domain awareness data augmentation.

Research on Tabular Deep Learning (Python package & papers)

Spontaneous Facial Micro Expression Recognition using 3D Spatio-Temporal Convolutional Neural Networks

[CVPR'21] DeepSurfels: Learning Online Appearance Fusion

Codes of paper "Unseen Object Amodal Instance Segmentation via Hierarchical Occlusion Modeling"

Parametric Contrastive Learning (ICCV2021)

This application is the basic of automated online-class-joiner(for YıldızEdu) within the right time. Gets the ZOOM link by scheduled date and time.

Generalizing Gaze Estimation with Outlier-guided Collaborative Adaptation

Mixup for Supervision, Semi- and Self-Supervision Learning Toolbox and Benchmark

LOFO (Leave One Feature Out) Importance calculates the importances of a set of features based on a metric of choice,

Pytorch implementation of few-shot semantic image synthesis

This is a repo of basic Machine Learning!

[SIGGRAPH Asia 2021] Pose with Style: Detail-Preserving Pose-Guided Image Synthesis with Conditional StyleGAN

Codebase for Inducing Causal Structure for Interpretable Neural Networks

Marvis is Mastouri's Jarvis version of the AI-powered Python personal assistant.