Optimal Adaptive Allocation using Deep Reinforcement Learning in a Dose-Response Study

Last update: Nov 01, 2022

Overview

Optimal Adaptive Allocation using Deep Reinforcement Learning in a Dose-Response Study

Supplementary Materials for Kentaro Matsuura, Junya Honda, Imad El Hanafi, Takashi Sozu, Kentaro Sakamaki "Optimal Adaptive Allocation using Deep Reinforcement Learning in a Dose-Response Study" Statistics in Medicine 202x; (doi:xxxxx)

How to Setup

We recommend using Linux or WSL on Windows, because the Ray package in Python is more stable on Linux. For example, in Ubuntu 20.04 (Python 3.8 was already installed), I was able to install the necessary packages with the following commands.

Install Ray

sudo apt update
sudo apt upgrade
sudo apt install python3-pip
sudo pip3 install tensorflow numpy pandas gym
sudo apt install cmake
sudo pip3 install -U ray
sudo pip3 install 'ray[rllib]'

Install R and RPy2

echo -e "\n## For R package"  | sudo tee -a /etc/apt/sources.list
echo "deb https://cloud.r-project.org/bin/linux/ubuntu $(lsb_release -cs)-cran40/" | sudo tee -a /etc/apt/sources.list
sudo apt-key adv --keyserver keyserver.ubuntu.com --recv-keys E298A3A825C0D65DFD57CBB651716619E084DAB9
sudo apt update
sudo apt install make g++ r-base
sudo apt install libxml2-dev libssl-dev libcurl4-openssl-dev
sudo pip3 install rpy2

Install `DoseFinding` package in R

install.packages('DoseFinding')

How to Use

Change simulation settings

To change the simulation settings, it is necessary to understand MCPMod/envs/MCPModEnv.py. This part is a bit difficult because of the interaction between R and Python. Therefore, we have a plan to create an R package to use our method easily.

Obtain adaptive allocation rule

To obtain RL-MAE by learning, please run learn_RL-MAE.py like:

nohup python3 learn_RL-MAE.py > std.log 2> err.log &

To obtain other RL-methods, please change the reward_type in line 25 in learn_RL-MAE.py to something like score_TD, then run the modified file.

When we used c2-standard-4（vCPUx4, RAM16GB) on Google Cloud Platform, the learning was completed within a day.

Simulate single trial

After the learning, we will obtain a checkpoint in ~/ray_results/PPO_MCPMod-v0_[datetime]-[xxx]/checkpoint-[yyy]/. To simulate single trial using the obtained rule, please move the checkpoint files (checkpoint and checkpoint.tune_metadata) in the directory to checkpoint/ in this repository, and rename the files as you like (see the example files). Then, please run simulate-single-trial_RL-MAE.py like:

python3 simulate-single-trial_RL-MAE.py

Optimal Adaptive Allocation using Deep Reinforcement Learning in a Dose-Response Study

Related tags

Overview

Optimal Adaptive Allocation using Deep Reinforcement Learning in a Dose-Response Study

How to Setup

Install Ray

Install R and RPy2

Install `DoseFinding` package in R

How to Use

Change simulation settings

Obtain adaptive allocation rule

Simulate single trial

Owner

Kentaro Matsuura

Demo notebooks for Qiskit application modules demo sessions (Oct 8 & 15):

TUPÃ was developed to analyze electric field properties in molecular simulations

DivNoising is an unsupervised denoising method to generate diverse denoised samples for any noisy input image. This repository contains the code to reproduce the results reported in the paper https://openreview.net/pdf?id=agHLCOBM5jP

Dynamic Slimmable Network (CVPR 2021, Oral)

Semi-Supervised Semantic Segmentation with Pixel-Level Contrastive Learning from a Class-wise Memory Bank

The 1st place solution of track2 (Vehicle Re-Identification) in the NVIDIA AI City Challenge at CVPR 2021 Workshop.

[ICCV2021] Official code for "Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition"

A python interface for training Reinforcement Learning bots to battle on pokemon showdown

Multispectral Object Detection with Yolov5

Code for reproducing our paper: LMSOC: An Approach for Socially Sensitive Pretraining

Efficient Speech Processing Tookit for Automatic Speaker Recognition

Code for paper Decoupled Dynamic Spatial-Temporal Graph Neural Network for Traffic Forecasting

AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.

Multi-task Self-supervised Object Detection via Recycling of Bounding Box Annotations (CVPR, 2019)

PyTorch implementation of 'Gen-LaneNet: a generalized and scalable approach for 3D lane detection'

Code for the paper "Generative design of breakwaters usign deep convolutional neural network as a surrogate model"

Code for "Universal inference meets random projections: a scalable test for log-concavity"

Codes to calculate solar-sensor zenith and azimuth angles directly from hyperspectral images collected by UAV. Works only for UAVs that have high resolution GNSS/IMU unit.

Robot Servers and Server Manager software for robo-gym

Sign Language Transformers (CVPR'20)

Optimal Adaptive Allocation using Deep Reinforcement Learning in a Dose-Response Study

Related tags

Overview

Optimal Adaptive Allocation using Deep Reinforcement Learning in a Dose-Response Study

How to Setup

Install Ray

Install R and RPy2

Install DoseFinding package in R

How to Use

Change simulation settings

Obtain adaptive allocation rule

Simulate single trial

Owner

Kentaro Matsuura

Demo notebooks for Qiskit application modules demo sessions (Oct 8 & 15):

TUPÃ was developed to analyze electric field properties in molecular simulations

DivNoising is an unsupervised denoising method to generate diverse denoised samples for any noisy input image. This repository contains the code to reproduce the results reported in the paper https://openreview.net/pdf?id=agHLCOBM5jP

Dynamic Slimmable Network (CVPR 2021, Oral)

Semi-Supervised Semantic Segmentation with Pixel-Level Contrastive Learning from a Class-wise Memory Bank

The 1st place solution of track2 (Vehicle Re-Identification) in the NVIDIA AI City Challenge at CVPR 2021 Workshop.

[ICCV2021] Official code for "Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition"

A python interface for training Reinforcement Learning bots to battle on pokemon showdown

Multispectral Object Detection with Yolov5

Code for reproducing our paper: LMSOC: An Approach for Socially Sensitive Pretraining

Efficient Speech Processing Tookit for Automatic Speaker Recognition

Code for paper Decoupled Dynamic Spatial-Temporal Graph Neural Network for Traffic Forecasting

AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.

Multi-task Self-supervised Object Detection via Recycling of Bounding Box Annotations (CVPR, 2019)

PyTorch implementation of 'Gen-LaneNet: a generalized and scalable approach for 3D lane detection'

Code for the paper "Generative design of breakwaters usign deep convolutional neural network as a surrogate model"

Code for "Universal inference meets random projections: a scalable test for log-concavity"

Codes to calculate solar-sensor zenith and azimuth angles directly from hyperspectral images collected by UAV. Works only for UAVs that have high resolution GNSS/IMU unit.

Robot Servers and Server Manager software for robo-gym

Sign Language Transformers (CVPR'20)

Install `DoseFinding` package in R