Agent57

This repository contains unofficial code reproducing Agent57, which outperformed humans in all Atari games.

Directory File

agent.py

define agent to play a supecific environment.
buffer.py

define buffer to store experiences with priorites.
learner.py

define learner to update parameter such as q networks and functions related to intrinsic reward.
main.py

run the main pipeline.
model.py

define some models such as q network and functions related to intrinsic reward.
segment_tree.py

define segment tree which decide segment index according to the priority.
tester.py

define tester which test performance of Agent57.
utils.py

define some classes and functions such as UCB and Retrace operator.

Requirement

python==3.9.5
matplotlib==3.4.2
ray==1.4.1
lz4==3.1.3
numpy==1.21.0
omegaconf==2.1.1
torch==1.9.0

Installation

pip install -r requirements.txt

Usage

python main.py

Citation

Agent57: Outperforming the Atari Human Benchmark

https://arxiv.org/abs/2003.13350

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

agent.py

agent.py

buffer.py

buffer.py

learner.py

learner.py

main.py

main.py

model.py

model.py

requirements.txt

requirements.txt

segment_tree.py

segment_tree.py

tester.py

tester.py

utils.py

utils.py

Repository files navigation

Agent57

Directory File

Requirement

Installation

Usage

Citation

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
agent.py		agent.py
buffer.py		buffer.py
learner.py		learner.py
main.py		main.py
model.py		model.py
requirements.txt		requirements.txt
segment_tree.py		segment_tree.py
tester.py		tester.py
utils.py		utils.py

yuta0821/agent57_pytorch

Folders and files

Latest commit

History

Repository files navigation

Agent57

Directory File

Requirement

Installation

Usage

Citation

About

Resources

Stars

Watchers

Forks

Languages