An experiment on the performance of homemade Q-learning AIs in Agar.io depending on their state representation and available actions

Last update: Jun 09, 2022

Overview

Agar.io_Q-Learning_AI

An experiment on the performance of homemade Q-learning AIs in Agar.io depending on their state representation and available actions.

An image of the circle categorisation function in action. Food blobs are outlined in blue, edible cells in green and dangerous cells in red according to where our program detects them. Screen edges mess that up a bit. The agents action at this moment is labelled with the green arrow.

States are calculated using the shortest euclidian distance to each of the three circle types: food, edible cells and dangerous cells. These distances are measured and discretized according to which interval they fall within. The rulers in this image are to scale.

Currently the agent can't press any keyboard buttons, only move around using the mouse. It could be added without too much hassle, but it would require a rework of some aspects of the code and a ton training, which already takes ages. The q-learning part could also do with a proper implementation of stochastic q-learning instead of our generic iterative q-learning, if I knew how to do it. I look forward to learning that at a later point.

Feel free to ask any questions about the code or the project. I hope you enjoy!

The humans in the experiment were subject to the same move set as the bots and agents, so only mouse movement.

An experiment on the performance of homemade Q-learning AIs in Agar.io depending on their state representation and available actions

Related tags

Overview

Agar.io_Q-Learning_AI

Owner

UIUCTF 2021 Public Challenge Repository

A GUI for Face Recognition, based upon Docker, Tkinter, GPU and a camera device.

Evaluation suite for large-scale language models.

Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.

a grammar based feedback fuzzer

The official implementation for "FQ-ViT: Fully Quantized Vision Transformer without Retraining".

Yolo object detection - Yolo object detection with python

Neural network chess engine trained on Gary Kasparov's games.

Clustering is a popular approach to detect patterns in unlabeled data

Impelmentation for paper Feature Generation and Hypothesis Verification for Reliable Face Anti-Spoofing

Train Scene Graph Generation for Visual Genome and GQA in PyTorch >= 1.2 with improved zero and few-shot generalization.

[Official] Exploring Temporal Coherence for More General Video Face Forgery Detection(ICCV 2021)

Udacity's CS101: Intro to Computer Science - Building a Search Engine

Notification Triggers for Python

This project is for a Twitter bot that monitors a bird feeder in my backyard. Any detected birds are identified and posted to Twitter.

Docker containers of baseline agents for the Crafter environment

MMdet2-based reposity about lightweight detection model: Nanodet, PicoDet.

C3d-pytorch - Pytorch porting of C3D network, with Sports1M weights

Entity-Based Knowledge Conflicts in Question Answering.

Fast Neural Representations for Direct Volume Rendering