Description

QMario is a personal project to train an AI agent playing the SuperMarioBros. The environment is created by OpenAI gym. The specification of the environment can be found here. The framework of this project is Pytorch Lightning.

Model

The neural network is just a simple double deep q network. The replay buffer is a multistep replay buffer.

Getting started

You can install all the dependencies by using qmario.yml with conda.

Use train.py to start training.

Use test.py to test the trained model. You need to edit the checkpoint path in test.py by yourself.

Hint: You can add save_video=True when constructing the model to save videos. Don't forget to specify the value of fps. Due to the limitation of MoviePy, you need to create the folder first if you want to put the videos in a folder.

Result

Double Deep Q Network:

mario_episode0_reward6230.0.mp4

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
best_model		best_model
core		core
gifs		gifs
.gitignore		.gitignore
README.md		README.md
d3qn_train.py		d3qn_train.py
ddqn_train.py		ddqn_train.py
distd3qn_train.py		distd3qn_train.py
env_wrapper_test.py		env_wrapper_test.py
environment.yml		environment.yml
mp42gif.py		mp42gif.py
rainbow_train.py		rainbow_train.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Description

Model

Getting started

Result

Double Deep Q Network:

About

Languages

ArchiMickey/QMario

Folders and files

Latest commit

History

Repository files navigation

Description

Model

Getting started

Result

Double Deep Q Network:

About

Topics

Resources

Stars

Watchers

Forks

Languages