Overview
Taking on the role of an explorer grave robbing Tutankhamun’s tomb while exploring dozens of rooms, the player is chased by creatures such as asps, vultures, parrots, bats, dragons, and even curses, all that kill the player on contact. The explorer can fight back by firing lasers at the creatures, but he can only cover the left and right directions. The player is also endowed with a single screen-clearing “flash bomb” per room or life. Finally, each room has warp zones that teleport the player around the room, which enemies cannot use.
To progress, the player collects keys open locked doors throughout the rooms, searching for the large exit door. Optional treasures can be picked-up for bonus points. Each room has a timer; when it reaches zero the explorer can no longer fire lasers, and once a room is cleared the remaining time is converted to bonus points.
Description from Wikipedia
Performances of RL Agents
We list various reinforcement learning algorithms that were tested in this environment. These results are from RL Database. If this page was helpful, please consider giving a star!
Human Starts
No-op Starts
Normal Starts
Result | Algorithm | Source |
---|---|---|
280.8 | ACER | Proximal Policy Optimization Algorithm |
254.4 | PPO | Proximal Policy Optimization Algorithm |
206.8 | A2C | Proximal Policy Optimization Algorithm |