Overview

The game is based on the game of bowling, playable by one player or two players alternating.

In all six variations, games last for 10 frames, or turns. At the start of each frame, the current player is given two chances to roll a bowling ball down an alley in an attempt to knock down as many of the ten bowling pins as possible. The bowler (on the left side of the screen) may move up and down his end of the alley to aim before releasing the ball. In four of the game’s six variations, the ball can be steered before it hits the pins. Knocking down every pin on the first shot is a strike, while knocking every pin down in both shots is a spare. The player’s score is determined by the number of pins knocked down in all 10 frames, as well as the number of strikes and spares acquired.

Description from Wikipedia

Performances of RL Agents

We list various reinforcement learning algorithms that were tested in this environment. These results are from RL Database. If this page was helpful, please consider giving a star!

Star

Human Starts

Result Algorithm Source
146.5 Human Massively Parallel Methods for Deep Reinforcement Learning
76.8 Distributional DQN Rainbow: Combining Improvements in Deep Reinforcement Learning
69.6 DDQN (tuned) Deep Reinforcement Learning with Double Q-learning
65.8 Prioritized DDQN (prop, tuned) Prioritized Experience Replay
65.7 DuDQN Dueling Network Architectures for Deep Reinforcement Learning
62.3 DDQN Deep Reinforcement Learning with Double Q-learning
58.0 Prioritized DQN (rank) Prioritized Experience Replay
53.95 Gorila DQN Massively Parallel Methods for Deep Reinforcement Learning
52.0 Prioritized DDQN (rank, tuned) Prioritized Experience Replay
50.4 PDD DQN Dueling Network Architectures for Deep Reinforcement Learning
41.8 A3C LSTM Asynchronous Methods for Deep Reinforcement Learning
41.2 DQN Massively Parallel Methods for Deep Reinforcement Learning
39.4 Rainbow Rainbow: Combining Improvements in Deep Reinforcement Learning
36.2 A3C FF 1 day Asynchronous Methods for Deep Reinforcement Learning
35.2 Random Massively Parallel Methods for Deep Reinforcement Learning
35.1 A3C FF Asynchronous Methods for Deep Reinforcement Learning

No-op Starts

Result Algorithm Source
160.7 Human Dueling Network Architectures for Deep Reinforcement Learning
154.8 Human Human-level control through deep reinforcement learning
86.5 IQN Implicit Quantile Networks for Distributional Reinforcement Learning
85.3 QR-DQN-0 Distributional Reinforcement Learning with Quantile Regression
81.8 C51 A Distributional Perspective on Reinforcement Learning
81.0 Reactor The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
77.2 QR-DQN-1 Distributional Reinforcement Learning with Quantile Regression
75.4 Reactor The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
74.1 Distributional DQN Rainbow: Combining Improvements in Deep Reinforcement Learning
72 DuDQN Noisy Networks for Exploration
71 NoisyNet DQN Noisy Networks for Exploration
70.5 DDQN Deep Reinforcement Learning with Double Q-learning
68.1 DDQN A Distributional Perspective on Reinforcement Learning
68 NoisyNet DuDQN Noisy Networks for Exploration
65.5 DuDQN Dueling Network Architectures for Deep Reinforcement Learning
62 DQN Noisy Networks for Exploration
59.92 IMPALA (deep) IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
59.3 Reactor ND The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
54.01 Gorila DQN Massively Parallel Methods for Deep Reinforcement Learning
50.4 DQN A Distributional Perspective on Reinforcement Learning
46.7 PDD DQN Dueling Network Architectures for Deep Reinforcement Learning
43.9 Linear Human-level control through deep reinforcement learning
42.4 DQN Human-level control through deep reinforcement learning
42 NoisyNet A3C Noisy Networks for Exploration
37 A3C Noisy Networks for Exploration
36.4 Contingency Human-level control through deep reinforcement learning
35.73 IMPALA (shallow) IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
31.06 IMPALA (deep, multitask) IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
30.0 Rainbow Rainbow: Combining Improvements in Deep Reinforcement Learning
24.3 ACKTR Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
23.1 Random Human-level control through deep reinforcement learning

Normal Starts

Result Algorithm Source
72 DQN Ours Deep Recurrent Q-Learning for Partially Observable MDPs
65.5 DRQN Deep Recurrent Q-Learning for Partially Observable MDPs
62 DRQN Deep Recurrent Q-Learning for Partially Observable MDPs
57.3 DQN Ours Deep Recurrent Q-Learning for Partially Observable MDPs
40.1 PPO Proximal Policy Optimization Algorithm
33.3 ACER Proximal Policy Optimization Algorithm
30.1 A2C Proximal Policy Optimization Algorithm