Skip to main content

The game of Hex implemented for reinforcement learning in the OpenAI gym framework. Optimized for rollout speed.

Project description

MiniHex

An OpenAI gym environment that allows an agent to play the game of Hex. The aim for this environment is to be lean and have fast rollouts, as well as, variable board size. With random actions it currently achieves ~340 games per second in a 11x11 grid (original size) on a single CPU (Intel Xenon E3-1230 @3.3GHz).

Hex is a two player game and needs to be converted into a "single agent environment" to fit into the gym framework. We achieve this by requiring a opponent_policy at creation time. Each move of the agent will be immediately followed by a move of the opponent. This is a function that takes as input a board state and outputs an action.

Installation

pip install minihex

Editable installation (if you wish to tweak the environment):

git clone https://github.com/FirefoxMetzger/minihex.git
pip install -e minihex/

Minimal Working Example

import gym
import minihex


env = gym.make("hex-v0",
               opponent_policy=minihex.random_policy,
               board_size=11)

state, info = env.reset()
done = False
while not done:
    board, player = state
    action = minihex.random_policy(board, player, info)
    state, reward, done, info = env.step(action)

env.render()

if reward == -1:
    print("Player (Black) Lost")
elif reward == 1:
    print("Player (Black) Won")
else:
    print("Draw")

Debug Mode

If the environment is instantiated with debug=True each step will check if a valid action is provided, and an IndexError will be raised if an invalid one is provided. This is very useful while writing agents, e.g., if the agent maintains it's own belief over the environment and may request invalid actions. When evaluating/running at scale, however, this check can cause significant slowdown. Hence, it is only performed if explicitly requested.

Limitations

Currently the enviornment is missing the following features to go to version 1.0

  • The swap action that is used to mitigate the disadvantage of playing second.
  • RGB rendering mode
  • add environment to pypi
  • no surrender action

Bugs and Contributing

If you encounter problems, check the GitHub issue page or open a new issue there.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

minihex-1.0.1.tar.gz (4.8 kB view details)

Uploaded Source

Built Distribution

minihex-1.0.1-py3-none-any.whl (5.8 kB view details)

Uploaded Python 3

File details

Details for the file minihex-1.0.1.tar.gz.

File metadata

  • Download URL: minihex-1.0.1.tar.gz
  • Upload date:
  • Size: 4.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.23.0 setuptools/49.6.0 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.0

File hashes

Hashes for minihex-1.0.1.tar.gz
Algorithm Hash digest
SHA256 be0f3fb98260e80b80a3b773d5b72eab29b009061b5214faa40e3cd9256f84a3
MD5 9e8831444b4991e2a5de83419c017616
BLAKE2b-256 d13f0fc8b958bca58a69643865afa81f92efb518bd554789ea9a25876f8fa08d

See more details on using hashes here.

File details

Details for the file minihex-1.0.1-py3-none-any.whl.

File metadata

  • Download URL: minihex-1.0.1-py3-none-any.whl
  • Upload date:
  • Size: 5.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.23.0 setuptools/49.6.0 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.0

File hashes

Hashes for minihex-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 4249c0d049445abaf0e1893acd6077bd2de8ca74393899bad241691972e90ebe
MD5 815cbf3b868e5e04c509551babb3a301
BLAKE2b-256 ab1708225b23600ff23e9797031e20ad4dd975cdc2d03199dfa5bea80e19b254

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page