Skip to content

Implementing the AlphaZero algorithm for multiple games with PGX and MCTX

License

Notifications You must be signed in to change notification settings

sr5434/AlphaZero

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AlphaZero

Implementing the AlphaZero algorithm(but with the MuZero Gumbel policy) for multiple games with PGX and MCTX. Thanks to Google's TensorFlow Research Cloud for providing compute resources. The training script is modified from the sample AlphaZero script provided in the PGX GitHub Repo. Scripts:

  • main.py: AlphaZero for playing games(Atari, Go, Poker, etc.)
  • test.py: Test the model and output an SVG of it playing itself.

Checkpoints(more coming soon):

Othello model plays itself(step 15145):

AI plays itself in game of Othello

About

Implementing the AlphaZero algorithm for multiple games with PGX and MCTX

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages