AlphaZero

Implementing the AlphaZero algorithm(but with the MuZero Gumbel policy) for multiple games with PGX and MCTX. Thanks to Google's TensorFlow Research Cloud for providing compute resources. The training script is modified from the sample AlphaZero script provided in the PGX GitHub Repo. Scripts:

main.py: AlphaZero for playing games(Atari, Go, Poker, etc.)
test.py: Test the model and output an SVG of it playing itself.

Checkpoints(more coming soon):

Othello model plays itself(step 15145):

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
LICENSE		LICENSE
README.md		README.md
game.svg		game.svg
main.py		main.py
requirements.txt		requirements.txt
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LICENSE

LICENSE

README.md

README.md

game.svg

game.svg

main.py

main.py

requirements.txt

requirements.txt

test.py

test.py

Repository files navigation

AlphaZero

About

Releases

Packages

Languages

License

sr5434/AlphaZero

Folders and files

Latest commit

History

Repository files navigation

AlphaZero

About

Resources

License

Stars

Watchers

Forks

Languages