Policy Gradient Implementations

Implementing reinforcement learning algorithms based on policy gradients.

Prerequisites

Python3.6

Installation

Using poetry:

poetry install

Using pip:

pip install -r requirements.txt

Using pip to install development dependencies too:

pip install -r requirements.dev.txt

On Google Colab to avoid conflicts with preinstalled packages:

pip install -r requirements.colab.txt

Running experiments

CLI

An executable is provided in ./bin. From the root directory run:

./bin/policy_gradients <algorithm>

To see the full list of options, including available algorithms:

./bin/policy_gradients --help

Several pre-trained models are provided in ./models. For example, to view a pre-trained SAC agent operate in the InvertedPendulumBulletEnv-v0 environment you can run:

./bin/policy_gradients sac -n 1 --env InvertedPendulumBulletEnv-v0 --eval --render --load_dir ./models

Programmatic API

Use the exposed run function with an options dictionary. This will be combined with a set of default hyperparameters for the relevant algorithm. For example:

import policy_gradients

policy_gradients.run({
    "algorithm": "sac",
    "env_name": "LunarLanderContinuous-v2",
    "n_episodes": 250,
    "log_period": 10,
    "save_dir": "./models",
    "seed": 123456,
})

Refer to parser.py for the full list of available options as well as the hyperparameters.py file for the relevant algorithm to see which hyperparameters apply.

Notebooks

A set of notebooks is provided in ./notebooks which demonstrates how to train each algorithm for an appropriate environment using the programmatic API. Each notebook provides a link to open the notebook in Google Colab. To run locally start a Jupyter notebook server and open the relevant notebook in the browser window which should open automatically:

jupyter notebook

Development

The following scripts assume the requirements have been installed. If using poetry, they assume poetry shell has already been run or else they should be prefixed with poetry run.

Lint

pylint ./policy_gradients

Typecheck

mypy

Format

black ./policy_gradients

Generating requirements files

./scripts/generate_requirements.sh

Troubleshooting

Poetry

I had trouble installing gym with Poetry because of its Pillow dependency and something to do with zlib. Setting PKG_CONFIG_PATH="/usr/local/opt/zlib/lib/pkgconfig" fixed this problem.

Name		Name	Last commit message	Last commit date
Latest commit History 220 Commits
bin		bin
models		models
notebooks		notebooks
policy_gradients		policy_gradients
scripts		scripts
.editorconfig		.editorconfig
.gitignore		.gitignore
.pylintrc		.pylintrc
README.md		README.md
mypy.ini		mypy.ini
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.colab.txt		requirements.colab.txt
requirements.dev.txt		requirements.dev.txt
requirements.txt		requirements.txt

willclarktech/policy-gradient-implementations

Folders and files

Latest commit

History

Repository files navigation

Policy Gradient Implementations

Prerequisites

Installation

Running experiments

CLI

Programmatic API

Notebooks

Development

Lint

Typecheck

Format

Generating requirements files

Troubleshooting

Poetry

About

Resources

Stars

Watchers

Forks

Languages