Differentiable Trust Region Layers for Deep Reinforcement Learning

This is the official code for the paper "Differentiable Trust Region Layers for Deep Reinforcement Learning" by Fabian Otto et al. accepted to ICLR 2021. The code allows the users to reproduce and extend the results reported in the study. Please cite the above paper when reporting, reproducing or extending the results.

[OpenReview][Arxiv]

Purpose of the project

This software is a research prototype, solely developed for and published as part of the publication "Differentiable Trust Region Layers for Deep Reinforcement Learning". It will neither be maintained nor monitored in any way.

Requirements, test, install, use, etc.

Prerequisites

When using Mujoco environments make sure to install it beforehand.

Installation

Clone the repository and go to the project root

cd path/to/trust-region-layers

Create and activate a virtualenv. Install the required packages with the provided requirements.txt

pip install -r requirements.txt

Note: To use the KL projection, you also need to install the optimized C++ implementation according to this.

Run Experiments

Hyperparameters can be found and adjusted in the corresponding configs

In order to run experiments execute e.g.

python3 main.py configs/pg/mujoco_config.json

When you are interested in running multiple experiments, pass the directory containing the config files to the main

python3 main.py configs/pg/my_agent_configs/ --num-threads 10

where --num-threads can be used to change the number of parallel jobs, which process the queue of provided runs.

Citation

If you use this work please cite

@inproceedings{otto_iclr2021,
  title={Differentiable Trust Region Layers for Deep Reinforcement Learning},
  author={Otto, Fabian and Becker, Philipp and Anh Vien, Ngo and Ziesche, Hanna Carolin and Neumann, Gerhard},
  booktitle={International Conference on Learning Representations},
  year={2021}
}

License

trust_region_layers is open-sourced under the AGPL-3.0 license. See the LICENSE file for details. For a list of other open source components included in trust_region_layer see the file 3rd-party-licenses.txt.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
configs/pg		configs/pg
cpp_projection		cpp_projection
trust_region_projections		trust_region_projections
utils		utils
.gitignore		.gitignore
3rd-party-licenses.txt		3rd-party-licenses.txt
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
trpl_overview.png		trpl_overview.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

configs/pg

configs/pg

cpp_projection

cpp_projection

trust_region_projections

trust_region_projections

utils

utils

.gitignore

.gitignore

3rd-party-licenses.txt

3rd-party-licenses.txt

LICENSE

LICENSE

README.md

README.md

main.py

main.py

requirements.txt

requirements.txt

trpl_overview.png

trpl_overview.png

Repository files navigation

Differentiable Trust Region Layers for Deep Reinforcement Learning

Purpose of the project

Requirements, test, install, use, etc.

Prerequisites

Installation

Run Experiments

Citation

License

About

Languages

License

boschresearch/trust-region-layers

Folders and files

Latest commit

History

Repository files navigation

Differentiable Trust Region Layers for Deep Reinforcement Learning

Purpose of the project

Requirements, test, install, use, etc.

Prerequisites

Installation

Run Experiments

Citation

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages