Attention-based Policy Gradient RL for learning to solve Travelling Salesperson Problem

A PyTorch implementation of the attention-based Policy Gradient RL for learning to solve Travelling Salesperson Problem based on the paper https://doi.org/10.1007/978-3-319-93031-2_12

This project is the PyTorch implementation of the attention-based policy gradient for solving TSP based on the paper Learning Heuristics for the TSP by Policy Gradient [PDF here]

The paper also incorporates a 2-opt heuristic to the RL algorithm, which we did not include here. Please refer to the paper for a more detailed explanation of the methodology.

Architecture

Neural (transformer) Encoder of the Policy Net

Neural (pointer) Decoder of the Policy Net

Training

The agent is trained using the REINFORCE (policy gradient) algorithm. refer to the REINFORCE original paper.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
agent_tour_1.gif		agent_tour_1.gif
agent_tour_2.gif		agent_tour_2.gif
attn_rl_tsp.ipynb		attn_rl_tsp.ipynb
model.pt		model.pt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

agent_tour_1.gif

agent_tour_1.gif

agent_tour_2.gif

agent_tour_2.gif

attn_rl_tsp.ipynb

attn_rl_tsp.ipynb

model.pt

model.pt

Repository files navigation

Attention-based Policy Gradient RL for learning to solve Travelling Salesperson Problem

Architecture

Neural (transformer) Encoder of the Policy Net

Neural (pointer) Decoder of the Policy Net

Training

About

Releases

Packages

Languages

ebrahimpichka/attn-PG-RL-tsp

Folders and files

Latest commit

History

Repository files navigation

Attention-based Policy Gradient RL for learning to solve Travelling Salesperson Problem

Architecture

Neural (transformer) Encoder of the Policy Net

Neural (pointer) Decoder of the Policy Net

Training

About

Topics

Resources

Stars

Watchers

Forks

Languages