RL-for-TSP

Overview

This project compares solutions to the Traveling Salesman Problem (TSP) using both Q-Learning and a Policy Gradient approach with a neural network. The Q-Learning algorithm is based on tabular Q-values, while the Policy Gradient approach utilizes a neural network to learn a policy.

Getting Started

Prerequisites

Python (>=3.6)
NumPy
TensorFlow (for Q-Learning)
PyTorch (for Policy Gradient)
Matplotlib

Installation

Clone the repository:

 ```bash
 git clone https://github.com/your-username/traveling-salesman.git
 cd traveling-salesman

Usage

Run the file by using the folllowing command

```bash
python QL_PG.py

play-around

Change the parameters in the script and observe the graph.

Q-Learning Parameters

num_cities, num_episodes, epsilon, alpha, gamma

Policay Gradient Parameters

num_cities, input_size, hidden_size, output_size, learning_rate, num_episodes

Results

View the results for loss over episdoes for Q-Learning and Policy Gradient Respectively.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
Group_6_TSP_RL_Final.pptx		Group_6_TSP_RL_Final.pptx
QL_PG.ipynb		QL_PG.ipynb
QL_PG.py		QL_PG.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL-for-TSP

Overview

Table of Contents

Getting Started

Prerequisites

Installation

Usage

play-around

Q-Learning Parameters

Policay Gradient Parameters

Results

About

Releases

Packages

Languages

batisnim/RL-for-TSP

Folders and files

Latest commit

History

Repository files navigation

RL-for-TSP

Overview

Table of Contents

Getting Started

Prerequisites

Installation

Usage

play-around

Q-Learning Parameters

Policay Gradient Parameters

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages