Reward-Criteria-Impact-on-RL

In this article, we study the reward criteria impact on the performance of reinforcement learning agent for autonomous navigation. The reward criteria chosen is based on the percentage of positive and negative rewards, which can be received by an agent. Based on this reward criteria three classes are formed, 'Balanced Class', 'Skewed Positive Class' and 'Skewed Negative Class'.

Task and Environment

Point goal navigation Task in gazebo simulation.

Steps to run before performing experiment

Setup the turtlebot3, ROS 2 (Dashing) according to the manual. https://emanual.robotis.com/docs/en/platform/turtlebot3/quick-start/#pc-setup .
Setup the turtlebot3 Simulation and Machine Learning Packages. ( Step 6 and Step 9 in the above manual)
Replace the dqn_agent.py and dqn_environment.py file in the machine learning package of turtlebot3 with the files mentioned in this repository.

Experiment

After replacing the files, follow steps mentioned in section 9.3.2 in the manual, https://emanual.robotis.com/docs/en/platform/turtlebot3/machine_learning/#machine-learning .

Results

Based on the experiments, the skewed negative class and balanced class are able to learn 74.6% and 72.6% respectively and a steady increase in average cumulative rewards has been observed. On the other hand, the skewed positive class did not show any steady improvement in the average cumulative rewards earned even after training for over a large number of episodes.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
dqn_agent.py		dqn_agent.py
dqn_environment.py		dqn_environment.py
env4.png		env4.png
rl_9.png		rl_9.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reward-Criteria-Impact-on-RL

Task and Environment

Steps to run before performing experiment

Experiment

Results

About

Releases

Packages

Languages

qwedaq/Reward-Criteria-Impact-on-RL

Folders and files

Latest commit

History

Repository files navigation

Reward-Criteria-Impact-on-RL

Task and Environment

Steps to run before performing experiment

Experiment

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages