Exploiting Reward Machines with Deep Reinforcement Learning in Continuous Action Domains

Sun, Haolin

Exploiting Reward Machines with Deep Reinforcement Learning in Continuous Action Domains

dc.contributor.advisor	Lesperance, Yves
dc.contributor.author	Sun, Haolin
dc.date.accessioned	2023-03-28T21:23:20Z
dc.date.available	2023-03-28T21:23:20Z
dc.date.copyright	2022-12-07
dc.date.issued	2023-03-28
dc.date.updated	2023-03-28T21:23:19Z
dc.degree.discipline	Computer Science
dc.degree.level	Master's
dc.degree.name	MSc - Master of Science
dc.description.abstract	Deep reinforcement learning can solve real-world robot control problems, such as autonomous driving and robotic arm manipulation. In deep reinforcement learning, an agent does not know the problem description and learns the optimal solution through trial-and-error. This method brings two major challenges when solving real-world problems: partial observability and learning efficiency. In this thesis, we address these two challenges and extend previous work. First, we use reward machines to address the problem of partial observability. Then, we focus on finding the existing cutting-edge deep reinforcement learning algorithms and integrating them with reward machines to enhance the learning efficiency. To test the performance of all the algorithms, we proposed a series of different tasks that can be used to mimic real-world robot control problems. Finally, based on the test results, we compare the performance of all the algorithms and analyze their advantages and disadvantages.
dc.identifier.uri	http://hdl.handle.net/10315/41039
dc.language	en
dc.rights	Author owns copyright, except where explicitly noted. Please contact the author directly with licensing requests.
dc.subject	Computer science
dc.subject.keywords	Machine learning
dc.subject.keywords	Reinforcement learning
dc.subject.keywords	Reward machine
dc.subject.keywords	Deep reinforcement learning
dc.subject.keywords	Q-learning
dc.subject.keywords	DDPG
dc.subject.keywords	SAC
dc.subject.keywords	TD3
dc.subject.keywords	PPO
dc.subject.keywords	Partial observability
dc.subject.keywords	Robotic control
dc.title	Exploiting Reward Machines with Deep Reinforcement Learning in Continuous Action Domains
dc.type	Electronic Thesis or Dissertation

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Sun_Haolin_HS_2023_Masters.pdf
Size:: 2.9 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 2 of 2

Name:: license.txt
Size:: 1.87 KB
Format:: Plain Text
Description:

Download

Name:: YorkU_ETDlicense.txt
Size:: 3.39 KB
Format:: Plain Text
Description:

Download

Collections

Computer Science