Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.14365/889
Title: Learning what to memorize: Using intrinsic motivation to form useful memory in partially observable reinforcement learning
Authors: Demir, Alper
Keywords: Memory
Intrinsic motivation
Partial observability
Reinforcement learning
Agents
Publisher: Springer
Abstract: Reinforcement Learning faces an important challenge in partially observable environments with long-term dependencies. In order to learn in an ambiguous environment, an agent has to keep previous perceptions in a memory. Earlier memory-based approaches use a fixed method to determine what to keep in the memory, which limits them to certain problems. In this study, we follow the idea of giving the control of the memory to the agent by allowing it to take memory-changing actions. Thus, the agent becomes more adaptive to the dynamics of an environment. Further, we formalize an intrinsic motivation to support this learning mechanism, which guides the agent to memorize distinctive events and enable it to disambiguate its state in the environment. Our overall approach is tested and analyzed on several partial observable tasks with long-term dependencies. The experiments show a clear improvement in terms of learning performance compared to other memory based methods.
URI: https://doi.org/10.1007/s10489-022-04328-z
https://hdl.handle.net/20.500.14365/889
ISSN: 0924-669X
1573-7497
Appears in Collections:Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection
WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collection

Files in This Item:
File SizeFormat 
889.pdf497.44 kBAdobe PDFView/Open
Show full item record



CORE Recommender

SCOPUSTM   
Citations

2
checked on Oct 2, 2024

WEB OF SCIENCETM
Citations

1
checked on Oct 2, 2024

Page view(s)

44
checked on Sep 30, 2024

Download(s)

38
checked on Sep 30, 2024

Google ScholarTM

Check




Altmetric


Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.