Learning What To Memorize: Using Intrinsic Motivation To Form Useful Memory in Partially Observable Reinforcement Learning

Loading...
Publication Logo

Date

2023

Authors

Demir, Alper

Journal Title

Journal ISSN

Volume Title

Publisher

Springer

Open Access Color

Green Open Access

Yes

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Average
Influence
Average
Popularity
Top 10%

Research Projects

Journal Issue

Abstract

Reinforcement Learning faces an important challenge in partially observable environments with long-term dependencies. In order to learn in an ambiguous environment, an agent has to keep previous perceptions in a memory. Earlier memory-based approaches use a fixed method to determine what to keep in the memory, which limits them to certain problems. In this study, we follow the idea of giving the control of the memory to the agent by allowing it to take memory-changing actions. Thus, the agent becomes more adaptive to the dynamics of an environment. Further, we formalize an intrinsic motivation to support this learning mechanism, which guides the agent to memorize distinctive events and enable it to disambiguate its state in the environment. Our overall approach is tested and analyzed on several partial observable tasks with long-term dependencies. The experiments show a clear improvement in terms of learning performance compared to other memory based methods.

Description

Keywords

Memory, Intrinsic motivation, Partial observability, Reinforcement learning, Agents, FOS: Computer and information sciences, Computer Science - Machine Learning, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Machine Learning (cs.LG)

Fields of Science

0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology, 01 natural sciences, 0105 earth and related environmental sciences

Citation

WoS Q

Q2

Scopus Q

Q1
OpenCitations Logo
OpenCitations Citation Count
1

Source

Applıed Intellıgence

Volume

53

Issue

Start Page

19074

End Page

19092
PlumX Metrics
Citations

Scopus : 3

Captures

Mendeley Readers : 8

SCOPUS™ Citations

3

checked on Mar 15, 2026

Web of Science™ Citations

2

checked on Mar 15, 2026

Page Views

3

checked on Mar 15, 2026

Downloads

2

checked on Mar 15, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
0.0

Sustainable Development Goals

17

PARTNERSHIPS FOR THE GOALS
PARTNERSHIPS FOR THE GOALS Logo