A Cultural Algorithm for Pomdps From Stochastic Inventory Control

dc.contributor.author Prestwich S.D.
dc.contributor.author Tarim S.A.
dc.contributor.author Rossi R.
dc.contributor.author Hnich B.
dc.date.accessioned 2023-06-16T14:58:01Z
dc.date.available 2023-06-16T14:58:01Z
dc.date.issued 2008
dc.description 5th International Workshop on Hybrid Metaheuristics, HM 2008 -- 8 October 2008 through 9 October 2008 -- Malaga -- 74367 en_US
dc.description.abstract Reinforcement Learning algorithms such as SARSA with an eligibility trace, and Evolutionary Computation methods such as genetic algorithms, are competing approaches to solving Partially Observable Markov Decision Processes (POMDPs) which occur in many fields of Artificial Intelligence. A powerful form of evolutionary algorithm that has not previously been applied to POMDPs is the cultural algorithm, in which evolving agents share knowledge in a belief space that is used to guide their evolution. We describe a cultural algorithm for POMDPs that hybridises SARSA with a noisy genetic algorithm, and inherits the latter's convergence properties. Its belief space is a common set of state-action values that are updated during genetic exploration, and conversely used to modify chromosomes. We use it to solve problems from stochastic inventory control by finding memoryless policies for nondeterministic POMDPs. Neither SARSA nor the genetic algorithm dominates the other on these problems, but the cultural algorithm outperforms the genetic algorithm, and on highly non-Markovian instances also outperforms SARSA. © 2008 Springer Berlin Heidelberg. en_US
dc.identifier.doi 10.1007/978-3-540-88439-2_2
dc.identifier.isbn 3540884386
dc.identifier.isbn 9783540884385
dc.identifier.issn 0302-9743
dc.identifier.scopus 2-s2.0-57049126222
dc.identifier.uri https://doi.org/10.1007/978-3-540-88439-2_2
dc.identifier.uri https://hdl.handle.net/20.500.14365/3397
dc.language.iso en en_US
dc.publisher Springer Verlag en_US
dc.relation.ispartof Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject Genetic algorithms en_US
dc.subject Heuristic algorithms en_US
dc.subject Inventory control en_US
dc.subject Learning algorithms en_US
dc.subject Markov processes en_US
dc.subject Reinforcement learning en_US
dc.subject Stochastic systems en_US
dc.subject Convergence properties en_US
dc.subject Cultural Algorithm en_US
dc.subject Eligibility traces en_US
dc.subject Memoryless policy en_US
dc.subject Non-Markovian en_US
dc.subject Partially observable Markov decision process en_US
dc.subject Share knowledge en_US
dc.subject Stochastic inventory controls en_US
dc.subject Evolutionary algorithms en_US
dc.title A Cultural Algorithm for Pomdps From Stochastic Inventory Control en_US
dc.type Conference Object en_US
dspace.entity.type Publication
gdc.author.scopusid 7004234709
gdc.author.scopusid 35563636800
gdc.author.scopusid 6602458958
gdc.bip.impulseclass C5
gdc.bip.influenceclass C5
gdc.bip.popularityclass C5
gdc.coar.access open access
gdc.coar.type text::conference output
gdc.collaboration.industrial false
gdc.description.departmenttemp Prestwich, S.D., Cork Constraint Computation Centre, Ireland; Tarim, S.A., Department of Management, Hacettepe University, Turkey; Rossi, R., Cork Constraint Computation Centre, Ireland; Hnich, B., Faculty of Computer Science, Izmir University of Economics, Turkey en_US
gdc.description.endpage 28 en_US
gdc.description.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q3
gdc.description.startpage 16 en_US
gdc.description.volume 5296 LNCS en_US
gdc.description.wosquality N/A
gdc.identifier.openalex W1510469449
gdc.identifier.wos WOS:000260605500002
gdc.index.type WoS
gdc.index.type Scopus
gdc.oaire.diamondjournal false
gdc.oaire.downloads 1
gdc.oaire.impulse 2.0
gdc.oaire.influence 2.7058251E-9
gdc.oaire.isgreen true
gdc.oaire.keywords Life Science
gdc.oaire.popularity 1.127E-9
gdc.oaire.publicfunded false
gdc.oaire.views 2
gdc.openalex.collaboration National
gdc.openalex.fwci 1.4125
gdc.openalex.normalizedpercentile 0.8
gdc.opencitations.count 4
gdc.plumx.crossrefcites 4
gdc.plumx.mendeley 12
gdc.plumx.scopuscites 5
gdc.scopus.citedcount 5
gdc.wos.citedcount 3
relation.isOrgUnitOfPublication e9e77e3e-bc94-40a7-9b24-b807b2cd0319
relation.isOrgUnitOfPublication.latestForDiscovery e9e77e3e-bc94-40a7-9b24-b807b2cd0319

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2503.pdf
Size:
390.5 KB
Format:
Adobe Portable Document Format