Please use this identifier to cite or link to this item:
https://hdl.handle.net/20.500.14365/3397
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Prestwich S.D. | - |
dc.contributor.author | Tarim S.A. | - |
dc.contributor.author | Rossi R. | - |
dc.contributor.author | Hnich B. | - |
dc.date.accessioned | 2023-06-16T14:58:01Z | - |
dc.date.available | 2023-06-16T14:58:01Z | - |
dc.date.issued | 2008 | - |
dc.identifier.isbn | 3540884386 | - |
dc.identifier.isbn | 9783540884385 | - |
dc.identifier.issn | 0302-9743 | - |
dc.identifier.uri | https://doi.org/10.1007/978-3-540-88439-2_2 | - |
dc.identifier.uri | https://hdl.handle.net/20.500.14365/3397 | - |
dc.description | 5th International Workshop on Hybrid Metaheuristics, HM 2008 -- 8 October 2008 through 9 October 2008 -- Malaga -- 74367 | en_US |
dc.description.abstract | Reinforcement Learning algorithms such as SARSA with an eligibility trace, and Evolutionary Computation methods such as genetic algorithms, are competing approaches to solving Partially Observable Markov Decision Processes (POMDPs) which occur in many fields of Artificial Intelligence. A powerful form of evolutionary algorithm that has not previously been applied to POMDPs is the cultural algorithm, in which evolving agents share knowledge in a belief space that is used to guide their evolution. We describe a cultural algorithm for POMDPs that hybridises SARSA with a noisy genetic algorithm, and inherits the latter's convergence properties. Its belief space is a common set of state-action values that are updated during genetic exploration, and conversely used to modify chromosomes. We use it to solve problems from stochastic inventory control by finding memoryless policies for nondeterministic POMDPs. Neither SARSA nor the genetic algorithm dominates the other on these problems, but the cultural algorithm outperforms the genetic algorithm, and on highly non-Markovian instances also outperforms SARSA. © 2008 Springer Berlin Heidelberg. | en_US |
dc.language.iso | en | en_US |
dc.publisher | Springer Verlag | en_US |
dc.relation.ispartof | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | en_US |
dc.rights | info:eu-repo/semantics/openAccess | en_US |
dc.subject | Genetic algorithms | en_US |
dc.subject | Heuristic algorithms | en_US |
dc.subject | Inventory control | en_US |
dc.subject | Learning algorithms | en_US |
dc.subject | Markov processes | en_US |
dc.subject | Reinforcement learning | en_US |
dc.subject | Stochastic systems | en_US |
dc.subject | Convergence properties | en_US |
dc.subject | Cultural Algorithm | en_US |
dc.subject | Eligibility traces | en_US |
dc.subject | Memoryless policy | en_US |
dc.subject | Non-Markovian | en_US |
dc.subject | Partially observable Markov decision process | en_US |
dc.subject | Share knowledge | en_US |
dc.subject | Stochastic inventory controls | en_US |
dc.subject | Evolutionary algorithms | en_US |
dc.title | A cultural algorithm for pomdps from stochastic inventory control | en_US |
dc.type | Conference Object | en_US |
dc.identifier.doi | 10.1007/978-3-540-88439-2_2 | - |
dc.identifier.scopus | 2-s2.0-57049126222 | en_US |
dc.authorscopusid | 7004234709 | - |
dc.authorscopusid | 35563636800 | - |
dc.authorscopusid | 6602458958 | - |
dc.identifier.volume | 5296 LNCS | en_US |
dc.identifier.startpage | 16 | en_US |
dc.identifier.endpage | 28 | en_US |
dc.identifier.wos | WOS:000260605500002 | en_US |
dc.relation.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | en_US |
dc.identifier.scopusquality | Q3 | - |
dc.identifier.wosquality | N/A | - |
item.grantfulltext | open | - |
item.openairetype | Conference Object | - |
item.openairecristype | http://purl.org/coar/resource_type/c_18cf | - |
item.fulltext | With Fulltext | - |
item.languageiso639-1 | en | - |
item.cerifentitytype | Publications | - |
Appears in Collections: | Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collection |
CORE Recommender
SCOPUSTM
Citations
5
checked on Nov 20, 2024
WEB OF SCIENCETM
Citations
3
checked on Nov 20, 2024
Page view(s)
68
checked on Nov 18, 2024
Download(s)
20
checked on Nov 18, 2024
Google ScholarTM
Check
Altmetric
Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.