Show simple item record

dc.contributor.authorLambrou, Ioannisen
dc.contributor.authorVassiliades, Vassilisen
dc.contributor.authorChristodoulou, Chris C.en
dc.creatorLambrou, Ioannisen
dc.creatorVassiliades, Vassilisen
dc.creatorChristodoulou, Chris C.en
dc.date.accessioned2019-11-13T10:40:54Z
dc.date.available2019-11-13T10:40:54Z
dc.date.issued2012
dc.identifier.issn0302-9743
dc.identifier.urihttp://gnosis.library.ucy.ac.cy/handle/7/54355
dc.description.abstractThis paper compares and investigates single-agent reinforcement learning (RL) algorithms on the simple and an extended taxi problem domain, and multiagent RL algorithms on a multiagent extension of the simple taxi problem domain we created. In particular, we extend the Policy Hill Climbing (PHC) and the Win or Learn Fast-PHC (WoLF-PHC) algorithms by combining them with the MAXQ hierarchical decomposition and investigate their efficiency. The results are very promising for the multiagent domain as they indicate that these two newly-created algorithms are the most efficient ones from the algorithms we compared. © 2012 Springer-Verlag.en
dc.source9th European Workshop on Reinforcement Learning, EWRL 2011en
dc.source.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-84861673287&doi=10.1007%2f978-3-642-29946-9_26&partnerID=40&md5=2bb84cac41a9e8db20f54622e5007393
dc.subjectMulti agent systemsen
dc.subjectLearning algorithmsen
dc.subjectHierarchical decompositionsen
dc.subjectReinforcement learningen
dc.subjectProblem domainen
dc.subjectMulti-agent settingen
dc.subjectHierarchical Reinforcement Learningen
dc.subjectHill climbingen
dc.subjectMulti-agent reinforcement learningen
dc.subjectMultiagent Reinforcement Learningen
dc.subjectSingle-agenten
dc.subjectTaxi Domainen
dc.subjectTaxicabsen
dc.titleAn extension of a hierarchical reinforcement learning algorithm for multiagent settingsen
dc.typeinfo:eu-repo/semantics/article
dc.identifier.doi10.1007/978-3-642-29946-9_26
dc.description.volume7188 LNAIen
dc.description.startingpage261
dc.description.endingpage272
dc.author.faculty002 Σχολή Θετικών και Εφαρμοσμένων Επιστημών / Faculty of Pure and Applied Sciences
dc.author.departmentΤμήμα Πληροφορικής / Department of Computer Science
dc.type.uhtypeArticleen
dc.description.notes<p>Sponsors: Artificial Intelligenceen
dc.description.notesAustralian National Universityen
dc.description.notesNICTAen
dc.description.notesPASCAL2en
dc.description.notesConference code: 89968</p>en
dc.source.abbreviationLect. Notes Comput. Sci.en
dc.contributor.orcidChristodoulou, Chris C. [0000-0001-9398-5256]
dc.gnosis.orcid0000-0001-9398-5256


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record