Show simple item record

dc.contributor.authorVassiliades, Vassilisen
dc.contributor.authorChristodoulou, Chris C.en
dc.creatorVassiliades, Vassilisen
dc.creatorChristodoulou, Chris C.en
dc.date.accessioned2019-11-13T10:42:57Z
dc.date.available2019-11-13T10:42:57Z
dc.date.issued2010
dc.identifier.isbn978-1-4244-6917-8
dc.identifier.urihttp://gnosis.library.ucy.ac.cy/handle/7/55133
dc.description.abstractIn this paper, we investigate the importance of rewards in Multiagent Reinforcement Learning in the context of the Iterated Prisoner's Dilemma. We use an evolutionary algorithm to evolve valid payoff structures with the aim of encouraging mutual cooperation. An exhaustive analysis is performed by investigating the effect of: i) the lower and upper bounds of the search space of the payoff values, ii) the reward sign, iii) the population size, and iv) the mutation operators used. Our results indicate that valid structures that encourage cooperation can quickly be obtained, while their analysis shows that: i) they should contain a mixture of positive and negative values and ii) the magnitude of the positive values should be much smaller than the magnitude of the negative values. © 2010 IEEE.en
dc.sourceProceedings of the International Joint Conference on Neural Networksen
dc.source2010 6th IEEE World Congress on Computational Intelligence, WCCI 2010 - 2010 International Joint Conference on Neural Networks, IJCNN 2010en
dc.source.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-79959432977&doi=10.1109%2fIJCNN.2010.5596937&partnerID=40&md5=6c11543164b6a46dda112b343a1245bb
dc.subjectMulti agent systemsen
dc.subjectNeural networksen
dc.subjectLower and upper boundsen
dc.subjectMathematical operatorsen
dc.subjectPopulation statisticsen
dc.subjectNegative valuesen
dc.subjectReinforcement learningen
dc.subjectSearch spacesen
dc.subjectMulti-agent reinforcement learningen
dc.subjectIterated prisoner's dilemmaen
dc.subjectMutation operatorsen
dc.subjectPopulation sizesen
dc.subjectPositive valueen
dc.titleMultiagent reinforcement learning in the iterated prisoner's dilemma: Fast cooperation through evolved payoffsen
dc.typeinfo:eu-repo/semantics/conferenceObject
dc.identifier.doi10.1109/IJCNN.2010.5596937
dc.author.faculty002 Σχολή Θετικών και Εφαρμοσμένων Επιστημών / Faculty of Pure and Applied Sciences
dc.author.departmentΤμήμα Πληροφορικής / Department of Computer Science
dc.type.uhtypeConference Objecten
dc.description.notes<p>Conference code: 85188en
dc.description.notesCited By :2</p>en
dc.contributor.orcidChristodoulou, Chris C. [0000-0001-9398-5256]
dc.gnosis.orcid0000-0001-9398-5256


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record