Show simple item record

dc.contributor.authorVassiliades, Vassilisen
dc.contributor.authorCleanthous, A.en
dc.contributor.authorChristodoulou, Chris C.en
dc.creatorVassiliades, Vassilisen
dc.creatorCleanthous, A.en
dc.creatorChristodoulou, Chris C.en
dc.date.accessioned2019-11-13T10:42:57Z
dc.date.available2019-11-13T10:42:57Z
dc.date.issued2009
dc.identifier.issn0302-9743
dc.identifier.urihttp://gnosis.library.ucy.ac.cy/handle/7/55135
dc.description.abstractThis paper investigates Multiagent Reinforcement Learning (MARL) in a general-sum game where the payoffs' structure is such that the agents are required to exploit each other in a way that benefits all agents. The contradictory nature of these games makes their study in multiagent systems quite challenging. In particular, we investigate MARL with spiking and non-spiking agents in the Iterated Prisoner's Dilemma by exploring the conditions required to enhance its cooperative outcome. According to the results, this is enhanced by: (i) a mixture of positive and negative payoff values and a high discount factor in the case of non-spiking agents and (ii) having longer eligibility trace time constant in the case of spiking agents. Moreover, it is shown that spiking and non-spiking agents have similar behaviour and therefore they can equally well be used in any multiagent interaction setting. For training the spiking agents, a novel and necessary modification enhances competition to an existing learning rule based on stochastic synaptic transmission. © 2009 Springer Berlin Heidelberg.en
dc.source19th International Conference on Artificial Neural Networks, ICANN 2009en
dc.source.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-70350582964&doi=10.1007%2f978-3-642-04274-4_76&partnerID=40&md5=5b93685289c75a4983eb6ec5afcf87a4
dc.subjectBackpropagationen
dc.subjectNeural networksen
dc.subjectTime constantsen
dc.subjectReinforcementen
dc.subjectFertilizersen
dc.subjectReinforcement learningen
dc.subjectMulti-agent reinforcement learningen
dc.subjectIterated prisoner's dilemmaen
dc.subjectDiscount factorsen
dc.subjectEligibility tracesen
dc.subjectMulti-agent interactionen
dc.subjectSynaptic transmissionen
dc.subjectLearning rulesen
dc.titleMultiagent reinforcement learning with spiking and non-spiking agents in the iterated prisoner's dilemmaen
dc.typeinfo:eu-repo/semantics/article
dc.identifier.doi10.1007/978-3-642-04274-4_76
dc.description.volume5768 LNCSen
dc.description.issuePART 1en
dc.description.startingpage737
dc.description.endingpage746
dc.author.faculty002 Σχολή Θετικών και Εφαρμοσμένων Επιστημών / Faculty of Pure and Applied Sciences
dc.author.departmentΤμήμα Πληροφορικής / Department of Computer Science
dc.type.uhtypeArticleen
dc.description.notes<p>Conference code: 77563en
dc.description.notesCited By :4</p>en
dc.source.abbreviationLect. Notes Comput. Sci.en
dc.contributor.orcidChristodoulou, Chris C. [0000-0001-9398-5256]
dc.gnosis.orcid0000-0001-9398-5256


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record