Multiagent reinforcement learning in the iterated prisoner's dilemma: Fast cooperation through evolved payoffs

Vassiliades, Vassilis; Christodoulou, Chris C.

doi:10.1109/IJCNN.2010.5596937

dc.contributor.author	Vassiliades, Vassilis	en
dc.contributor.author	Christodoulou, Chris C.	en
dc.creator	Vassiliades, Vassilis	en
dc.creator	Christodoulou, Chris C.	en
dc.date.accessioned	2019-11-13T10:42:57Z
dc.date.available	2019-11-13T10:42:57Z
dc.date.issued	2010
dc.identifier.isbn	978-1-4244-6917-8
dc.identifier.uri	http://gnosis.library.ucy.ac.cy/handle/7/55133
dc.description.abstract	In this paper, we investigate the importance of rewards in Multiagent Reinforcement Learning in the context of the Iterated Prisoner's Dilemma. We use an evolutionary algorithm to evolve valid payoff structures with the aim of encouraging mutual cooperation. An exhaustive analysis is performed by investigating the effect of: i) the lower and upper bounds of the search space of the payoff values, ii) the reward sign, iii) the population size, and iv) the mutation operators used. Our results indicate that valid structures that encourage cooperation can quickly be obtained, while their analysis shows that: i) they should contain a mixture of positive and negative values and ii) the magnitude of the positive values should be much smaller than the magnitude of the negative values. © 2010 IEEE.	en
dc.source	Proceedings of the International Joint Conference on Neural Networks	en
dc.source	2010 6th IEEE World Congress on Computational Intelligence, WCCI 2010 - 2010 International Joint Conference on Neural Networks, IJCNN 2010	en
dc.source.uri	https://www.scopus.com/inward/record.uri?eid=2-s2.0-79959432977&doi=10.1109%2fIJCNN.2010.5596937&partnerID=40&md5=6c11543164b6a46dda112b343a1245bb
dc.subject	Multi agent systems	en
dc.subject	Neural networks	en
dc.subject	Lower and upper bounds	en
dc.subject	Mathematical operators	en
dc.subject	Population statistics	en
dc.subject	Negative values	en
dc.subject	Reinforcement learning	en
dc.subject	Search spaces	en
dc.subject	Multi-agent reinforcement learning	en
dc.subject	Iterated prisoner's dilemma	en
dc.subject	Mutation operators	en
dc.subject	Population sizes	en
dc.subject	Positive value	en
dc.title	Multiagent reinforcement learning in the iterated prisoner's dilemma: Fast cooperation through evolved payoffs	en
dc.type	info:eu-repo/semantics/conferenceObject
dc.identifier.doi	10.1109/IJCNN.2010.5596937
dc.author.faculty	002 Σχολή Θετικών και Εφαρμοσμένων Επιστημών / Faculty of Pure and Applied Sciences
dc.author.department	Τμήμα Πληροφορικής / Department of Computer Science
dc.type.uhtype	Conference Object	en
dc.description.notes	<p>Conference code: 85188	en
dc.description.notes	Cited By :2</p>	en
dc.contributor.orcid	Christodoulou, Chris C. [0000-0001-9398-5256]
dc.gnosis.orcid	0000-0001-9398-5256

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

Τμήμα Πληροφορικής / Department of Computer Science [1952]

Show simple item record