dc.contributor.author | Vassiliades, Vassilis | en |
dc.contributor.author | Christodoulou, Chris C. | en |
dc.creator | Vassiliades, Vassilis | en |
dc.creator | Christodoulou, Chris C. | en |
dc.date.accessioned | 2019-11-13T10:42:57Z | |
dc.date.available | 2019-11-13T10:42:57Z | |
dc.date.issued | 2010 | |
dc.identifier.isbn | 978-1-4244-6917-8 | |
dc.identifier.uri | http://gnosis.library.ucy.ac.cy/handle/7/55133 | |
dc.description.abstract | In this paper, we investigate the importance of rewards in Multiagent Reinforcement Learning in the context of the Iterated Prisoner's Dilemma. We use an evolutionary algorithm to evolve valid payoff structures with the aim of encouraging mutual cooperation. An exhaustive analysis is performed by investigating the effect of: i) the lower and upper bounds of the search space of the payoff values, ii) the reward sign, iii) the population size, and iv) the mutation operators used. Our results indicate that valid structures that encourage cooperation can quickly be obtained, while their analysis shows that: i) they should contain a mixture of positive and negative values and ii) the magnitude of the positive values should be much smaller than the magnitude of the negative values. © 2010 IEEE. | en |
dc.source | Proceedings of the International Joint Conference on Neural Networks | en |
dc.source | 2010 6th IEEE World Congress on Computational Intelligence, WCCI 2010 - 2010 International Joint Conference on Neural Networks, IJCNN 2010 | en |
dc.source.uri | https://www.scopus.com/inward/record.uri?eid=2-s2.0-79959432977&doi=10.1109%2fIJCNN.2010.5596937&partnerID=40&md5=6c11543164b6a46dda112b343a1245bb | |
dc.subject | Multi agent systems | en |
dc.subject | Neural networks | en |
dc.subject | Lower and upper bounds | en |
dc.subject | Mathematical operators | en |
dc.subject | Population statistics | en |
dc.subject | Negative values | en |
dc.subject | Reinforcement learning | en |
dc.subject | Search spaces | en |
dc.subject | Multi-agent reinforcement learning | en |
dc.subject | Iterated prisoner's dilemma | en |
dc.subject | Mutation operators | en |
dc.subject | Population sizes | en |
dc.subject | Positive value | en |
dc.title | Multiagent reinforcement learning in the iterated prisoner's dilemma: Fast cooperation through evolved payoffs | en |
dc.type | info:eu-repo/semantics/conferenceObject | |
dc.identifier.doi | 10.1109/IJCNN.2010.5596937 | |
dc.author.faculty | 002 Σχολή Θετικών και Εφαρμοσμένων Επιστημών / Faculty of Pure and Applied Sciences | |
dc.author.department | Τμήμα Πληροφορικής / Department of Computer Science | |
dc.type.uhtype | Conference Object | en |
dc.description.notes | <p>Conference code: 85188 | en |
dc.description.notes | Cited By :2</p> | en |
dc.contributor.orcid | Christodoulou, Chris C. [0000-0001-9398-5256] | |
dc.gnosis.orcid | 0000-0001-9398-5256 | |