Multiagent reinforcement learning in the iterated prisoner's dilemma: Fast cooperation through evolved payoffs

Vassiliades, Vassilis; Christodoulou, Chris C.

doi:10.1109/IJCNN.2010.5596937

Conference Object

Date

2010

Author

Vassiliades, Vassilis

Christodoulou, Chris C.

ISBN

978-1-4244-6917-8

Source

Proceedings of the International Joint Conference on Neural Networks
2010 6th IEEE World Congress on Computational Intelligence, WCCI 2010 - 2010 International Joint Conference on Neural Networks, IJCNN 2010

Google Scholar check

Keyword(s):

Multi agent systems

Neural networks

Lower and upper bounds

Mathematical operators

Population statistics

Negative values

Reinforcement learning

Search spaces

Multi-agent reinforcement learning

Iterated prisoner's dilemma

Mutation operators

Population sizes

Positive value

Metadata

Show full item record

Abstract

In this paper, we investigate the importance of rewards in Multiagent Reinforcement Learning in the context of the Iterated Prisoner's Dilemma. We use an evolutionary algorithm to evolve valid payoff structures with the aim of encouraging mutual cooperation. An exhaustive analysis is performed by investigating the effect of: i) the lower and upper bounds of the search space of the payoff values, ii) the reward sign, iii) the population size, and iv) the mutation operators used. Our results indicate that valid structures that encourage cooperation can quickly be obtained, while their analysis shows that: i) they should contain a mixture of positive and negative values and ii) the magnitude of the positive values should be much smaller than the magnitude of the negative values. © 2010 IEEE.