Evolving Cooperation in the Iterated Prisoner’s Dilemma - Dr. David Mathias

Thursday, January 16, 2020 11 a.m. to noon

Abstract: 

The Prisoner’s Dilemma is a simple two-player game in which two accomplices have been captured by the police and are being interrogated separately. Each has two choices: to cooperate with their accomplice and refuse to confess; or to defect by confessing and implicating their accomplice. The reward (symmetrically, penalty) that each receives depends not only on their own action but also that of the other player. In the Iterated Prisoner’s Dilemma (IPD), the players play multiple rounds of the game allowing them to learn about their opponent’s actions and act accordingly.

The Iterated Prisoner's Dilemma has been studied in great detail in fields as diverse as economics, computer science, psychology, politics, and environmental studies. This is due, in part, to the intriguing property that its Nash Equilibrium is not a globally optimal solution. Many researchers have used evolutionary computation to evolve effective strategies for IPD. Typically treated as a single-objective problem, a player's goal is to maximize their total reward. Mittal and Deb created a multi-objective version of the game by including minimization of the opponent's reward as an additional objective. 

Here, we explore the role of mutual cooperation in IPD player performance. We implement a multi-objective genetic algorithm in which each member of the population belongs to one of four sub-populations: selfish, communal, cooperative, and selfless, the last three of which use mutual cooperation as an optimization objective. Game play occurs among all members, without regard to sub-population, while crossover and selection occur only within a sub-population. Our evolved players are tested against a population of Axelrod's strategies and the Gradual strategy. Testing solely using self score, we find that players evolved with mutual cooperation as an objective perform very well. In some cases, our cooperative players completely dominate the competition. Thus, learning to play nicely with others is a successful strategy for maximizing personal reward.

Read More

Location:

L3Harris Engineering Center: HEC 356

Contact:


Calendar:

Events at UCF

Category:

Speaker/Lecture/Seminar

Tags:

n/a