Game Theory Course – Page 2 – Immutable Distribution

February 27, 2019March 12, 2019

Game Theory 4: Non-Cooperative Games

NON-COOPERATIVE GAMES

In studying the dynamics of cooperation and competition between actors, understanding the structure of the game that is being played is central to understanding the system of interest.

In game theory, a primary distinction is made between those game structures that are cooperative and those that are non-cooperative.

As we will see the fundamental dynamics surrounding the whole game are altered as we go from games whose structure is innately competitive to those games where cooperation is the default position.

A cooperative game is one wherein the agents are able to resort to some institution or third party in order to enable cooperation and optimal results for all.

A game is noncooperative if players cannot form the structures required to enable cooperation.

For example, we might think about two people wishing to make a commercial transaction online. Given two anonymous people interacting without some institution to enable cooperation, there is no reason for either to think that the other will carry through with the transaction as promised.

The seller is incentivised to take the money and not send the item while the buyer is likewise incentivised to take the product without sending the money. In the absence of some cooperative structure that would enable each party to trust the other and thus cooperate, the game would naturally gravitate towards defection and the potentially valuable transaction would not take place.

Thus we can see how in the absence of cooperative mechanisms each player may follow the course that renders them the best payoff without regard for what the other does, or what is optimal for the overall system and this can result in suboptimal outcomes for all.

In non-cooperative games, each agent in the game is assumed to act in their self-interest, and this self-interested agent is the primary unit of analysis within noncooperative games because there is no cooperative structure.

This is in contrast to cooperative game theory that treats groups or subgroups of agents as the unit of analysis and assumes they can achieve certain outcomes among themselves through binding cooperative agreements.

Game theory historically has been very much focused on non-cooperative games and trying to find optimal strategies within such a context. This is likely because non-cooperative games are very much amenable to our standard mathematical framework and thus offer nice closed form solutions.

But it is important to note that the real world is made up of situations that are sometimes cooperative, sometimes non-cooperative, and often involve elements of both.

As previously mentioned, non-cooperative games arise due to a number of factors. Firstly the game may be inherently zero-sum, meaning what one wins the other loses and thus there is an inherent dynamic of competition.

Many sports games are specifically designed to be zero-sum in their structure, so as to create a dynamic of competition. In such a case there is only one prize, and if someone else gets it, you don’t. There is no incentive for cooperation and every incentive for competition and thus the best option is for the actor to focus on maximizing their payoff irrespective of all else.

This is called a strictly competitive game. A strictly competitive game is a game in which the interests of each player are diametrically opposed.

Likewise, a game may be non-competitive due to the incapacity to create cooperative structures. Most people, when engaged in a game, will wish to not only optimize their own payoff but will wish to optimize the overall outcome as well.

In general, people do not like the idea of waste or of unfairness and we typically search for some optimal solution given both our own interests and some consideration for the overall organization.

The real world of social interaction is full of all sorts of informal social and cultural institutions designed to enable trust, cooperation and optimal outcomes for all.

Almost as soon as two people start to interact they will start to look for commonalities and shared interests that enable them to develop trust and cooperation.

Thus, non-cooperative games are typically those where the actors can not interact and form the trust required for cooperation. Indeed, there will be certain games that we construct where we specifically want competition and we do that by not allowing the players to cooperate, such as in a competitive market.

Lastly non-cooperative games can be a product of an incapacity to enforce binding contracts. If there is a third party involved to ensure optimal outcomes for the overall organization through sanctions and incentives, this can form a solid basis for cooperation – in the way that a government does by enforcing laws.

This is famously captured in Thomas Hobbes’ conception of the state of nature. Where he pondered “What was life like before civil society?” He went on to write “during the time men live without a common power to keep them all in awe, they are in that condition which is called war, and such a war as is of every man against every man.”

In this state, every person has a natural right or liberty to do anything one thinks necessary for preserving one’s own life.

Hobbes’ ideas illustrate vividly how in the absence of a third party to enforce cooperation, competition can prevail.

EQUILIBRIUM ANALYSIS

Non-cooperative games create a specific dynamic within a game, where we are taking the individual and their payoff as the basic unit of analysis. In such a circumstance we do not need to consider what is best for all if given some form of cooperation because this is not possible within the context.

We are solely interested in how the individuals will act.

The question of how should they act to optimize their own payoff, and given the assumption that both are performing this optimization what will be a stable solution to the game.

Given these assumptions, both players should search for a strategy that optimizes their payoff, and where those strategies of the players interact we should have a stable outcome, that we should be able to predict will occur.

This stable outcome is what we call an equilibrium.

Where equilibrium, in the general sense, means a state in which opposing forces are balanced, thus creating a point of stability and stasis.

When we see a ball at the bottom of a bowl it is in a state of equilibrium, because if we put it anywhere else in the bowl the force of gravity would act on it to pull it back to this static point. This is the same for the actors in a non-cooperative game because they are both trying to optimize their payoff they will both naturally gravitate towards the strategy that gives them the highest payoff.

But because their payoff is dependent on what strategy the other chooses and because they can not depend upon cooperation between them, they have to choose the best strategy assuming that the other will work to optimize their payoff without cooperating.

This point of equilibrium in a game is called the Nash equilibrium after the famous mathematician John Nash.

In game theory, the Nash equilibrium is a solution concept of a non-cooperative game involving two or more players in which each player is assumed to know the equilibrium strategies of the other players, and no player has anything to gain by changing only his or her own strategy. If each player has chosen a strategy and no player can benefit by changing strategies while the other players keep theirs unchanged, then the current set of strategy choices and the corresponding payoffs constitute a Nash equilibrium.

The Nash equilibrium is one of the foundational concepts in game theory.

The basic intuition of the Nash equilibrium is in predicting what others will do given their self-interest only and then choosing your optimal strategy given that assumption.

Nash equilibrium is a point where all players are doing their best given the absence of cooperation. It is a law that no one would want to change in the absence of some effective overall structure for coordination.

PRISONERS GAME

Nash equilibrium is best illustrated through the prisoner’s dilemma game.

The prisoner’s dilemma game is a classic two player game that is often used to present the concept of Nash equilibrium in a payoff matrix form.

Conceive of two prisoners detained in separate cells, interrogated simultaneously and offered deals in the form of lighter jail sentences for betraying the other criminal. They have the option to “cooperate” with the other prisoner by not telling on them, or “defect” by betraying the other.

However, if both players defect, then they will both serve a longer sentence than if neither said anything. Lower jail sentences are here interpreted as higher payoffs.

The prisoner’s dilemma has a similar matrix as depicted for the coordination game, but the maximum reward for each player is obtained only when the players’ decisions are different. Each player improves their own situation by switching from “cooperating” to “defecting”, given the knowledge that the other player’s best decision is to “defect”. The prisoner’s dilemma thus has a single Nash equilibrium: where both players choose to defect.

What has long made this an interesting case to study is the fact that this scenario is globally inferior to “both cooperating”. That is, both players would be better off if they both chose to “cooperate” instead of both choosing to defect. However, each player could improve their own situation by breaking the mutual cooperation, no matter how the other player changes their decision.

PREDICTION

The central aim of non-cooperative game theory then is in trying to predict people’s actions within a game by finding the Nash equilibria and assuming they will play that because it is their best option.

It is then legitimate for us to ask does equilibrium analysis give us any predictive capacity over what happens in the real world? Often the outcome of experiments is not an equilibrium as predicted by the theory. This is mainly because people do not fully reason through the game in a fully logically consistent fashion.

Equilibrium is a point where everyone has figured out what everyone else will do, thus behaviorally it often does not predict what people will do the first time they play the game.

Equilibrium should more be interpreted as what will happen over a number of iterations within a non-cooperative game, as players come to better understand the game and how to reason through it.

Similar to putting a ball in a bowl, it takes time before it arrives at an equilibrium and this is what is seen in game experiments they tend over time towards the equilibrium.

For example, in a game, people are asked to choose a number between 0 to 100, with the winner being the person who is able to guess what will be 2/3 of the average figure proposed by others.

So everyone is being asked to guess a bit below the average number proposed.

In this game, only a small percentage choose the equilibrium point – which is zero – and because other people did not act rationally in this game they were wrong.

In many ways then choosing this equilibrium as a prediction of what would happen is not a good option. And this clearly diverges dramatically from what the theory tells us.

However, overtime, as the game is iterated upon the numbers chosen by people does move towards the equilibrium. Thus it tells us something about statistical averages of the system but not very much about how it will behave in the real world the first iteration of the game.

February 24, 2019February 22, 2019

Game Theory 3: Elements of Games

ELEMENTS OF GAMES

Games in game theory involve a number of central elements which we can identify as players, strategies, and payoffs. In this chapter we are going to zoom in to better understand each of these different elements to a game, talking first about the players and rationality, then strategies and payoffs.

PLAYERS

As we touched upon in a previous videos agents are abstract models of individuals or organizations which have agency. Agency means the capacity of actors to make choices and to act independently on those choices to affect the state of their environment and they do this in order to improve their state within that environment.

In order to act and make choices, agents need a value system and need some set of rules under which to make their choices so as to improve their state with respect to their value system.

A big idea here is that of rationality, and we have to be careful how we defined this idea of rationality. A dictionary definition of rationality would read something like this “based on or in accordance with reason or logic”. Rationality simply means acting according to a consistent set of rules, that are based upon some value system that provides the reason for acting.

To act rationally is to have some value system and to act in accordance with that value system.

When a for-profit business tries to sell more products, it is acting in a rational fashion, because it is acting under a set of rules to generate more of what it values.

When a person who values their community does community work, they are acting rationally. Because their actions are in accordance with their value system and thus they have a reason for acting in that fashion.

Standard game theory makes a number of quite strong assumptions about the agents involved in games. A central assumption of classical game theory is that players act according to a limited form of rationality, what is sometimes call hyperrationality.

A player is rational in this sense if it consistently acts to improve its payoff without the possibility of making mistakes, has full knowledge of other players’ interactions and the actions available to them, and has an infinite capacity to calculate a priori all possible refinements in an attempt to find the “best one.” If a game involves only rational agents, each of whom believes all other agents to be rational, then theoretical results offer accurate predictions of the games outcomes.

Agents have a single conception of value, i.e. all value is reduced to a single homogeneous form called utility. Preferences and value are well defined.

Rational agents have unlimited rationality, the idea of omnipotence, i.e. they know all relevant information when making a choice, they can compute this information and all of its consequences. Within this model, agents have perfect information, and any uncertainty can be reduced to some probability distribution. The agent’s behavior is then seen to be an optimization algorithm over their set of possibilities.

Game theory is a young field of study—less than a century old. In that time, it has made remarkable advances, but it remains far from complete.

Traditional game theory assumes that the players of games are hyperrational — that they act in best accordance with their own desires given their knowledge and beliefs. This assumption does not always appear to be a reasonable one. In certain situations, the predictions of game theory and the observed behavior of real people differ dramatically.

People in the real world operate according to a multiplicity of motives, some of the time people are in a situation where they are simply trying to optimize a single metric, but more often they are not. They are embedded within a context where they are trying to optimize according to a number of different metrics.

The fact that people aren’t always optimizing according to a single metric is illustrated in the many games where people don’t choose actions that give them the greatest payoff within that single value system.

The best empirical examples of this are taken from the dictator game. The dictator game is a very simple game, where one person is given a sum of money, say 100 dollars, this person plays the role of “the dictator,” and is then told that they must offer some amount of that money to the second participant, even if that amount is zero. Whatever amount the dictator offers to the second participant must be accepted. The second participant, should they find the amount unsatisfactory, cannot then punish the dictator in any way.

Standard economic theory assumes that all individuals act solely out of self-interest. Under this assumption, the predicted result of the dictator game is that the “dictator should keep 100% of the cake, and give nothing to the other player.” This effectively assigns the value of what the dictator shares with the second player to zero.

The actual results of this game, however, differ sharply from the predicted results. With a “standard” dictator game setup, “only 40% of the experimental subjects playing the role of dictator keep the whole sum.” In research by Robert Forsythe, et al, they found the average amount given, under these standard conditions, to be around 20% of the allocated money.

In any case, in the majority of these game trials, the dictator assigns the second player a non-zero amount.

The obvious reason for this is that the dictator is not simply trying to optimize according to a single monetary value – that a strict conception of rationality would posit – but is acting rationally to optimize according to a number of different value systems.

They want the money, yes, but they are also optimizing according to cultural and social capital that motivates them to act in accordance with some conception of fairness and it is out of the interaction of these different value systems that we get the empirical results.

What agents value can be simple or it can be complex.

A financial algorithm is a form of agent that acts according to some set of rules designed to create a financial profit; this is an example of a very simple value system.

In contrast, what a human being value is typically many things. People value social capital, that is to say, their relationships with other people and their roles within social groups. They care about cultural capital, how they perceive themselves and how others perceive them. They care about financial capital and natural capital. They often care about their natural environment to a greater or lesser extent.

Likewise, the set of instructions or rules can be based on some simple linear cause and effect model – what may be called an algorithm – or they may be much more complex models – what may be called a schema.

Thus when we say that someone is acting rationally and maximizing their value payoff, this can be in many different contexts. A person helps an old lady onto the bus, not because they are going to get paid for this, but what they do get from this is some sense of being a decent person and they gain some payoff in that sense.

Thus it is not the concept of rationality or that people try to optimize their payoff that needs to be revised. It is the narrow definition of rationality as optimizing according to a single metric that needs to be expanded within many contexts that involve social interaction.

The classical conception of strict rationality based upon a single metric will apply in certain circumstances. It will be relevant to many games in ecology, where creatures have a simple conception of value maximization.

Likewise, it will often be relevant to computer algorithms and software systems and sometimes relevant for socioeconomic interactions, or at least partially relevant.

As the influential biologist Maynard Smith, in the preface to the book Evolution and the Theory of Games, “paradoxically, it has turned out that game theory is more readily applied to biology than to the field of economic behavior for which it was originally designed.”

If we want an empirically accurate theory of games between more complex agents it will need to be expansive in its conception of value and rationality to include the more complex set of value systems and reasoning processes that are engendered in such games. We have spent quite a bit of time talking about this idea of rationality as it is a major unresolved flaw within standard game theory, one that is important to be aware of.

GAME STRATEGIES

Strategy is the choice of one’s actions.

In game theory, player’s strategy is any of the options they can choose in a setting where the outcome depends on the action of others. A strategy, in the practical sense, is then, a complete algorithm for playing the game, telling a player what to do for every possible situation throughout the game.

For example, the game might be a business entering a new market and trying to gain market share against other players. This will not just happen overnight but they will have to take a series of actions that are all coordinated towards their desired end result. They might first have to organize production processes and logistics, then advertising, then pricing etc. Each of these actions we would call a move in the game, and the overall strategy consists of a set of moves.

A player’s strategy set defines what strategies are available for them to play. For instance, in a single game of rock-paper-scissors, each player has the finite strategy set of rock, paper, scissors.

Likewise, a player’s strategy set can be infinite, for example in choosing how much to pay when making an offer to purchase an item in a process of bartering, this could be potentially infinite, it could be any increment.

PURE / MIXED STRATEGY

In some games, there will not be one primary strategy that an agent will always choose but in many circumstances, they may have a number of options and choose between them with some given probability. This will often be the case when they don’t want the other player to know in advance which move they will take.

For example, in smuggling goods across the Vietnam-Chinese border, the smugglers have many different points of entry available to them and the police have many different points that they could secure. In such a case neither side wants always to choose the same location, they want some degree of randomness in the strategy that they choose.

This gives us a distinction in games between those with strategies that one will always play and those that one will play only with a given probability. This distinction is captured in the terms mixed and pure strategy.

Pure strategies are ones which do not involve randomness and tell us what to do in every situation. A pure strategy provides a complete definition of how a player will play a game. In particular, it determines the move a player will make for any situation they may face.

Strategies that are not pure—that depend on an element of chance—are called “mixed strategies.” In mixed strategies, you have a number of different options and you ascribe a probability to the likelihood of playing them. As such we can think about a mixed strategy as a probability distribution over the actions players have available to them.

PAYOFFS

For every strategy taken within a game, there is a payoff associated with that strategy.

A player’s payoff defines how much they like the outcome of the game.

The payoffs for a particular player reflect what that player cares about, not what another player thinks they should care about. Payoffs must reflect the actual preferences of the players, not preferences anyone else ascribes to them.

Game theorists often describe payoffs in terms of utility — the general happiness a player gets from a given outcome. Payoffs can represent any type of value, but only the factors that are incorporated into the model. Thus we have to be careful in asking what do the agents really value.

Payoffs are then essentially numbers which represent the motivations of players. In general, the payoffs for different players cannot be directly compared, because they are to a certain extent subjective.

Payoffs may have numerical values associated with them or they may simply be a set of ranking preferences. If the payoff scale is only a ranking, the payoffs are called “ordinal payoffs.” For example, we might say that Kate likes apples more than oranges and oranges more than grapes.

However if the scale measures how much a player prefers one option to another, the payoffs are called “cardinal payoffs.” So if the game was simply one for money then we could ascribe a value to each payoff, that would be the quantity of money gained.

In many games all that matters is the ordinal payoffs, all we need to know is which options they prefer without actually knowing how much they prefer them. This is useful because in reality people don’t really go around ascribing specific values to how much they like things, but they do think about whether they prefer one thing or another. Kate may know that she likes apples more than oranges but she would probably laugh if you asked her to put values on how much more she likes them.

In the next section, we start to play some games, looking at how to solve games, how we find the best strategies and talk about the important idea of equilibrium.

February 22, 2019February 20, 2019

Game Theory 2: Types of Games

GAMES MODELS

As we talked about in the previous module, a game within game theory is any situation involving interdependency between adaptive agents.

Games are fundamentally different from decisions made in a context with only one adaptive agent. To illustrate this point, think of the difference between the decisions of a bricklayer and those of a business person. When the bricklayer decides how he might go about building a house he does not expect the bricks and mortar to fight back. We could call this a neutral environment.

But when the business person tries to enter a new market, they must take note of the actions and of the other actors in that market in order to best understand the viable options available to them.

A situation that depends only on the actions of one actor is best understood as one of decision theory not so much game theory.

Like the business person, all players engaged in a game must recognize their interaction with other intelligent and purposeful agents. Their own choices must allow both for conflict and for possibilities for cooperation. So a game really tries to capture this dynamic where autonomous agents that have their own goals are interdependent in effecting some joint outcome.

A game has three major components: players, strategies, and payoffs.

A player is a decision maker in a game.
A strategy is a specification of a decision for each possible situation in which a player may find themselves.
A payoff is a reward or loss that players experience when all the players follow their respective strategies.

GAME REPRESENTATION

Games are represented in either a matrix form or as a tree graph.

The matrix form models a game without time involved where players must choose their strategies simultaneously.
A tree graph model involves time as an element allowing for choices to be made in a sequential process over a course of time, thus forming a tree-like representation that captures the choices made by agents at each stage in the game.

The matrix model is the most common method for representing a game and is called in game theory normal-form representation. The normal-form representation to a game associates the players with the axes to the matrix, with each column or row along the axis corresponding to one unique strategy for the player.

Where the players’ different strategies interact in the matrix, a value is placed to represent the associated payoffs for each player if those given strategies are played.

In simultaneous games, the players don’t have to move at the same time. The only restriction is that no players can know the other players’ decisions when they make their own choice.

The normal form is a condensed form of the game, stripped of all features but the possible options of each player and their payoffs during one iteration of the game. This makes it more convenient to analyze.

A game where choices are made sequentially over time is represented as a decision tree graph that branches out with each iteration of the game as time goes forward and players have to make choices. An example of this extensive form of game would be chess where players move in a sequential process with each move of one player creating a multiplicity for possible new moves of the other as they branch out into the future.

Players engaged in a sequential game then have to look ahead and reason back as each player tries to figure out how the other players will respond to his current move, how he will respond in turn, and so on. The player anticipates where his initial decisions will ultimately lead and uses this information to calculate his current best choice.

INFORMATION

Agents within a game are making their choices based on the information available to them. Thus we can identify information as a second important factor in the makeup of the game.

In any given game agents can have complete information meaning each player has knowledge of the payoffs and possible strategies of other players, or incomplete information referring to situations in which the payoffs and strategies of other players are not completely known.

An example of a game of perfect information would be one that is called the ultimatum game where one player receives a sum of money and proposes how to divide the sum with the other player. The second player chooses to either accept or reject this proposal. If the second player accepts, the money is split according to the proposal. If the second player rejects, neither player receives any money. In this game, all information is available to all players.

In contrast many real world games involve imperfect information. For example, prisoner dilemma games only make sense if given imperfect information where you are choosing without knowing how the other has chosen.

Information plays an important role in real-world games and it can work as an advantage or disadvantage to the players. When one player knows something that others do not, sometimes the player will wish to conceal this. For example in playing poker, and at other times they will want to reveal it, for example, companies offering guarantees for their products is a display of the information that they have that their product is not going to break down soon, and they want customers to know this information.

SYMMETRY

This reveals also how games can be asymmetrical. Meaning the payoffs to individuals for the different possible actions may not be the same. If the identities of the players can be changed without changing the payoff to the strategies, then a game is symmetric.

Many of the commonly studied 2 × 2 games are symmetric.

Games of coordination are typically symmetrical. Take for example the case of people choosing the sides of the road upon which to drive. In a simplified example, assume that two drivers meet on a narrow dirt road. Both have to swerve in order to avoid a head-on collision. If both execute the same swerving maneuver they will manage to pass each other, but if they choose differing maneuvers they will collide. In the payoff matrix successful passing is represented by a payoff of 10, and a collision by a payoff of 0 and we can see how the payoff to each player are symmetrical.

ZERO-SUM / NON-ZERO-SUM

Games are played over some mutually desired resource, what we are defining as value within that game. For example, countries go to war over territory, businesses compete for market share, creatures for the resources within an ecosystem, political parties for decision making power, athletes for prizes and prestige etc.

In all of these situations, there is some shared conception of what agents value and some interdependence in how that value is distributed out depending on the actions of the agents.

But the question is whether the total value distributed out to all agents remains constant irrespective of their actions or can it grow or decrease depending on their capacity to cooperate.

Constant-sum games are games in which the sum of the players’ payoffs sum to the same number. These games are games of pure competition of the type “my gain is your loss”.

Zero-sum games are a special case of constant-sum games, in which choices by players can neither increase nor decrease the available resources. In zero-sum games, the total benefit to all players in the game, for every combination of strategies, always adds to zero.

One can see this in the game paper, rock, scissors or in most sporting events.

In zero-sum games, the relationship between the agents’ payoffs are negatively correlated, which is called negative interdependence, meaning individuals can only achieve their goal via the failure of another agent and this creates an attractor towards competition, there is no incentive to cooperate and thus these games are called strictly competitive as competition is always the best strategy.

Non-constant games or non-zero sum games are those in which the total value to be distributed can increase or decrease depending on the degree of cooperation between actors.

For example, through the members of a business working together they can create more value than working separately, thus the whole payoff gets bigger. Equally, the total payoff may get smaller through conflict, like in an arms race between two gangs in a city.

In non-zero sum games, the outcome for agents is positively correlated, if one gets more the other will too if one gets less the other will too. With non-zero sum games, we can get positive interdependence between the agents, meaning members of a group come to share common goals and perceive that working together is individually and collectively beneficial, and success depends on the participation of all the members leading to cooperation.

COOPERATIVE / NON-COOPERATIVE

A cooperative game is one in which there can be cooperation between the players and they have the same cost.

So cooperative games are an example of non-zero sum games. This is because in cooperative games, either every player wins or loses.

Cooperation may be achieved through a number of different possibilities. It may be built into the dynamics of the game as would be the case with a positive-sum game where payoffs are positively correlated. In such a case the innate structure of the game creates an attractor towards cooperation because it is both in the interest of the individuals and the whole organization.

A good example of this are the mutually beneficial gains from trade in goods and services between nations. If businesses or countries can find terms of trade in which both parties benefit then specialization and trade can lead to an overall improvement in the economic welfare of both countries, with both sides seeing it as in their interest to cooperate in this organization, because of the extra value that is being generated.

Equally, cooperation may be achieved by external enforcement by some authoritative third party such as governments and contract law. Where we cooperate in a transaction because the third party is ensuring that it is in our interests to do so by creating punishments or rewards.

Likewise, cooperation may be achieved through peer-to-peer interaction and feedback mechanisms as will talk about in future videos.

A non-cooperative game is one where an element of competition exists and there are limited mechanisms for creating institutions for cooperation. This may be because of the inherent nature of the game we are playing. That is to say, it is a zero-sum game which is strictly competitive and thus cooperation will add no value.

Noncooperation may be a function of isolation, lack of communication and interaction with which to build up the trust that enables cooperation.

We see this within modern societies, as these societies have grown in size they have transited from communal cooperative systems based on the frequent interaction of members to requiring formal third parties to ensure cooperation because of the anonymity and lack of interaction between members of large societies.

Lastly, there may simply be a lack of formal institutions to support cooperation between members. An example of this might be what we call a failed state where the government’s authority is insufficiently strong to impose sanctions and thus can not work as the supporting institutional framework for cooperation.

SUMMARY

In this video we have looked at some of the basic features to games, we talked about the two basic forms for representation, that of the normal form in a matrix model and that of the extensive form as a tree graph that unfolds over time.

We talked about the important role of information, where games may be defined as having imperfect or perfect information and how agents may use information to their advantage.

We talked about symmetrical and asymmetrical payoffs in games. We briefly looked at zero sum games and non-zero sum games where the payoffs can get larger given cooperation.

Finally we talked about the distinction between a cooperative and non-cooperative game and some of the factors that create these different types of games which we will be discussing further throughout the course.

February 21, 2019February 19, 2019

Game Theory 1: Game Theory Overview

GAME THEORY OVERVIEW

We live in a world that exhibits extraordinary levels of order and organization on all levels from the smallest molecules, to human social organizations to the entire universe. We might say that it is the job of the enterprise of science to try and understand this extraordinary order and organization that we see in the world around us. And in many ways, we have been very successful in the past few hundred years in making progress in this project. We understand the workings of the atom, the structure of DNA, we understand the origins of the universe, how galaxies form and the precise elliptical orbit of the Earth around the Sun.

But what all these systems that we have been so good at describing and predicting the behavior of have in common is that they are inert. That is, they do not have any degree of autonomous adaptive capacity.

Here we can make a fundamental distinction between those systems that are composed of inert elements and those that are composed of adaptive elements.

Because these inert systems that are studied in physics and chemistry do not have adaptive capacity we can describe them through a single global rule. We can write equations about how elements will react when combined or how the solar system will change over time according to a set of differential equations in a deterministic fashion.

Unfortunately, this approach does not work when dealing with systems that are composed of adaptive elements that are non-deterministic in their behavior.

Adaptation gives the elements in the system the capacity to respond in different ways depending on the local information they receive. And the overall organization that forms is in fact not a product of a global rule, like we might have for a chemical reaction. Instead, the result is a product of how these adaptive agents respond to each other.

With these adaptive systems, the overall makeup of the organization is not necessarily defined by a top-down rule, but may emerge out of how the elements adapt and respond to each other locally.

There is no algebraic or differential equation to describe how international politics works, why families fall apart. or the success of a business within a market. The overall workings of these adaptive systems is an emergent phenomenon of local rules and interdependencies.

GAME THEORY

And it is these systems composed of adaptive agents that are interdependent that game theory tries to understand and model.

A game is a system wherein adaptive agents are interdependent in affecting each other and the overall outcome.

Game theory is the mathematical modeling of such systems.

These adaptive systems are pervasive in our world, from cities and traffic to economies, financial markets, social networks, ecosystems, politics, and business.

The central ingredients of these systems is that of agents and interdependency. Without either of these elements, we don’t have a game.

If the elements did not have agency and the capacity for adaptation they would have no choices and we would have a deterministic system.

Likewise, if they were not interdependent then they would not form some combined organization and we would then study them in isolation in which case likewise we would not have a game.

AGENTS

Games are formed out of the interdependencies between adaptive agents.

So what is an adaptive agent? An agent is any entity that has what we call agency. Agency is the capacity to make choices based upon information and act upon those choices autonomously to affect the state of their environment.

Examples of agents include social agents, such as individual human beings, businesses, governments, etc. They may be biological agents such as bacteria, plants, or mammals. They may also be technologies such as robots or algorithms of various kind.

All adaptive systems regulate some process and they are designed to maintain and develop their structure and functioning. For example, plants process light and other nutrients and their adaptive capacity enables them to alter their state so as to intercept more of those resources. The same is true for bacteria and animals, as well as for a basketball team or a business. They all have some conception of value that represents whatever is the resource that they require, whether that is sunlight, fuel, food, money etc.

This creates what we can call a value system. That is to say, whatever structure or process they are trying to develop forms the basis for their conception of value and they use their agency to act and make choices in the world to improve their status with respect to whatever it is they value.

As we can see this concept of value is highly abstract. And as we will discuss in a future module this value system can be very simple or very complex but it forms the foundations to what we are dealing with when talking about adaptive agents and games.

You can’t model a game without understanding what the agents value and the better you understand what they really value and incorporate it into the model the better the model will be.

Thus agents can also be defined by what we call goal oriented behavior. They have some model as to what they value and they take actions to affect their environment in order to achieve more of whatever is defined as value.

GAMES

In game theory, a game is any context within which adaptive agents interact and in so doing become interdependent.

Interdependence means that the values associated with some property of one element become correlated with those of another. In this context, it means that the goal attainment of one agent becomes correlated with the others.

The value or payoff to one agent in the interaction is associated with that of the others.

This gives us a game. Wherein agents have a value system, they can make choices and take actions that affect others and the outcome of those interactions will have a certain payoff for all the agents involved.

A game then being a very abstract model can be applied to many circumstances of interest to researchers. And it has become a mainstream tool within the social sciences of economics, political science and sociology but also in biology and computer science.

The trade negotiations between two nations can be modeled as a game. The interaction of businesses within a market is a game. The different strategies adopted by creatures in an ecosystem can be seen as a game. The interaction between a seller and buyer as they haggle over the price of an item is a form of game. The provision of public goods and the formation of organizations can be seen as games. Likewise, the routing of internet traffic and the interaction between financial algorithms are games.

To quickly take a simple concrete example of a game, let’s think about the current situation with respect to international politics and climate change. In this game, we have all of the world’s countries and all countries will benefit from a stable climate. But it requires them to cooperate and all pay the price of reducing emissions in order to achieve this.

Although this cooperative outcome would be best for all, it is in fact in the interest of any nation state to defect on their commitments as then they would get the benefit of others reducing their pollution without having to pay the cost of reducing their own emissions.

Because in this game it is in the private interests of each to defect, in the absence of some overall coordination mechanism the best strategy for an agent to adopt given only their own cost-benefit analysis, is to defect and thus all will defect and we get the worst outcome for the overall system.

COOPERATION

This game is called the prisoner’s dilemma and it is the classical example given of a game, because it captures in very simplified terms the core dynamic, between cooperation and competition, that is at the heart of almost all situations of interdependence between adaptive agents.

In the interdependency between agents there comes to form two different levels to the system: the macro-level, wherein they are all combined and have to work cooperatively to achieve an overall successful outcome, and the micro-level, wherein we have individual agents pursuing their own agendas according to their own cost-benefit analysis.

It is precisely because the rules and dynamics that govern the whole and those that govern the parts are not aligned that we get this core constraint between cooperation and competition.

This is what is called the social dilemma and it can be stated very simply as what is rational for the individual is irrational for the whole.

If you do what is rational according to the rules of the macro-level to achieve cooperation then you will be operating in a way that is irrational to the rules of the micro-level and vice versa.

If either of these dimensions to the system was removed then we would not have this core constraint. If the agents were not interdependent within the whole organization then there would be no macro-level dynamic and the set of parts would be simply governed by the rules of the agents locally.

Equally, if each agent always acted in the interests of the whole without interest for their own cost-benefit analysis, then again we could do away with the rules governing the micro-level and we would simply have one set of rules governing the whole thus there would be no core dynamic of interest, things would be very simple and straightforward. The complexity arises out of the interaction between these two different rule sets and trying to resolve it by aligning the interests of the individuals with those of the whole.

SUMMARY

So we have outlined what game theory is, talking about it as the study of situations of interdependence between adaptive agents and how these interdependencies create the core dynamic of cooperation and competition that is of central interest to many. In the coming videos in this section, we will talk about the different elements involved in games and the different types of games we might encounter.

February 20, 2019February 19, 2019

Game Theory Course Introduction

PREFACE

As we watch the news each day, many of us ask ourselves why people can’t cooperate, work together for economic prosperity and security for all, against war? Why can’t we come together against the degradation of our environment?

But in strong contrast to this, the central question in the study of human evolution is why humans are so extraordinary cooperative as compared with many other creatures. In most primate groups, competition is the norm, but humans form vast complex systems of cooperation.

Humans live out their lives in societies, and the outcomes of those social systems and our individual lives is largely a function of the nature of our interaction with others. A central question of interest across the social sciences, economics, and management is this question of how people interact with each other and the structures of cooperation and conflict that emerge out of these.

Of course, social interaction is a very complex phenomenon. We see people form friendships, trading partners, romantic partnerships, business compete in markets, countries go to war, the list of types of interaction between actors is almost endless.

For thousands of years, we have searched for the answers to why humans cooperate or enter into conflict by looking at the nature of the individuals themselves. But there is another way of posing this question, where we look at the structure of the system wherein agents interact, and ask how does the innate structure of that system create the emergent outcomes?

The study of these systems is called game theory. Game theory is the formal study of situations of interdependence between adaptive agents and the dynamics of cooperation and competition that emerge out of this. These agents may be individual people, groups, social organizations, but they may also be biological creatures, they may be technologies.

The concepts of game theory provide a language to formulate, structure, analyze, and understand strategic interactions between agents of all kind.

Since its advent during the mid 20th-century, game theory has become a mainstream tool for researchers in many areas most notably, economics, management studies, psychology, political science, anthropology, computer science and biology. However, the limitations of classical game theory that developed during the mid 20th century are today well known.

Thus, in this course, we will introduce you to the basics of classical game theory while making explicit the limitations of such models. We will build upon this basic understanding by then introducing you to new developments within the field, such as evolutionary game theory and network game theory that try to expand this core framework.

In the first section, we will take an overview of Game Theory. We will introduce you to the models for representing games, the different elements involved in a game and the various factors that affect the nature and structure of a game being played.

In the second section, we look at Non-cooperative Games. Here you will be introduced to the classical tools of game theory used for studying competitive strategic interaction based around the idea of Nash equilibrium.

We will illustrate the dynamics of such interactions and various formal rules for solving non-cooperative games.

In the third section, we turn our attention to the theme of Cooperation.

We start out with a general discourse on the nature of social cooperation before going on to explore these ideas within a number of popular models, such as the social dilemma, tragedy of the commons, and public goods games. Finally, talking about ways for solving social dilemmas through enabling cooperative structures.

The last section of the course deals with how games play out over time as we look at Evolutionary Game Theory. Here we talk about how game theory has been generalized to whole populations of agents interacting over time through an evolutionary process, to create a constantly changing dynamic as structures of cooperation rise and fall.

Finally, in this section we will talk about the new area of Network Game Theory, that helps to model how games take place within some context that can be understood as a network of interdependencies.

This book is a gentle introduction to game theory and it should be accessible to all. Unlike a more traditional course in game theory, the aim of this book will not be on the formalities of classical game theory and solving for Nash equilibrium, but instead using this modeling framework as a tool for reasoning about the real world dynamics of cooperation and competition.