● Decision Support Systems · Lecture 3

When everyone is deciding

In Lectures 1–2 a single agent faced an indifferent environment. Now the environment is other rational agents, each chasing their own goal — and your best move depends on theirs. Welcome to game theory.

Prof. Andrea Campagner~28 minthis is where your exam example lives

🎯 Why this lecture matters most for the exam

Your sample exam (the Esempi sheet) asks you to define a Nash equilibrium and apply it to the Alice & Bob number game. That whole question is built from this lecture. We'll teach every idea in the professor's order, then dismantle that exact question in the Exam Lab. Use 🌙 for dark mode.

1the setup

What is a game?

Same machinery as decision theory — but we replace the faceless environment with a roomful of self-interested, rational agents.

🎙 the one change

Basically we refine the environment: instead of an abstract environment, we have a collection of self-interested and rational agents.

Definition — a game

A game is a set of agents O = {o₁,…,oₘ} where each agent oᵢ has a set of actions Aᵢ (its pure strategies), plus a utility function that gives every agent a payoff for each combination of actions.

U : A₁ × … × Aₘ → ℝᵐU(ā)ᵢ = the utility agent oᵢ gets when everyone plays the action profile ā

Agents needn't commit to one action. A strategy lets them randomise:

Pure vs mixed strategy

A pure strategy = commit to one action. A (mixed) strategy sᵢ = a probability distribution over Aᵢ — flip a weighted coin over your actions. A strategy profile s̄ = (s₁,…,sₘ) picks one strategy per agent.

Writing it down: normal form

Just like decision matrices, we lay a (2-player) game in a table — its normal form. Rows = Agent 1's actions, columns = Agent 2's actions, and each cell holds both payoffs: Agent 1, Agent 2.

iWhat normal form hides

Normal form flattens the structure: in chess one "action" would be an entire game of moves. When that structure matters you'd use an extensive representation instead — but for this lecture, the table is enough.


2four games to know

A little zoo of games

Three famous 2×2 games cover the whole spectrum from pure cooperation to pure competition.

Driving game — pure coordination

1 ↓ / 2 →LeftRight
Left1, 1−1, −1
Right−1, −11, 1

Two drivers approach each other; they're both happy if they pick the same side. A pure coordination game: every agent gets the same payoff (U(ā)ᵢ = U(ā)ⱼ), so there's no reason to compete — coordination helps everyone.

Matching Pennies — zero-sum

1 ↓ / 2 →HeadsTails
Heads1, −1−1, 1
Tails−1, 11, −1

Player 1 wants to match, Player 2 wants to mismatch. A zero-sum game: two agents, and in every cell the payoffs sum to zero — one wins exactly what the other loses. Purely competitive. Remember this game — your exam question is a 99-action version of it.

🎙 zero-sum is everywhere

Chess, checkers — pretty much all table games are zero-sum games. And all sports are typically zero-sum: one team wins, the other loses.

Prisoner's Dilemma — a bit of both

1 ↓ / 2 →CooperateDefect
Cooperate−1, −1−4, 0
Defect0, −4−3, −3

Two suspects are interrogated separately. "Cooperate" = confess with your partner; "Defect" = blame them. The most realistic kind of game — it mixes cooperation and competition. (The numbers are arbitrary; only their order matters.) We'll return to its unsettling solution in §4.

◇ Check yourself
In every cell of a game the two payoffs add up to 0. What kind of game is it?
Payoffs summing to zero in every cell = a zero-sum (purely competitive) game. Coordination games have equal payoffs; the prisoner's dilemma is neither.

3solution concepts · part 1

Analysing a game: Pareto

In decision theory we hunted for the optimal action. In a game there's a twist.

🎙 the twist that changes everything

The optimal strategy of an agent depends also on what the other agents do.

Because of that, there's no single "best." Instead we study solution concepts — sets of outcomes that are interesting in some defined sense. The first takes an outside observer's view.

Pareto dominance & optimality

Profile Pareto dominates if it's at least as good for every agent and strictly better for at least one. is Pareto optimal if nothing Pareto-dominates it. (It's exactly decision theory's dominance, lifted to many agents.)

🎙 who was Pareto?

Here "Pareto" is the name of a person — an Italian mathematician and social scientist; the name of this relationship derives from his.

!Why it's weak (just like dominance)

In a zero-sum game every profile is Pareto optimal — one agent's gain is the other's loss, so no profile beats another for both. So Pareto optimality often can't single anything out. We need a concept that takes the agent's own view.


4solution concepts · part 2 · the big one

Nash equilibrium

The most important solution concept in all of game theory — and the one your exam asks you to define.

Start from one agent's point of view. Suppose you knew everyone else's strategies (write them s̄₋ᵢ). Then your problem collapses to ordinary decision-under-risk: just pick what maximises your utility.

Best response

Given the others' fixed strategies s̄₋ᵢ, a best response is a strategy that maximises your own payoff: sᵢ* ∈ argmaxsᵢ U(sᵢ, s̄₋ᵢ)ᵢ.

Nash equilibrium

A strategy profile is a Nash equilibrium when every agent is simultaneously playing a best response to everyone else. Strict if each best response is the unique best; weak if some agent is tied between strategies.

🎙 why it's called stability

If I select the best response to the others, then I have no reason for changing my decision — as long as the other agents do not change theirs. No incentive to deviate unilaterally.

See it live 🎮

Below is a 2×2 game. Pick a classic, or edit any payoff. Each agent's best responses light up in their colour (P1 blue in each column, P2 magenta in each row); a cell where both are best-responding is a pure Nash equilibrium.

🎮 Game Explorer

A pure Nash equilibrium = a cell highlighted in both colours at once.

P1 ↓ / P2 →LeftRight
Left
,NE
,NE
Right
,NE
,NE
P1 best response P2 best response Nash equilibrium

Two examples worth memorising

Driving game (try it above): the two Nash equilibria — (Left,Left) and (Right,Right) — exactly coincide with the Pareto-optimal outcomes. Everyone agreeing on a side is stable and good.

!The Prisoner's Dilemma — the famous tragedy

Its unique Nash equilibrium is (Defect, Defect) → payoff (−3, −3). But that's Pareto-dominated by (Cooperate, Cooperate)(−1, −1)! Rational self-interest drives both to an outcome that's worse for everyone. (Switch the Explorer to "Prisoner's Dilemma" and watch only one cell light up green.)

◇ Check yourself
A strategy profile is a Nash equilibrium when…
Nash = mutual best responses: nobody can do better by unilaterally deviating. Maximising the total isn't required (see the prisoner's dilemma); "better for at least one" describes Pareto, not Nash.

5existence & mixed strategies

Do they even exist?

Four natural questions: do Nash equilibria always exist, how many are there, why are they interesting, and how do we compute them?

Nash's Theorem (existence)

In any game with finitely many agents, each with finitely many actions, at least one Nash equilibrium exists. (The proof uses Brouwer's fixed-point theorem — you won't be asked to prove it.)

But wait — try Matching Pennies in the Explorer: no cell lights up green. There's no Nash equilibrium in pure strategies! How does that square with the theorem?

The answer: go mixed

The theorem promises an equilibrium — just not necessarily a pure one. Matching Pennies has a unique mixed equilibrium: both players play (½ Heads, ½ Tails). Randomising 50/50 makes you unpredictable, so the opponent can't exploit you — neither can do better by deviating.

s̄ = ( (½ Heads, ½ Tails), (½ Heads, ½ Tails) )the unique Nash equilibrium of Matching Pennies

How many?

Not unique in general. The slides note that once you have two equilibria and , you can interpolate between them:

p·s̄ + (1−p)·t̄   is an equilibrium for every p ∈ [0, 1]→ two equilibria imply infinitely many

6meaning & computation

Why care, and can we compute it?

Beyond "stability," Nash equilibria have a beautiful interpretation in two-player zero-sum games — and that's also where they're easy to compute.

Recall maximin from Lecture 1 — the pessimist who maximises their worst case. Lift it to games:

sᵢmaximin = argmaxsᵢ  mins₋ᵢ  U(sᵢ, s₋ᵢ)ᵢplay as if the others will do their worst to you
The Minimax Theorem (von Neumann)

In any finite two-player zero-sum game, Nash equilibria coincide with the minimax = maximin strategies. Each player's maximin value equals their minimax value — that common number is the value of the game. So a Nash equilibrium is what you get when both players play as cautious pessimists.

Computation

Case · tap+

Zero-sum → easy

The minimax theorem turns it into a linear program. So Nash equilibria of zero-sum games are computable efficiently — the problem is in P.

Case · tap+

General → hard

For general games it becomes a non-linear program. Finding a Nash equilibrium is believed intractable: it's PPAD-complete.

!Exam-ready one-liner

Zero-sum: Nash equilibria reduce to linear programming → in P (efficient). General games: believed not efficiently solvable → PPAD-complete. In practice you use solvers that work well on real instances (see Shoham & Leyton-Brown).

◇ Check yourself
Computing a Nash equilibrium is efficient (in P) for which games?
The minimax theorem turns two-player zero-sum games into a linear program → P. General games are believed intractable (PPAD-complete) — existence (Nash's theorem) doesn't imply easy computation.

7a friendlier equilibrium

Correlated equilibria

A broader notion that's both more cooperative and easier to compute than Nash.

The idea

Add a trusted coordination mechanism — a shared random signal with joint distribution π — and let each agent's strategy σᵢ depend on the private signal it sees. It's a correlated equilibrium if no agent can do better by ignoring its recommendation. Every Nash equilibrium is a correlated equilibrium, but not the reverse.

Battle of the Sexes: a couple prefer being together but disagree on where — Ballet or Fight.

1 ↓ / 2 →BalletFight
Ballet2, 10, 0
Fight0, 01, 2

Its mixed Nash equilibrium pays each only 2/3. But add a fair coin both can see: "Heads → both Fight, Tails → both Ballet." Now they always end up together, for an expected payoff of (2 + 1)/2 = 1.5 each — far better than 2/3. The shared coin let them coordinate.

And it's cheap to compute

Unlike Nash, correlated equilibria can be found via linear programming. Questions that are hard for Nash (uniqueness, Pareto-optimality, guaranteed payoff) are all in P for correlated equilibria.


study like the exam

The Exam Lab

Lecture 3 is the heart of the DSS exam. The first question below is your actual sample exam — fully worked.

📋 The answer skeleton (and the rules)

① Define precisely in your own words · ② Apply to his scenario · ③ Decide & motivate. Written exam mandatory (need ≥ 18); optional essay for up to +4; no theorem proofs required — state & use them.

★ Your sample exam · Nash equilibrium

Define, in your own words but as precisely as possible, the notion of a Nash equilibrium. Then: Alice and Bob each bet on a natural number between 1 and 99. Alice wins if the two numbers coincide; Bob wins if they differ. Alice claims there is no Nash equilibrium; Bob claims Alice is wrong. Who is correct? Motivate.

① DEFINEA Nash equilibrium is a strategy profile — one strategy (pure or mixed) per player — in which each player's strategy is a best response to the others'. Equivalently: no player can increase their utility by unilaterally changing their strategy.
② APPLYThis is a finite two-player game (2 players, 99 actions each) — essentially Matching Pennies scaled to 99: Alice wants to match, Bob to mismatch (zero-sum). Two facts:
• By Nash's theorem (finite players + finite actions) → at least one equilibrium exists. • There is no pure equilibrium: for any fixed pair of numbers the loser can always deviate — if they coincide Bob switches to differ; if they differ Alice switches to match. No pure profile is stable.
③ DECIDEBob is correct that a Nash equilibrium exists. Alice is only partially right: there is no pure equilibrium, but the unique equilibrium is in mixed strategies — each player picks a number uniformly at random from 1…99. Motivate: if Alice is uniform, every Bob choice is equally (un)likely to match, so he can't improve by deviating — and symmetrically for Alice. Neither has an incentive to change → it's an equilibrium.
Question · Game types & Pareto

Define a zero-sum game and a pure coordination game. Classify Matching Pennies and the Driving game. What does Pareto optimality tell you in a zero-sum game, and why? Motivate.

① DEFINEA zero-sum game has two agents and, in every action profile, the payoffs sum to 0 (purely competitive). A pure coordination game has all agents receiving the same payoff in every profile (purely cooperative).
② APPLYMatching Pennies is zero-sum (each cell sums to 0). The Driving game is a pure coordination game (both get 1, or both get −1).
③ MOTIVATEIn a zero-sum game every strategy profile is Pareto optimal: any gain for one agent is an exact loss for the other, so no profile is better for both. Hence Pareto optimality is uninformative there — much like dominance in decision theory.
Question · The dilemma

Define best response and Nash equilibrium (strict vs weak). For the Prisoner's Dilemma — (C,C)=−1,−1; (C,D)=−4,0; (D,C)=0,−4; (D,D)=−3,−3 — find the Nash equilibrium and explain why it is paradoxical. Motivate.

① DEFINEA best response to the others' fixed strategies maximises your own utility. A Nash equilibrium is a profile where every player plays a best response; strict if each is the unique best response, weak if some player is tied.
② APPLYWhatever the opponent does, Defect beats Cooperate (0 > −1 and −3 > −4). So Defect is a dominant best response for both → the unique Nash equilibrium is (Defect, Defect) = (−3, −3).
③ MOTIVATEIt is paradoxical because (D,D) is Pareto-dominated by (Cooperate, Cooperate) = (−1, −1): both players would be better off cooperating, yet rational self-interest pushes each to defect. Stability (no incentive to deviate) does not imply efficiency.
Question · Existence & computation

State Nash's existence theorem and the Minimax theorem. What is the computational complexity of finding a Nash equilibrium in (a) two-player zero-sum games and (b) general games? Discuss.

① STATENash's theorem: every game with finitely many agents and finitely many actions has at least one Nash equilibrium (possibly mixed). Minimax theorem: in any finite two-player zero-sum game, Nash equilibria coincide with the maximin = minimax strategies, whose common value is the value of the game.
② COMPLEXITY(a) Zero-sum: the minimax theorem reduces it to linear programming → solvable efficiently, in P. (b) General games: a non-linear problem, believed intractable — PPAD-complete.
③ DISCUSSSo existence (guaranteed by Nash) does not imply efficient computation: only the zero-sum case is known to be easy. Practical solvers handle real instances despite the worst-case hardness.
Question · Correlated equilibria

Define a correlated equilibrium and explain how, in the Battle of the Sexes, a shared fair coin beats the mixed-Nash payoff. How do Nash and correlated equilibria relate? Motivate.

① DEFINEA correlated equilibrium adds a shared coordination mechanism (a joint distribution π over signals) plus strategies σ that map each agent's signal to an action, such that no agent can do better by deviating from its recommended action.
② APPLYBattle of the Sexes' mixed Nash pays each 2/3. With a fair coin both observe — "Heads → both Fight, Tails → both Ballet" — they always coordinate, for expected payoff (2+1)/2 = 1.5 each, well above 2/3.
③ MOTIVATEEvery Nash equilibrium is a correlated equilibrium, but not vice versa — the extra shared signal enlarges what's achievable. Bonus: correlated equilibria are computable by linear programming (in P), unlike Nash in general.
🗣 Say these out loud (cover the page)

• Define a Nash equilibrium in one precise sentence.
• Why does Matching Pennies have no pure equilibrium — and what is its mixed one?
• Prisoner's dilemma: what's the equilibrium, and why is it a "tragedy"?
• Zero-sum vs coordination vs prisoner's dilemma — one line each.
• Zero-sum vs general games: which is in P, which is PPAD-complete?
• How does a correlated equilibrium let players beat their mixed-Nash payoff?