Winning a “Beauty Contest,” Or How My Professor Gambled $250 Teaching a Lesson in Game Theory

Using game theory is like playing high stakes poker. You can predict the right moves, but you can still get burned by surprises. Game theorists often gamble more than just academic integrity and pride. In one very memorable lecture, my college professor staked $250 to teach a lesson about crowd behavior.

The lecture began innocently enough. We were going to play a simple game. Here are its rules:

1. Everyone secretly submits a whole number from 0 to 20.

2. All entries will be collected, and the guesses will be averaged together.

3. The winning number will be chosen as two-thirds of the average, rounded to the closest number. For instance, if the average of all entries was 3, then the winning number would be chosen as 2. Or if the average was 4, the winning number would be 3 (rounded from 2.6666…).

4. Entries closest to the winning number get a prize of meeting with the professor over a $5 smoothie. (In the academic version of the game, multiple winners split the prize, but my professor was being generous).

Before you read on, I would like you to seriously consider what number you might pick.

Imagine you are sitting in a Stanford lecture hall and actually playing this game. You seek the glory of outsmarting 49 other Stanford kids, and you really want to meet with the professor since you find game theory fascinating. You have 10 seconds to decide before ballots are collected. Which number would you pick?

Please write down your answer on a piece of paper before reading on.

Some Guiding Logic

The game is called a “p-beauty contest.” The “p” refers to the proportion the average is multiplied by–in this case, p is two-thirds. If you’re wondering, the game has the same flavor for any value of p less than 1. Why is it called a beauty contest? It’s because the game is the numbers-analog to a beauty contest developed by John Maynard Keynes.

Here is the beauty contest that Keynes pondered. Imagine that a newspaper runs a contest to determine the prettiest face in town. Readers can vote for the prettiest face, and the face with the most votes will be the winner. Readers voting for the prettiest face will be entered in a raffle for a big prize.

How does the game play out? Keynes wanted to point out the group dynamics. The naive strategy would be to pick the face you found to be the most attractive. A better would be to picking the face that you think other people will find attractive.

The number “beauty contest” has the same kind of logic. You don’t pick a number you like. You pick a number that’s closest to two-thirds of the average of everyone else. The twist of both games is that your guess affects the average outcome. And each person is trying to outsmart everyone else.

Given the subtlety of the game, my professor was banking on paying out to only a few winners. Although it was mathematically possible for each of us to win, and he was taking that risk. In fact, he knew that if we were all rational, we would all win. He would have to pay out a $5 smoothie to 50 students–that is, he made a $250 gamble playing this game.

Why was he so confident? Let’s explore the solution to the game and see why it’s hard to be rational.

Numbers You Shouldn’t Pick

Even though it’s not possible to know what other people are guessing, this game has a solution. If everyone acts rationally, there are only two possible winning numbers. It takes some crafty thinking, but it is really based on two principles I think you will accept.

Principle 1: Don’t Play Stupid Strategies (Eliminate Dominated Strategies)

The first principle is that players should avoid writing down numbers that could never win. That sounds logical enough, but it’s not always the case. We all can agree that writing a number that could never win is just a dumb, stupid strategy. You are picking an option that’s inferior to something else, and hence is known as a dominated strategy. (For an example, see this article about a dominated strategy in real life).

Are there any dominated strategies in the beauty contest?

To start answering that question, we need to figure out which numbers will never win. A natural question is what is the highest winning number? You would never want to pick a number larger than that, unless you want to lose.

You know that the highest number anyone can pick is the number 20. If every single person picked 20, then the average would be 20. The winning number would be two-thirds of 20, which is 13 when rounded.

Should you ever find yourself submitting 20?

The answer is no–there is always a better choice, say the number 19. The only time 20 wins is precisely when everyone else picks it and everyone shares the prize. In that case, you would be better off writing 19 to win the prize unshared. Plus, by writing 19, you can possibly win in other cases, like when everyone picks 19. You are always better off writing 19 than 20. The guess of 20 is dominated–it’s dumb.

You should never choose 20. And your rational opponents should be thinking the same way. So here’s the big result: you can reason that no player ever chooses 20.

Principle 2: Trim the Game, and Apply Principle 1 Again (Iterate the Elimination Process)

By principle 1, no player will ever choose 20. Therefore, you can essentially remove 20 as a choice. The game trims to a smaller beauty contest in which everyone is picking a number between 0 and 19. The smaller game has survived one round of principle 1.

Now, repeat! Ask yourself: in the reduced game, are there any dominated strategies?

Now 19 takes the role of 20 from the last analysis. Since 19 is the highest possible average, it will never be a good idea to guess it. Applying principle 1, you can reason that 18 is always a better choice than 19. Thus, 19 is dominated and should be eliminated as a choice for every player.

The game is now trimmed to picking numbers from 0 to 18. This is the result of two iterations of principle 1.

There’s no reason to stop now. You can iterate principle 1 to successively eliminate choices of 18, 17, 16, and so on. The only numbers remaining will be 0 and 1. (This requires 19 iterations of principle 1.)

There is a name for this thought process. It’s aptly named, but a mouthful: iterated elimination of dominated strategies (IEDS). The idea is to eliminate bad moves, trim the game, and iterate the process to find the surviving moves.

These remaining strategies are considered to be rationalizable moves, that is, moves that can possibly win.

Here’s the schematic for the IEDS:

iterated_elimination_dominated_strategies.jpg

The Equilibriums

The only reasonable choices are to pick the numbers 0 and 1. Is either a better choice? This is unfortunately where IEDS cannot give insight.

It’s possible to have 0 as a winning number–imagine all players picked 0. (The winning number will be 0).

It is also possible to have 1 as a winning number–imagine all players picked 1. (The winning number is 2/3, which rounds to 1.)

The answer will depend on what people think others will be guessing. Both equilibriums–all 0 and all 1–are achievable.

Back to the Classroom

None of us in the class had this deep understanding of IEDS. We were just learning game theory–it was actually our third lecture. If you’ve read one or two of my articles, then you know more about game theory than what we knew when we faced the game. My professor was pretty sure our guesses would be all over the place.

But Stanford kids can be crafty. One student used some sharp thinking and realized that coordination would help; he asked if we could talk to each other. The professor, still feeling we were novices, confidently replied with a smile, “Sure. Go ahead.” We only had 10 seconds to write down our answers anyway.

Before the professor could change his mind, the student quickly shouted to all of us, “If we all write down 0, we all win.”

It was remarkable. He figured out the equilibrium and told us what to do! He couldn’t be tricking us because the math was clear: if we all picked 0, we would all have winning numbers.

My professor’s face seemed to drop. That’s $250 on the line. (He never let future years talk).

How Smart Are Stanford Kids?

The professor was relieved after he tallied the votes. He told us that admirably most of us wrote down the number 0 (I was among those who did). But there were larger answers too, ranging from 1 to 10.

Someone actually wrote down 10! And this was after being told the answer.

After all was said and done, the winning number turned out to be 2, and the prize was awarded to three students. Thanks to our irrationality, my professor only paid out $15.

It was even better. My professor grilled the students who wrote down larger numbers. They all squirmed, as he was physically intimidating, and explained reasons like “it was my lucky number” or “I don’t know. I wasn’t really thinking.”

The Practical Lesson

What is going on? This is a group of smart students that was told the answer to the game.

The example illustrates a flaw of IEDS. It can get you reasonable answers if you think players are reasoning out further and further in nested logic. We don’t have infinite rationality, only bounded rationality.

The practical answer to what you should write depends on the book answer plus your subjective beliefs about what other people do. It’s the combination of book smarts plus social smarts that matters.

The people who wrote down the winning numbers told the class they suspected some people would deviate for irrational reasons. And they were rewarded for not confusing theory and practice.

Additional reading

The stock market is another example of a beauty contest–prices depend on fundamentals and investor psychology. For one example, check out this article on FDA fast-tracking and the stock market.



Share this post:

| More

Previous post:

Next post:



  • http://bizop.ca michael webster

    Presh, although everyone write “0″ or “1″ are the obvious, to a strategically minded crowd, points, I doubt that these solutions are the result of eliminating dominated solutions.

    Suppose that you were right, and the people had a pretty good understanding of how to eliminate dominated solutions – not perfect, just pretty good.

    In larger crowds, we might expect a few strategically challenged to upset the apple cart.

    But, if you are right, then in a two person beauty game, we should get the players figuring out the solution.

    Now, consider the traveler’s dilemma game. Two people are offered between $180 and $300, depending on what number they choose. The lower number is the winning number, and gets paid its bid, N + P, and the loser, or high bid, gets N – P.

    Simple elimination would reduce the players to bidding $180, and indeed this does happen if P is high enough. But with low P’s, or penalties, you get a range of bids between 295 -300.

    Two interesting papers to follow up on this and similar problems are:

    Ariel Rubenstein’s Instinctive and Cognitive Reasoning, here:
    http://arielrubinstein.tau.ac.il/papers/Response.pdf

    Goeree and Holt’s Ten Little Treasures of Game Theory, here:
    http://people.virginia.edu/~cah2k/treasure.pdf

  • http://bizop.ca michael webster

    Actually, I should modify the above post.

    In the p-beauty game, any number could be a focal point or Schellng point. For example, if we all picked 10, then we would all be winners.

    There is no rational incentive for anyone to pick 2/3 of 10 because this is a game with many winners.

    So if any number will do, how do we get the group to focus?

    One person yell. “How many people want a smoothie?” Presumably all the hands go up.

    Now yell, “Are we number one, are we number one … Then write that number down, now.”

    In your example, the people who picked 2 were likely no more rational than the people who picked their favourite number. But they did win, and winning unfortunately counts for more than being right.

  • http://www.mawsoft.com/blog RohoMech

    Question, can I use IEDS to talk my professor out of giving a pop-quiz in a pre-defined time period?

  • http://bizop.ca michael webster

    @RohoMech;

    Sure you can, but you might be surprised by the result!

  • http://www.mindyourdecisions.com/blog Presh Talwalkar

    Michael Webster: Per your first comment, I think we’re on the same page. The theory of IEDS says 0 or 1 are rational, but in practice that’s not what people do. And it’s because we have different beliefs and focal points. And those papers are very entertaining and useful–thanks for the links.

    Per the second comment, it is better to win unshared. Imagine if the Nobel paid out the same money to every winner, there would still be more glory to win alone. Plus, in the real game the grand prize is shared, like the Nobel prize money, so you do want to win alone.

    There is a small semantic difference. What’s “rational” in real life is different from “rational” in theory land. This means what people do can be seen as rational, even if it diverges from the rational model. The theory says 0 or 1 is the result of deleting strategies, but in practice we should know it will not be observed. People think differently, play different games, and use heuristics.

    And that’s a distinction I try to emphasize of late–don’t confuse theory with practice!

    RohoMech Haha. You’re referring to my article on surprize quizzes:

    http://mindyourdecisions.com/blog/2008/01/22/game-theory-tuesdays-i-want-to-surprise-you/

  • http://bizop.ca michael webster

    @ Presh

    1. Actually I don’t think 0 or 1 are the only undominated choices. I am in the minority on this. But consider three players choosing between 1 and 10. Assume they each choose 6, the V(6,6,6) for each player is $. And while it is true that, for example, V(4,6,6) is $ for 1 and 0 for 2 and 3, that doesn’t give 1 any reason to switch from 6 to 4: he is indifferent.

    2. The conditions of the beauty game are not the same as the Nobel Prize: each party can win $.

    3. The better way to frame your thesis, something akin to Church’s Thesis is this: all reasonable behaviour is the maximizing behaviour.
    We don’t have “rational’ doing double duty. I actually subscribe to this thesis: but I have to then generalize on the existent models of maximization.

  • http://www.mindyourdecisions.com/blog Presh Talwalkar

    Michael Webster: Hmm…I still think this a matter of definitions for the most part, so I’ll give one last clarification, after which you can respond and have the last word ;)

    1. There are two things going on here. In the formal p-beauty contest, the prize is split –imagine it was $5 shared among winners–so it is better to lower your bid. Even in the classroom game I played, one email I got pointed out the time with the professor is split.

    Now for the other point. Yes, 0 and 1 are not “strictly” undominated as higher numbers can win. But the higher numbers are never a best response–they are “weakly” dominated. The theory does not predict what people do, as the papers you linked to point out.

    2. As stated in 1, the formal p-beauty and Nobel have prizes split.

    3. Here’s my own take, as I do think rationality is serving double duty. I’ll use a math example to illustrate.

    Circles are defined as all the points of equal distance from a given point (the center). We can find its area, circumference, etc.

    Do circles actually exist? No. Man-made circular objects are approximations, but the theory is close enough to reality that we ignore the error.

    Rationality has a precise definition in economic theory. It’s a prescription for behavior with certain mathematical properties.

    Does the theory hold true? No. Man-made maximizing behavior is not well approximated by the theory of rationality.

    “Circles” and “Rationality” serve double duty of theory/reality. It’s harmless to confuse that for circles but I think very dangerous for rationality.

    Hope that clarifies.

    Thanks for the interesting discussion. You keep me on my toes :)

  • http://bizop.ca michael webster

    I will follow up on the last point.

    There are a number of maximizing models which are different: the number of interesting but different utility models. My own PhD showed how to construct preference relations from choice functions that many thought showed irrational or non maximizing behaviour. And I remain convinced that all reasonable behaviour can be modeled as maximizing, as long as we are suitably catholic about our choice functions.

    Some behaviour is just unreasonable, but much behaviour which seems to be not maximizing probably is with the correct derivation of a preference relation from the underlying choice.

    Or so I contend.

    Yes, I agree that there are two beauty games. But, the remarks were directed only to the game your professor was playing which you note in 4. did not require a unique winner.

  • http://bizop.ca michael webster

    Let me make more explicit my last point in the context of a similar game, the Traveller’s dilemma – slightly modified.

    The game is this. We each pick a number between 180 and 300. Low number wins his guess plus penalty p. High number wins low number’s guess minus penalty p.

    Not a lot of work to show that 180 is the result of the IED algo.

    But many people, in around 75% in Rubinstein’s study pick in the range of 295 -300.

    There are two explanations for this. The first, the stupid explanation. People are just too dumb to work out a long “if I do, he will do, so I will do …” claims and it is just plain luck that they gave up unravelling the tree. There is an accidental coordination somewhere in the interval [295 -300] because people gave up thinking in frustration. They both reasoned back from 300, gave up the problem as too hard, and randomly choose a number between 295 and 300.

    The other explanation is this. There is a scale or geography effect. From up high the value of the outcomes in the interval 295 -300 look roughly the same, while down lower at a birds eye view we can make or distinguish between better or worse options. (It is known that this scale effect is not permitted by the VM calculus. And this scale effect would violated what is called in choice theory property beta. Beta is necessary if the derived preference relation, the normalization of the choice function, is to have an indifference relation that is transitive. I am unaware of a general theory of preferences that accounts for scale effects.)

    There is probably something to both stories – but the second story casts serious doubt on the IED algo as being a necessary condition of prescriptive rationality.

  • Mahesh

    I am reading this post wondering how will it impact elections if all we had to do was just go to a web site and vote. In the current voting scheme, people who are indifferent just don’t go due to the effort involved. Now given these people can vote, how will they skew elections? Maybe we ought to be skeptical of electronic voting machines and stick with the secret ballot. :)

  • http://www.mindyourdecisions.com/blog Presh Talwalkar

    Mahesh: Actually, some evidence shows that convenient voting, if private, has less turnout than the current voting scheme!

    The answer is on page 2 of this is very interesting article:

    http://www.nytimes.com/2005/11/06/magazine/06freak.html

    Let’s get internet voting set up right away…

  • Ankit

    I know that this is too late, but the p-beauty problem got me thinking.

    If p is, say, 100, and I pick 300, I am guaranteed a pay-off of 200. Shouldn’t I pick 300 instead of 180?

  • http://www.mindyourdecisions.com/blog/ Presh Talwalkar

    Ankit:
    I think you’re referring to the Traveler’s dilemma…Iteratively you can deduce the lower you pick the better. If P=100, and your partner picks 300, then you want to undercut to 299 and end up with 399 (N+P).

    The point is that while 300 is guaranteed a payoff of 200, you can do even better by picking a lower number.

  • halisha hunter

    i just want to know , are beauty contest a good idea ?

  • Scott

    I think the big question is:

    What answer did you submit?

  • http://www.mindyourdecisions.com/blog/ Presh Talwalkar

    Scott:
    I was one of the people who wrote 0. At the time I was annoyed that other people didn’t also cooperate. Now, I just accept it and makes sure that my analysis accounts for some random irrationality.

  • http://bizop.ca michael webster

    Al Roth has some more academic background on this game.
    http://marketdesigner.blogspot.com/2009/09/nagels-guessingbeauty-contest-game.html

  • Pingback: Game theory in Numb3rs: hide and seek - Mind Your Decisions

  • scooter

    Perhaps the students who did not choose 0 did so just to annoy the “rational” students. I know I would have!

  • Pingback: Patterns In The Void › Game Theory & Anarchism II: How information can smash the state

  • Pingback: Math puzzle: chances of meeting up with a friend (and game theory puzzle extension) - Mind Your Decisions

  • Shashank

    I believe between 1 and 0, 0 would be the more rational choice and should be the dominating strategy.

    The group size in this example was 50, assuming all rational members who all are using game theory, the member can only pick between 0 and 1.
    Assume everyone picks 0, in this case everyone is the winner.
    In case everyone picks 1, average would be 1, 2/3rd of which would be rounded to 1 again, hence as expected, 1 should win.
    However, consider that some students picked 0 and others picked 1.
    Now when only 2 unbiased choices are possible, knowledge of probability dictates that with a large enough group the set would be approximately equally divided.
    Hence, assume for a moment, that a 25 pick 1 and 25 pick 0. This would mean that the average would be 25/50 = 0.5. The 2/3 of 0.5 would be would be less than 0.5 which would be rounded to 0. Hence 0 would be the winning bet.
    Even if there are more than 25 people who pick 1 as long as the 2/3rd of the average remains below .5, the number would round off to 0.
    Going backwards, as long as the average remains below 0.5*3/2 (i.e. 0.75) all picking 0 would win. Hence in a group of 50, until 38 people pick 1, those picking 0 would win. (this is applicable to all groups with more than 3 students)

    Hence anyone with the knowledge of probability along with game theory should pick 0.

  • john

    I agree with Shashank.

    But I’m afraid micheal webster doesn’t seem to make sense. It’s true that in the situation that Presh gives there is no incentive to pick 2/3 of 10 if it is known that all others will pick 10, but that isn’t known. If any one drops below 10, then all those who stay lose. So, if you don’t know what every one else is picking, there is rational incentive to drop by 2/3.

    Anything above 13 is dominated by 13 ( it only takes 7 iterations to get to 0, not 20). To decide to pick a number under 13 you don’t need to make any assumption about the other participants. To pick 0 you need to know that all participants are “rational”, and that all participants know that everyone involved is “rational” (the quotes are because it must be assumed that the desire to win the prize is rational, and any contrary motivation is not – it’s not rational to dislike smoothies or to want to avoid time with the professor).

    As for the traveler’s dilemma, if I understand correctly, it is poorly described. 2 people aren’t offered between $180 and $300, but rather they get to pick any value between $180 and $300, and they are offered their bid + or – some predefined P. + to the low bid and – to the high bid.

    No where does it mention what happens if the bids are equal, but I’m guessing that each would get their bid with no gain or loss.

    This is not a 0 sum game, it is a positive sum game with the sum being anywhere from $360 to $600. I’d rather share $600 over $360. With a P of $300, this is like the steal of share game (although now there are 120 levels of attempting to steal – and if you both steal, you don’t go away with nothing, you just each have $120 less).

    If P is less than $60, it makes absolutely no sense to chose $180 as the maximum you could get is $240, and you can garantee yourself at least $240 by choosing $300 (so $300 dominates $180). So, if you thought that $180 was dominant by iteration, you stopped the iteration too soon. And it is no surprise that people are picking between $295 and $300.

    Has anyone averaged the winnings of those bidding high versus those bidding $180? That is the test of “rationality” where social aspects are considered.

    If both are rational, and both know they are rational, and they will have future dealings with one another, they will both choose $300. Rational people know that cooperation benefits all in the long run, and it is hard to get cooperation if you are known to stab people in the back.

    In your modified traveler’s dilemma, there is more incentive to go to $180. But a rational person who trusts his partner will still choose $300. However, it will take a smaller amount of doubt in the oponent to switch to $180 unless p is quite small.

    As for a persons values and desires affecting what is rational (I think that is what you are trying to say with your “choice function”), I wholly agree.

  • john

    To Michael Webster, a small appology.

    When it comes to the game in this article, I still think you make no sense. However, I now see that eliminating dominated strategies is not necessarily rational.

    For the traveler’s dilemma with a choice between $180 and $300 and a p=$60, choosing 299 does not completely dominate choosing 300. In fact, for most choices (119 out of 121 possible choices) of the oponent, 300 will be a dollar better than 299. For the other 2 choices 299 is better by $59. But 300 does completely dominate 180.

    For the modified version of the dilemma, 299 does dominate 300. And by iterating one down per step, 180 does become the supposed rational strategy (when one knows the opponent is also rational).

    For the steal or split game, you don’t need to resort to iteration. There are only 2 choices, and stealing always dominates splitting (stealing will either double the winnings, or have no effect), so stealing seems the obvious rational strategy.

    The problem with eliminating the dominated strategies in these 2 cases is that you end up with the following situation. 2 “rational” people playing one another will both lose, while 2 “irrational” people will both win. That seems like a strange definition of “rational”.

  • Pingback: Why your crazy girlfriend always gets what she wants: battle of the sexes game theory - Mind Your Decisions





Previous post:

Next post:

Other posts you may enjoy reading:

Random Posts