Conditional Probability

In my last entry on probability theory, I promised to have a more detailed look at conditional probability. We need this for solving the disease test problem and for solving the Monty Hall problem mathematically.

P(A | B) reads "the probability of A given B", and this is referred to as conditional probability. The formula for calculating this probability is

P(A | B) = P(A AND B) / P(B)

Why?

We're trying to figure out the probability that the event A is also true, given that B is true. Sometimes A may be true even though B is not, but we're not interested in those instances.

Independent events

Now, sometimes the probability of A is independent of B. For example, if we flip two coins [A, B], and each event A and B is true if the corresponding coin comes up heads. If B is true (that is, comes up heads) the probability of A is still 50% (1). This means that

P(A | B) = P(A)

where P(A) is the a-priori probability and the conditional probability is unchanged due to the information that we gained from flipping the coin B.

Dependent events

But what if

P(A | B) != P(A)

("!=" reads "does not equal") In this case, A is dependent on B. What this means is that, as we gain information about B, the probability of A changes from the a-priori probability. In this case, we need to consider all the cases when B is true:

P(A AND B) + P(NOT A AND B) = P(B)

Those are all of the instances where B is true. So we know that B is true. Out of all the instances where B is true [P(B)], some of them are instances where A is also true [P(A AND B)]:

P(A | B) = P(A AND B) / P(B)

An example

Let's say we have a drawn three cards from a deck: An ace, a king and a queen [A, K, Q]. We shuffle those three cards and draw two of them, trying to draw an ace. Let's say that the first card is not an A. What's the probability that the second one is?

First, let's define two events:

Card1 is the event that the first card is an A
Card2 is the event that the second card is an A

P(Card1) = 1/3
P(NOT Card1) = 2/3
P(Card2 AND NOT Card1) = 1/3

The last probability is easy to see from an a-priori standpoint - The probability that any one of the drawn cards will be an A is 1/3. So the probability of that one card to be an A and the other one not to be an A is also 1/3.

P(Card2 | NOT Card1) = P(Card2 AND NOT Card1) / P(NOT Card1)
P(Card2 | NOT Card1) = (1/3) / (2/3) = 1/2

More intuitively, this can be illustrated as follows:








Card 2

AKQ
Card 1A01/61/6
K1/601/6
Q1/61/60

As we can see in this matrix, there are six possible combinations of two cards. [AA, KK, QQ] are not possible, since there's only one card of each rank. Each possible combination has a 1/6 probability of occuring. [AK, AQ] are the possible combinations where the first card is an A. In the matrix, we can find the probabilities stated earlier:

P(Card1) = 2 * 1/6 = 1/3
P(NOT Card1) = 4 * 1/6 = 2/3
P(Card2 AND NOT Card1) = 2 * 1/6 = 1/3

Asking what P(Card2 | NOT Card1) is, is the same as asking "how big a fraction of the times that we don't pick an A as our first card do we pick an ace as our second card?". We can easily see in the matrix that there are 4 cases (marked as green) where we don't pick an A as our first card. In two of those cases our second card is an A. 2/4 = 1/2. But also, 2*(1/6) / 4*(1/6) = 1/2.

2*(1/6) / 4*(1/6) = P(Card2 AND NOT Card1) / P(NOT Card1)

Stated in words, in 2 out of 4 cases when the first card is not an A (out of a total of 6 possible cases, which includes draws where the first card is an A), the second card is an A. So, knowing that the first card is not an A, we can narrow the situation down to those 4 cases, giving us a probability of 2/4 = 1/2.

I hope this entry has helped your understanding of how conditional probability works. It's not very formal, and it's not very extensive, but hopefully it's quite intuitive and at least free of too big gaps in it's logic. However, it's late now, and I kind of just threw this one out there, because I haven't posted anything for a while.

__________
Notes:
(1) If you think otherwise, you're subjected to the gamblers fallacy, which we'll have a closer look at in a future post.

Soccer Penalty Kicks Article Flawed?

In my last post on this subject, I made a quick reference to an article on a mathematical examination of soccer penalty kicks. In this article, Tim Harford gives a brief survey of the findings of a paper by Ignatio Palacios-Huerta of the Brown University. The paper (pdf) is quite an interesting read, and I really recommend anyone with some knowledge in statistics and game theory to read it. However, I do believe I've detected a flaw in it. Though, before proceeding any further, I should include all the standard disclaimers, including, but not limited to, the fact that I'm in no way an authority nor an expert in this area, and that there is a chance that I've misunderstood things. I have all due respect for Mr Palacios-Huerta as a scientist and for Mr Harford as a writer, and I'm merely a layman myself.

Anyways. After writing my first entry on the article by Tim Harford, I got to thinking. Quoting from Tim Harford's article:
Professionals such as the French superstar Zinédine Zidane and Italy's goalkeeper Gianluigi Buffon are apparently superb economists: Their strategies are absolutely unpredictable, and, as the theory demands, they are equally successful no matter what they do, indicating that they have found the perfect balance among the different options. These geniuses do not just think with their feet.
At first, this seemed to be a good indication that Zidane and Buffon are indeed playing optimal strategies. But what hit me after writing my first entry, is that their playing optimal strategies doesn't make themselves indifferent between their strategy choices. That is, their playing optimally doesn't make them succeed equally often no matter what they do. It does, however, make their opponents indifferent between their strategy choices.

The optimal strategy is about making your opponent indifferent between his strategy choices. Recall, from my previous post on this subject, how the indifference equations for each player included the strategy choices for the other player, but not his own strategy choices. This relationship works two ways: Your playing optimally doesn't make you indifferent, and your indifference is not an indication that you're playing optimally.

So the Harford article is wrong. The fact that Zinédine Zidane and Gianluigi Buffon seem to be indifferent does not indicate that they play optimal strategies. It does, however, indicate that their opponents, on an aggregate level, are playing optimally.

Now, is this Tim Harford's or Ignatio Palacios-Huerta's mistake? In order to find out, I read the original paper by Mr Palacios-Huerta.

In the paper, Palacios-Huerta starts by formulating a hypothesis saying that professional players are indeed playing a minimax strategy. In order to test this hypothesis, he examines a sample of 1417 penalty kicks. I have no objections to his examination on all the players on an aggregate level. However, when testing the hypothesis for individual players, he seems to be looking at each individual players' strategy choices and their corresponding outcomes. Using Pearson statistics and p-values, based on those figures, the hypothesis is rejected for five players.

On an aggregate level, we can look at the overall figures of both sides of the game. According to the hypothesis, both goalies and kickers should have equal sucess rates, no matter their choices. This can be tested and the hypothesis rejected with the tools used by Palacios-Huerta. But when testing the hypothesis for individual players, we should look at that individual player's aggregated opponents' sucess rates for their strategy choices, which, it seems to me, is not what he's done.

So what hypothesis should we reject when the individual figures used by Palacios-Huerta don't give a good enough match with the hypothesis? Well, not the one that that particular player is playing minimax, but rather the one that his opponents, on an aggregate level, play minimax. This is not, in itself, an uninteresting hypothesis to examine, but, as far as I can see, it's not the one intended by Palacios-Huerta.

Unfortunately, the tables provided in the paper don't allow for the data to be rearranged so that we can perform this test on our own. There is no information on the strategy choices of the opponents of each individual player and their corresponding outcomes, so we can't examine the hypothesis that a specific individual player plays optimally, without accessing the underlying data.

So, what do we know about Zinédine Zidane and Gianluigi Buffon? Not much, but it seems they've been playing against superb economists.
__________
Notes:
Again, let me remind you of the disclaimers. I'm really a laysman, and I may very well be wrong. Either all wrong or just in my interpretation of the paper.

External links in this post:
World Cup Game Theory - What economics tells us about penalty kicks by Tim Harford. The quoted article in Slate Magazine.
Ignatio Palacios-Huerta at the Brown University website
Professionals Play Minimax by Ignatio Palacios-Huerta of the Brown University (pdf format)

Other resources:
Tim Harford - The Undercover Economist

The Monty Hall Problem, Part 4

Hopefully, we all agree that we should switch when faced with the problem given in the basic formulation of the Monty Hall problem. However, in the alternative formulation given in my first entry, there's one major difference. In the original game, the host was obliged to reveal a second door after watching you picking one. In my version of the game, I hadn't made any such commitment. So, why would I give you a chance to change your mind? Possibly out of generosity, sure, but most probably because I knew you had made the right choice and wanted you to switch to an empty cup.

Put in game-theory terms, switching is a dominated strategy. Your strategy choices are to switch when given the opportunity or to never switch (switch / stay). I have more strategy choices than you do. This is a full payoff matrix of the game for all possible strategy choices, with your choices represented as columns and my choices as rows. The outcome values are the probabilities of you winning the bill.








SwitchStay
No-No1/31/3
No-Yes1
1/3
Yes-No01/3
Yes-Yes2/31/3


My strategy choice "Yes-No", for example, means that I offer you an opportunity to switch if you choose the right cup initially, but I don't offer you that opportunity if you choose the wrong one. So the first Yes or No refers to whether I offer you that opportunity when you choose the right cup, and the second one to the case when you choose the wrong one.

Notice that all of my strategy choices but "Yes-No" (offering the opportunity to switch only when you've picked the right cup) are dominated. This means that they can never lead to better results, only worse, depending on your strategy choice. So there is no reason for me to choose any of those strategies. Thus removing those strategy choices, we get a much simpler payoff matrix:





SwitchStay
Yes-No01/3


It should now be obvious that switching is a dominated strategy. So in a game-theory sense, switching is a bad strategy. However, game theory isn't everything. Maybe you have a "read" on me, making you believe that I want you to have the bill. Maybe you think that I intended to always give you the switching opportunity as in the original Monty Hall problem. So there may be reasons to deviate from game-theory optimal play. But lacking such guidance, you're probably better off resorting to game theory, in this case guaranteeing you a 1/3 chance to win the prize.

Quiz: The Probability of Having a Disease

A new super-resistent virus has emerged in Farawayistan. The Ministry of Health estimate 1 out of 1000 to be infected. After immense efforts, a very accurate test has been developed. If the test subject is infected, 99.9% of the time the test will come out positive. But if the subject is not infected, 1 in 1000 times the test will yield a false positive result.

A random person is tested, and the test comes out positive. What's the probability that he's infected?

Please, answer in the comments.

The Maths of Soccer Penalty Kicks

In a very interesting article, titled World Cup Game Theory - what economics tells us about penalty kicks, Financial Times columnist Tim Harford gives a simple and intuitive introduction to how to apply game theory to soccer penalty kicks. I really recommend reading his article. (After writing this entry, I detected a flaw in this article. More on that here.)

In this entry, I'll examine the game theory of penalty kicks in some more detail, and we'll arrive at an actual formula for each player of the game. Due to the mathematical nature of this examination, there will be some math that may look dense and deterring, but it really looks more complicated than it actually is. As an attempt to make it easier to follow, I've used some color coding. Maybe, this serves only to make it look messier. Please tell me what you think in the comments.

Now, let's get started

A penalty kick can be reduced into a simple grid game. There are two players: The penalty kicker, who has the choice of which direction to shoot, and the goalie, who has the choice of which way to throw himself.

As Tim Harford points out, there is not enough time for the goalie to see which way the ball is going and to subsequently choose to go in that direction in order to intercept the ball. He must guess, risking going in the complete wrong direction. So the goalie's choice of direction is independent of the shooters choice (1).

Most shooters have a stronger and a weaker side, and should tend to favor their stronger side. But if they always choose to aim at the stronger side, the goalie can exploit this by always going that direction. The shooter can then counter-exploit this by shooting at the other side, where he will most certainly score a goal even though it's his weaker side, since the goalie will be going the other way. The goalie then reacts to this, and we have a never-ending loop of exploitations and counter-exploitations. To solve this problem, we need to find a game-theory optimal solution that offers an equilibrium to the game.

Game-theory optimal play

If both players play game-theory optimally, neither one can better his chances by altering his strategy. If he could, his strategy wouldn't have been optimal. In the same fashion, the optimal strategy is unexploitable, since the opponent can't gain an additional edge on it by switching from his optimal strategy. Remember how the shooter could elect to always shoot to his weaker side to exploit a strategy where the goalie always go to the shooters' stronger side? That means that the goalie's strategy wasn't optimal.

Now, if neither player can gain an edge by altering their strategy, then, by definition, they're indifferent between their choices. If they weren't indifferent, one choice would be better than the other, and the player would gain an edge by opting for that choice. So, in order to find the game-theory optimal strategies, we should look for indifference points.

Strategy values

This game can easily be summarized into a grid as follows:






GSGW
SS
50%95%
SW
80%30%

where the rows represent the shooter's strategy choices and the columns represent the goalie's strategy choices. Each cell correspond to a combination of the choices of both players, and the figure is the corresponding chance of a goal. GS means that the goalie throws himself in the shooter's strong direction, and GW that he opts for the shooter's weak side. SS and SW relate to the corresponding shooting strategies.

Now, those figures are just made up for the purpose of illustration. I don't claim that they're realistic in any way. Notice though, that I've taken into account the chance that the shooter misses the goal even if the goalie goes the wrong way, and that there is a bigger risk for this when he shoots to his weaker side. Notice, also, that the chance of a goal is greater if he opts for the stronger side and the goalie goes the right way, than if he opts for the weaker side and the goalie goes that way.

So, we have 2 strategies for each player: SS and SW for the shooter, and GS and GW for the goalie. We have 4 strategy pairs, SSGS, SSGW, SWGS and SWGW, with corresponding outcome values. (2)

Calculating the goalie's strategy

If the shooter chooses the strategy SS S% of the time, he will chose the strategy SW 1-S% of the time (3). Similarily, if the goalie chooses the strategy GS G% of the time, he will chose strategy GW 1-G% of the time. We should now solve for S and G, which are the strategy variables for the two players.

The expected value for the shooter of strategy SS is:

E(SS) = G * SSGS + (1 - G) * SSGW

In plain English, this means that the shooter will obtain an outcome value of SSGS (50% in our example) the G% of times when the goalie chooses the strategy GS, and he'll obtain a outcome value of SSGW (95% in our example) the 1-G% of times when the goalie chooses the strategy GS.

Similarily,

E(SW) = G * SWGS + (1 - G) * SWGW

Now, here comes the magic. The shooter is indifferent when E(SS) = E(SW), as explained above. To find this point, we'll insert the two equations above into that equation:

E(SS) = E(SW)
G * SSGS + (1 - G) * SSGW = G * SWGS + (1 - G) * SWGW
G * SSGS + SSGW - G * SSGW = G * SWGS + SWGW - G * SWGW
G * SSGS + G * SWGW - G * SSGW - G * SWGS = SWGW - SSGW
G * (SSGS + SWGW - SSGW - SWGS) = SWGW - SSGW
G = SWGW - SSGW / (SSGS + SWGW - SSGW - SWGS)

And that's the formula for the indifference point for the shooter. When the goalie chooses his actions according to this formula, the shooter will be indifferent to his strategy choices. Plugging our example figures into the formula:

G = SWGW - SSGW / (SSGS + SWGW - SSGW - SWGS)
G = 0.3 - 0.95 / (0.5 + 0.3 - 0.95 - 0.8)
G ~ 0.684

The goalie should go for the shooter's stronger side about 68.4% of the time.

Calculating the shooter's strategy

Similarly, we calculate the expected values of the goalie's strategy choices:

E(GS) = S * SSGS + (1 - S) * SWGS = S * SSGS - S * SWGS + SWGS
E(GW) = S * SSGW + (1 - S) * SWGW = S * SSGW - S * SWGW + SWGW

Notice that high outcome values are bad for the goalie, as that means a large probability of a goal. But that doesn't matter to us. We don't care about the exact values of the strategies, as long as they're equal. Now for the indifference equation:

E(GS) = E(GW)
S * SSGS - S * SWGS + SWGS = S * SSGW - S * SWGW + SWGW
S * (SSGS + SWGW - SSGW - SWGS) = SWGW - SWGS
S = SWGW - SWGS / (SSGS + SWGW - SSGW - SWGS)

Again, plugging in the example figures:

S = SWGW - SWGS / (SSGS + SWGW - SWGS - SSGW)
S = 0.3 - 0.8 / (0.5 + 0.3 - 0.95 - 0.8)
S ~ 0.526

Conclusion

The shooter should opt for his stronger side only 52.6% of the time, while the goalie goes that way 68.4% of the time. Both players are then indifferent to their choices. This means that either one of them could choose any action, and still obtain exactly the same result. But biasing towards one decision opens up for the opponent to exploit that bias, and that's why they should stay with the frequencies prescribed by the solution (4). They cant do better by changing, but they can do worse, if their opponent catches on.

Next entry on this topic: Soccer Penalty Article Flawed?
__________

Notes:
(1) Actually, some goalies have developed an ability to, based on the movements of the shooter prior to the impact with the ball, anticipate in what direction the shooter will aim. This gives him a greater time frame in order to decide which way to go. On the other hand, some shooters have developed a counter-technique of bluffing which way he's going to aim, so this boils down to a game of it's own. For simplicity, we'll assume that there are no prior indications as to which way the shooter will aim.
(2) Here, we consider the probability of a goal, for each strategy pair, an outcome. Even if the eventual, actual outcome of the kick is still uncertain, the probability of a goal, given a specific strategy pair, serves as a value of that strategy pair for each player.
(3) As explained in the entry Probability Theory for Dummies.
(4) If the opponent isn't playing the optimal strategy, however, one might consider making exploitative adjustments to the optimal strategy. Bear in mind, though, that this opens up for counter-exploitation, and should thus only be done if one don't expect the opponent to catch on.

External links in this post:
World Cup Game Theory - What economics tells us about penalty kicks
by

The Monty Hall Problem, Part 3

Previous entries in this series:
Part 1 | Part 2

In this post, we'll examine the Monty Hall problem intuitively. First, we'll start with the problem in it's basic formulation:

You're on a game show. Here are the rules of the game: There are three doors, and behind one of them there's a brand new car. Behind the two other doors, there are goats. You'll choose one of the doors, and subsequently, the game show host, who knows where the car is, will open one of the other doors, revealing a goat. Then you are given a choice: Will you stay with your initial choice or will you switch to the other unopened door?

Many answer initially that it doesn't matter. They reason that, as there are only two doors left, one of which include the car and one that does not, the chances that any one of them veils the car is 1/2. Sensible as this may seem, it's incorrect. In fact, the probability that the other door veils the car is 2/3, so you should switch.

How can this be? Of course, when we first picked a door, there was a 1/3 chance that we picked the right one. But when the game show hosts opens another door, doesn't that additional information change the situation? In fact, it doesn't. We don't gain any additional information from his opening of a door, since he'd open a door to reveal a goat whether or not we picked the right door to start with. We knew from the beginning that the probability was 1/3, and we haven't gained any additional information to change things, so we're still looking at a 1/3 probability.

Here's another way of looking at it: 1/3 of times, you'll pick the right door to start with. Staying will grant you the car, and switching gives nothing. But 2/3 of the times, you'll pick the wrong door, and the host will open the other door that also veils a goat. Those times, staying gives nothing, and switching grants you the car. So switching gives you the car in 2/3 of instances.

In the next part of this series, we'll have a look at the mathematics of this game, but for now, let's leave it at this. As a side note, if the host didn't know where the car is, but he still opened a door at random and just happened to reveal a goat, then the chances would, indeed, be 1/2 and you'd be indifferent to switching.

In the alternative formulation that I gave in my first entry, I said that the correct decision is to stay with your initial choice. The formulation was slightly different, with only one significant difference. But what? I've had a few friends asking me about this on MSN, and one of them actually came up with the correct answer by himself. Can anyone figure out what that difference is, and why it changes things so drastically?Please comment.

Probability Theory for Dummies

Before we start examining the Monty Hall problem in any kind of detail, let's look at some probability notation and theory. This entry is intended for those readers who aren't already altogether familiar with basic probability theory. Those who are may read this as a review, or skip reading today's post.

I believe that about 50% of my readers are going to read this specific entry. The others may not be too interested in this particular topic, they may be deterred by the mathematical notation (don't let that deter you, though), or they may already be very familiar with the topics in this entry. However, taking a random reader, the probability that he'll read this entry, given that my assumption is correct, is 50%. Put another way:

P(Read) = 0.5

"P" is the notation for probability. "Read" is the event that he reads the entry, and 0.5 is the decimal form of the probability of that event. In English, P(Read) = 0.5 reads "the probability of the event that he reads this entry equals 50%".

Conditional probability (2)

However, I feel that at least 75% of my math-inclined readers probably are familiar with those topics and, thus, will skip reading this entry. So,

P(NOT Read | Math-inclined) = 0.75

"P" and "Read" are the same as above."NOT" is an operator stating that the "Read" event is not true. "|" is the the operator of conditional probability, and it reads "given". "Math-inclined" is the event that the chosen reader is one of those math-inclined readers. So this whole statement reads "The probability of the event that he does not read this entry, given the event that he's math-inclined, equals 75%".

The NOT operator

The chances that an event takes place, plus the chance that it does NOT, always adds up to 100%. Put differently, in 100% of cases, either the event takes place or it doesn't. So,

P(A) + P(NOT A) = 1.

With some very basic algebra, this can be turned into

P(NOT A) = 1 - P(A)

The probability that the math-inclined reader will read this entry equals 1 minus the probability that he will NOT read it, so

P(Read | Math-inclined) = 1 - P(NOT Read | Math-inclined) = 1 - 0.75 = 0.25

Joint probability

I believe that about 75% of my readers are math-inclined, so

P(Math-inclined AND Read) = 0.75 * 0.25 = 0.1875

There's an 18.75% chance that a randomly chosen reader is math-inclined but still reads this entry. "AND" is the operator of joint probability. This means that, for the statement "Math inclined AND Read" to be true, both events have to be true. The reason why we multiply the probabilities of both events can be illustrated as follows:

P(Math-inclined) = 0.75 = 3/4
P(Read | Math-inclined) = 0.25 = 1/4

as stated above. Out of 16 randomly picked readers, on average 12 readers will be math-inclined (3/4 * 16). Out of those 12 math-inclined readers, 3 will read this entry (1/4 * 12). So out of 16 initially chosen readers, 3 are math-inclined AND will read this entry. 3 out of 16 are 18.75% (3/16 = 0.1875). This is consistent with multiplying the probabilities of both required events: 3/4 * 1/4 = 3/16.

Put differently, in 75% of cases the randomly chosen reader is math-inclined. In 25% of those cases, he'll still read this entry. As we all know, a certain percentage of a quantity equals that quantity multiplied by the decimal form of the percentage. So, 25% of 75% is 0.25 * 75%, which equals 18.75% (0.1875).

The OR operator and the Sieve principle

P(Math-inclined OR NOT Math-inclined) = 0.75 + 0.25 = 1

Being put this way, this one seems pretty obvious. The probability that a reader is either math-inclined or not is obviously 1. But it may not seem equally obvious if we put it in more abstract terms. Formally,

P(A OR B) = P(A) + P(B) - P(A AND B)

Why is this? Well, let's resort to an intuitive explanation. Let's say we have six two-letter combinations, for example [AB, AC, AD, BC, BD, CD]. If we choose one of those combinations at random,

P(A) = 3/6 (the probability of the event that the letter "A" is represented in the combination)
P(B) = 3/6 (the probability of the event that the letter "B" is represented in the combination)

What's the probability that either A or B are represented in the combination? Well, from a quick look, we can see that it's 5/6, since only 1 in 6 combinations include neither A or B. But how do we arrive at this mathematically?

Adding the number of combinations where A is represented to the number of combinations where B is represented seems like a good first step. But this leaves us with 6/6, which is clearly wrong. This is because, in this way, we've double-counted the instance where both A and B are represented. So, having counted that instance twice, we need to subtract it once:

P(A OR B) = P(A) + P(B) - P(A AND B) = 3/6 + 3/6 - 1/6 = 5/6

This is called the Sieve principle, or the inclusion-exclusion principle. With more events, such as P(A OR B OR C) it gets a little bit trickier, but let's not bother with that right now.

Returning to

P(Math-inclined OR NOT Math-inclined) = 0.75 + 0.25 = 1

didn't I forget to subtract the double-counted instances? No. Double counting isn't possible in this case, since one reader can't be both math-inclined and non-math-inclined. So

P(Math-inclined AND NOT Math-inclined) = 0

and so, actually,

P(Math-inclined OR NOT Math-inclined) = 0.75 + 0.25 - 0 = 1

but we can just skip that step.

That's a very brief look at probability theory. We'll expand on it later on, and we'll use it when examining the Monty Hall problem further. But first, we'll have a more intuitive look at the Monty Hall problem tomorrow.

Next entry on this topic: Conditional Probability

__________
Notes:
(1) This, and all of the other figures concerning the inclinations and tendencies of my readers, are completely made up. They are probably not even nearly accurate, and I use them solely for illustration purposes.
(2) In a future entry, we'll have a more detailed look at conditional probability

A Real-Life Application of the Bluff

Many years ago, being a new student in a new city, working for students' associations was a fast lane into the student life. I took on many responsibilities, and a frequent problem was that of getting people to work. It's hard to convince people to work for sub-minimum wages when they have exams coming up. Needless to say, after I had ended my responsibilities at the students' associations, I felt for my poor successors trying to call for workers during exam periods. I didn't have much time to work, but I wanted to help. So, when they called me, asking me to work for them, I used to tell them to keep looking for someone else to work for them first, but to call me back if they couldn't find anyone, promising to think about it if they indeed did call me back.

Pretty soon, they learned that I'd never say no, once they called me back. So they started calling me first, and I'd tell them to look for someone else first. Then, they'd just wait for a couple of hours and then call me back, knowing that I'd never say no. My strategy was highly exploitable, so to say. Of course, I could have just quit playing the game, that is, quit working at all. But I wanted to help out. So instead, I introduced the bluff to my strategy.

Sometimes, when they called me and I knew for a fact that I couldn't work that particular day, I'd still tell them to look for someone else and then call me back if they couldn't find anyone else, promising to think about it if they did. When they called me back, I'd say, "no, sorry, I can't". That way, they couldn't rely solely on exploiting me anymore, and they had to actually start calling other people when I told them to.

Efficient-Market Hypothesis Yielded $300 Million Donation

The University of Chicago Graduate School of Business is being renamed the University of Chicago Booth School of Business. The reason is a $300 Million donation from David G. Booth, the founder and chief executive of Dimensional Fund Advisors. Booth, who's been highly successful in his business, attributes much of his success to the university. His business is largely grounded in efficient-market hypothesis, which was, to a large extent, developed at the U. of Chicago Graduate School of Business.

Basically, the efficient-market hypothesis states that prices on traded assets reflect all available information. Thus, no piece of information can be used in order to gain an edge on the market, since that information is already accounted for in the price. No trader can outperform the market, other than by luck.

There are some obvious problems to this hypothesis, and I'll use this piece of news as a starting point for a series of entries on them.
__________
Resources:
The New York Times on the donation

The Existence of God is 4 to 1?

Bookmaker Paddy Power are currently keeping a book on the existence of God. The odds are, according to the book, 4 to 1, and the odds of Russel Brand, English comedian, actor, author, columnist and television presenter, being God is, apparently, 500 to 1. On their website, Paddy Power state that "scientific proof must emerge by 31st Dec 2009, to confirm his omnipresence in order for bets to be deemed winners".

It's not a coincidence that this bet emerges at this particular point in time. Recently, a fund raising, for the putting of adverts saying "There's probably no God" on London buses, started. Professor Richard Dawkins, famous atheist and bestselling author of The God Delusion, supports the campaign by matching any donations up to £5,500.

Also, recent developments in the Large Hadron Collider project seems to have given people hopes of obtaining proof of the existence of God. The odds of the bet have fluctuated in correlation to events in the collider project, maybe not too surprisingly, since the scientists of the project are hoping to find the so called God particle. However, proving the existence of the God particle would not equal proving the existence of God, and therefore, anyone taking the Paddy Power bet, in the hopes of winning on such an event, is mistaken.

So, without the hopes of finding a proof through the collider, we're back where we started in the ancient hunt for proof of God's existence. And I bet the odds against any proof to emerge within the next year are significantly larger than 4 to 1, no matter how likely the existence of God is deemed to be.
__________
External links in this post:
Paddy Power's website
Donation page for the Atheist Bus Campaign.
Other references:
English news site Telegraph.co.uk on the bet

A Quick Briefing on the Poker Bluff

In poker, there are good hands, bad hands and hands in between. On the last betting round (1), obviously, you'd like to bet with your good hands, hoping for your opponent to call with a lower hand, thus making you money. However, you can't bet all of your hands, so you will check some of your hands. But if you bet only with your best hands, your opponent can adjust to this, and call only with hands slightly better than your worst betting hand. It wouldn't make sense for him to call with hands lower than your worst betting hand, since that would lose him money.

But if he wouldn't call with a hand lower than your worst betting hand, then why would you bet it in the first place? It doesn't win you any money when he has a lower hand, but it costs you money when he has a better hand. So, you wouldn't bet it. This narrowing of your betting range would lead your opponent to narrow his calling range as well, so that you'd have another worst betting hand that isn't worth betting, either. And so it goes on, until you don't bet any hand but the very best hand possible. To put it in game-theory terms, there is no equilibrium where you bet only a top range of your hands.

In order to find an equilibrium, you need to introduce the bluff to your strategy. By betting a top range of your hands, and a bottom range of hands, that is, some of your absolutely worst hands, you'll make it worthwhile for your opponent to call with a hand lower than your worst value-betting hand (value betting being betting with one of the top range hands, as opposed to bluff betting, which you do with the bottom range hands). It follows that betting with that very worst value-betting hand will be worthwhile to you, since now it makes you money when your opponent calls with a lower hand.

But do we really want to make it worthwhile for our opponent to call? That sounds beneficial to him, doesn't it? Well, if we bluff with the optimal frequency, that is, if our value-betting range stands in the right proportion to our bluffing range, he will break even from calling with those hands that can only beat a bluff. In game-theory terms, he's indifferent to calling. But even if he could choose to fold instead of calling, he must call with a proper fraction of those hands, in order not to open up for you to exploit his folding tendency by bluffing more.

There, that's a quick briefing on the poker bluff.
__________
Notes:
(1) On earlier betting rounds, when the lower hand sometimes has a chance of winning, due to drawing possibilities, things are a bit different. But this is outside of the scope of this entry.

Responsible Gaming in the Swedish Monopoly

Recently, the Swedish government-owned gaming company, Svenska Spel, which enjoys statutory monopoly on most forms of gambling services, including lotteries, online poker and casinos, was awarded the World Lottery Association's (WLA) Award for Responsible Gaming Excellence. The Swedish government has commissioned Svenska Spel to arrange lotteries and other gambling games in a responsible way. The rationale for this mission and the monopoly status is that this is deemed to have a positive effect on public health. Providing a more responsible service than foreign companies, Svenska Spel has a mission to keep Swedish gamblers from using those foreign services.

However, many people suspect that the real rationale is purely fiscal - that the Swedish government wants to take a piece of the juicy gambling cake. Also, the responsibility seems, in many ways, to be only reluctantly introduced into the activities of the company, and only to the smallest extent possible in order to fulfill the requirements from the government.

On their website, for instance, Svenska Spel claim to refrain from advertising for "hazardous" games. Poker, for example, is clearly considered such a hazardous game, but still, they do advertise for it. Not only do they target existing poker players with ads for upcoming tournaments and events, but they also engage in external marketing.

A peculiar instance that I saw today, harmless as it may seem, relates to the Lotto game. In the Lotto, a series of 7 numbers ranging from 1 to 35 is drawn. Players choose their own string of numbers prior to the draw, and the number of corresponding numbers between the player's strings and the drawn string are counted. 7 correct numbers gives the grand prize, often making the winners millionaires. Fewer correct numbers give smaller prizes, and usually, less than 4 correct numbers gives no return.

On their website, statistics on the draws can be found. From their website:
In the following statistics, you can see which numbers have been drawn the most and the least frequently in the last half year. The statistics are updated continuously. [...]

Isn't statistics fun? Remember, though, that the choice of numbers is always random. Good luck!
(My translation)

Now, why is this a problem?

Svenska Spel responsibly added a note on the randomness of the draw. You can not, by choosing specific numbers, impact your chances to hit the jackpot. But this is not the implication of the text at large, quite the contrary. Posting those statistics appeals to the Gamblers fallacy and the Texas sharpshooter fallacy (1), encouraging people to vainly try to tinker with their numbers in order to better their chances to win. Encouragement of irrational gambling and providing gamblers with false expectations is not my idea of responsible conduct.

Interestingly, those numbers can be put to constructive use by a gambler free of those fallacies. If used the way contrary to the way that a gambler, subjected to the fallacies, would use them, we can exploit those who use them based on the fallacies. That is, we choose numbers that are less likely to be chosen by other, irrational players. Our winning chances won't increase, but on our occasional wins, we'll have fewer other winners to share the prize pool with.

So, if there's a good use for the numbers, then there's no problem? Wrong. The good use is only possible to the extent that other players are tricked by the numbers.

For the sake of fairness, let's note that, at least, Svenska Spel takes some measures for responsible gaming. They do, if not all they can, at least probably more than most other gaming companies. So the award is probably fair, and I don't mean to contest that. Contrarily, I congratulate Svenska Spel on their award.
__________
Notes:
(1) In a future entry, there will be more on how the fallacies mentioned relate to the Lotto and those statistics.
__________
External links in this post:
Website of the Swedish government-owned gaming company Svenska Spel

Other references:
Press release on World Lottery Association's website

The Monty Hall Problem, Continued

I suppose one comment for my very first blog entry is to be considered good. To recap, I gave a new wording to the famous Monty Hall problem. Three cups were placed upside down on a bar desk, with a $100 bill under one of them. You were told to chose one of the cups, and if that particular cup contained the $100 bill, the bill would be yours. I lifted one of the other cups after watching you choose one, revealing that there was no bill under that one. You were offered an opportunity to change your mind and go for the other unrevealed cup instead of the one you initially chose. The question is: Should you switch or should you stay with your initial choice?

Anonymous commented as follows:
You should switch. The probability that you picked the right cup first is 1/3, so its 2/3 that the bill is in one of the other cups. Knowing that the cup is not in one of them doesnt change that. If you switch the probability is 2/3.
This may seem counter-intuitive to many readers. But from a probabilistic point of view, it seems to make sense. This is the answer that I expected from most readers who are already familiar with the Monty Hall problem, since this is the correct answer to the problem in its basic formulation. However, remember that I hinted that there is a twist to this one? There is, and this answer is, in fact, wrong. You should stay with your initial choice.

Now, I expect to be ridiculed by some of my readers. Please, shoot. I'm very confident that my solution is correct, and I will explain why in a later entry. In the meantime, can anyone come up with an explanaition? Why should you stay? Or, if you don't agree with me, why am I wrong?

Who is the Economatheek?

I suppose you've had some trouble interpreting my nom de plume, The Economatheek. It's simple, though. It's a makeup word composed by the three parts economics, mathematics and geek. It says, if not all, at least some of it.

Being a math-inclined law-and-business student among the polar bears and huskies of Sweden, I spend the long winters studying a wide range of sources, only connected by the fact that they're not actually on the curriculus. I've developed an interest in the shortcomings of mainstream economic theory and the fallacies that affect much of our thinking. I find the inherent uncertainty that impregnate every aspect of social and economic life, but which is widely assumed away in mainstream economic theory, quite intriguing and I feel that this is where focus needs to be set.

Besides from this, I also harbour a great interest in game theory, probability theory, statistics, and many more topics. There will be plenty of all of those topics here as time goes by.

In order not to scare our newcoming visitors off, let's start out slow:

The Monty Hall problem, popularized by columist and author Marilyn vos Savant, proves an intricate problem to most on their first acquaintance with it. Expecting at least some of my visitors to have already read, solved and understood the problem in it's basic formulation, I shall say that there is a twist to this one. So stay with me.

It's friday night, and you're standing by the bar in the local pub. From the shadows, a skinny, bearded man emerges, introducing himself as The Economatheek. Holding up a $100 bill, he asks the bartender for three coffee cups, and after receiving the cups, he asks you to look away. The bartender, who you trust as an honest man, serves as a witness as The Economatheek puts the three cups upside down with the $100 bill under one of them.

You are now told to chose one of the cups. If the bill is, indeed, to be found under that particular cup, it will be yours. You chose one of the cups, but just as you're about to lift it, the skinny man tells you to wait. He then lifts one of the other cups, to reveal that there is no bill under that particular cup. Thereafter, he asks you if you still want to go for the cup that you initially chose, or if you want to switch to the other one that has yet not been lifted.

So, what is your choice, and why? Please, share your thoughts in the comments.

Also, please excuse my English. Being brought up by polar bears, English was not the first language I learned. I will not take offence from remarks on my language, rather the opposite. After all, life as a geek is all about learning.

The next entry on this topic: The Monty Hall problem, continued
__________
External links in this post:
http://www.marilynvossavant.com/
Official website of columnist and author Marilyn vos Savant.