
Once upon a time, everyone in my English class were talking about preparing for exam, and they came up with this topic: when you got stuck on a series of multiple choice problems, and you decide to pick answers at random, is there any strategy that increase your chance of getting each problem right? The class discussed eagerly. Some says they simply choose an answer randomly for each problem, and some argues they have a higher chance compared to formal getting each problem correct by consistently selecting the same answer choice. As a person who interest in math, and since probability has something to do with math, I wondered if one of the two is an advantageous strategy. It turns out that the two ways gives an examinee same chance of getting a problem right, assuming each answer choice is equally likely being picked by the examinee, and is equally likely being the only correct answer.
For simplicity, we will assume an examinee only picks one answer choice, and only one of the answer choices is the correct answer. Let integer n be the number of answers in one multiple choice problem, where n ≥ 2. We consider n ≥ 2 because a multiple choice problem with less than 2 answer choices is illogical. Let a1, a2, …, an represent each answer choice, respectively. Let s represent the answer chosen by an examinee, and c represent the correct answer.
Notice that all possible outcome of s and c are equally likely to appear, because each answer choice is equally likely being picked by the examinee, and each answers choice is equally likely being the correct answer. Let A be the event that ai is picked by the examinee, where i is an arbitrary integer in range [1, n]. Since each answer choice is equally likely being picked by the examinee, the probability that A happen, represented by P(A), is 1 / n. Similarly, let B be the event that aj is the correct answer, where j is an arbitrary integer in range [1, n]. Since each answer choice is equally likely being the correct answer, the probability that B happen, represented by P(B), is also 1 / n. The probability that both A and B, that the examinee chose ai and the correct answer is aj, happen is P(A) × P(B) = 1 / n × 1 / n = 1 / n2. This is important, because it means all possible outcomes of the answer being chosen and the correct answer appear with the same probability, and that the probability that the examinee getting a problem right can be calculated by dividing the number of possible outcome of s and c that makes the examinee get the problem right by the number of possible outcome of s and c.
Since the examinee can pick any one of the n answer choices, and the correct answer is one of the n answer choices, the number of possible outcomes of s and c is n × n = n2. Moreover, the examinee gets the problem correct when s = c, which means the answer picked by the examinee is the same as the correct answer. Since there are n possible answers, there are n outcomes of s and c that satisfies s = c. Therefore the probability that the examinee get a problem right with this method is n / n2 = 1 / n.
When using the second method, which choice should be consistently pick is arbitrary: an examinee can pick any of the answer choice they wish. Suppose the examinee picked ai, which means s = ai. The examinee gets the problem right when s = ai = c, which means the answer chosen by the examinee is the same as the correct answer. Since any one of the n answer choices is equally likely be the correct answer, the probability that c = ai is 1 / n. Therefore the probability that the examinee get a problem right with this method is also 1 / n. Comparing the result of the two methods, we know that none of the two methods is advantageous compared to the other: they give an examinee the same probability of getting the problem right.
One may suspect that the probability of getting a problem right using the first method is actually 1 / n2. Consider answer choice ai. Since each of the n answer choice is equally likely being picked by the examinee, the probability that the examinee picked ai is 1 / n. Since each of the n answer choice is equally likely being the correct answer, the probability that ai is the correct answer is 1 / n. The probability that ai is being picked by the examinee and is the correct answer at the same time is 1 / n × 1 / n = 1 / n2. Therefore the examinee get the problem right at a probability of 1 / n2.
This idea is flawed because it considers only one outcome of s and c that satisfy s = c, while there is a total of n outcomes satisfying s = c. It is true that the probability of one specific answer choice being chosen by the examinee and being the correct answer is 1 / n2, but remember that examinee gets the problem right as long as s = c. Since there are n answer choices that s can take value of and c can take value of, there are n outcomes of s and c that satisfy s = c, where each outcome associate with different answer choice being s, and each outcome occur with probability of 1 / n2. The probability that one of these n outcomes occur is n × 1 / n2 = n / n2 = 1 / n. Which means the examinee can get the problem right with a probability of 1 / n.
As a result, both methods gives an examinee probability of 1 / n of getting a multiple choice problem of n ≥ 2 answer choices right, with the assumption that every answer choice is equally like be picked by the examinee, and equally likely be the correct answer, and that examinee picks only one answer, and only one answer choice is the correct answer. There is no advantage of using one over another. Nevertheless, as the probability of hitting correctness with a randomly picked answer is not that high that one may rely on to pass a class. Therefore it is adviced to study hard and eliminate answer choice that sound wrong or find the correct on based on an educated guess.