Chebyshev's Theorem Calculator

Q: How to use Chebyshev's theorem?

Chebyshev's theorem is used when you need to estimate data spread without knowing the distribution shape. It provides a minimum proportion of data within a specified number of standard deviations from the mean.

Q: Does Chebyshev’s theorem apply to all samples?

Yes , the Chebyshev’s theorem can be applied to all possible datasets . It explains the minimum proportion of the measurements within 1, 2, or more standard deviations of the mean.

Q: What is the Chebyshev’s theorem valid for?

Chebyshev’s theorem is valid for any probability distribution , regardless of its shape. It provides a general bound on the proportion of data within a specified number of standard deviations from the mean. This theorem is useful for understanding data spread when the exact distribution is unknown or when the data does not follow a specific distribution pattern .

Creators

Maciej Kowalski, PhD candidate

Maciej KowalskiPhD candidate

Research Gate

Maciej holds a PhD in Theoretical Mathematics from the Polish Academy of Sciences. He loves patterns and structure, be it in datasets, equations, or everyday life. He is curious and meticulous, and balances his analytical side with a passion for fantasy, CrossFit, good cinema, and a diverse array of games. See full profile

Check our editorial policy

Reviewers

Bogna Szyk

Bogna is the chief operating officer at Omni Calculator, where she helps keep things running smoothly and ideas moving forward. With a background in civil engineering and a knack for organizing chaos, she brings structure and strategy to everything she does. After hours, you’ll likely find her dancing zouk or crafting the next twist in a D&D campaign. See full profile

Check our editorial policy

and Jack Bowater

Welcome, traveler, to the Chebyshev's theorem calculator, which makes use of the so-called Chebyshev's inequality to compute the probability that we will diverge from what we expect to obtain. Do not be discouraged, young padawan, by this mathematical formalism combined with a vague description of our problem. We promise you that by the end of your visit to our calculator, you will know Chebyshev's rule inside out!

🙋 If you want to learn more about the mathematical concept of probability, make sue to take a look at Omni's probability calculator.

Chebyshev's theorem formula

A quick glance at our Chebyshev's theorem calculator, and you will see two formulas similar to each other. That is because we can find both of them in the literature (or whatever it is that brought you into our loving arms today), depending on which suits us best. Let's start by breaking down the notation used therein:

P denotes the probability of an event (the event specified inside the brackets after the symbol).
X is a random variable that describes our random experiment. E.g., it can denote the number of sixes we get in ten dice rolls and the probability associated with each possible result.
E(X) is the expected value of our event (as described informally in the section below). If you haven't yet encountered this notion, visit our expected value calculator
σ² (read: sigma squared) denotes the variance of our event. This parameter describes how much the result of our event can differ if we repeat the experiment several times and how likely it is to happen. Note that it is traditionally given as a square of some value, σ, which can (and does) sometimes appear in calculations without the square.
k is the boundary of the result, i.e., how far we want our result to diverge from the expected.

In general, the first (respectively: the second) formula should be read as follows: the probability of the event X diverging from the expected value E(X) by at least k (respectively: by at least k × σ) is at most σ² / k² (respectively: 1 / k²).

Chebyshev's rule

Pafnuty Chebyshev was a Russian mathematician whose work can tell us a lot about the universe. "The universe? But I've come here to count some formulas!" Well, admittedly, it's not the time and place to think about how fast the universe expands, but maybe it is the perfect opportunity to see that random events are not that random after all? In fact, many random processes in nature can be approximated by the so-called normal probability distribution, which has an elegant structure and useful properties. If you want to, you may later check the normal distribution calculator to appreciate its beauty, but as for now, let's not dive into the mathematical mumbo-jumbo but rather try to understand the idea of our fellow mathematician.

Consider the all-time favorite probability problem – tossing a coin. If we flip a coin a hundred times, how many heads will we get? If we do it once, the probability of getting heads is 0.5 because both results have the same likelihood of happening. So if we toss a coin twice, it makes sense to expect one heads, right? Of course, you can never be sure to obtain just such a result, but in general, if half of the time we should get heads, then, in the end, we should get half as many heads as there were tosses.

🔎 If you're interested in probability problems connected to coins, you should definitely check our coin toss probability calculator and a bit more advanced coin toss streak calculator.

You might not realize it yet, but we've just informally defined the so-called expected value of an event. It is usually denoted by E(X), where X is the random variable associated with that event. What Chebyshev's inequality states is that obtaining a result that is far from that expected value is not likely. Indeed, if we think about getting, say, at most twenty-five heads in the hundred tosses, this doesn't seem too likely. And if we get lower to twenty, fifteen, or ten, then the probability should fall along with the number. In other words, we would have to be really unlucky for that to happen. To be specific, we can use Chebyshev's theorem formula to see that the "really unlucky" translates to at most a 4% probability for twenty-five heads and as low as 1.56% for ten.

How to use Chebyshev's inequality?

Let's see our Chebyshev's theorem calculator in action and try to follow its line of thinking. Say you wanted to impress a pretty friend at a party and told her that you're a famous magician, Chebyshev the Magnificent. Unfortunately, they seem skeptical and ask for a magic trick. Try to play it cool and make the mathematics work in your favor.

Take a deck of cards and say that you'll be pulling out one card, checking its color, and putting it back into the deck. You will repeat this 20 times and state that there will be at least ten clubs among the twenty cards you pull. Quite an impressive magic trick, wouldn't you say? Okay, let's see the calculations behind this.

Our random variable X here denotes the number of cards of clubs that we pull out. Each time we take a card, there's a 1 / 4 probability of it being clubs (since there are four suits and all have the same number of cards).
Since we repeat the experiment twenty times and each time we have a probability of success of one in four, the expected value (i.e., the expected number of clubs) is 20 × 1/4 = 5.
We want to pull at least ten clubs, so we want to diverge from the expected value by 10 - 5 = 5. This is our bound k as denoted in Chebyshev's theorem formula. (Note that our calculations will take into account all possibilities diverging by at least five from the expected value. It will, therefore, also include obtaining zero clubs.)
In our experiment, on each try, there are only two possibilities: a clubs card or a not-clubs card. The big-headed mathematicians call such events binomial probability events What this means to us is that the variance, σ², is then equal to

σ² = No of tries × Prob of success × Prob of failure = 20 × 1/4 × 3/4 = 3.75.
According to the first formula in the Chebyshev's theorem calculator, the probability of our event is, therefore, at most:

probability ≤ σ² / k² = 3.75 / 5² = 0.15.

This means that we have a probability of at most 15% for our magic trick to work... Hmm, maybe let's reconsider this. Maybe let's say that it was a mis-slip, and you meant a great mathematician, not a magician. That's just as attractive, right?

Let us see another example, by evaluating the minimum proportions of the mean to the standard deviations using Chebyshev’s rule, if k=8.

In this case, we need to adjust our probability to:

probability ≤ 1 - 1/k²

once we are talking about the opposite event mapped by our initial formula. Therefore, by including our data we have:

probability ≤ 1 - 1 / 8² = 0.98437

Thus, we can conclude that a minimum of 98.437% of the data points are within 8 standard deviations from the mean.

Finding the standard deviations from the mean with the Chebyshev's inequality

Suppose that a company wants to understand the minimum proportion of employees' ages within $2.5$ standard deviations from the mean. The mean age of the employees is $35\,\, \mathrm{years}$ , and the standard deviation is $5\,\, \mathrm{years}$ . We can determine it by using Chebyshev's theorem.

First, let us identify the values: $\mathbb{E}(X) = 35$ as the mean value, $\sigma = 5$ as the standard deviation, and $k = 2.5$ as the number of standard deviations.

To calculate the minimum proportion, we use the formula:

\footnotesize \mathbb{P}(|X - \mathbb{E}(X)| < k\sigma) \geq 1 - \frac{1}{k^2}

Then, by substituting our data we have:

\footnotesize \begin{split} \mathbb{P}(|X - 35| < 12.5) & \geq 1 - \frac{1}{(2.5)^2} \\ & \geq 1 - 0.16\\ & \geq 0.84 \end{split}

The final result means that at least $84\%$ of the data (employees' ages in this case) lies within $2.5$ standard deviations from the mean.

Other uses and disadvantages

Let's say that you've made a bet that your favorite basketball team will score between 60 and 80 points in the upcoming match. Everyone expects them to score around 70, so all you want is that they don't score 10 points more or less than that expected value. If we want to calculate the risk of you losing your bet, then this sounds suspiciously similar to what we've done above, but this time we don't want to go too far from some value. Fortunately, this is the exact opposite event, and so our formula will still work after a few adjustments:

P(|X - E(X)| < k) = 1 - P(|X-E(X)| ≥ k) ≥ 1 - (σ² / k²).

Since we don't want to get away from the expected value by more than 10, we want our k to be equal to 11 (since 10 is still okay for us). Now, σ² is, in general, a very tricky parameter to count. For the sake of our example, let's say that it is equal to 20. This means that your team has had its ups and downs in the last few matches, and getting 50 points or 90 points is not out of the question. This allows us to count our probability from the above formula:

P(|X - E(X)| < 11) ≥ 1 - (σ² / k²) = 1- (20 / 11²) ≈ 0.83,

which translates to a probability of at least 83%. It seems that your bet was a good one!

Note, however, that both the above and our original formula have a significant disadvantage. Namely, if in the original one, the fraction on the right-hand side is greater than one, then all Chebyshev's rule gives us is that our probability is at most 1, or, if we translate this to percentages, it is at most 100%. It basically means that our event will happen or will not. Well, that certainly helps...

FAQs

How to use Chebyshev's theorem?

Chebyshev's theorem is used when you need to estimate data spread without knowing the distribution shape. It provides a minimum proportion of data within a specified number of standard deviations from the mean.

Does Chebyshev’s theorem apply to all samples?

Yes, the Chebyshev’s theorem can be applied to all possible datasets. It explains the minimum proportion of the measurements within 1, 2, or more standard deviations of the mean.

What is the Chebyshev’s theorem valid for?

Chebyshev’s theorem is valid for any probability distribution, regardless of its shape. It provides a general bound on the proportion of data within a specified number of standard deviations from the mean. This theorem is useful for understanding data spread when the exact distribution is unknown or when the data does not follow a specific distribution pattern.