Maths Formulae Statistics Probability

Probability – Basic Concepts and Rules

Explore basic probability formulas, definitions, and examples. Learn how to calculate outcomes and events.
🔑

Definition of Probability

Probability measures the likelihood of an event occurring and is a fundamental concept in statistics and real-world decision-making. It is represented as a number between 0 and 1, where 0 means an impossible event and 1 means a certain event.

Probability is the mathematical framework for quantifying uncertainty and measuring the likelihood of events occurring. It provides the foundation for statistical inference, decision making under uncertainty, and modeling random phenomena in science, engineering, and everyday life.

SymbolDescription
P(A)Probability of Event A - Likelihood between 0 and 1
SSample Space - Set of all possible outcomes
AᶜComplement of A - Event A does not occur
A ∩ BIntersection - Both events A and B occur
A ∪ BUnion - Either event A or B (or both) occurs
P(A|B)Conditional Probability - Probability of A given B
XRandom Variable - Function from outcomes to numbers
E[X]Expected Value - Average value of random variable
Var(X)Variance - Measure of spread around the mean
fₓ(x)Probability Density Function - Continuous probability model
Fₓ(x)Cumulative Distribution Function - P(X ≤ x)
Empty Set - Impossible event with probability 0
🔢

Key Probability Formulas

\[ P(A) = \frac{\text{Number of favorable outcomes}}{\text{Total number of possible outcomes}} = \frac{|A|}{|S|} \]
Basic Probability
\[ P(A^c) = 1 - P(A) \]
Complement Rule
\[ P(A \cup B) = P(A) + P(B) - P(A \cap B) \]
Addition Rule
\[ P(A|B) = \frac{P(A \cap B)}{P(B)} \quad \text{where } P(B) > 0 \]
Conditional Probability
\[ P(A \cap B) = P(A) \cdot P(B) \]
Multiplication Rule (Independent Events)
\[ P(A \cap B) = P(A|B) \cdot P(B) \]
Multiplication Rule (Dependent Events)
\[ P(A|B) = \frac{P(B|A) \cdot P(A)}{P(B)} \]
Bayes' Theorem
\[ P(B) = \sum_{i} P(B|A_i) \cdot P(A_i) \]
Law of Total Probability
🎨

Visualizing Probability

S (sample space) A P(A) = n(A) / n(S) = 5 / 10 = 0.5 0 0.5 1
Probability P(A) = n(A)/n(S): ratio of favourable outcomes in event A to total outcomes in sample space S

Probability concepts are often visualized using a Venn diagram. A large rectangle represents the entire sample space (S), containing all possible outcomes. Circles inside the rectangle represent events. For two events, A and B, the area where the circles overlap represents their intersection (A ∩ B), where both events occur. The total area covered by both circles represents their union (A ∪ B), where at least one of the events occurs.

⚙️

Key Properties of Probability

Axiomatic Foundation

Probability is built upon three simple but powerful axioms proposed by Kolmogorov: non-negativity (probability is never negative), normalization (the probability of the entire sample space is 1), and countable additivity (for mutually exclusive events, the probability of their union is the sum of their individual probabilities).

Conditional Structure

Probability theory provides a formal way to update our beliefs in light of new evidence through the concept of conditional probability. This is the foundation of Bayesian reasoning, learning from data, and many machine learning algorithms.

Distribution Framework

Random variables and their associated probability distributions (like the Normal, Binomial, or Poisson distributions) allow us to model and analyze random phenomena in a structured way. They describe the probabilities of all possible outcomes of a random process.

Universal Applicability

The principles of probability are not confined to a single field. They are universally applicable to model uncertainty in any domain, from quantum physics and genetics to finance, weather forecasting, and artificial intelligence.

🔬

Proof of the Addition Rule

We want to prove the addition rule for two events A and B: P(A ∪ B) = P(A) + P(B) - P(A ∩ B).

1. The union A ∪ B can be expressed as the union of two disjoint (mutually exclusive) sets: the event A and the event 'B but not A', which is written as B ∩ Aᶜ.

\[ A \cup B = A \cup (B \cap A^c) \]

2. Since A and (B ∩ Aᶜ) are disjoint, by the third axiom of probability (additivity), the probability of their union is the sum of their probabilities.

\[ P(A \cup B) = P(A) + P(B \cap A^c) \]

3. We can also express event B as the union of two disjoint sets: 'B and A' (A ∩ B) and 'B but not A' (B ∩ Aᶜ).

\[ B = (A \cap B) \cup (B \cap A^c) \]

4. Applying the additivity axiom again, we get:

\[ P(B) = P(A \cap B) + P(B \cap A^c) \]

5. Rearranging this gives an expression for P(B ∩ Aᶜ):

\[ P(B \cap A^c) = P(B) - P(A \cap B) \]

6. Finally, substitute this expression back into the equation from step 2 to get the addition rule.

\[ P(A \cup B) = P(A) + P(B) - P(A \cap B) \]
🧮

Worked Example

From a standard 52-card deck, what is the probability of drawing a single card that is either a King or a Heart?
  1. Define the events. Let A be the event of drawing a King, and B be the event of drawing a Heart.
  2. Calculate the probability of event A. There are 4 Kings in a 52-card deck. So, P(A) = 4/52.
  3. Calculate the probability of event B. There are 13 Hearts in a 52-card deck. So, P(B) = 13/52.
  4. Identify the intersection of A and B. The event (A ∩ B) is drawing a card that is both a King and a Heart, which is the King of Hearts. There is only one such card. So, P(A ∩ B) = 1/52.
  5. Apply the Addition Rule for non-mutually exclusive events: P(A ∪ B) = P(A) + P(B) - P(A ∩ B).
  6. Substitute the probabilities: P(A ∪ B) = 4/52 + 13/52 - 1/52.
  7. Calculate the final probability: P(A ∪ B) = 16/52, which simplifies to 4/13.
The probability of drawing a King or a Heart is 4/13.
🧮

Try It

🏭

Applications of Probability

Medical Diagnosis & Treatment

Probability is used to assess the effectiveness of treatments, the accuracy of diagnostic tests (e.g., using Bayes' theorem to determine the probability of a disease given a positive test), and to stratify patient risk for various conditions.

Finance & Investment

Financial models use probability to quantify market risk, price options and derivatives (e.g., the Black-Scholes model), optimize investment portfolios, and calculate insurance premiums based on the likelihood of events like accidents or natural disasters.

Machine Learning & AI

Probability is the backbone of many AI systems. Probabilistic models like Bayesian networks are used for prediction and classification, and probability theory provides a framework for quantifying the uncertainty in a model's predictions.

Weather & Climate Science

Weather forecasts are inherently probabilistic, giving the 'chance of rain' rather than a deterministic prediction. Climate models use probability to project future environmental scenarios and assess the likelihood of extreme weather events.

🌍

Real-World Examples

A factory produces widgets with a 5% defect rate. If a quality control inspector randomly selects 2 widgets, what is the probability that both are not defective? Assume the selections are independent.
  1. Determine the probability of a single widget being not defective. This is the complement of being defective: P(Not Defective) = 1 - 0.05 = 0.95.
  2. Let A be the event that the first widget is not defective, and B be the event that the second widget is not defective. P(A) = 0.95 and P(B) = 0.95.
  3. Since the events are independent, the probability of both occurring is the product of their individual probabilities: P(A ∩ B) = P(A) × P(B).
  4. Calculate the result: 0.95 × 0.95 = 0.9025.
The probability that both widgets are not defective is 0.9025, or 90.25%.
In a city, 60% of people use public transit (T) and 50% own a bicycle (B). 30% of people use public transit and own a bicycle. What is the probability that a randomly selected person uses public transit or owns a bicycle?
  1. Identify the given probabilities: P(T) = 0.60, P(B) = 0.50, and the intersection P(T ∩ B) = 0.30.
  2. Use the general addition rule to find the probability of the union of the two events: P(T ∪ B) = P(T) + P(B) - P(T ∩ B).
  3. Substitute the values into the formula: P(T ∪ B) = 0.60 + 0.50 - 0.30.
  4. Calculate the result: 1.10 - 0.30 = 0.80.
The probability that a randomly selected person uses public transit or owns a bicycle is 0.80, or 80%.
🏙️

Real-World Scenarios

5% 15% 80% Claim Probability P(claim)=0.05 → set premium
Insurance Pricing
Actuaries use claim probability to set premiums — if P(claim) = 0.05 and average claim = $10,000, the expected loss per policy is $500, the minimum premium.
☁️ Rain Forecast P(rain) = 70%
Weather Forecasting
A "70% chance of rain" is an empirical probability — based on historical data showing that in similar atmospheric conditions, it rained on 7 out of 10 occasions.
A Card Probability P(ace) = 4/52 = 1/13 ≈ 7.7%
Card Games
Probability theory originated in gambling. P(drawing an ace) = 4/52. Blackjack card counters use running probability estimates to gain a statistical edge.

Insurance Risk Assessment

Actuaries use historical data and probability models to calculate the likelihood of events such as car accidents, house fires, or health issues. These probabilities are fundamental to setting insurance premiums that are fair to both the customer and the company, ensuring that the pool of premiums can cover the expected claims.

Spam Email Filtering

Your email service uses a form of Bayesian probability to decide if an incoming message is spam. The filter analyzes the words and features of the email and calculates the probability that it is spam given those features, comparing it to the known probabilities of words appearing in spam and non-spam emails.

Sports Analytics

Analysts use probability to model sports outcomes. For example, they might calculate a basketball player's probability of making a free throw based on past performance, or a team's probability of winning a game based on player statistics, location, and other factors. This helps teams make strategic decisions and provides betting odds.

📚

Types & Classifications

Probability can be interpreted in several ways, each suited to different scenarios:

  • Classical Probability: Assumes all outcomes in a sample space are equally likely. It is calculated a priori (before any experiment) as the ratio of favorable outcomes to total outcomes. Example: The probability of rolling a 4 on a fair die is 1/6.
  • Empirical (or Frequentist) Probability: Defines probability as the long-run relative frequency of an event over many repeated trials. It is determined a posteriori (after the experiment). Example: If a coin is flipped 1000 times and lands on heads 505 times, the empirical probability of heads is 505/1000.
  • Subjective Probability: Represents a degree of belief or confidence in a proposition, based on personal judgment, experience, and available evidence. It is often used for unique events where repeated trials are not possible. Example: An expert's assessment of the probability that a new startup will succeed.
Distribution TypeDescriptionExample
BernoulliA single trial with two possible outcomes (success/failure).Flipping a single coin.
BinomialThe number of successes in a fixed number of independent Bernoulli trials.Number of heads in 10 coin flips.
PoissonThe number of events occurring in a fixed interval of time or space.Number of emails received in an hour.
NormalA continuous distribution (bell curve) that models many natural phenomena.Heights or blood pressure of a population.
⚠️

Common Mistakes

⚠️ The Gambler's Fallacy: A common mistake is believing that past independent events can influence future outcomes. For example, after a coin lands on tails five times in a row, the probability of it landing on heads on the next flip is still 0.5. The coin has no memory.
⚠️ Confusing Independence with Mutual Exclusivity: Independent events can occur together (P(A ∩ B) = P(A)P(B)), while mutually exclusive events cannot (P(A ∩ B) = 0). For example, drawing a King and drawing a Heart from a deck are independent but not mutually exclusive (King of Hearts). Drawing a King and a Queen are mutually exclusive.
💡 Ignoring the Base Rate: When evaluating conditional probability, such as medical test results, people often focus on the test's accuracy while ignoring the initial (base rate) probability of the condition. A positive result from a highly accurate test for a very rare disease may still mean it's more likely you don't have the disease. This is where Bayes' Theorem is crucial.
🚀

Study Strategy

1 📚 Build a Strong Foundation
  • Review the formal definition of probability as the ratio of favorable outcomes to the total number of outcomes.
  • Distinguish between sample space, events (simple and compound), and outcomes using the provided examples.
  • Master the key properties, especially that probability values must be between 0 and 1 inclusive.
  • Clearly define the difference between mutually exclusive events and independent events before moving to formulas.
2 🧠 Internalize the Core Formulas
  • Write out the Addition Rule, P(A or B) = P(A) + P(B) - P(A and B), and practice reciting it.
  • Memorize the special case for mutually exclusive events where P(A and B) = 0.
  • Learn the Multiplication Rule for independent events, P(A and B) = P(A) * P(B), and its conditions.
  • Drill the Complement Rule, P(A') = 1 - P(A), as a shortcut for finding the probability of an event not occurring.
3 ✍️ Apply with Guided Practice
  • Follow the 'Worked Example' step-by-step, explaining the logic for each calculation aloud.
  • Use the 'Visualizing Probability' section to draw a Venn diagram or tree diagram for each practice problem.
  • Solve the example problems again without looking at the solutions, then compare your work to identify gaps.
  • Analyze the 'Proof of the Addition Rule' to understand why subtracting the intersection is necessary for non-mutually exclusive events.
4 🌍 Connect to Real-World Scenarios
  • Translate the 'Real-World Scenarios' from words into probability notation before attempting to solve them.
  • For each 'Application of Probability' (e.g., finance, medicine), create one new, simple problem.
  • Review the 'Common Mistakes' section after solving a set of problems to check for conceptual errors.
  • Explain how you would calculate the probability in one of the 'Real-World Examples' to a friend or family member.
Master probability by building from core concepts to real-world application, turning uncertainty into predictable insight.

Frequently Asked Questions

×

×