Maths Formulae Statistics Poisson Distribution

Poisson Distribution – Counting Rare Events

Understand the Poisson distribution used for modeling count data and rare events in fixed intervals.
🔑

Definition of Poisson Distribution

The Poisson Distribution is a discrete probability distribution that models the probability of a given number of events occurring in a fixed interval of time or space. It is used when the events happen independently and at a constant average rate (λ). It's often called the distribution of "rare events" because it's ideal for modeling occurrences like arrivals, failures, or accidents.

SymbolDescription
\[ \lambda \]Rate Parameter - The average number of events per interval.
\[ k \]Event Count - The actual number of events observed for which probability is calculated.
\[ e \]Euler's Number - A mathematical constant approximately equal to 2.71828.
\[ k! \]k Factorial - The product of all positive integers up to k (e.g., 4! = 4 × 3 × 2 × 1 = 24).
\[ P(X = k) \]Probability Mass Function - The probability that the random variable X is exactly equal to k.
\[ E[X] \]Expected Value - The mean or average outcome of the distribution.
📊

Key Formulas

\[ P(X = k) = \frac{\lambda^k e^{-\lambda}}{k!} \]
Probability Mass Function (PMF)
\[ E[X] = \lambda \]
Mean (Expected Value)
\[ \text{Var}(X) = \lambda \]
Variance
\[ F(k) = P(X \leq k) = \sum_{i=0}^{k} \frac{\lambda^i e^{-\lambda}}{i!} \]
Cumulative Distribution Function (CDF)
\[ \lambda_t = \lambda \times t \]
Time and Space Scaling
📈

Distribution Shape

0 1 2 3 4 5 6 7 8 9 10 ↑ Peak at k=λ=4 P(k) = e^(−λ) λᵏ / k! (λ=4)
Poisson Distribution: probability of k events in a fixed interval — peak at k = λ (the mean count)

The Poisson distribution is represented by a bar chart, as it is a discrete distribution. The shape of the chart depends on the value of the rate parameter λ.

  • For small values of λ (e.g., λ < 5), the distribution is highly right-skewed, with the peak of the probability mass near zero.
  • As λ increases, the distribution becomes more symmetric and bell-shaped, resembling a normal distribution. The peak of the distribution is centered around the value of λ.
⚙️

Properties of the Poisson Distribution

PropertyDescription
Mean equals VarianceA unique and defining characteristic: \[ E[X] = \text{Var}(X) = \lambda \]
Additive PropertyThe sum of two independent Poisson random variables is also a Poisson random variable. If \[ X_1 \sim \text{Poisson}(\lambda_1) \] and \[ X_2 \sim \text{Poisson}(\lambda_2) \], then \[ X_1 + X_2 \sim \text{Poisson}(\lambda_1 + \lambda_2) \].
Discrete NatureThe random variable can only take non-negative integer values (0, 1, 2, ...).
SkewnessThe distribution is always right-skewed, but the skewness decreases as λ increases. Skewness is \[ 1/\sqrt{\lambda} \].

Poisson Process Conditions: For a process to be modeled by a Poisson distribution, four conditions must be met:

  1. Events occur independently of one another.
  2. The average rate of events (λ) is constant over the interval.
  3. Events cannot occur simultaneously.
  4. The probability of an event occurring is proportional to the length of the interval.
🔬

Derivation (Poisson as a Limit of Binomial)

The Poisson distribution can be derived as a limiting case of the Binomial distribution, where the number of trials n is very large and the probability of success p is very small.

1. Start with the Binomial PMF, where we define the average rate λ = np, which implies p = λ/n.

\[ P(X=k) = \binom{n}{k} p^k (1-p)^{n-k} = \frac{n!}{k!(n-k)!} (\frac{\lambda}{n})^k (1-\frac{\lambda}{n})^{n-k} \]

2. Now, we take the limit as n → ∞ and analyze each part of the formula.

\[ \lim_{n\to\infty} \frac{n!}{ (n-k)! n^k } = \lim_{n\to\infty} \frac{n(n-1)...(n-k+1)}{n^k} = 1 \]
\[ \lim_{n\to\infty} (1 - \frac{\lambda}{n})^n = e^{-\lambda} \]
\[ \lim_{n\to\infty} (1 - \frac{\lambda}{n})^{-k} = 1 \]

3. Combining these limits, the Binomial formula converges to the Poisson PMF.

\[ \lim_{n\to\infty} P(X=k) = \frac{\lambda^k}{k!} \times 1 \times e^{-\lambda} \times 1 = \frac{\lambda^k e^{-\lambda}}{k!} \]
🧮

Worked Example

Given a Poisson distribution with an average rate λ = 3, calculate the probability of observing exactly k = 4 events.
  1. State the Poisson probability mass function: P(X = k) = (λ^k * e^-λ) / k!
  2. Substitute the given values λ = 3 and k = 4 into the formula: P(X = 4) = (3^4 * e^-3) / 4!
  3. Calculate the components: 3^4 = 81, e^-3 ≈ 0.0498, and 4! = 4 × 3 × 2 × 1 = 24.
  4. Combine the components in the formula: P(X = 4) = (81 * 0.0498) / 24 ≈ 4.0338 / 24.
  5. Calculate the final probability: P(X = 4) ≈ 0.168
The probability of observing exactly 4 events is approximately 0.168 or 16.8%.
🧮

Try It

🚀

Applications

Healthcare & Emergency Services: Used to model patient arrivals at an emergency room, the number of calls for an ambulance, or the occurrence of a rare disease. This helps in resource planning and staffing.

Telecommunications & Network Traffic: Models the number of phone calls arriving at a call center per hour or the number of data packets arriving at a network router per second. Essential for designing systems that can handle expected loads.

Manufacturing & Quality Control: Used to count the number of defects or flaws in a product, such as bubbles in a sheet of glass or errors per page of a book. This helps in monitoring and improving production processes.

Finance and Insurance: Models the number of insurance claims (e.g., car accidents) filed per month or the number of stock market crashes in a decade, aiding in risk assessment and premium calculation.

🌍

Real-World Examples

A coffee shop observes an average of 10 customers arriving per hour. What is the probability that exactly 3 customers will arrive in a 15-minute interval?
  1. First, scale the rate parameter λ to the new interval. The hourly rate is 10 customers. The interval is 15 minutes, which is 15/60 = 0.25 hours.
  2. New rate λ = 10 customers/hour × 0.25 hours = 2.5 customers.
  3. Identify the number of events k = 3.
  4. Apply the Poisson formula: P(X = 3) = (2.5^3 * e^-2.5) / 3!
  5. Calculate the components: 2.5^3 = 15.625, e^-2.5 ≈ 0.0821, 3! = 6.
  6. Calculate the probability: P(X = 3) = (15.625 * 0.0821) / 6 ≈ 0.2138.
The probability of exactly 3 customers arriving in a 15-minute interval is approximately 0.2138 or 21.38%.
A city's fire department reports an average of 1.2 fires per day. What is the probability that there will be more than 2 fires in a single day?
  1. The rate is λ = 1.2 fires per day.
  2. We need to find P(X > 2). This is equal to 1 - P(X ≤ 2), which is 1 - [P(X=0) + P(X=1) + P(X=2)].
  3. Calculate P(X=0): (1.2^0 * e^-1.2) / 0! ≈ 0.3012
  4. Calculate P(X=1): (1.2^1 * e^-1.2) / 1! ≈ 0.3614
  5. Calculate P(X=2): (1.2^2 * e^-1.2) / 2! ≈ 0.2169
  6. Sum these probabilities: P(X ≤ 2) ≈ 0.3012 + 0.3614 + 0.2169 = 0.8795.
  7. Calculate P(X > 2) = 1 - 0.8795 = 0.1205.
The probability of more than 2 fires in a day is approximately 0.1205 or 12.05%.
🏞️

Real-World Scenarios

0 1 2 3 4 5 6 7 8 9 Calls per Minute (λ=4) P(k) = e^(−λ)λᵏ/k!
Call Centre Arrivals
If a helpdesk averages λ=4 calls/minute, the Poisson distribution gives the probability of exactly k calls arriving in any given minute.
! Road Accidents / Month λ=2.3 accidents/month P(0)=10%, P(≥3)=32%
Accident Modelling
Road safety analysts model monthly accident counts as Poisson(λ). If λ=2.3, they can compute P(no accidents) = e^(−2.3) ≈ 10% to budget emergency services.
1 2 3 4 5 6 7 8 9 10 Website Hits / Second λ=6 hits/s → server capacity
Web Server Capacity
Page requests arrive at a server according to a Poisson process. Knowing λ lets DevOps teams provision servers to handle P(k>capacity) below a threshold.

Wildlife Biology: Ecologists use the Poisson distribution to model the spatial distribution of plants or animal nests in a large area. By dividing the area into a grid, they can count the number of occurrences per quadrat to determine if the distribution is random, clustered, or uniform.

Astronomy: When observing deep space with a telescope, astronomers count the number of photons arriving from a distant star in a given time interval. This photon count often follows a Poisson distribution, which is crucial for analyzing the brightness and variability of celestial objects.

Web Server Management: System administrators model the number of requests hitting a web server per second. This helps them provision enough server capacity to handle peak traffic and ensure a low probability of the system being overloaded.

📂

Types and Classifications

The Poisson distribution is a fundamental member of the family of discrete probability distributions. It doesn't have "types" in the same way a geometric shape does, but it is a key component in more complex models.

The Poisson Process: The distribution arises from a stochastic process called the Poisson process. This process describes events occurring continuously and independently at a constant average rate. The number of events in any interval of time or space in a Poisson process follows a Poisson distribution.

Compound Poisson Distribution: This is a variation where the events themselves have random sizes. For example, if the number of insurance claims follows a Poisson distribution, and the size of each claim is also a random variable, the total claim amount follows a compound Poisson distribution.

⚠️

Common Mistakes

⚠️ Forgetting to scale the rate parameter (λ). If a problem gives a rate per hour but asks for a probability over 30 minutes, you must adjust λ accordingly (e.g., divide the hourly rate by 2). The interval for λ must match the interval of the question.
⚠️ Applying Poisson when events are not independent. For example, modeling the number of contagious disease cases is not a good fit, because one case makes another more likely. The Poisson model requires that the occurrence of one event does not influence the probability of another.
💡 Confusing P(X > k) with P(X ≥ k). For discrete distributions, these are different. P(X > k) = 1 - P(X ≤ k), whereas P(X ≥ k) = 1 - P(X ≤ k-1). Be careful with inequalities.
🚀

Study Strategy

1 🧠 Grasp the Core Concepts
  • Focus on the key condition: events occurring independently within a fixed interval of time or space at a constant average rate.
  • Clearly distinguish between λ (lambda), the mean number of events, and k, the specific number of events you are calculating the probability for.
  • Review the core assumptions: events are independent, the rate is constant, and two events cannot occur at the exact same instant.
  • Internalize how Poisson is a special case of the Binomial distribution for a large number of trials and a small probability of success.
2 📝 Commit Formulas to Memory
  • Write out the Probability Mass Function (PMF) P(X=k) = (λ^k * e^-λ) / k! ten times until it becomes second nature.
  • Memorize that for a Poisson distribution, both the mean (μ or E[X]) and the variance (σ² or Var(X)) are equal to λ.
  • Practice locating the e^x and factorial (!) functions on your calculator to ensure swift and accurate computations.
  • Drill the components: λ is the rate, k is the count, e is Euler's number (~2.718), and P(X=k) is the resulting probability.
3 ✍️ Solve Guided Problems
  • Follow the provided 'Worked Example' step-by-step, then attempt to solve it independently without looking at the solution.
  • Practice calculating probabilities for different conditions: exactly k events (P(X=k)), at most k events (P(X≤k)), and at least k events (P(X≥k)).
  • Actively check your work against the 'Common Mistakes' section, especially regarding time/space interval consistency for λ.
  • Change the λ or k values in the worked examples and recalculate to observe how the probability distribution shifts.
4 🌍 Tackle Real-World Scenarios
  • For each 'Real-World Example', practice identifying the 'interval' (e.g., per hour, per km) and extracting the correct λ and k values.
  • Create your own problem based on the 'Applications' list, such as modeling customer arrivals at a store or typing errors on a page.
  • Use the formula to answer predictive questions, like 'What is the probability of receiving more than 5 emails in the next hour?'.
  • Explain why the Poisson distribution is a better fit for a given scenario compared to the Binomial or Normal distributions.
Master the Poisson distribution by systematically building from core concepts to confident real-world application.

Frequently Asked Questions

×

×