The Poisson Distribution is a discrete probability distribution that models the probability of a given number of events occurring in a fixed interval of time or space. It is used when the events happen independently and at a constant average rate (λ). It's often called the distribution of "rare events" because it's ideal for modeling occurrences like arrivals, failures, or accidents.
| Symbol | Description |
|---|---|
| \[ \lambda \] | Rate Parameter - The average number of events per interval. |
| \[ k \] | Event Count - The actual number of events observed for which probability is calculated. |
| \[ e \] | Euler's Number - A mathematical constant approximately equal to 2.71828. |
| \[ k! \] | k Factorial - The product of all positive integers up to k (e.g., 4! = 4 × 3 × 2 × 1 = 24). |
| \[ P(X = k) \] | Probability Mass Function - The probability that the random variable X is exactly equal to k. |
| \[ E[X] \] | Expected Value - The mean or average outcome of the distribution. |
The Poisson distribution is represented by a bar chart, as it is a discrete distribution. The shape of the chart depends on the value of the rate parameter λ.
| Property | Description |
|---|---|
| Mean equals Variance | A unique and defining characteristic: \[ E[X] = \text{Var}(X) = \lambda \] |
| Additive Property | The sum of two independent Poisson random variables is also a Poisson random variable. If \[ X_1 \sim \text{Poisson}(\lambda_1) \] and \[ X_2 \sim \text{Poisson}(\lambda_2) \], then \[ X_1 + X_2 \sim \text{Poisson}(\lambda_1 + \lambda_2) \]. |
| Discrete Nature | The random variable can only take non-negative integer values (0, 1, 2, ...). |
| Skewness | The distribution is always right-skewed, but the skewness decreases as λ increases. Skewness is \[ 1/\sqrt{\lambda} \]. |
Poisson Process Conditions: For a process to be modeled by a Poisson distribution, four conditions must be met:
The Poisson distribution can be derived as a limiting case of the Binomial distribution, where the number of trials n is very large and the probability of success p is very small.
1. Start with the Binomial PMF, where we define the average rate λ = np, which implies p = λ/n.
2. Now, we take the limit as n → ∞ and analyze each part of the formula.
3. Combining these limits, the Binomial formula converges to the Poisson PMF.
Healthcare & Emergency Services: Used to model patient arrivals at an emergency room, the number of calls for an ambulance, or the occurrence of a rare disease. This helps in resource planning and staffing.
Telecommunications & Network Traffic: Models the number of phone calls arriving at a call center per hour or the number of data packets arriving at a network router per second. Essential for designing systems that can handle expected loads.
Manufacturing & Quality Control: Used to count the number of defects or flaws in a product, such as bubbles in a sheet of glass or errors per page of a book. This helps in monitoring and improving production processes.
Finance and Insurance: Models the number of insurance claims (e.g., car accidents) filed per month or the number of stock market crashes in a decade, aiding in risk assessment and premium calculation.
Wildlife Biology: Ecologists use the Poisson distribution to model the spatial distribution of plants or animal nests in a large area. By dividing the area into a grid, they can count the number of occurrences per quadrat to determine if the distribution is random, clustered, or uniform.
Astronomy: When observing deep space with a telescope, astronomers count the number of photons arriving from a distant star in a given time interval. This photon count often follows a Poisson distribution, which is crucial for analyzing the brightness and variability of celestial objects.
Web Server Management: System administrators model the number of requests hitting a web server per second. This helps them provision enough server capacity to handle peak traffic and ensure a low probability of the system being overloaded.
The Poisson distribution is a fundamental member of the family of discrete probability distributions. It doesn't have "types" in the same way a geometric shape does, but it is a key component in more complex models.
The Poisson Process: The distribution arises from a stochastic process called the Poisson process. This process describes events occurring continuously and independently at a constant average rate. The number of events in any interval of time or space in a Poisson process follows a Poisson distribution.
Compound Poisson Distribution: This is a variation where the events themselves have random sizes. For example, if the number of insurance claims follows a Poisson distribution, and the size of each claim is also a random variable, the total claim amount follows a compound Poisson distribution.
Forgetting to scale the rate parameter (λ). If a problem gives a rate per hour but asks for a probability over 30 minutes, you must adjust λ accordingly (e.g., divide the hourly rate by 2). The interval for λ must match the interval of the question.
Applying Poisson when events are not independent. For example, modeling the number of contagious disease cases is not a good fit, because one case makes another more likely. The Poisson model requires that the occurrence of one event does not influence the probability of another.
Confusing P(X > k) with P(X ≥ k). For discrete distributions, these are different. P(X > k) = 1 - P(X ≤ k), whereas P(X ≥ k) = 1 - P(X ≤ k-1). Be careful with inequalities.