(Series Part 1) Gamma Distribution Deep Dive: The Foundation of Related Distributions
Table of Contents
- Introduction
- Related Distributions
- Examples
- Chi-Squared Distribution
- Weibull Distribution
- Conclusion
Introduction
The Gamma distribution is a continuous probability distribution characterized by two parameters: the shape parameter () and the rate parameter (). It serves as a foundational distribution from which several other important distributions can be derived through specific parameter substitutions.
Probability Density Function (PDF) of Gamma distribution
The probability density function (PDF) of the Gamma distribution is given by:
where is:
has the following properties that will be used to derive the related distributions:
-
Recursive Relation:
-
Special Case for Positive Integers (Used in Erlang Distribution):
-
Value at (Used in Chi-Squared Distribution):
Related Distributions
Erlang Distribution
The Erlang distribution is a special case of the Gamma distribution where the shape parameter () is a positive integer.
-
PDF:
-
Parameters:
- Shape parameter
- Rate parameter
-
Substitution:
- Set
Chi-Squared Distribution
The Chi-squared distribution is another special case of the Gamma distribution, widely used in hypothesis testing and confidence interval estimation.
-
PDF:
-
Parameters:
- Degrees of freedom
-
Substitution:
- Set
- Set
Exponential Distribution
The Exponential distribution is a special case of the Gamma distribution with the shape parameter set to 1.
-
PDF:
-
Parameters:
- Rate parameter
-
Substitution:
- Set
Weibull Distribution
While not a direct parameter substitution from the Gamma distribution, the Weibull distribution is a separate continuous probability distribution used primarily in reliability engineering and failure analysis.
-
PDF:
-
Parameters:
- Scale parameter
- Shape parameter
Poisson Distribution
The Poisson distribution is a discrete probability distribution expressing the probability of a given number of events occurring in a fixed interval of time or space.
- Probability Mass Function (PMF):
Relationship with Gamma Distribution
In Bayesian statistics, the Poisson and Gamma distributions are connected through the concept of conjugate priors. If the number of events follows a Poisson distribution with an unknown rate parameter , and is assumed to follow a Gamma distribution as a prior, then the posterior distribution of given the observed data remains a Gamma distribution. This conjugate prior relationship simplifies the updating of beliefs about after observing data.
Examples
Gamma Distribution
Waiting Time for Multiple Events
Suppose the time between events follows an Exponential distribution with rate . The time until the -th event occurs follows a Gamma distribution with parameters (shape) and (rate).
Erlang Distribution
Telephone Call Center
The time until the -th call arrives at a call center can be modeled using the Erlang distribution with shape parameter and rate parameter .
Poisson Distribution
Number of Emails Received
The number of emails a person receives in an hour can be modeled by a Poisson distribution with parameter representing the average number of emails per hour.
Derivation
The Poisson distribution is a discrete probability distribution that expresses the probability of a given number of events occurring in a fixed interval of time or space. It can be derived as a limit of the Binomial distribution under specific conditions.
The Binomial distribution models the number of successes in independent trials, each with probability of success:
Limit Conditions:
- Let
- Let
- Such that remains constant
Using the approximations for large and small :
and
Substituting these into the Binomial PMF:
This is the Poisson PMF, demonstrating that under the conditions of a large number of trials and a small probability of success per trial, the Binomial distribution converges to the Poisson distribution with parameter .
Relationship Between Poisson and Gamma Distributions
As mentioned earlier, in Bayesian statistics, the Poisson and Gamma distributions form a conjugate pair. This means that when the rate parameter of a Poisson distribution is assigned a Gamma prior, the posterior distribution of after observing data remains a Gamma distribution. This property facilitates the updating of beliefs about in a computationally efficient manner.
Derivation: Gamma as a Conjugate Prior for Poisson
Assume the number of events follows a Poisson distribution with rate parameter :
Suppose the prior distribution of is a Gamma distribution with parameters (shape) and (rate):
Posterior Distribution:
Using Bayes’ theorem, the posterior distribution of given is:
Substituting the expressions:
Simplifying (constants can be absorbed into the proportionality):
This is the kernel of a Gamma distribution with updated parameters:
- Shape parameter:
- Rate parameter:
Therefore, the posterior distribution is:
Interpretation:
After observing events, the updated belief about the rate parameter is captured by a Gamma distribution with increased shape and rate parameters. This conjugate relationship simplifies Bayesian updating, as the posterior remains in the same family as the prior.
Chi-Squared Distribution
The Chi-squared distribution is a special case of the Gamma distribution and is widely used in statistical hypothesis testing and confidence interval estimation.
Goodness-of-Fit Test
In a Chi-squared goodness-of-fit test, the test statistic follows a Chi-squared distribution with degrees of freedom equal to the number of categories minus one. This test assesses whether observed frequencies differ from expected frequencies under a specific hypothesis.
Weibull Distribution
The Weibull distribution is a continuous probability distribution used extensively in reliability engineering and failure analysis. It is characterized by its scale parameter () and shape parameter ().
Time to Failure of a Mechanical Component
The lifespan of a mechanical component can be modeled using the Weibull distribution. The shape parameter indicates the failure rate behavior:
- : Failure rate decreases over time (infant mortality).
- : Corresponds to the Exponential distribution (constant failure rate).
- : Failure rate increases over time (wear-out failures).
Example:
A component with and hours suggests that the failure rate increases with time, modeling wear-out behavior.