Gamma-Poisson Distribution

Important properties and mathematical definitions

Last updated on Jan 24, 2021 4 min read

ShowHide

code chunks

General Properties
Relation to the Negative Binomial Distribution
$R$ Implementation

The Gamma-Poisson distribution is a statistical distribution for overdispersed count data. It is also known under the name Negative Binomial. Unlike the Negative Binomial which is primarily used for repeated trials and number of success / failures, the Gamma-Poisson is parametrized by the mean $μ$ and the overdispersion $α$ . If $α = 0$ , it reduces to the Poisson distribution.

General Properties

Definition: $Y \sim GammaPoisson (μ, α)$ where $Y$ are non-negative integer (i.e., “counts”).

Probability Mass Function

$f_{GP} (y | μ, α) = \frac{Γ (y + 1 / α)}{Γ (1 / α) Γ (y + 1)} {(\frac{μ^{2} α}{μ + μ^{2} α})}^{y} {(\frac{1}{1 + μ α})}^{1 / α}$

Cumulative Distribution Function

$F_{GP} (y | μ, α) = I (\frac{1}{1 + μ α}; 1 / α, 1 + y)$ where $I (z; a, b)$ is the regularized incomplete beta function.

Momements

Mean

$E [Y] = μ$

Variance

$V ar [Y] = μ + α μ^{2}$

Skewness

$S kew [Y] = \frac{\sqrt{\frac{μ}{1 + μ α}} (1 + 2 μ α)}{μ}$

Kurtosis

$K urtosis [Y] = \frac{1 + 6 μ α + 6 μ^{2} α^{2}}{μ + μ^{2} α} + 3$

Moment Generating Function

$M_{X \sim GP} (t) := E [\exp (t X)] = {(\frac{1}{1 - μ α (e^{t} - 1)})}^{1 / α}$

Characteristic Function

$M_{X \sim GP} (t) := E [\exp (t X)] = {(\frac{1}{1 - μ α (e^{i t} - 1)})}^{1 / α}$

Fisher Information

$I (μ) = E [{(\frac{\partial}{\partial μ} \log f_{GP} (y))}^{2} | μ] = \frac{1}{μ + α μ^{2}}$

Mathematica fails to calculate $I (α)$ .

Useful derivatives

$\frac{\partial}{\partial μ} \log (f_{G P} (y | μ, α) = \frac{y - μ}{μ + α μ^{2}}$

$\frac{\partial}{\partial β_{0}} \log (f_{G P} (y | μ, α) = \frac{y - μ}{1 + α μ},$ where $μ = \exp (β_{0} + offset)$ .

Relation to the Negative Binomial Distribution

The Gamma-Poisson and the Negative Binomial distribution are mathematically identical, they just use different parametrizations. However, this can cause a lot of confusion, especially because the $R$ and Wikipedia again use two different parametrizations of the Negative Binomial:

Wikipedia uses $r$ for the number of trials and $p$ for the number of failures
$R$ uses $size$ for the number of trials and $p^{'}$ for the number of successes, thus $p^{'} = 1 - p$ .

To convert from the Negative Binomial to the Gamma-Poisson parametrization $\begin{aligned} μ & = \frac{p r}{1 - p} & = \frac{(1 - p^{'}) r}{p^{'}} \\ α & = 1 / r & = 1 / size . \end{aligned}$

To convert from the Gamma-Poisson to the Negative Binomial parametrization: $\begin{aligned} p & = \frac{α μ}{1 + α μ} \\ p^{'} & = \frac{1}{1 + α μ} \end{aligned}$ and $r = size = 1 / α .$

$R$ Implementation

dgampoi <- function(x, mean, overdispersion){
  gamma(x + 1/overdispersion)/(gamma(1/overdispersion) * gamma(x + 1)) * 
    ((mean^2 * overdispersion) / (mean + mean^2 * overdispersion))^x *
    (1/(1 + mean * overdispersion))^(1/overdispersion)
}


# Or based on the existing 'dnbinom'
dgampoi2 <- function(x, mean, overdispersion){
  dnbinom(x, mu = mean, size = 1 / overdispersion)
}

library(tidyverse)
cross_df(list(x = seq(0, 200), mu = 100, alpha = c(0, 0.05, 0.2, 2))) %>%
  mutate(dens = dgampoi2(x, mean = mu, overdispersion = alpha)) %>%
  ggplot(aes(x = x-0.5)) +
    geom_step(aes(y = dens, color = as.factor(alpha)), show.legend = FALSE) +
    annotate("text", x = 12, y = 0.03, label = expression(alpha==2), size = 7) +
    annotate("text", x = 30, y = 0.003, label = expression(alpha==0.2), size = 7, hjust = 1) +
    annotate("text", x = 120, y = 0.025, label = expression(alpha==0), size = 7) +
    annotate("text", x = 80, y = 0.015, label = expression(alpha==0.05), size = 7, hjust = 1) +
    cowplot::theme_cowplot(font_size = 20) +
    coord_trans(xlim = c(0, 150), ylim = c(0, 0.04)) + 
    scale_y_continuous(expand = expansion(0)) + 
    scale_x_continuous(expand = expansion(0), breaks = seq(0, 140, by = 20)) +
    labs(title ="Density of Gamma Poisson with µ=100",
         x = "y", y = expression(f[GP](y*"|"*mu==100,alpha)))

cross_df(list(x = seq(0, 30), mu = 10, alpha = c(0, 0.05, 0.2, 2))) %>%
  mutate(dens = dgampoi2(x, mean = mu, overdispersion = alpha)) %>%
  ggplot(aes(x = x-0.5)) +
    geom_step(aes(y = dens, color = as.factor(alpha)), show.legend = FALSE) +
    cowplot::theme_cowplot(font_size = 20) +
    coord_trans(xlim = c(0, 30)) +
    scale_y_continuous(expand = expansion(0)) +
    scale_x_continuous(expand = expansion(0), breaks = seq(0, 30, by = 5)) +
    labs(title ="Density of Gamma Poisson with µ = 10",
         x = "y", y = expression(f[GP](y*"|"*mu==10,alpha)))

Gamma-Poisson Distribution

General Properties

Probability Mass Function

Cumulative Distribution Function

Momements

Mean

Variance

Skewness

Kurtosis

Moment Generating Function

Characteristic Function

Fisher Information

Useful derivatives

Relation to the Negative Binomial Distribution

$R$ Implementation

Constantin Ahlmann-Eltze

Postdoc

Gamma-Poisson Distribution

General Properties

Probability Mass Function

Cumulative Distribution Function

Momements

Mean

Variance

Skewness

Kurtosis

Moment Generating Function

Characteristic Function

Fisher Information

Useful derivatives

Relation to the Negative Binomial Distribution

RR Implementation

Constantin Ahlmann-Eltze

Postdoc

$R$ Implementation