Definition

Overdispersion is the presence of greater observed variance than what would be expected under a given statistical model. It can happen due to heterogeneity or lack of independence between trials, and clustering or grouping in the data.

Overdispersion on Binomial Distribution

Assume that there are clusters, the number of observations of each cluster is . Let the Random Variable following Binomial Distribution be the number of successes out of observation, where is also Random Variable with mean and variance . Also, let be the total number of successes in the all clusters, where . Then, the mean and variance of is defined as Here, is called a dispersion parameter. If , then overdispersion occurred.

The Lexis Ratio is also used for detecting overdispersion where and

Overdispersion on Poisson Distribution

If in Poission Distribution setting, then the overdispersion might occur. Let i.i.d. be the number of occurrences at the -th cluster, and the number of clusters is also a Random Variable. Also, let be the total number of occurrences in clusters, where follows a Poission Distribution and independent to . Then, the mean and variance of is defined as If , then the overdispersion occurs.