Generalized Linear Model (GLM)

Generalized linear models (GLMs) are statistical models that are, as the name suggests, generalizations of the ordinary linear model where

Y = β_{0} + β_{1} x_{1} + \dots + β_{p} x_{p} = Xβ

One of the assumptions of the linear model is that each $y_{i}$ is drawn from a normal distribution, so what we’re actually predicting in the equation above is:

E (Y ∣ X) = Xβ

where $E (Y ∣ X)$ is the mean of the normal distribution.

GLMs allow us to fit models that are linear in their predictors but where the assumptions of an ordinary linear model may not be appropriate, including this assumption of normality but also other assumptions. For instance, linear models assume a linear relationship between predictors and response values across the entire domain of $X$ , but this obviously can lead to implausible predicted responses. Or linear models can lead to predicted probabilities above 1 or below 0, if we try to use them to predict probabilities.

GLMs address this issue by allowing us to assume the dependent variables are drawn from arbitrary distributions (limited to those in the exponential family) rather than only a normal distribution. This works by specifying some link function that transforms the mean of the dependent variable to be a linear function of the predictors.

If we denote this function as $η$ , then we can write a GLM as:

η (Y ∣ X) = Xβ

Sometimes, rather than transforming the dependent variable, we want to transform the independent variables, so we can use the inverse link function instead:

(Y ∣ X) = η^{- 1} (Xβ)

Examples

The link functions for the linear, logistic, and Poisson regression are:

Linear

η (μ) = μ

i.e. the identity function

Logistic

η (μ) = lo g (\frac{μ}{1 - μ})

i.e. the logit (sigmoid) function

Poisson

η (μ) = lo g (μ)

i.e. the log link

Brain

Explorer

Generalized Linear Model (GLM)

Examples

Linear

Logistic

Poisson

Graph View

Table of Contents

Backlinks