count model

The Importance of Including an Exposure Variable in Count Models

November 19th, 2020 by Jeff Meyer

When our outcome variable is the frequency of occurrence of an event, we will typically use a count model to analyze the results. There are numerous count models. A few examples are: Poisson, negative binomial, zero-inflated Poisson and truncated negative binomial.

There are specific requirements for which count model to use. The models are not interchangeable. But regardless of the model we use, there is a very important prerequisite that they all share.

(more…)

11 comments

Count Models: Understanding the Log Link Function

November 12th, 2020 by Jeff Meyer

When we run a statistical model, we are in a sense creating a mathematical equation. The simplest regression model looks like this:

Y_i = β₀ + β₁X+ ε_i

The left side of the equation is the sum of two parts on the right: the fixed component, β₀ + β₁X, and the random component, ε_i.

You’ll also sometimes see the equation written (more…)

2 comments

When to Use Logistic Regression for Percentages and Counts

April 30th, 2018 by Karen Grace-Martin

One important yet difficult skill in statistics is choosing a type model for different data situations. One key consideration is the dependent variable.

For linear models, the dependent variable doesn’t have to be normally distributed, but it does have to be continuous, unbounded, and measured on an interval or ratio scale.

Percentages don’t fit these criteria. Yes, they’re continuous and ratio scale. The issue is the (more…)

9 comments

Poisson or Negative Binomial? Using Count Model Diagnostics to Select a Model

March 19th, 2018 by Jeff Meyer

How do you choose between Poisson and negative binomial models for discrete count outcomes?

One key criterion is the relative value of the variance to the mean after accounting for the effect of the predictors. A previous article discussed the concept of a variance that is larger than the model assumes: overdispersion.

(Underdispersion is also possible, but much less common).

There are two ways to check for overdispersion: (more…)

10 comments

The Problem with Linear Regression for Count Data

February 26th, 2018 by Jeff Meyer

Imagine this scenario:

This year’s flu strain is very vigorous. The number of people checking in at hospitals is rapidly increasing. Hospitals are desperate to know if they have enough beds to handle those who need their help.

You have been asked to analyze a previous year’s hospitalization length of stay by people with the flu who had been admitted to the hospital. The predictors in your data set are age group, gender and race of those admitted. You also have an indicator that signifies whether the hospital was privately or publicly run.

(more…)

1 comment

Understanding Incidence Rate Ratios through the Eyes of a Two-Way Table

December 27th, 2016 by Jeff Meyer

The coefficients of count model regression tables are shown in either logged form or as incidence rate ratios. Trying to explain the coefficients in logged form can be a difficult process.

Incidence rate ratios are much easier to explain. You probably didn’t realize you’ve seen incidence rate ratios before, expressed differently.

Let’s look at an example.

A school district was interested in how many children in their sixth grade classes played on organized sports teams. So they did a count and also noted the gender of the child. The results were put into a table: (more…)

2 comments