Dealing with Discrete Count Variables

Not all numerical dependent variables are created equal! Some are discrete, not continuous:

-- Number of surviving offspring

​​​​​-- Number of crimes committed in an area

-- Number of days in the hospital

If you apply linear regression, which is designed for continuous dependent variables, to discrete dependent variables, you’re going to run into some BIG issues…

Analyzing Count Data:

Poisson, Negative Binomial, and Other Essential Models

My goal is that by the end of the workshop, you’ll know how to spot when you need a count model and how to choose the most appropriate one, implement it, and understand the results. You’ll learn six different types of count models (plus all their varieties) and how to select the one that will work best for your specific data set.

In this six-module live, online workshop, we’ll cover:

-- What counts are and why they don’t work well in linear models

-- When a model is appropriate and how to evaluate a model

-- Working with Offset/Exposure Variables

-- Understanding Incidence Rate Ratios and the log link function

-- Steps to Running a Poisson Model

-- Interpreting Coefficients, Marginal Effects, and Interactions

-- Overdispersion & Negative Binomial Models

-- Model Fit, Assumptions, and Influence

-- Zero Truncated, Hurdle, Zero Inflated Poisson and Negative Binomial Models

Who is this workshop for?

This workshop is suitable for graduate students and research professionals. It’s for you if you:

-- Have tried to do a Poisson or negative binomial regression before, but found it confusing or difficult
--​​ ​​​Have used linear regression or ANOVA and want to take your statistical skills to the next step

-- Know you will need to implement a count model soon

It’s not for you if you:

-- ​​​​​​​Are a statistics beginner and have never done a linear regression
​​​​​​​​​​​​​-- Have an advanced degree in statistics and want theoretical knowledge of count models

How does it work?

All sessions are conducted online via live webinar. You can log in via phone or Internet. You’ll see the instructor’s screen to view the presentation… all from your own home or office.

During each webinar session, the instructor will cover core concepts, and will leave plenty of time to ask your own questions.

As a participant in the Analyzing Count Data workshop, you’ll have access to a participant-only website, your workshop “hub.” That’s where you’ll access all workshop resources and material, including:

-- Real research data sets in SPSS, SAS,  Stata, and csv formats.

-- Syntax and demonstration videos to conduct all the workshop examples in SPSS, R, SAS, and Stata.

-- Video screen capture recordings of each workshop session. Made available within 48 hours after each session so you can review the material at your convenience. If you miss any of the live sessions, you can still participate on your own schedule.

-- Exercises. (yes, HOMEWORK!) You really need to practice this stuff and get your hands dirty, so we’re giving you the data to try it on your own. But don’t worry–you won’t be on your own stymied by some coding error that won’t work. You’ll get the code to do the exercises and the answers in case you get stuck.

-- A forum  to submit written questions between sessions. Got a question as you’re reviewing the video recording or your notes? Just submit a question in the workshop forum. Jeff will answer it there if he can, or if it’s something she needs to show you, he’ll answer it in the next Q&A session.

-- Video recordings of all Q&A sessions to review at your convenience. You can even submit questions for the Q&A sessions ahead of time, and Jeff will answer them in the next session and you can watch at your convenience.

-- A list of helpful resources and suggestions for further reading. There’s no required textbook, but there are some good resources that we recommend to support your learning.

-- Bonus videos. We’ve included a few videos from some webinars on relevant topics to help your understanding. Included are:
> A Review of Logarithms for the Data Analyst
> What Happened to R squared?: Assessing Model Fit for Logistic, Multilevel and Other Models that Use Maximum Likelihood
> Binary, Ordinal, and Multinomial Logistic Regression for Categorical Outcomes
> Zero Inflated Models
> Working with Truncated and Censored Data

What's Covered?

Module 1: Understanding Count Models

We’ll start with an understanding of what counts are and why they don’t work well in linear models. We’ll talk in detail about why a count model is necessary and what a log link means.

Then we’ll go through a brief overview of the most important concepts and steps so you have a big-picture understanding. This lays a strong foundation for the rest of the workshop.

We’ll cover:

-- Discrete vs Continuous Variables
-- The Variety of Count Data
-- Why OLS Linear Regression Doesn’t Work with Count Data
-- Modeling Assumptions for Count Data
-- Modeling Process
​​​​​​​-- Offset/Exposure Variable
​​​​​​​-- Incidence Rate Ratios

Module 2: The Poisson Model

Now that you’ve had an overview, we’ll dig deep into the simplest count model: the Poisson. We’ll explore the issues, terminology, and modeling that apply to all count models within the context of this simplest of models.

Covered:

-- Important Terminology
​​​​​​​-- Poisson Model Assumptions
​​​​​​​-- Poisson Distribution
​​​​​​​-- Analyzing Data and Model Fit
​​​​​​​-- Interpreting the Results
​​​​​​​-- Marginal Effects
​​​​​​​-- Interactions

Module 3: The Negative Binomial Model

Now that you understand the Poisson model, we expand on it. Unfortunately, the Poisson model often doesn’t fit real data. In this module we introduce the powerhouse of count models: the negative binomial.

Covered:

-- Overdispersion​​​​​​​
​​​​-- Negative Binomial Distribution
​​​​​​​-- Important Terminology
​​​​​​​-- When to use Negative Binomial Models
​​​​​​​-- Running Models
​​​​​​​-- Model Fit
​​​​​​​-- Negative Binomial Model Assumptions

Module 4: Model Diagnostics and Truncated Models

Now that we’ve run a model, we need to make sure that it fits the data. In this module you learn how to do that in Poisson and negative binomial models and learn a new type of count model that will account for a common failure of model fit--a complete lack of zeros in the data.

Covered:

-- Predicted Values and Residuals: comparison of models
​​​​​​​-- Influential Observations: Cook’s distance
​​​​​​​-- Residuals versus Predicted Values
​​​​​​​-- Residuals versus Predictors
​​​​​​​-- Zero Truncated Models

Module 5: Hurdle and Zero-inflated Models

Lots and lots of zeros in the data. In this module we’ll explore in detail two ways of approaching and dealing with this common situation (think of variables like number of arrests or accidents--most people don’t have any).

Covered:
​​​​​
-- Hurdle Model

​​​​​​​-- Zero Inflated Poisson Model​​​​​​​
​​​​​​​-- Zero Inflated Negative Binomial Model

Module 6: Extension and Review

Now that you understand, in detail, when and how to implement a variety of count models, we’ll do two things.

First, we’ll introduce some lesser-known, but valuable alternative models that are sometimes exactly what you need.

Then we’ll take a data set and go through a model-building example, so you can see, start-to-finish an example of the order in which to do steps, the information to consider, and which output to interpret.

​​​​​​​
Hi, I’m Jeff Meyer, your workshop instructor and professional statistical consultant.

I have taught this workshop in the past and have watched students grow in confidence and master count models.

My interests are wide-ranging. I work with linear, binomial, count and mixed models. In addition my work can entail running exploratory and confirmatory models, Structural Equation Models (SEM), latent class analysis and multiple imputation models for missing data. ​​​​​​​

I understand that to be an effective instructor, it takes more than subject matter knowledge and a logical approach to analyzing data. I truly enjoy working with people, and I care about your success!

Prerequisites

So what kind of background in statistics do you need?

This is an advanced-level course. You should have solid experience in running linear models, and experience with ANOVA or linear regression.

We’re assuming you understand:

-- Interpreting regression coefficients

-- Least-squares estimation

-- P-values

-- Dummy coding

-- Interactions

Familiarity with logistic regression will be helpful, but it’s not necessary.

If you have questions about whether you’re ready for this class, just email us. We’ll give you our honest opinion. We want you to succeed!

