Blog Posts

Link Functions and Errors in Logistic Regression

March 14, 2018

I recently held a free webinar in our The Craft of Statistical Analysis program about Binary, Ordinal, and Nominal Logistic Regression. It was a record crowd and we didn’t get through everyone’s questions, so I’m answering some here on the site. They’re grouped by topic, and you will probably get more out of it if […]

Getting Accurate Predicted Counts When There Are No Zeros in the Data

March 12, 2018

We previously examined why a linear regression and negative binomial regression were not viable models for predicting the expected length of stay in the hospital for people with the flu. A linear regression model was not appropriate because our outcome variable, length of stay, was discrete and not continuous. A negative binomial model wasn’t the […]

Member Training: Using Transformations to Improve Your Linear Regression Model

March 5, 2018

Transformations don’t always help, but when they do, they can improve your linear regression model in several ways simultaneously. They can help you better meet the linear regression assumptions of normality and homoscedascity (i.e., equal variances). They also can help avoid some of the artifacts caused by boundary limits in your dependent variable -- and sometimes even remove a difficult-to-interpret interaction. In this webinar, we will review the assumptions of the linear regression model and explain when to consider a transformation of the dependent variable or independent variable.

The Problem with Linear Regression for Count Data

February 26, 2018

Imagine this scenario: This year’s flu strain is very vigorous. The number of people checking in at hospitals is rapidly increasing. Hospitals are desperate to know if they have enough beds to handle those who need their help. You have been asked to analyze a previous year’s hospitalization length of stay by people with the […]

Is there a fix if the data is not normally distributed?

February 19, 2018

In this video I will answer a question from a recent webinar Random Intercept and Random Slope Models. We are answering questions here because we had over 500 people live on the webinar so we didn't have time to get through all the questions.

What packages allow you to deal with random intercept and random slope models in R?

February 13, 2018

Member Training: Marginal Means, Your New Best Friend

February 5, 2018

Interpreting regression coefficients can be tricky. Especially when there are interactions in the model. Or categorical predictors. (Or worse – both.) But there is a secret weapon that can help you make sense of your regression results: marginal means. They’re not the same as descriptive stats. They aren’t usually included by default in our output. And they sometimes go by the name LS or Least-Square means. And they’re your new best friend. So what are these mysterious, helpful creatures? What do they tell us, really? And how can we use them?

Can I Treat 5 Waves of Repeated Measurements as Categorical or Continuous?

January 29, 2018

Question: Can you talk more about categorical and repeated Time? If I have 5 waves at ages 0, 1 year, 3 years, 5 years, and 9 years, would that be categorical or repeated? Does mixed account for different spacing in time? Mixed models can account for different spacing in time and you’re right, it […]

Six Differences Between Repeated Measures ANOVA and Linear Mixed Models

January 22, 2018

As mixed models are becoming more widespread, there is a lot of confusion about when to use these more flexible but complicated models and when to use the much simpler and easier-to-understand repeated measures ANOVA. One thing that makes the decision harder is sometimes the results are exactly the same from the two models and sometimes the results are vastly different. In many ways, repeated measures ANOVA is antiquated -- it's never better or more accurate than mixed models. That said, it's a lot simpler. As a general rule, you should use the simplest analysis that gives accurate results and answers the research question. I almost never use repeated measures ANOVA in practice, because it's rare to find an analysis where the flexibility of mixed models isn't an advantage. But they do exist. Here are some guidelines on similarities and differences:

Using Pairwise Comparisons to Help you Interpret Interactions in Linear Regression

January 12, 2018

In a previous post we discussed using marginal means to explain an interaction to a non-statistical audience. The output from a linear regression model can be a bit confusing. This is the model that was shown. In this model, BMI is the outcome variable and there are three predictors:

<< Older Entries Newer Entries >>

stat skill-building compass

Find clarity on your statistics journey. Try the new tool Stat Skill-Building Compass: Find Your Starting Point!

new blog post: Member Training: A Guide to Models for Causal Inference

Previous Posts

stat skill-building compass