Blog Posts

Likert Scale Items as Predictor Variables in Regression

May 22, 2009

I was recently asked about whether it's okay to treat a likert scale as continuous in a regression model. Here's my reply.

Missing Data: Criteria for Choosing an Effective Approach

May 20, 2009

In choosing an approach to missing data, there are a number of things to consider. But you need to keep in mind what you're aiming for before you can even consider which appraoch to take. There are three criteria we're aiming for with any missing data technique:

Five Advantages of Running Repeated Measures ANOVA as a Mixed Model

May 13, 2009

There are two ways to run a repeated measures analysis. The traditional way is to treat it as a multivariate test--each response is considered a separate variable. The other way is to it as a mixed model. While the multivariate approach is easy to run and quite intuitive, there are a number of advantages to running a repeated measures analysis as a mixed model.

Interpreting Interactions: When the F test and the Simple Effects disagree.

May 11, 2009

The way to follow up on a significant two-way interaction between two categorical variables is to check the simple effects. Every so often, however, you have a significant interaction, but no significant simple effects. It is not a logical impossibility. They are testing two different, but related hypotheses.

Why Logistic Regression for Binary Response?

May 5, 2009

Logistic regression models can seem pretty overwhelming to the uninitiated. Why not use a regular regression model? Just turn Y into an indicator variable--Y=1 for success and Y=0 for failure. For some good reasons.

SPSS GLM or Regression? When to use each

April 23, 2009

Regression models are just a subset of the General Linear Model, so you can use GLMs to analyze regressions. It is what I usually use. But in SPSS there are options available in the GLM and Regression procedures that aren't available in the other. How do you decide when to use GLM and when to use Regression?

EM Imputation and Missing Data: Is Mean Imputation Really so Terrible?

April 15, 2009

I’m sure I don’t need to explain to you all the problems that occur as a result of missing data. Anyone who has dealt with missing data—that means everyone who has ever worked with real data—knows about the loss of power and sample size, and the potential bias in your data that comes with listwise […]

Checking Assumptions in ANOVA and Linear Regression Models: The Distribution of Dependent Variables

April 10, 2009

The distributional assumptions for linear regression and ANOVA are for the distribution of Y|X -- that's Y given X. You have to take out the effects of all the Xs before you look at the distribution of Y.

The Distribution of Independent Variables in Regression Models

April 9, 2009

I often hear concern about the non-normal distributions of independent variables in regression models, and I am here to ease your mind. There are NO assumptions in any linear model about the distribution of the independent variables. Yes, you only get meaningful parameter estimates from nominal (unordered categories) or numerical (continuous or discrete) independent variables. […]

Is Multicollinearity the Bogeyman?

April 8, 2009

Multicollinearity occurs when two or more predictor variables in a regression model are redundant. It is a real problem, and it can do terrible things to your results. But it is uncommon, and is often misdiagnosed.

<< Older Entries Newer Entries >>

stat skill-building compass

Find clarity on your statistics journey. Try the new tool Stat Skill-Building Compass: Find Your Starting Point!

new blog post: Member Training: A Guide to Models for Causal Inference

Previous Posts

stat skill-building compass