ANOVA

SPSS GLM or Regression? When to use each

April 23rd, 2009 by

Regression models are just a subset of the General Linear Model, so you can use GLM procedures to run regressions.  It is what I usually use.

But in SPSS there are options available in the GLM and Regression procedures that aren’t available in the other.  How do you decide when to use GLM and when to use Regression?

GLM has these options that Regression doesn’t: (more…)


Checking Assumptions in ANOVA and Linear Regression Models: The Distribution of Dependent Variables

April 10th, 2009 by

Here’s a little reminder for those of you checking assumptions in regression and ANOVA:

The assumptions of normality and homogeneity of variance for linear models are not about Y, the dependent variable.    (If you think I’m either stupid, crazy, or just plain nit-picking, read on.  This distinction really is important). (more…)


Dummy Coding in SPSS GLM–More on Fixed Factors, Covariates, and Reference Groups, Part 2

March 31st, 2009 by

Part 1 outlined one issue in deciding whether to put a categorical predictor variable into Fixed Factors or Covariates in SPSS GLM.  That issue dealt with how SPSS automatically creates dummy variables from any variable in Fixed Factors.

There is another key default to keep in mind. SPSS GLM will automatically create interactions between any and all variables you specify as Fixed Factors.

If you put 5 variables in Fixed Factors, you’ll get a lot of interactions. SPSS will automatically create all 2-way, 3-way, 4-way, and even a 5-way interaction among those 5 variables. (more…)


Why ANOVA and Linear Regression are the Same Analysis

March 11th, 2009 by

Stage 2If your graduate statistical training was anything like mine, you learned ANOVA in one class and Linear Regression in another.  My professors would often say things like “ANOVA is just a special case of Regression,” but give vague answers when pressed.

It was not until I started consulting that I realized how closely related ANOVA and regression are.  They’re not only related, they’re the same thing.  Not a quarter and a nickel–different sides of the same coin.

So here is a very simple example that shows why.  When someone showed me this, a light bulb went on, even though I already knew both ANOVA and multiple linear (more…)


Testing and Dropping Interaction Terms in Regression and ANOVA models

February 26th, 2009 by

In a Regression model, should you drop interaction terms if they’re not significant?

In an ANOVA, adding interaction terms still leaves the main effects as main effects.  That is, as long as the data are balanced, the main effects and the interactions are independent.  The main effect is still telling (more…)


Problems Caused by Categorizing Continuous Variables

February 20th, 2009 by

I just came across this great article by Frank Harrell:  Problems Caused by Categorizing Continuous VariablesStage 2

It’s from the Vanderbilt University biostatistics department, so the examples are all medical, but the points hold for any field.

It goes right along with my recent post, Continuous and Categorical Variables: The Trouble with Median Splits.