The Assumptions of Linear Models: Explicit and Implicit

by Karen Grace-Martin

Share

If you’ve compared two textbooks on linear models, chances are, you’ve seen two different lists of assumptions.

I’ve spent a lot of time trying to get to the bottom of this, and I think it comes down to a few things.

1. There are four assumptions that are explicitly stated along with the model, and some authors stop there.

2. Some authors are writing for introductory classes, and rightfully so, don’t want to confuse students with too many abstract, and sometimes untestable, assumptions.  So they write them in more concrete terms that aren’t incorrect, but aren’t the core assumptions, either.

3. Some authors are writing for very specific fields or research situations, like experiments or survey data analysis.  They state the assumptions in terms specific to that analysis, not the more general forms.  For example, the assumptions of ANOVA are the same as those for regression, although they’re often written in a more specific form.

4. Likewise, sometimes the logical implication of an assumption is more interesting or important to a specific field or is just generally easier to test.  So rather than writing the assumption itself, the implicatation is written.  Logically, they’re really the same thing.  But they can look totally different, and it can make you look at someone’s list and say “hey, they left something out!”

So what are they, really?

The Explicit Assumptions

These assumptions are explicitly stated by the model:

  1. The residuals are independent
  2. The residuals are normally distributed
  3. The residuals have a mean of 0 at all values of X
  4. The residuals have constant variance

The Implicit Assumptions

These assumptions aren’t, but the specification of the model implies them.  This is the way I’ve summarized them–they can be written with different terminology, of course.

  1. All X are fixed and are measured without error
  2. The model is linear in the parameters
  3. The predictors and response are specified correctly
  4. There is a single source of unmeasured random variance

If there is an assumption you’ve heard not on this list, chances are it is a logical extension of one of these core assumptions.

1.

Bookmark and Share

tn_assum_lmLearn more about each of the assumptions of linear models–regression and ANOVA–so they make sense–in our new On Demand workshop: Assumptions of Linear Models.

{ 1 comment… read it below or add one }

IM CHIU

The use of “residuals” in the Explicit Assumption can be misleading. The linear model make major assumptions on the “error” term. The “residuals” are the estimates of the “errors”.

Reply

Leave a Comment

Please note that Karen receives hundreds of comments at The Analysis Factor website each week. Since Karen is also busy teaching workshops, consulting with clients, and running a membership program, she seldom has time to respond to these comments anymore. If you have a question to which you need a timely response, please check out our low-cost monthly membership program, or sign-up for a quick question consultation.

{ 1 trackback }

Previous post:

Next post: