The Analysis Factor Statwise Newsletter
Volume 1, Issue 1
July, 2008
In This Issue

A Note from Karen

Featured Article: Why ANOVA and Linear Regression are the Same Analysis

Resource of the Month

What's New

About Us

 
Quick Links

Our Website

More About Us

You received this email because you subscribed to The Analysis Factor's mailing list. To change your subscription, see the link at end of this email. If your email is having trouble with the format, click here for a web version.

Please forward this to anyone you know who might benefit. If you received this from a friend, sign up for this ezine now!
A Note from Karen

Dear %$firstname$%,

Karen Grace-MartinIt's officially "mud season" in Ithaca, which means we sun-deprived inhabitants celebrate 40 degree weather and get covered in mud as all the snow melts on ground with dead grass.

It also means it's time to tap the maple trees. A few years ago we moved to a hill outside of town with a nice stand of sugar maples. The first year we tapped 3 trees and quickly became overwhelmed with the quantity of sap we had to boil 95% of the water out of. When I broke my ankle collecting wood for the evaporating fire, my enthusiasm for the project officially died. Still, the promise of spring right around the corner means I can't ignore the maples entirely.

We tried out our new Web Meeting service in February's teleseminar. I have to say, it's pretty cool. Everyone could see my presentation on their own screens and join in the audio either by phone or over their computer's microphone. There's also a great little chat function so you can type in questions. It's pretty fun. You can download the video recording for f.r.e.e from our Learning section of the web site.

So we'll definitely be using the Web Meetings for the upcoming Interpreting Regression Coefficients workshop. There are still openings. Spots are limited, though, so if you're interested, register soon.

And I want you all to know I haven't forgotten your survey responses. I'm still planning to offer a couple more workshops this spring. I've been focusing on getting the first tested, out, and underway, but will announce the others soon.

I hope you enjoy this month's feature article. It explains a simple little concept in linear models that on one ever seems to get to in statistics classes. It's the thing I wished someone had showed me way back when. I hope you find it helpful as well.

Happy analyzing,
Karen

Featured Article: 5 Steps for Calculating Sample Size

If your graduate statistical training was anything like mine, you learned ANOVA in one class and Linear Regression in another. My professors would often say things like "ANOVA is just a special case of Regression," then do a lot of hand waving when pressed to explain.

It was not until I started consulting that I realized how closely related ANOVA and regression are. They're not only related, they're the same thing. Not a quarter and a nickel--different sides of the same coin.

So here is a very simple example that shows why. When someone showed me this, a light bulb went on, even though I already knew both ANOVA and mulitple linear regression quite well (and already had my masters in statistics!). I believe that understanding this little concept has been key to my understanding the general linear model as a whole--its applications are far reaching.

As an example, I use a model with a single categorical independent variable--employment category--with 3 categories: managerial, clerical, and custodial. The dependent variable is Previous Experience in months. (This data set is employment.sav, one of the data sets that comes free with SPSS).

We can run this as either an ANOVA or a regression. In the ANOVA, the categorical variable is effect coded, which means that each category's mean is compared to the grand mean. In the regression, the categorical variable is dummy coded**, which means that each category's intercept is compared to the reference group's intercept. Since the intercept is defined as the mean value when all other predictors = 0, and there are no other predictors, the three intercepts are just means.

In both analyses, Job Category has an F=69.192, with a p < .001. Highly significant.

In the ANOVA, we find the means of the three groups are:

Clerical:     85.039

Custodial: 298.111

Manager:  77.619

In the Regression, we find these coefficients:

Intercept:  77.619

Clerical:      7.420

Custodial: 220.492

The intercept is simply the mean of the reference group, Managers. The coefficients for the other two groups are the differences in the mean between the reference group and the other groups.

You'll notice, for example, that the regression coefficient for Clerical is the difference between the mean for Clerical, 85.039, and the Intercept, or mean for Manager (85.039 - 77.619 = 7.420). The same works for Custodial.

So an ANOVA reports each mean and a p-value that says at least two are significantly different. A regression reports only one mean (as an intercept), and the differences between that one and all other means, but the p-values evaluate those specific comparisons.

It's all the same model, the same information, but presented in different ways. Understand what the model tells you in each way, and you are empowered.

I suggest you try this little exercise with any data set, then add in a second categorical variable, first without, then with an interaction. Go through the means and the regression coefficients and see how they add up.

**The dummy coding creates two 1/0 variables: Clerical = 1 for the clerical category, 0 otherwise; Custodial = 1 for the custodial category, 0 otherwise. Observations in the Managerial category have a 0 value on both of these variables, and this is known as the reference group.

 

Resource of the Month

Statistical Computing at UCLA

If you need to learn how to do something in SPSS, SAS, or Stata or to use a host of specialized statistical packages, head over to the web page of the Statistical Computing office in UCLA's Academic Technology Services. They offer an amazing selection of resources for learning how to analyze data in a number of general and specialized stats packages.

What's New

1. Free Teleseminar on March 25th: Understanding Probability, Odds, and Odds Ratios in Logistic Regression

Do odds and odds-ratios seem a bit, well, strange? Interpreting odds ratios in logistic regression requires thinking on a whole different scale. It's like driving to Canada and having all the speed limit signs in km/hour. Not so different, but hard to translate at first.

In this teleseminar, you will learn the simple relationship between probability and odds and how that translates into interpreting odds ratios for continuous and categorical variables.

Learn more and register at: http://www.analysisfactor.com/learning/teletraining5.html.

2. There are still spaces available in our four-week webinar workshop: Reinforce the Foundation: Interpreting Regression Coefficients. It begins March 26th.

If you've been struggling with (or just not quite sure about) centering, interactions, dummy coding, and the like, this workshop can get you on track.

I invite you to join me in my upcoming workshop "Building the Foundation: Interpreting Regression Coefficients." We'll go over these topics, and more, step-by-step. It will give you the strong foundation in understanding and working with regression that you need to become a confident and qualified statistical analyst. You will then have the capacity to move on to multilevel, logistic, Cox models, and so many more.

I'll be happy to answer any questions and discuss if the workshop is a good fit for you. You can find me at karen@analysisfactor.com or 607.539.3216.

More information and register at:
http://www.analysisfactor.com/learning/IRC/registration.html

About Us

Karen Grace-Martin is the owner and founder of The Analysis Factor. Her philosophy is that statistics, as an applied skill, is learned best within the context of a researcher’s own data. Researchers at every stage of their career therefore need ongoing statistics training and support. The Analysis Factor offers statistical consulting, projects, resources, and learning programs that empower social science researchers to become confident, able, and skilled statistical practitioners.

Karen spent seven years as a statistical consultant in the statistical consulting office at Cornell University. While there, she learned how to be a great statistical advisor—not only with excellent statistical skills, but to understand the pressures and issues researchers are facing, how to give fabulous customer service, and how to communicate technical ideas at a level each client understands. 

You can learn more about Karen Grace-Martin and The Analysis Factor at analysisfactor.com.

Please forward this newsletter to colleagues who you think would find it useful. Your recommendation is how we grow.

If you received this email from a friend or colleague, click here to subscribe to this newsletter.

Need to change your email address? See below for details.

No longer wish to receive this newsletter? See below to cancel.