The Analysis Factor Newsletter Banner

Volume 9, Issue 2
September 2016

A Note from Karen

Karen Grace-Martin PhotoIt's official: Our membership program, previously known as Data Analysis Brown Bag, is now called Statistically Speaking (cue applause).

We find this name better reflects the wide range of statistical support and community the program entails.

The big reveal has been underway for the past month, so you already may have noticed some changes at our website. We hope you like the new name as much as we do.

In response to customer requests, we also just rolled out a new group membership discount. So if you've been waiting for the perfect moment to join, this just might be it. Grab two or more of your closest friends or work buddies to take advantage of our 15% discount on our already affordable memberships. (Head over here for more details.)

This month we bring you an article on removing the constant from a regression model, courtesy of Jeff Meyer. And if this article piques your interest or if you've been thinking of learning Stata, be sure to check out Jeff's upcoming workshop, Introduction to Data Analysis with Stata. Enrollment closes September 29th.

Happy analyzing!
Karen


The Impact of Removing the Constant from a Regression Model: The Categorical Case

Jeff Meyer Photoby Jeff Meyer, MPA, MBA

In a simple linear regression model how the constant (aka, intercept) is interpreted depends upon the type of predictor (independent) variable.

If the predictor is categorical and dummy-coded, the constant is the mean value of the outcome variable for the reference category only. If the predictor variable is continuous, the constant equals the predicted value of the outcome variable when the predictor variable equals zero.

Removing the Constant When the Predictor Is Categorical

When your predictor variable X is categorical, the results are logical. Let's look at an example.

I regressed the weight of an auto on where the auto was manufactured (domestic vs foreign) to produce the following results.

Look at the Coef. column. It tells us the mean weight of a car built domestically is 3,317 pounds. A car built outside of the U.S. weighs 1,001 pounds less, on average 2,316 pounds.

Most statistical software packages give you the option of removing the constant. This can save you the time of doing the math to determine the average weight of foreign built cars.

The model below includes the option of removing the constant.

The domestic weight is the same in both outputs. In the output without the constant the mean weight of a foreign built car is shown rather than the difference in weight between domestic and foreign built cars.

Notice that the sum of the square errors (Residuals) is identical in both outputs (2,8597,399). The t-score for domestically built cars is identical in both models. The t-score for foreign is different in each model because it is measuring different statistics.

The Impact on R-squared

The one statistical measurement that is very different between the two models is the R-squared. When including the constant the R-squared is 0.3514 and when excluding the constant it is 0.9602. Wow, makes you want to run every linear regression without the constant!

The formula used for calculating R-squared without the constant is incorrect. The reported value can actually vary from one statistical software package to another. (This article explains the error in the formula, in case you're interested.)

If you are using Stata and you want the output to be similar to the “no constant” model and want accurate R-squared values then you need to use the option hascons rather than noconstant.

The impact of removing the constant when the predictor variable is continuous is significantly different. In a follow-up article, we will explore why you should never do that.


References and Further Reading:

How to Interpret the Intercept in 6 Linear Regression Examples

Understanding Interaction Between Dummy Coded Categorical Variables in Linear Regression

Interpreting (Even Tricky) Regression Coefficients – A Quiz

 

This Month's Statistically Speaking Webinar

Cox Regression


Upcoming Workshops

Introduction to Data Analysis with Stata


Quick Links

The Analysis Factor Consulting

The Analysis Institute Workshops

Statistically Speaking Membership

More About Us


Who We Are

The Analysis Factor is your go-to source for expert training and mentorship in all things statistics. Our trusty team of top-of-the-line statistics experts is at the helm, ready to help anyone who gets their hands messy with data.

The Analysis Factor is the difference between knowing about statistics and knowing how to use statistics in data analysis.

Statistical analysis is an applied skill. And you have to learn how to use statistical tools, like analysis, within the context of a researcher's own data. We specialize in doing just that.

What We Do

At The Analysis Factor, we offer one-on-one consulting services, live and on-demand workshops, and monthly topic webinars complete with Q&A sessions. All with friendly faces and plucky personalities, to boot.

Our valuable resources and learning programs empower researchers to become confident, able, and skilled statistical practitioners.

We aim to make your journey through the real-world application of statistical analysis better, easier, and (dare we say) more fun.

Why We Do It

Karen Grace-Martin, the brilliant brains behind the (ad)venture, spent 7 years as a statistical consultant at Cornell University. While there, she learned that being an excellent statistical advisor is not only about having the goods (i.e., the best statistical skills ever), but about understanding the real pressures and issues that researchers face.

Combine this understanding with fabulous customer service and the rare ability to communicate technical ideas in a way that each client understands and, well, you've got the best darn stats training around.

Here at The Analysis Factor, we're on a mission to make data analysis affordable and accessible for everyone.

Yep, that means you.

Learn more at theanalysisfactor.com.


Think the world needs a little more oomph with their data?

So do we. Forward this newsletter to a friend, colleague, or hey, even your mom. (Little known fact: Moms love what you love.)

And if you got this from a friend (or from your mom), sign up here to get your very own copy each month.


You got this email because you subscribed to The Analysis Factor's list community. (Smart move, if you ask us.)

If you've changed your mind, just click the link at the end of this email. We'll welcome you back with open arms any time.

Share the love. Forward this newsletter to friends, fans, and colleagues who might be interested. Your recommendation is how we grow.

Get this email from a friend, colleague, or secret admirer of all things statistics? Click here to subscribe.

Newsletter not displaying right? View it in your browser instead.
Need to change your email address? Check out the details below.
No longer wish to receive this newsletter? Sorry to see you go. See below for the how-to.