Effect Size Statistics in Logistic Regression

Effect size statistics are expected by many journal editors these days.

If you’re running an ANOVA, t-test, or linear regression model, it’s pretty straightforward which ones to report.

Things get trickier, though, once you venture into other types of models..  Logistic regression, for example. 

Many of the common effect size statistics, like eta-squared and Cohen’s d, can’t be calculated in a logistic regression model.  So now what do you use?

Types of Effect Size Statistics

First, it’s important to understand what effect size statistics are for and why they’re worth reporting.

This quotation by Joseph Durlak explains it nicely:

“…provide information about the magnitude and direction of the difference between two groups or the relationship between two variables.”

There are two types of effect size statistics–standardized and unstandardized.

Standardized statistics have been stripped of all units of measurement.

Correlation is a nice example.  People like correlation because the strength and direction of any two correlations can be compared, regardless of the units of the variables on which the correlation was measured.

Unstandardized statistics are still measured in the original units of the variables.  So a difference in two means and a regression coefficient are both effect size statistics and both are useful to report.

Most people mean standardized when they say “effect size statistic.”  But both describe the magnitude and direction of the research findings.

Odds Ratios as Effect Size Statistics

If you’re at all familiar with logistic regression, you’re also familiar with odds ratios.  Odds ratios measure how many times bigger the odds of one outcome is for one value of an IV, compared to another value.

For example, let’s say you’re doing a logistic regression for a ecology study on whether or not a wetland in a certain area has been infected with a specific invasive plant.  Predictors include water temperature in degrees Celsius, altitude, and whether the wetland is a fen or a marsh.

If the odds ratio for water temperature is 1.12, that means that for each one-degree Celsius increase in water temperature, the odds of the wetland having the invasive plant species is 1.12 times as big, after controlling for the other predictors.

That odds ratio is an unstandardized effect size statistic.  It tells you the direction and the strength of the relationship between water temperature and the odds that the plant is present.

It’s unstandardized because it’s based on the units of temperature.

I realize no ecologist would do so, but if the water were measured in degrees Fahrenheit, that odds ratio would have a different value.  The direction and the strength of the relationship would be the same, but the statistic would be evaluated on a different scale.

Odds Ratios as Standardized Effect Size Statistics

Surprisingly, I’ve seen odds ratios listed as standardized effect size statistics.

A little digging showed those authors were referring to one of two situations.

The first situation is when the predictor is also binary.  The odds ratio for whether the wetland was a fen or a marsh can be considered standardized, not because we’ve removed any units, but because there never were any.

So if the odds ratio for fen vs. marsh is 2.3, we know the odds of the invasive plant being in a fen is 2.3 times that of a marsh.

The second is when the numerical predictor is standardized.  If we standardize the temperature (aka, convert the scale to Z scores), we’ve removed the units. 

We’ll get the same Z scores from degrees Fahrenheit as degrees Celsius, and the new odds ratio will be in terms of one standard deviation increases in temperature, rather than one degree.

Voila! Units removed.

 

Understanding Probability, Odds, and Odds Ratios in Logistic Regression
Despite the way the terms are used in common English, odds and probability are not interchangeable. Join us to see how they differ, what each one means, and how to tame that tricky beast: Odds Ratios.

Reader Interactions

Comments

  1. Agnieszka says

    Many thanks for this post.
    Now, if I would like to convert OR into d, using the LogOddsRatio formula – which ORs should I use: the standardised ones?

  2. Oliver says

    Hello,

    as this is the first post to this topic that i found on google.
    I found the following paper to this subject:
    Using Dominance Analysis to Determine Predictor Importance in Logistic Regression; Journal of Educational and Behavioral Statistics 34(3):319 -347 · September 2009

  3. Kirsten says

    How can one compute standardized odds ratios for categorical variables in order to compare the magnitude of their effect?

    • azmat says

      Dear
      Did you find the answer, How we could calculate Odd ratios??

      How can one compute standardized odds ratios for categorical variables in order to compare the magnitude of their effect?

      regards
      azmat

      • Karen Grace-Martin says

        Hi Azmat (and Kirsten),

        It doesn’t make sense to use standardized coefficients in any type of regression (not just logistic) for categorical variables. Standardized coefficients are in terms of standard deviations and what does the standard deviation of Yes/No mean?

  4. Joe Hoover says

    Hello,

    I came across this post when looking for a way to estimate effect sizes for logistic regression coefficients. I anticipated finding something obscure, so I was happily surprised to read that odds-ratios estimated with standardized predictors can be interpreted as effect sizes. This makes sense to me; but, I wonder if you are aware of any published precedent for this approach? Thanks!

    Best,
    Joe


Leave a Reply

Your email address will not be published. Required fields are marked *

Please note that, due to the large number of comments submitted, any questions on problems related to a personal study/project will not be answered. We suggest joining Statistically Speaking, where you have access to a private forum and more resources 24/7.