Blog Posts

Assumptions of Linear Models are about Errors, not the Response Variable

March 19, 2024

I recently received a great question in a comment about whether the assumptions of normality, constant variance, and independence in linear models are about the residuals or the response variable. The asker had a situation where Y, the response, was not normally distributed, but the residuals were.

Member Training: Coarsened Exact Matching, an Alternative to Propensity Score Matching

February 29, 2024

The objective for quasi-experimental designs is to establish cause and effect relationships between the dependent and independent variables. However, they have one big challenge in achieving this objective: lack of an established control group.

Beyond R-squared: Assessing the Fit of Regression Models

February 20, 2024

A well-fitting regression model results in predicted values close to the observed data values. The mean model, which uses the mean for every predicted value, generally would be used if there were no useful predictor variables. The fit of a proposed regression model should therefore be better than the fit of the mean model. But […]

Getting Started with Stata Tutorial #4: the Statistics Menu

February 4, 2024

In part 3 of this series, we explored the Stata graphics menu. In this post, let’s look at the Stata Statistics menu. Statistics Menu Let’s use the Statistics menu to see if price varies by car origin (foreign). We are testing whether a continuous variable has a different mean for the two categories of a […]

Member Training: Effective File and Process Management in Statistical Projects

January 31, 2024

Do you ever wish your data analysis project were a little more organized?

Charting a Path to Statistical Confidence and Mastery

January 17, 2024

Tell me if you can relate to this: You love your field of study, you enjoy asking the big questions and discovering answers. But, when it comes to data analysis and statistics you get a little bogged down. You might even feel a bit lost sometimes. And that is hard to admit. Because after all, […]

When the Hessian Matrix Goes Wacky

December 20, 2023

If you have run mixed models much at all, you have undoubtedly been haunted by some version of this very obtuse warning: “The Hessian (or G or D) Matrix is not positive definite. Convergence has stopped.” Or “The Model has not Converged. Parameter Estimates from the last iteration are displayed.” What on earth does that mean?

The Difference Between Crossed and Nested Factors

December 18, 2023

One of those tricky, but necessary, concepts in statistics is the difference between crossed and nested factors. As a reminder, a factor is any categorical independent variable. In experiments, or any randomized designs, these factors are often manipulated. Experimental manipulations (like Treatment vs. Control) are factors. Observational categorical predictors, such as gender, time point, poverty […]

What is Family-wise Error Rate?

December 8, 2023

In statistical practice, there are many situations where best practices are clear. There are many, though, where they aren’t. The granddaddy of these practices is adjusting p-values when you make multiple comparisons. There are good reasons to do it and good reasons not to. It depends on the situation. At the heart of the issue […]

Five Ways to Analyze Ordinal Variables (Some Better than Others)

December 3, 2023

There are not a lot of statistical methods designed just to analyze ordinal variables. But that doesn’t mean that you’re stuck with few options. There are more than you’d think. Some are better than others, but it depends on the situation and research questions. Here are five options when your dependent variable is ordinal.

<< Older Entries Newer Entries >>

stat skill-building compass

Find clarity on your statistics journey. Try the new tool Stat Skill-Building Compass: Find Your Starting Point!

Previous Posts

stat skill-building compass