OptinMon

Member Training: Using Open Data in Research: Opportunities and Challenges

October 1st, 2020 by

Open data, particularly government open data is a rich source of information that can be helpful to researchers in almost every field, but what is open data? How do we find what we’re looking for? What are some of the challenges with using data directly from city, county, state, and federal government agencies?

(more…)


What It Really Means to Remove an Interaction From a Model

September 17th, 2020 by

When you’re model building, a key decision is which interaction terms to include. And which interactions to remove.Stage 2

As a general rule, the default in regression is to leave them out. Add interactions only with a solid reason. It would seem like data fishing to simply add in all possible interactions.

And yet, that’s a common practice in most ANOVA models: put in all possible interactions and only take them out if there’s a solid reason. Even many software procedures default to creating interactions among categorical predictors.

(more…)


How Big of a Sample Size do you need for Factor Analysis?

August 21st, 2020 by

Most of the time when we plan a sample size for a data set, it’s based on obtaining reasonable statistical power for a key analysis of that data set. These power calculations figure out how big a sample you need so that a certain width of a confidence interval or p-value will coincide with a scientifically meaningful effect size.

But that’s not the only issue in sample size, and not every statistical analysis uses p-values.

(more…)


Effect Size Statistics: How to Calculate the Odds Ratio from a Chi-Square Cross-tabulation Table

August 12th, 2020 by

Lest you believe that odds ratios are merely the domain of logistic regression, I’m here to tell you it’s not true.

One of the simplest ways to calculate an odds ratio is from a cross tabulation table.

We usually analyze these tables with a categorical statistical test. There are a few options, depending on the sample size and the design, but common ones are Chi-Square test of independence or homogeneity, or a Fisher’s exact test.

(more…)


Member Training: Explaining Logistic Regression Results to Non-Researchers

August 1st, 2020 by

Interpreting the results of logistic regression can be tricky, even for people who are familiar with performing different kinds of statistical analyses. How do we then share these results with non-researchers in a way that makes sense?

(more…)


Why Adding Values on a Scale Can Lead to Measurement Error

July 22nd, 2020 by

Whenever you use a multi-item scale to measure a construct, a key step is to create a score for each subject in the data set.

This score is an estimate of the value of the latent construct (factor) the scale is measuring for each subject.  In fact, calculating this score is the final step of running a Confirmatory Factor Analysis.

(more…)