Data Preparation

Recoding a Variable from a Survey Question to Use in a Statistical Model

March 18th, 2019 by

Survey questions are often structured without regard for ease of use within a statistical model.Stage 2

Take for example a survey done by the Centers for Disease Control (CDC) regarding child births in the U.S. One of the variables in the data set is “interval since last pregnancy”. Here is a histogram of the results.

(more…)


Member Training: Determining Levels of Measurement: What Lies Beneath the Surface

March 4th, 2019 by

You probably learned about the four levels of measurement in your very first statistics class: nominal, ordinal, interval, and ratio.

Knowing the level of measurement of a variable is crucial when working out how to analyze the variable. Failing to correctly match the statistical method to a variable’s level of measurement leads either to nonsense or to misleading results.

But the simple framework of the four levels is too simplistic in most real-world data analysis situations.

(more…)


Rescaling Sets of Variables to Be on the Same Scale

December 11th, 2018 by

by Christos Giannoulis, PhD

Attributes are often measured using multiple variables with different upper and lower limits. For example, we may have five measures of political orientation, each with a different range of values.

Each variable is measured in a different way. The measures have a different number of categories and the low and high scores on each measure are different.

(more…)


Member Training: A Data Analyst’s Guide to Methods and Tools for Reproducible Research

November 1st, 2017 by

Have you ever experienced befuddlement when you dust off a data analysis that you ran six months ago? 

Ever gritted your teeth when your collaborator invalidates all your hard work by telling you that the data set you were working on had “a few minor changes”?

Or panicked when someone running a big meta-analysis asks you to share your data?

If any of these experiences rings true to you, then you need to adopt the philosophy of reproducible research.

(more…)


Tricks for Using Word to Make Statistical Syntax Easier

March 13th, 2017 by

We’ve talked a lot around here about the reasons to use syntax — not only menus — in your statistical analyses.

Regardless of which software you use, the syntax file is pretty much always a text file. This is true for R, SPSS, SAS, Stata — just about all of them.

This is important because it means you can use an unlikely tool to help you code: Microsoft Word.

I know what you’re thinking. Word? Really?

Yep, it’s true. Essentially it’s because Word has much better Search-and-Replace options than your stat software’s editor.

Here are a couple features of Word’s search-and-replace that I use to help me code faster:

(more…)


Member Training: Statistical Rules of Thumb: Essential Practices or Urban Myths?

March 1st, 2017 by

There are many rules of thumb in statistical analysis that make decision making and understanding results much easier.

Have you ever stopped to wonder where these rules came from, let alone if there is any scientific basis for them? Is there logic behind these rules, or is it propagation of urban legends?

In this webinar, we’ll explore and question the origins, justifications, and some of the most common rules of thumb in statistical analysis, like:

(more…)