OptinMon 15 - Principal Component Analysis

Factor Analysis: Choosing Your Variables

November 8th, 2013 by

by Maike Rahn, PhD

In previous posts in this series, we discussed factors and factor loadings and rotations. In this post, I would like to address another important detail for a successful factor analysis, the type of variables that you include in your analysis.

What type of variable?

Ideally, factor analysis is conducted with continuous variables that are normally distributed since factor analysis is based on a correlation matrix.

However, you will undoubtedly find many factor analyses that include ordinal variables, particularly Likert scale items.

While technically, Likert items don’t meet the assumptions of Factor Analysis, at least in some situations the  results have been found to be quite reasonable.  For example,  Lubke & Muthen, (2004) found that Confirmatory Factor Analysis on a single homogenous group worked, as long as items have at least seven values.

Some researchers include variables with fewer than seven values into their factor analysis.  Sometimes this cannot be avoided, if you are using an already published scale.

Last, there is an interesting discussion about including binary variables in a factor analysis in the Sage Publications booklet “Factor analysis. Statistical methods and practical issues” (Kim and Mueller, 1978; page 75).

Correct coding of variables

It is important to prepare your variables in advance. For example, if you anticipate finding a socioeconomic factor, create your ordinal variable occupation with levels from lowest to highest to make sure that you have a positive factor loading with your factor.

Occupational categories Levels of occupation variable
Nurse’s Aid 1
Administrative assistant 2
Nurse 3
Nurse manager 4
Physician 5
Department chair 6
Director 7

 

The reason for this preparation is that you will wind up with factor solutions that are easily interpretable, because variables that are coded in the same direction as the factor will always have a positive factor loading. On the other hand, variables that have an inverse association with the factor will always have a negative factor loading.

Kim, Jae-On and Mueller, Charles W (1978) Factor analysis. Statistical methods and practical issues. Series: Quantitative Applications in the Social Sciences. Sage Publications: Beverly Hills, CA.


Three Tips for Principal Component Analysis

June 14th, 2013 by

Principal Component Analysis (PCA) is a handy statistical tool to always have available in your data analysis tool belt.

It’s a data reduction technique, which means it’s a way of capturing the variance in many variables in a smaller, easier-to-work-with set of variables.

There are many, many details involved, though, so here are a few things to remember as you run your PCA.

1. The goal of PCA is to summarize the correlations among a set of observed variables with a smaller set of linear (more…)


Factor Analysis: A Short Introduction, Part 6–Common Problems

March 8th, 2013 by

by Maike Rahn, PhD

In the previous blogs I wrote about the basics of running a factor analysis. Real-life factor analysis can become complicated. Here are some of the more common problems researchers encounter and some possible solutions:

Solution: lower the cut-offs of your factor loadings, provided that lower factor loadings are expected and accepted in your field.

Solution 1: you were not able to validate the factor structure in your sample; your analysis with this sample did not work out.

Solution 2: your factor analysis has just become exploratory. Something is going on with your sample that is different from the samples used in other studies. Find out what it is.

Solution: the good news is that you found the hypothesized factors. The bad news is (more…)


Factor Analysis: A Short Introduction, Part 5–Dropping unimportant variables from your analysis

February 8th, 2013 by

by Maike Rahn, PhD

When are factor loadings not strong enough?

Once you run a factor analysis and think you have some usable results, it’s time to eliminate variables that are not “strong” enough. They are usually the ones with low factor loadings, although additional criteria should be considered before taking out a variable.

As a rule of thumb, your variable should have a rotated factor loading of at least |0.4| (meaning ≥ +.4 or ≤ –.4) onto one of the factors in order to be considered important. (more…)


Factor Analysis: A Short Introduction, Part 4–How many factors should I find?

November 16th, 2012 by

by Maike Rahn, PhD

One of the hardest things to determine when conducting a factor analysis is how many factors to settle on. Statistical programs provide a number of criteria to help with the selection.

Eigenvalue > 1

Programs usually have a default cut-off for the number of generated factors, such as all factors with an eigenvalue of ≥1.

This is because a factor with an eigenvalue of 1 accounts for as much variance as a single variable, and the logic is that only factors that explain at least the same amount of variance as a single variable is worth keeping.

But often a cut-off of 1 results in more factors than the user bargained for or (more…)


Factor Analysis: A Short Introduction, Part 3-The Difference Between Confirmatory and Exploratory Factor Analysis

November 2nd, 2012 by

by Maike Rahn, PhD

An important question that the consultants at The Analysis Factor are frequently asked is:

What is the difference between a confirmatory and an exploratory factor analysis?

A confirmatory factor analysis assumes that you enter the factor analysis with a firm idea about the number of factors you will encounter, and about which variables will most likely load onto each factor.

Your expectations are usually based on published findings of a factor analysis.

An example is a fatigue scale that has previously been validated. You would like to make sure that the variables in your sample load onto the factors the same way they did in the original research.

In other words, you have very clear expectations about what you will find in your own sample. This means that (more…)