Think of CFA as a process for testing what you already think you know. CFA is an integral part of structural equation modeling (SEM) and path analysis. The hypothesized factors should always be validated with CFA in a measurement model prior to incorporating them into a path or structural model. Because… garbage in, garbage out. CFA is also a useful tool in checking the reliability of a measurement tool with a new population of subjects, or to further refine an instrument which is already in use.

## In Factor Analysis, How Do We Decide Whether to Have Rotated or Unrotated Factors?

Question: How do we decide whether to have rotated or unrotated factors?

Answer: Great question. Of course, the answer depends on your situation. When you retain only one factor in a solution, then rotation is irrelevant. In fact, most software won’t even print out rotated coefficients and they’re pretty meaningless in that situation. But if you retain two or more factors, you need to rotate. Unrotated factors are pretty difficult to interpret in that situation.

## Can You Use Principal Component Analysis with a Training Set Test Set Model?

Question: Can you use Principal Component Analysis with a Training Set Test Set Model?

Answer: Yes and no. Principal Component Analysis specifically could be used with a training and test data set, but it doesn’t make as much sense as doing so for Factor Analysis. That’s because PCA is really just about creating an index variable from a set of correlated predictors.

## Can We Use PCA for Reducing Both Predictors and Response Variables?

Question: Can we use PCA for reducing both predictors and response variables? In fact, there were a few related but separate questions about using and interpreting the resulting component scores, so I’ll answer them together here.

## The Fundamental Difference Between Principal Component Analysis and Factor Analysis

One of the many confusing issues in statistics is the confusion between Principal Component Analysis (PCA) and Factor Analysis (FA). They are very similar in many ways, so it’s not hard to see why they’re so often confused. They appear to be different varieties of the same analysis rather than two different methods. Yet there is a fundamental difference between them that has huge effects on how to use them.

## In Principal Component Analysis, Can Loadings Be Negative?

Question: In Principal Component Analysis, can loadings be both positive and negative?

Answer: Yes. Recall that in PCA, we are creating one index variable (or a few) from a set of variables. You can think of this index variable as a weighted average of the original variables. The loadings are the weights. The goal of the PCA is to come up with optimal weights. “Optimal” means we’re capturing as much information in the original variables as possible, based on the correlations among those variables.

## A Huge Improvement to How We Teach Statistical Software in Workshops

We’re changing how we teach our statistics workshops to support more software options. Each module will feature a live webinar lecture (along with all the supplementary material — code, exercises, Q&As, etc.). This lecture used to include all the statistical concepts, the steps to implement, and a demonstration in one or more software packages. Now we’re splitting those things up so that you have easier access to the software support you need.

## Analyzing Zero-Truncated Count Data: Length of Stay in the ICU for Flu Victims

Let’s imagine you have been asked to determine the factors that will help a hospital determine the length of stay in the intensive care unit (ICU) once a patient is admitted. The hospital tells you that once the patient is admitted to the ICU, he or she has a day count of one. As soon as they spend 24 hours plus 1 minute, they have stayed an additional day. Clearly this is count data. There are no fractions, only whole numbers.

## January 2017 Webinar: Communicating Statistical Results to Non-Statisticians

One of the biggest challenges that data analysts face is communicating statistical results to our clients, advisors, and colleagues who don’t have a statistics background. Unfortunately, the way that we learn statistics is not usually the best way to communicate our work to others, and many of us are left on our own to navigate what is arguably the most important part of our work.

## Two-Way Tables and Count Models: Expected and Predicted Counts

In a previous article, we discussed how incidence rate ratios calculated in a Poisson regression can be determined from a two-way table of categorical variables. Statistical software can also calculate the expected (or predicted) count for each group. Below is the actual and expected count of the number of boys and girls participating and not participating in organized sports.