Why We Needed a Random Sample of 6 numbers between 1 and 10000
As you may have read in one of our recent newsletters, this month The Analysis Factor hit two milestones:
- 10,000 subscribers to our mailing list
- 6 years in business.
We’re quite happy about both, and seriously grateful to all members of our community.
So to celebrate and to say thanks, we decided to do a giveaway to 6 randomly-chosen newsletter subscribers.
I just sent emails to the 6 winners this morning.
How We Randomly Generated 6 Equally Likely Values out of 10000 Using R Commander (more…)
Not too long ago, a client asked for help with using Spotlight Analysis to interpret an interaction in a regression model.
Spotlight Analysis? I had never heard of it.
As it turns out, it’s a (snazzy) new name for an old way of interpreting an interaction between a continuous and a categorical grouping variable in a regression model. (more…)
A central concept in statistics is the level of measurement of a variable. It’s so important to everything you do with data that it’s usually taught within the first week in every intro stats class.
But even something so fundamental can be tricky once you start working with real data. (more…)
In our last article, we learned about model fit in Generalized Linear Models on binary data using the glm() command. We continue with the same glm on the mtcars data set (regressing the vs variable on the weight and engine displacement).
Now we want to plot our model, along with the observed data.
Although we ran a model with multiple predictors, it can help interpretation to plot the predicted probability that vs=1 against each predictor separately. So first we fit a glm for only (more…)
In the last article, we saw how to create a simple Generalized Linear Model on binary data using the glm() command. We continue with the same glm on the mtcars data set (more…)
Ordinary Least Squares regression provides linear models of continuous variables. However, much data of interest to statisticians and researchers are not continuous and so other methods must be used to create useful predictive models.
The glm() command is designed to perform generalized linear models (regressions) on binary outcome data, count data, probability data, proportion data and many other data types.
In this blog post, we explore the use of R’s glm() command on one such data type. Let’s take a look at a simple example where we model binary data.
(more…)