Blog Posts

Generalized Linear Models in R, Part 2: Understanding Model Fit in Logistic Regression Output

June 24, 2014

In the last article, we saw how to create a simple Generalized Linear Model on binary data using the glm() command. We continue with the same glm on the mtcars data set

Generalized Linear Models in R, Part 1: Calculating Predicted Probability in Binary Logistic Regression

June 18, 2014

Ordinary Least Squares regression provides linear models of continuous variables. However, much data of interest to statisticians and researchers are not continuous and so other methods must be used to create useful predictive models. The glm() command is designed to perform generalized linear models (regressions) on binary outcome data, count data, probability data, proportion data and many other data types. In this blog post, we explore the use of R’s glm() command on one such data type. Let’s take a look at a simple example where we model binary data.

Member Training: Multiple Comparisons

June 1, 2014

Whenever you run multiple statistical tests on the same set of data, you run into the problem of the Familywise Error Rate. What this means is that the true probability of a type 1 error somewhere in the family of tests you’re running is actually higher than the alpha=.05 you’re using for any given test. This is a complicated and controversial issue in statistics — even statisticians argue about whether it’s a problem, when it’s a problem, and what to do about it.

SPSS Procedures for Logistic Regression

May 15, 2014

Logistic Regression can be used only for binary dependent variables. It can be invoked using the menu choices at right or through the LOGISTIC REGRESSION syntax command. The dependent variable must have only two values. If you specify a variable with more than two, you'll get an error. One big advantage of this procedure is it allows you to build successive models by entering a group of predictors at a time.

Member Training: Cluster Analysis–Hierarchical and KMeans

May 7, 2014

Cluster analysis classifies individuals into two or more unknown groups based on a set of numerical variables. It is related to, but distinct from, a few other multivariate techniques including discriminant Function Analysis..

What’s in a Name? Moderation and Interaction, Independent and Predictor Variables

April 14, 2014

When we talk about moderation, though, there is a specific role to X and Z. One is assigned as the Independent Variable and the other as the Moderator. The Independent Variable is an independent variable based on the third implication listed above: its effect is of primary interest.

R Is Not So Hard! A Tutorial, Part 14: Pie Charts

March 27, 2014

In Part 14, let’s see how to create pie charts in R. Let’s create a simple pie chart using the pie() command. As always, we set up a vector of numbers and then we plot them.

R Programming Video: 15 Tips for The Beginner

March 25, 2014

One of our instructors–David Lillis–recently gave a talk in front of the Wellington R Users Group highlighting 15 Tips for using the R statistical programming language aimed at the beginner. Below is a video recording of his presentation…

Analysis of Complex Sample Surveys Made Simple

March 19, 2014

Complex Surveys use a sampling technique other than a simple random sample. Terms you may have heard in this area include cluster sampling, stratified sampling, oversampling, two-stage sampling, and primary sampling unit. Complex Samples require statistical methods that take the exact sampling design into account to ensure accurate results. This webinar, by guest presenter Dr. […]

R Is Not So Hard! A Tutorial, Part 13: Box Plots

March 17, 2014

In Part 13, let’s see how to create box plotsin R. Let’s create a simple box plot using the boxplot() command, which is easy to use. First, we set up a vector of numbers and then we plot them. Boxplots can be created for individual variables or for variables by group.

<< Older Entries Newer Entries >>

stat skill-building compass

Find clarity on your statistics journey. Try the new tool Stat Skill-Building Compass: Find Your Starting Point!

new blog post: Member Training: The Dark Side of Data Science

Previous Posts

stat skill-building compass