by Jeff Meyer
Like many people with graduate degrees, I have used a number of statistical software packages over the years.
Through work and school I have used Eviews, SAS, SPSS, R and Stata.
Some were more difficult to use than others but if you used them often enough you would become proficient to take on the task at hand (though some packages required greater usage of George Carlin’s 7 dirty words).
There was always one caveat which determined which package I used,
which one my professor or my employer insisted that I use. But that all changed when I decided to start up my own consulting business.
professor or my employer insisted that I use. But that all changed when I decided to start up my own consulting business.
Eviews was quickly eliminated from the selection process due to its narrow focus. SAS came fully loaded but I had to be fully loaded with cash in order to afford it (btw, I’m not).
So the final contenders were R, SPSS and Stata (wow, three choices, just like the TV show “International House Hunters”).
Why not R
R had a lot to offer. I remembered a friend’s favorite saying, “If it’s free, it’s for me”.
My biggest concern with R was remembering how to use it. With the potential for infrequent use I might end up spending too much time relearning how to use it.
Why not SPSS
I had used SPSS at graduate school and had the student version on my desk top at home. I had read that SPSS was the choice of most social scientist.
Graphs (clients love graphs) were easy to create, copy and paste into a Word document. If I forgot the codes I could always revert to the menus. So I decided to use SPSS.
To begin learning the things they don’t teach you in graduate school but you had better know if you don’t want to make a fool of yourself, I started taking Karen’s workshops.
The important topic of “bootstrapping” came up and I opened up SPSS to begin practicing.
I believe I used five of George Carlin’s words when I discovered that it was not included in the student version of SPSS. I opened up Stata (which I had on my desk top as well) and found the option (along with plenty of documentation on how to use it).
I then took Karen’s workshop on “Missing Data” and quickly ran into problems of recreating her results due to the need for an add-on module. I contacted SPSS and was told the missing data module was an additional $1,500.
Once again I opened up Stata and found that everything that I needed was included in the version of Stata that I owned.
Once I settled on using Stata as my primary statistical software package I realized how much it has to offer besides being less expensive.
Like SPSS, Stata allows you to write code or use menus to perform your analysis.
Stata has two primary menu tabs: Graphics and Statistics. Within “Statistics” there are twenty-one sub tabs and numerous tabs within those tabs. Within “Graphics” there are twenty-one tabs as well (you think the people at Stata like to play Black Jack?).
I admit, this can be a bit daunting and time consuming if you are trying to find a specific function. But wait, there’s more.
In the command box you can type “help” and what you are looking for. For example, if I want to run a logistic regression I simply type “help logistic regression” and documentation for running a logistic regression opens.
Within the help guide is “Menu” which gives the path you take to use the menu method of running the command. In this case it is: Statistics > Binary outcomes > Logistic regression.
The key point here is I don’t waste time trying to find what I’m looking for. After running the command through the menus method the code shows up in the “Review” box. I can then copy and paste the command into a “do-file”.
A “do-file” is the text document that allows you to submit more than one command to Stata at once.
Stata allows you to have more than one do-file opened at a time. This is a big plus because it makes it easy to copy and paste from other project do-files into the current do-file. Using do-files is significantly quicker than using the menus if you have created template do-files, especially for creating graphs.
There are so many options for creating a graph. It takes less than a minute to copy from a template and paste the commands into your current project.
Stata is extremely efficient running repetitive analysis when incorporating macros and loops in a do-file. This sounds like it may be difficult but it’s not. This will be discussed in a future blog post.
How well is Stata support by Stata Corp? On average Stata sends out updated files every two months with new features and/or any fixes to reported glitches.The reference guide for Stata 13 is 281 pages filled with examples and links to the data sets used in the examples.
The professional community also provides incredible support. Stata allows third party written commands (also known as modules) to be imported into the software. The website http://ideas.repec.org/s/boc/bocode.html is a warehouse for hundreds of third party written commands which have been tested before made public. Running a search for “logistic” returned 128 results.
The bottom line is Stata will run every analysis that the other major statistical packages can, if not more.
It is a very efficiently organized program to learn to use. Third party professionals are continuously offering new functions. Stata adds new features without charging a “new” version fee.
All this and the added bonus is it’s reasonably priced and has no add-on charges.
Jeff Meyer is a statistical consultant, instructor and writer for The Analysis Factor. Learn more about Jeff…