<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The Analysis Factor</title>
	<atom:link href="http://www.theanalysisfactor.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.theanalysisfactor.com</link>
	<description>Statistical Consulting, Resources, and Statistics Workshops for Researchers in Psychology, Sociology, and other Social and Biological Sciences</description>
	<lastBuildDate>Tue, 31 Jan 2012 21:12:32 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Understanding Probability, Odds, and Odds Ratios in Logistic Regression</title>
		<link>http://www.theanalysisfactor.com/understanding-probability-odds/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=understanding-probability-odds</link>
		<comments>http://www.theanalysisfactor.com/understanding-probability-odds/#comments</comments>
		<pubDate>Tue, 31 Jan 2012 17:57:51 +0000</pubDate>
		<dc:creator>Karen</dc:creator>
				<category><![CDATA[Statistical Analysis Webinars]]></category>
		<category><![CDATA[Upcoming]]></category>

		<guid isPermaLink="false">http://www.theanalysisfactor.com/?p=2312</guid>
		<description><![CDATA[Odds ratios are the bane of many data analysts. Interpreting them can be like learning a whole new language. This teleseminar will go over an example to show how to interpret the odds ratios in binary logistic regression. You will learn: how probability and odds both measure the same thing on different scales the meaning [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Odds ratios are the bane of many data analysts. Interpreting them can be like learning a whole new language. This teleseminar will go over an example to show how to interpret the odds ratios in binary logistic regression. You will learn:</p>
<ul>
<li>how probability and odds both measure the same thing on different scales</li>
<li>the meaning of odds</li>
<li>how to interpret an odds ratio for continuous and categorical predictors in logistic regression</li>
</ul>
<p><strong>Date:</strong> April 4, 2012</p>
<p><strong>Time:</strong> 1pm Eastern Time (12pm Central, 11am Mountain, 10am Pacific)</p>
<p><strong>Where:</strong> Anywhere you have a fast internet connection</p>
<p><strong>Length of Program:</strong> An Hour</p>
<p><strong>Cost:</strong> Always FREE</p>
<p><strong>Space is limited.</strong></p>
[upcoming-webinar-optin]
]]></content:encoded>
			<wfw:commentRss>http://www.theanalysisfactor.com/understanding-probability-odds/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>When Can Count Data be Considered Continuous?</title>
		<link>http://www.theanalysisfactor.com/count-data-considered-continuous/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=count-data-considered-continuous</link>
		<comments>http://www.theanalysisfactor.com/count-data-considered-continuous/#comments</comments>
		<pubDate>Fri, 13 Jan 2012 19:19:48 +0000</pubDate>
		<dc:creator>Karen Grace-Martin</dc:creator>
				<category><![CDATA[Poisson and Negative Binomial Regression Models]]></category>

		<guid isPermaLink="false">http://www.theanalysisfactor.com/?p=2295</guid>
		<description><![CDATA[Q: How high does the count scale have to be before you can consider it continuous?

I suspect you're getting at the same issue as in the last question. It's certainly true that when you get into very large numbers, many of the issues with count variables aren't issues anymore.]]></description>
			<content:encoded><![CDATA[<p></p><p>Last month I did a <a href="http://www.theanalysisfactor.com/poisson-and-negative-binomial-regression/">webinar on Poisson and negative binomial models for count data</a>. With a few hundred participants, we ran out of time to get through all the questions, so I&#8217;m answering some of them here on the blog.</p>
<p>This set of questions are all related to when it&#8217;s appropriate to treat count data as continuous and run the more familiar and simpler linear model.</p>
<p><strong>Q: Do you have any guidelines or rules of thumb as far as how many discrete values an outcome variable can take on before it makes more sense to just treat it as continuous?</strong></p>
<p>The issue usually isn&#8217;t a matter of how many values there are.  I see what you mean in that a discrete scale that goes from <span id="more-2295"></span>0 to 8 for example feels more discrete because there are only nine possible values, compared to a discrete scale that goes from 0 to 200.  201 values just feels more continuous.</p>
<p>But that&#8217;s not really the issue in most <a href="http://www.theanalysisfactor.com/poisson-regression-analysis-for-count-data/">count models</a>. The issue with <a href="http://www.theanalysisfactor.com/6-types-of-dependent-variables-that-will-never-meet-the-glm-normality-assumption/">count variables is that they bounded at zero</a>. This wreaks havoc on the <a href="http://www.theanalysisfactor.com/assumptions-of-linear-modelsassumptions-of-linear-models/">assumptions of a linear model</a>, which require continuous data.</p>
<p>If none of your data are near zero, it would be less of an issue.  Treating that count variable as continuous would give you predicted values that are non-integers, but perhaps that&#8217;s not a big issue in your particular data set.</p>
<p><strong>Q: How high does the count scale have to be before you can consider it continuous?</strong></p>
<p>I suspect you&#8217;re getting at the same issue as in the last question. It&#8217;s certainly true that when you get into very large numbers, many of the issues with count variables aren&#8217;t issues anymore.</p>
<p>For example, most incomes are not measured using decimals, just whole numbers. You could consider them a count of the number of dollars. Likewise, demographic variables like the number of children vaccinated in a state over the course of the year are truly counts, but the smallest values are likely to be in the hundreds of thousands, or even millions.</p>
<p>As long as there are no data along the bound of zero, and you don&#8217;t mind predicted values that include decimals, there&#8217;s no problem treating it as continuous.</p>
<p><strong>Q: For count data distributing not skewed, but in a symmetric/or even normal shape, are poisson and NB still the best choice?</strong></p>
<p>Sometimes.  The Poisson distribution is only skewed when the mean is very small. When the mean gets up to only 10, the distribution will become symmetric and bell shaped.</p>
<p>Depending on the effects of the predictors, and actual range of the data, i.e. whether there are actual 0 pounds or not, you may get identical results from running a linear model compare to a Poisson or negative binomial model.</p>
<p>If you do run a linear model, it will be possible to get predictive values below zero, and you need to consider whether that&#8217;s problematic in your situation. If the point of your model is prediction, it may be more of an issue.</p>
<p><strong>Q: If count data can be normalized by log transformation, will you recommend using poisson or linear regression?</strong></p>
<p>It&#8217;s never wrong to run a Poisson model, so what you&#8217;re asking is if the increased accuracy is worth the trouble of running the more complicated model.  There are certainly cases where running a linear model simplifies things a lot and still gives you the same results.  (You just won&#8217;t know you have the same results unless you run both).</p>
<p>When the mean count is very small, and zero is the most common value in the data set, it will be impossible to normalize using a log transformation.  It just won&#8217;t work. The mode will always be at the lowest value.  In that situation, you have no choice.</p>
<p>However, if the mean count is a little bit larger, zero may not be the most common value. When the mode is not the lower bound, it will be possible to use a log transformation to normalize the data. In fact, not too many years ago, when Poisson and negative binomial models were not readily available in software, textbooks did suggest this approach.  You may still have some on your shelf that do so.</p>
<p>It&#8217;s not necessarily a bad approach.  You may very well get the exact same results. If so, and if, for example, you are writing a report for an audience with out the statistical sophistication to understand the Poisson model, it may be a better choice.</p>
<p>However, it&#8217;s not exactly the same thing. A Poisson model uses a <a href="http://www.theanalysisfactor.com/interpreting-regression-coefficients-in-models-other-than-ordinary-linear-regression/">log link function</a>, which applies the log to the mean&#8211;not each individual data point. So it&#8217;s harder to back transform coefficients when you&#8217;ve got a log transformation.  So if there are not major advantages to running a linear model, you are usually better off with a more sophisticated and more accurate Poisson model, or one of its derivatives.</p>
<p><em>If you&#8217;d like to learn more about the different models available for Count data, you can download a recording of the webinar: <strong><a href="”"> Poisson and Negative Binomial Regression for Count Data.</a></strong> It’s free.</em></p>
<p><a class="addthis_button" href="http://addthis.com/bookmark.php?v=250&amp;pub=kgracemartin"><img style="border: 0;" src="http://s7.addthis.com/static/btn/v2/lg-share-en.gif" alt="Bookmark and Share" width="125" height="16" /></a><script type="text/javascript" src="http://s7.addthis.com/js/250/addthis_widget.js#pub=kgracemartin"></script><br />
<!-- AddThis Button END --></p>
]]></content:encoded>
			<wfw:commentRss>http://www.theanalysisfactor.com/count-data-considered-continuous/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>The Difference Between Eta Squared and Partial Eta Squared</title>
		<link>http://www.theanalysisfactor.com/partial-eta-squared/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=partial-eta-squared</link>
		<comments>http://www.theanalysisfactor.com/partial-eta-squared/#comments</comments>
		<pubDate>Fri, 16 Dec 2011 18:02:13 +0000</pubDate>
		<dc:creator>Karen Grace-Martin</dc:creator>
				<category><![CDATA[ANOVA]]></category>
		<category><![CDATA[SPSS]]></category>
		<category><![CDATA[Eta Squared]]></category>
		<category><![CDATA[Partial Eta Squared]]></category>

		<guid isPermaLink="false">http://www.theanalysisfactor.com/?p=1250</guid>
		<description><![CDATA[For ANOVAs, two of the most popular are Eta-squared and partial Eta-squared.  In one way ANOVAs, they come out the same, but in more complicated models, their values, and their meanings differ.him]]></description>
			<content:encoded><![CDATA[<p></p><p>It seems every editor and her brother these days wants to see standardized <a href="http://www.theanalysisfactor.com/effect-size/">effect size statistics</a> reported in journal articles.</p>
<p>For ANOVAs, two of the most popular are <a href="http://www.theanalysisfactor.com/effect-size/">Eta-squared</a> and partial Eta-squared.  In one way ANOVAs, they come out the same, but in more complicated models, their values, and their meanings differ.</p>
<p>SPSS only reports partial Eta-squared, and in earlier versions of the software it was (unfortunately) labeled Eta-squared.  More recent versions have fixed the label, but still don&#8217;t offer Eta-squared as an option.</p>
<p>Luckily Eta-squared is very simple to calculate yourself based on the sums of squares in your ANOVA table. I&#8217;ve written another blog post with all the formulas. You can <span id="more-1250"></span><a href="http://www.theanalysisfactor.com/calculate-effect-size/">check it out here</a>.</p>
<p>But if you&#8217;re still wondering about the details of the differences between partial Eta-squared and Eta-squared and which one you ought to be using, I recommend reading this article:</p>
<p>Levine, T.R. &amp; Hullett, C.R. (2002). <a href="https://www.msu.edu/~levinet/eta%20squared%20hcr.pdf" target="_blank">Eta Squared, Partial Eta Squared and the Misreporting of Effect Size in Communication Research</a>.  Human Communication Research, 28, 612-625.</p>
<p><!-- AddThis Button BEGIN --><br />
<a class="addthis_button" href="http://addthis.com/bookmark.php?v=250&amp;pub=kgracemartin"><img style="border: 0;" src="http://s7.addthis.com/static/btn/v2/lg-share-en.gif" alt="Bookmark and Share" width="125" height="16" /></a><script type="text/javascript" src="http://s7.addthis.com/js/250/addthis_widget.js#pub=kgracemartin"></script><br />
<!-- AddThis Button END --></p>
<form style="border: 1px solid #ccc; padding: 3px; text-align: center;" action="http://www.feedburner.com/fb/a/emailverify" method="post">Like this post?<br />
Enter your email address to have posts delivered:<br />
<input style="width: 140px;" type="text" name="email" />
<input type="submit" value="Subscribe" /></form>
]]></content:encoded>
			<wfw:commentRss>http://www.theanalysisfactor.com/partial-eta-squared/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Should You Always Center a Predictor on the Mean?</title>
		<link>http://www.theanalysisfactor.com/center-on-the-mean/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=center-on-the-mean</link>
		<comments>http://www.theanalysisfactor.com/center-on-the-mean/#comments</comments>
		<pubDate>Fri, 02 Dec 2011 14:47:08 +0000</pubDate>
		<dc:creator>Karen Grace-Martin</dc:creator>
				<category><![CDATA[Linear Regression]]></category>
		<category><![CDATA[Other regression models]]></category>
		<category><![CDATA[Centering]]></category>
		<category><![CDATA[Intercept]]></category>
		<category><![CDATA[predictor variables]]></category>

		<guid isPermaLink="false">http://www.theanalysisfactor.com/?p=2212</guid>
		<description><![CDATA[One problem is that the mean age at which infants utter their first word may differ from one sample to another. This means you're not always evaluating that mean that the exact same age.  It's not comparable across samples.

So another option is to choose a meaningful value of age that is within the values in the data set. One example may be at 12 months.]]></description>
			<content:encoded><![CDATA[<p></p><p>Centering <a href="http://www.theanalysisfactor.com/the-many-names-of-independent-variables/">predictor variables</a> is one of those simple but extremely useful practices that is easily overlooked.</p>
<p>It&#8217;s almost too simple.</p>
<p><a href="http://www.theanalysisfactor.com/when-not-to-center-a-predictor-variable-in-regression/">Centering</a> simply means subtracting a constant from every value of a variable.  What it does is redefine the 0 point for that predictor to be whatever value you subtracted.  It shifts the scale over, but retains the units.</p>
<p>The effect is that the slope between that predictor and the response variable doesn&#8217;t <span id="more-2212"></span>change at all.  But the interpretation of the intercept does.</p>
<p>The<a href="http://www.theanalysisfactor.com/interpreting-the-intercept-in-a-regression-model/"> intercept</a> is just the mean of the response when all predictors = 0.  So when 0 is out of the range of data, that value  is meaningless.  But when you center X so that a value within the dataset becomes 0, the intercept becomes the mean of Y at the value you centered on.</p>
<p>What&#8217;s the point?  Who cares about <a href="http://www.theanalysisfactor.com/interpreting-the-intercept-in-a-regression-model/">interpreting the intercept</a>?</p>
<p>It&#8217;s true.  In many models, you&#8217;re not really interested in the intercept.  In those models, there isn&#8217;t really a point, so don&#8217;t worry about it.</p>
<p><em>But</em>, and there&#8217;s always a but, in many models interpreting the intercept becomes really, really important.  So whether and where you center becomes important too.</p>
<p>A few examples include models with a dummy-coded predictor, models with a polynomial (curvature) term, and random slope models.</p>
<p>Let&#8217;s look more closely at one of these examples.</p>
<p>In models with a <a href="http://www.theanalysisfactor.com/making-dummy-codes-easy-to-keep-track-of/">dummy-coded</a> predictor, the intercept is the mean of Y for the <a href="http://www.theanalysisfactor.com/when-dummy-codes-are-backwards/">reference category</a>—the category numbered 0.  If there’s also a continuous predictor in the model, X2, that intercept is the mean of Y for the reference category only when X2=0.</p>
<p>If 0 is a meaningful value for X2 and within the data set, then there’s no reason to center.  But if neither is true, centering will help you interpret the intercept.</p>
<p>For example, let’s say you’re doing a study on language development in infants.  X1, the dummy-coded categorical predictor, is whether the child is bilingual (X1=1) or monolingual (X1=0).  X2 is the age in months when the child spoke their first word, and Y is the number of words in their vocabulary for their primary language at 24 months.</p>
<p>If we don’t center X2, the intercept in this model will be the mean number of words in the vocabulary of monolingual children who uttered their first word at birth (X2=0).</p>
<p>And since infants never speak at birth, it&#8217;s meaningless.</p>
<p>A better approach is to center age at some value that is actually in the range of the data. One option, often a good one, is to use the mean age of first spoken word of all children in the data set.</p>
<p>This would make the intercept the mean number of words in the vocabulary of monolingual children for those children who uttered their first word at the mean age that all children uttered their first word.</p>
<p>One problem is that the mean age at which infants utter their first word may differ from one sample to another. This means you&#8217;re not always evaluating that mean that the exact same age.  It&#8217;s not comparable across samples.</p>
<p>So another option is to choose a meaningful value of age that is within the values in the data set. One example may be at 12 months.</p>
<p>Under this option the interpretation of the intercept is the mean number of words in the vocabulary of monolingual children for those children who uttered their first word at 12 months.</p>
<p>The exact value you center on doesn&#8217;t matter as long it&#8217;s meaningful, holds the same meaning across samples,  and within the range of data.  You may find that choosing the lowest value or the highest value of age is the best option. It&#8217;s up to you to decide the age at which it’s most meaningful to interpret the intercept.</p>
<p><!-- AddThis Button BEGIN --><br />
<a class="addthis_button" href="http://addthis.com/bookmark.php?v=250&amp;pub=kgracemartin"><img style="border: 0;" src="http://s7.addthis.com/static/btn/v2/lg-share-en.gif" alt="Bookmark and Share" width="125" height="16" /></a><script type="text/javascript" src="http://s7.addthis.com/js/250/addthis_widget.js#pub=kgracemartin"></script><br />
<!-- AddThis Button END --></p>
<p>If you want more information on using and interpreting regression coefficients for interactions, dummy variables, and centered predictors, get the recording from my webinar: <a href="../learning/teletraining4.html">Interpreting Regression Coefficients: A Walk Through Output</a>. It&#8217;s free.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.theanalysisfactor.com/center-on-the-mean/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Great statistical resources: StatTrek</title>
		<link>http://www.theanalysisfactor.com/stattrek/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=stattrek</link>
		<comments>http://www.theanalysisfactor.com/stattrek/#comments</comments>
		<pubDate>Fri, 28 Oct 2011 16:14:26 +0000</pubDate>
		<dc:creator>Karen Grace-Martin</dc:creator>
				<category><![CDATA[Statistical Resources]]></category>
		<category><![CDATA[Binomial distribution]]></category>
		<category><![CDATA[probability]]></category>

		<guid isPermaLink="false">http://www.theanalysisfactor.com/?p=2144</guid>
		<description><![CDATA[I got to pull out my old theoretical stats book to make sure I still remembered the pdf of a binomial distribution (I almost remembered it).

I calculated it out, but wanted to double-check that it was correct, and found this fabulous website for calculating binomial probabilities:]]></description>
			<content:encoded><![CDATA[<p></p><p>As someone who focuses on data analysis, I don&#8217;t often have to calculate probabilities, but I always love it when I do.  It brings me back to the basics I learned in stats classes.</p>
<p>A client asked what is the probability of choosing exactly 3 outliers in a sample of 30, if 3.5% of the population is outliers.</p>
<p><span id="more-2144"></span>I got to pull out my old theoretical stats book to make sure I still remembered the pdf of a binomial distribution (I almost remembered it).</p>
<p>I calculated it out, but wanted to double-check that it was correct, and found this fabulous website for calculating binomial probabilities:</p>
<h2 style="padding-left: 30px;"><a href="http://stattrek.com/Tables/Binomial.aspx">StatTrek</a></h2>
<p>(I have to admit, the geek in me loves the name as well).</p>
<p>So if you ever need to calculate a binomial, or other probability, check it out.</p>
<p><!-- AddThis Button BEGIN --><br />
<a class="addthis_button" href="http://addthis.com/bookmark.php?v=250&amp;pub=kgracemartin"><img style="border: 0;" src="http://s7.addthis.com/static/btn/v2/lg-share-en.gif" alt="Bookmark and Share" width="125" height="16" /></a><script type="text/javascript" src="http://s7.addthis.com/js/250/addthis_widget.js#pub=kgracemartin"></script><br />
<!-- AddThis Button END --></p>
<form style="border: 1px solid #ccc; padding: 3px; text-align: center;" action="http://www.feedburner.com/fb/a/emailverify" method="post">Like this post?<br />
Enter your email address to have posts delivered:<br />
<input style="width: 140px;" type="text" name="email" />
<input type="submit" value="Subscribe" /></form>
]]></content:encoded>
			<wfw:commentRss>http://www.theanalysisfactor.com/stattrek/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Interpreting Interactions Between Two Effect-Coded Categorical Predictors</title>
		<link>http://www.theanalysisfactor.com/interactions-effect-coded-predictors/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=interactions-effect-coded-predictors</link>
		<comments>http://www.theanalysisfactor.com/interactions-effect-coded-predictors/#comments</comments>
		<pubDate>Fri, 21 Oct 2011 15:37:22 +0000</pubDate>
		<dc:creator>Karen Grace-Martin</dc:creator>
				<category><![CDATA[ANOVA]]></category>
		<category><![CDATA[Linear Regression]]></category>
		<category><![CDATA[Effect coding]]></category>
		<category><![CDATA[Interactions]]></category>

		<guid isPermaLink="false">http://www.theanalysisfactor.com/?p=2050</guid>
		<description><![CDATA[I recently received this great question: Question: Hi Karen,  ive purchased a lot of your material and read a lot of your pdf documents w.r.t. regression and interaction terms.  Its, now, my general understanding that interaction for two or more categorical variables is best done with effects coding, and interactions  cont v. categorical variables is [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>I recently received this great question:</p>
<p>Question:</p>
<blockquote><p>Hi Karen,  ive purchased a lot of your material and read a lot of your pdf documents w.r.t. regression and interaction terms.  Its, now, my general understanding that interaction for two or more categorical variables is best done with effects coding, and interactions  cont v. categorical variables is usually handled via dummy coding.  Further, i may mess this up a little but hopefully you&#8217;ll get my point and more importantly my question, i understand that</p>
<p>1)  given a fitted line Y = b0 + b1 x1 + b2 x2 + b3 x1*x2, the interpretation for b3 is the diff of the effect of x1 on Y, when x2 changes one unit, if x1 and x2 are cont.  ( also interpretation can be reversed in terms of x1 and x2).<span id="more-2050"></span></p>
<p>2) given a fitted line Y = b0 + b1 x1 + b2 x2 + b3 x1*x2, the interpretation for b3 is it affects Y by changing the slope (whereas b2 affects the intercept), if x1 is cont. and x2 is cat 0,1,  namely dummy coded.  Thus, b1 is not a main effect for x1.</p>
<p>But what was not clear to me is what happens when x1 and x2 are effect coded cat variables, let&#8217;s say (-1,1) and the fitted line is Y = b0 + b1 x1 + b2 x2 + b3 x1*x2, the interpretation for b3  is  ????  this was only hinted at in the outlines .. I&#8217;d like to know how to interpret the b3 in this case, and I&#8217;d like to know why in this case of interaction, b1 is still considered a main effect for say x1?    Thanks for any clarifications!</p>
<p>Any help would be greatly appreciated,</p>
<p>J</p></blockquote>
<p>Answer:</p>
<blockquote><p>Hi J,</p>
<p>Great question.  First, yes, everything you state in points 1 and 2 are correct. I would add, though, that the <a href="http://www.theanalysisfactor.com/why-anova-and-linear-regression-are-the-same-analysis/">effect-coded</a> coefficients are not all that easy to interpret by themselves.  They do give you p-values for true main effects, and that&#8217;s useful, but you usually need to interpret using multiple comparison tests of the means.</p>
<p>To answer your questions:</p>
<p>It all comes down to what is a 0 point in dummy and effect coding.</p>
<p><strong>In <a href="http://www.theanalysisfactor.com/when-dummy-codes-are-backwards/">dummy coding</a></strong>, we&#8217;ve set the value of 0 to one of the categories, so:</p>
<p>1. the coefficients reflect actual cell mean differences, and have meaningful interpretations as such</p>
<p>2. when there is an interaction, the value of b1, eg., is the effect of X1 when X2 = 0.  Since X2=0 for one category of X2, b1 is not a main effect (an overall effect of X1 across all values of X2). It&#8217;s a marginal effect&#8211;an effect of X1 at a single value of X2.</p>
<p><strong>In <a href="http://www.theanalysisfactor.com/the-general-linear-model-analysis-of-covariance-and-how-anova-and-linear-regression-really-are-the-same-model-wearing-different-clothes/">effect coding</a></strong>, we&#8217;ve set the value of 0 to being <em>in between</em> the two categories, so:</p>
<p>1. the coefficients are differences between cell means and grand means, and do not have particularly meaningful interpretations.  So you can use the p-value to tell that there is a significant interaction, but the only way to interpret that interaction in a meaningful way is to actually look at the means.  That&#8217;s why we usually just look at F statistics and means in ANOVA, which uses effect coding.  The coefficients themselves aren&#8217;t very interpretable.</p>
<p>2. when there is an <a href="http://www.theanalysisfactor.com/clarifications-on-interpreting-interactions-in-regression/">interaction</a>, the value of b1, eg., is the effect of X1 when X2 = 0.  Since X2 = 1 at the mean of the two categories of X2, b1 is a main effect.  It&#8217;s the effect of X1 at the mean value of X2.</p>
<p>Below, I&#8217;ve inserted an example.  I&#8217;ve displayed the means of each group and the coefficients when the variables are dummy and effect coded.</p>
<p>In dummy coding, the <a href="http://www.theanalysisfactor.com/interpreting-the-intercept-in-a-regression-model/">intercept</a> is one of the four means&#8211;when both Poverty and Gender=0.  The various coefficients are differences from that mean.  Grab a calculator and see if you can figure out how to get each mean.</p>
<p>In effect coding, the intercept is the <a href="http://www.theanalysisfactor.com/why-anova-and-linear-regression-are-the-same-analysis/">grand mean</a>.  That point right in the middle of the graph.  The coefficients are differences from that point to the points marked with X on the graph.  Once again, see if you can figure out what each coefficient is telling you.</p>
<p><a href="http://www.theanalysisfactor.com/wp-content/uploads/2011/10/dummy-effect-coding-tables2.png"><img title="dummy-effect-coding-tables" src="http://www.theanalysisfactor.com/wp-content/uploads/2011/10/dummy-effect-coding-tables2.png" alt="" width="475" height="725" /></a></p>
<p><a href="http://www.theanalysisfactor.com/wp-content/uploads/2011/10/dummy-effect-coding-graph1.png"><img title="dummy-effect-coding-graph" src="http://www.theanalysisfactor.com/wp-content/uploads/2011/10/dummy-effect-coding-graph1.png" alt="" width="529" height="466" /></a></p></blockquote>
<p><!-- AddThis Button BEGIN --><br />
<a class="addthis_button" href="http://addthis.com/bookmark.php?v=250&amp;pub=kgracemartin"><img style="border: 0;" src="http://s7.addthis.com/static/btn/v2/lg-share-en.gif" alt="Bookmark and Share" width="125" height="16" /></a><script type="text/javascript" src="http://s7.addthis.com/js/250/addthis_widget.js#pub=kgracemartin"></script></p>
<p>If you want more information on using and interpreting parameter estimates in regression using SPSS, get the recording from my webinar: <a href="../learning/teletraining4.html">Interpreting Regression Coefficients: A Walk Through Output</a>.  It&#8217;s free.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.theanalysisfactor.com/interactions-effect-coded-predictors/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Changes at The Analysis Factor</title>
		<link>http://www.theanalysisfactor.com/changes-at-the-analysis-factor/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=changes-at-the-analysis-factor</link>
		<comments>http://www.theanalysisfactor.com/changes-at-the-analysis-factor/#comments</comments>
		<pubDate>Mon, 10 Oct 2011 17:49:46 +0000</pubDate>
		<dc:creator>Karen Grace-Martin</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.theanalysisfactor.com/?p=2030</guid>
		<description><![CDATA[But more importantly, we've made a big effort to simplify the website and make it easier to get around.  As we've grown a lot over the past 3+ years, little bits were added here and there, and it was starting to get a bit jumbled.

So we've streamlined.  Here are some highlights of the website changes.]]></description>
			<content:encoded><![CDATA[<p></p><p>You&#8217;ve probably noticed it looks a little different around here.</p>
<p>The look of the site has changed, clearly.  It&#8217;s a lot less blue.</p>
<p>But more importantly, we&#8217;ve made a big effort to simplify the website and make it easier to get around.  As we&#8217;ve grown a lot over the past 3+ years, little bits were added here and there, and it was starting to get a bit jumbled.</p>
<p>So we&#8217;ve streamlined.  Here are some highlights of the website changes.</p>
<h2>Website Updates</h2>
<p>1. <strong>We&#8217;ve combined the web site and the blog</strong>.  The tips, articles, and resources on our blog are some of our most popular pages, but a link to &#8220;blog&#8221; doesn&#8217;t always capture that.  So we&#8217;ve made those articles front and center.<span id="more-2030"></span></p>
<p>2. W<strong>e&#8217;ve made a lot of our free offerings easier to get</strong>.  You used to have to sign up separately for each webinar recording.  Now you can sign up just once, and <a href="http://www.theanalysisfactor.com/webinars/recordings/">get access to all our recordings</a>.</p>
<p>3. <strong>We&#8217;ve just plain cleaned up the navigation</strong>.  Drop-down menus, a sidebar with popular pages and functions, and fewer duplicate pages should make everything much easier to find.  We&#8217;re still working on the Workshop section, but hope to have that updated soon.</p>
<h2>Service Updates</h2>
<p>We&#8217;re also making some changes to our consulting services that will help us grow and still be able to give you great service and continue to offer new workshops, webinars, and resources:</p>
<p>1. <strong>We&#8217;re simplifying how we administer consulting</strong>.  We used to ask for a deposit, then do monthly billing.  Now we&#8217;re just keeping it simple.  You can pre-purchase <a href="http://www.theanalysisfactor.com/statistical-consulting-services/hourly-consulting-service/">one consultation or four</a> (at a discount).  Just pick what you need.  Not sure what you need?  Start small, and you can always add on.  If your organization requires an exception, just contact us.</p>
<p>2.<strong> I am offering consulting only</strong>.  I realize when a lot of people say consulting, they mean doing projects as a contractor.  I have offered both services, and always made a distinction between projects and consulting&#8211;guiding you through analyses you&#8217;re doing yourself by helping you choose appropriate analyses, explaining how to conduct them, and helping with interpretation.</p>
<p>Although I love getting my hands dirty with data, my true love, (and I honestly believe my true gift and the best way to serve you) is through understanding where you&#8217;re stuck, asking the right questions, and getting you unstuck by explaining what you&#8217;re not understanding.</p>
<p>There are a lot of good data analysts, but not a lot of good statistical consultants.  So I am going to limit myself to what I do best, and personally offer only consulting services.  <a href="http://www.theanalysisfactor.com/statistical-consulting-services/">Learn more about my consulting services here</a>.</p>
<p>3. BUT, I also realize there is great need for many researchers to hand off their data analysis.  So I have <strong>partnered with two expert freelance data analysts</strong> so you&#8217;re not left without the data analysis project services.</p>
<p>So with pride, I introduce Dr. Jane Yank and Dr. Maike Rahn, who will be available to run your analyses for you.  Jane and Maike have different backgrounds in their areas of research, statistical expertise, and the statistical software they use.  This will greatly expand the types of analyses and software we&#8217;re able to assist with.  Learn more about Jane and Maike and how to work with them on our <a href="http://www.theanalysisfactor.com/statistical-consulting-services/statistical-project-services/">Statistical Project Services page</a>.<br />
<!-- AddThis Button BEGIN --><br />
<a class="addthis_button" href="http://addthis.com/bookmark.php?v=250&amp;pub=kgracemartin"><img style="border: 0;" src="http://s7.addthis.com/static/btn/v2/lg-share-en.gif" alt="Bookmark and Share" width="125" height="16" /></a><script type="text/javascript" src="http://s7.addthis.com/js/250/addthis_widget.js#pub=kgracemartin"></script><br />
<!-- AddThis Button END --></p>
<p>&nbsp;</p>
<form style="border: 1px solid #ccc; padding: 3px; text-align: center;" action="http://www.feedburner.com/fb/a/emailverify" method="post">Like this post?<br />
Enter your email address to have posts delivered:</p>
<input style="width: 140px;" type="text" name="email" />
<input type="submit" value="Subscribe" /></form>
]]></content:encoded>
			<wfw:commentRss>http://www.theanalysisfactor.com/changes-at-the-analysis-factor/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>The Repeated and Random Statements in Mixed Models for Repeated Measures</title>
		<link>http://www.theanalysisfactor.com/repeated-and-random-2/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=repeated-and-random-2</link>
		<comments>http://www.theanalysisfactor.com/repeated-and-random-2/#comments</comments>
		<pubDate>Fri, 30 Sep 2011 14:35:03 +0000</pubDate>
		<dc:creator>Karen Grace-Martin</dc:creator>
				<category><![CDATA[ANOVA]]></category>
		<category><![CDATA[Mixed and Multilevel Models]]></category>
		<category><![CDATA[Marginal Model]]></category>
		<category><![CDATA[mixed model]]></category>
		<category><![CDATA[Proc Mixed]]></category>
		<category><![CDATA[Random Statement]]></category>
		<category><![CDATA[Repeated Statement]]></category>
		<category><![CDATA[SPSS Mixed]]></category>

		<guid isPermaLink="false">http://www.analysisfactor.com/statchat/?p=1280</guid>
		<description><![CDATA[Here's one example of the flexibility of mixed models, and its resulting potential for confusion and error.

In repeated measures and longitudinal studies, the observations are clustered within a subject.  That means the observations, and their residuals, are not independent.  They're correlated.  There are two ways to deal with this correlation.]]></description>
			<content:encoded><![CDATA[<p></p><p style="padding-left: 30px;">&#8220;Because mixed models are more complex and more flexible than the general linear model, the potential for confusion and errors is higher.&#8221;</p>
<p style="padding-left: 30px;">- Hamer &amp; Simpson (2005)</p>
<p>Linear Mixed Models, as implemented in SAS&#8217;s Proc Mixed, SPSS Mixed, <a href="http://www.theanalysisfactor.com/r-code-random-intercept-and-slope/">R&#8217;s LMER</a>, and Stata&#8217;s xtmixed, are an extension of the general linear model.  They use more sophisticated techniques for estimation of parameters (means, variances, regression coefficients, and standard errors), and as the quotation says, are much more flexible.</p>
<p>Here&#8217;s one example of the flexibility of mixed models, and its resulting potential for confusion and error.<span id="more-1994"></span></p>
<p>In repeated measures and longitudinal studies, the observations are clustered within a subject.  That means the observations, and their residuals, are not independent.  They&#8217;re correlated.  There are two ways to deal with this correlation.</p>
<h2>The Marginal Model</h2>
<p>One is to alter the <a href="http://www.theanalysisfactor.com/covariance-matrices/">covariance structure</a> of the residuals.  What this means is that instead of assuming that all observations are independent, as you do in a linear model, you assume the residuals from a single subject are related.  Their covariances are non-zero.  So you have to estimate the covariances among all the residuals from a single subject.</p>
<p>This approach is called a <a href="http://www.theanalysisfactor.com/repeated-measures-approaches/">Marginal</a> or Population Averaged approach.  It&#8217;s not truly a mixed model, although you can use Mixed procedures to run them.  You get these models in SAS Proc Mixed and SPSS Mixed by using a repeated statement instead of a random statement.</p>
<h2>The Mixed Model</h2>
<p>The other way to deal with non-independence of a subject&#8217;s residuals is to leave the residuals alone, but actually alter the model by controlling for subject.  When you control for subject as a factor in the model, you literally redefine what a residual is.  Instead of being the distance between a data point and the average for everyone, it&#8217;s the distance between a data point and <em>the mean for that subject</em>.</p>
<p>You could, theoretically, include Subject as a<a href="http://www.theanalysisfactor.com/specifying-fixed-and-random-factors-in-mixed-models/"> fixed factor</a>, but that usually uses up most of the degrees of freedom.  If instead, you treat Subject as a <a href="http://www.theanalysisfactor.com/specifying-fixed-and-random-factors-in-mixed-models/">random factor</a>, you are still controlling for Subject, you&#8217;re still able to redefine the residuals and deal with non-independence, while using up only a few degrees of freedom.</p>
<p>Because the model now contains both fixed and random effects, it is now officially a <a href="http://www.theanalysisfactor.com/advantages-of-repeated-measures-anova-as-a-mixed-model/">Mixed Model</a>.  You get these models in <a href="http://www.theanalysisfactor.com/mixed-up-mixed-models/">SAS Proc Mixed</a> and SPSS Mixed by using a random statement.</p>
<h2>Putting them together</h2>
<p>Most of the time, controlling for Subject is enough to deal with all the non-independence of the residuals for each subject.</p>
<p>But every once in a while it&#8217;s not. If there is extra non-independence (or even non-constant variance) among the residuals, you can still estimate those non-zero covariances by adding a Repeated statement.</p>
<p>It&#8217;s fine to include a Repeated statement right along with a Random statement, and is sometimes necessary to have a good fitting model.  The repeated statement still controls the covariance structure of the residuals for a single subject.  It&#8217;s just that now those residuals have been redefined as the distance between each point and the subject&#8217;s mean.  In the Marginal model, they&#8217;re not.  They still represent the distance between each point and the overall mean.</p>
<p><!-- AddThis Button BEGIN --><br />
<a class="addthis_button" href="http://addthis.com/bookmark.php?v=250&amp;pub=kgracemartin"><img style="border: 0;" src="http://s7.addthis.com/static/btn/v2/lg-share-en.gif" alt="Bookmark and Share" width="125" height="16" /></a><script type="text/javascript" src="http://s7.addthis.com/js/250/addthis_widget.js#pub=kgracemartin"></script><br />
<!-- AddThis Button END --></p>
<p style="padding-left: 30px;">If you want to learn more about mixed models, check out the recording of my Random Intercept and Random Slope Models webinar.  These two models are the basic building blocks of all mixed models.</p>
<p style="padding-left: 30px;"><a title="Random Intercept and Random Slope Models Webinar" href="http://www.theanalysisfactor.com/random-intercept-and-random-slope-models-webinar/" target="_self"><strong>Get it all here</strong></a>.  It’s free.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.theanalysisfactor.com/repeated-and-random-2/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>Khan Acadamy: When you need to relearn math concepts in order to learn statistics</title>
		<link>http://www.theanalysisfactor.com/khan-acadamy-when-you-need-to-relearn-math-concepts-in-order-to-learn-statistics/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=khan-acadamy-when-you-need-to-relearn-math-concepts-in-order-to-learn-statistics</link>
		<comments>http://www.theanalysisfactor.com/khan-acadamy-when-you-need-to-relearn-math-concepts-in-order-to-learn-statistics/#comments</comments>
		<pubDate>Wed, 28 Sep 2011 14:17:59 +0000</pubDate>
		<dc:creator>Karen Grace-Martin</dc:creator>
				<category><![CDATA[Statistical Resources]]></category>

		<guid isPermaLink="false">http://www.analysisfactor.com/statchat/?p=1576</guid>
		<description><![CDATA[Has this ever happened to you? You need to implement a statistical technique you haven&#8217;t encountered before.  Maybe someone told you you need SEM or linear mixed models or logistic regression, for example. Luckily, you&#8217;ve found a pretty good book on the topic and are cruising along learning it, when it refers to some mathematical [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Has this ever happened to you?</p>
<p>You need to implement a statistical technique you haven&#8217;t encountered before.  Maybe someone told you you need SEM or linear mixed models or logistic regression, for example.</p>
<p>Luckily, you&#8217;ve found a pretty good book on the topic and are cruising along learning it, when it refers to some mathematical concept that you don&#8217;t understand.  Like the rank of a matrix.  Or a logarithm.</p>
<p>Maybe you never took matrix algebra.  Maybe you know you understood this many years ago.  But after 10 (or many more) years of not using linear algebra on a regular basis, and you just don&#8217;t remember exactly what a rank means.</p>
<p>Sure you can do a web search, but it&#8217;s still a chore to find a good source that <em>explains</em> what the heck it means.</p>
<p>I just came across this fabulous tutorial website: <a href="http://www.khanacademy.org/" target="_blank">Khan Academy</a>.</p>
<p>It has video tutorials on many topics in math.</p>
<p>I was particularly pleased to find it because I recently wrote about how complicated issues in statistics, like a <a href="http://www.theanalysisfactor.com/covariance-matrices/">Covariance Matrix</a>, are not so intimidating if you understand some linear algebra and can remember what covariance means.</p>
<p>Khan Academy has tutorials on both <a href="http://www.khanacademy.org/#linear-algebra">linear algebra</a> (a lot of them) and basic <a href="http://www.khanacademy.org/#probability">probability</a> and <a href="http://www.khanacademy.org/#statistics" target="_blank">statistics</a>.  So even if the concept you don&#8217;t quite remember is a statistical one&#8211;like covariance&#8211;you can get those as well.</p>
<p><!-- AddThis Button BEGIN --><br />
<a class="addthis_button" href="http://addthis.com/bookmark.php?v=250&amp;pub=kgracemartin"><img style="border: 0;" src="http://s7.addthis.com/static/btn/v2/lg-share-en.gif" alt="Bookmark and Share" width="125" height="16" /></a><script type="text/javascript" src="http://s7.addthis.com/js/250/addthis_widget.js#pub=kgracemartin"></script><br />
<!-- AddThis Button END --></p>
<form style="border: 1px solid #ccc; padding: 3px; text-align: center;" action="http://www.feedburner.com/fb/a/emailverify" method="post">Like this post?<br />
Enter your email address to have posts delivered:<br />
<input style="width: 140px;" type="text" name="email" />
<input type="submit" value="Subscribe" /></form>
]]></content:encoded>
			<wfw:commentRss>http://www.theanalysisfactor.com/khan-acadamy-when-you-need-to-relearn-math-concepts-in-order-to-learn-statistics/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Interpreting Linear Regression Coefficients: A Walk through Output</title>
		<link>http://www.theanalysisfactor.com/interpreting-linear-regression-parameters-a-walk-through-output/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=interpreting-linear-regression-parameters-a-walk-through-output</link>
		<comments>http://www.theanalysisfactor.com/interpreting-linear-regression-parameters-a-walk-through-output/#comments</comments>
		<pubDate>Mon, 26 Sep 2011 21:20:13 +0000</pubDate>
		<dc:creator>Karen</dc:creator>
				<category><![CDATA[Recordings]]></category>
		<category><![CDATA[Statistical Analysis Webinars]]></category>

		<guid isPermaLink="false">http://www.theanalysisfactor.com/?p=1878</guid>
		<description><![CDATA[In this webinar we’re doing something a little different &#8211; rather than give you an overivew of a topic, we will interpret together the regression coefficients table from a real data set. This data set is from the dissertation of a client I worked with a few years ago.  She has graciously allowed us to [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>In this webinar we’re doing something a little different &#8211; rather than give you an overivew of a topic, we will interpret together the regression coefficients table from a real data set.</p>
<p>This data set is from the dissertation of a client I worked with a few years ago.  She has graciously allowed us to use her data as an example.  It&#8217;s a great example because it includes many of the coefficients that can be really tricky</p>
<ul>
<li>a categorical dummy-coded variable</li>
<li>an interaction between the categorical and a continuous variable</li>
<li>centering</li>
</ul>
<p><strong>Date:</strong> October 19, 2011</p>
<p><strong>Time:</strong> 12pm Eastern Time (11am Central, 10am Mountain, 9am Pacific)</p>
<p><strong>Where:</strong> Anywhere you have a fast internet connection</p>
<p><strong>Length of Program:</strong> An Hour</p>
<p><strong>Cost:</strong> Always FREE</p>
<p><strong>Space is limited.</strong></p>
<p>&nbsp;</p>
<h3><a href="http://www.theanalysisfactor.com/webinars/registration/">Sign up for our next webinar here</a>.</h3>
]]></content:encoded>
			<wfw:commentRss>http://www.theanalysisfactor.com/interpreting-linear-regression-parameters-a-walk-through-output/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
	</channel>
</rss>

