<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	
	xmlns:georss="http://www.georss.org/georss"
	xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#"
	>

<channel>
	<title>regression Archives - rweber.net</title>
	<atom:link href="https://www.rweber.net/tag/regression/feed/" rel="self" type="application/rss+xml" />
	<link>https://www.rweber.net/tag/regression/</link>
	<description>trying to be a mile wide AND a mile deep</description>
	<lastBuildDate>Fri, 04 Dec 2015 01:05:49 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.9.4</generator>
<site xmlns="com-wordpress:feed-additions:1">37896774</site>	<item>
		<title>Hazards of correlation and regression</title>
		<link>https://www.rweber.net/mathematics/statistics/hazards-correlation-regression/</link>
					<comments>https://www.rweber.net/mathematics/statistics/hazards-correlation-regression/#respond</comments>
		
		<dc:creator><![CDATA[Rebecca]]></dc:creator>
		<pubDate>Thu, 24 Apr 2014 12:00:17 +0000</pubDate>
				<category><![CDATA[statistics]]></category>
		<category><![CDATA[regression]]></category>
		<guid isPermaLink="false">http://www.rweber.net/?p=6036</guid>

					<description><![CDATA[<p>1. Drawing causation conclusions Ski and snowboard sales tend to rise and fall together, but sales of one don&#8217;t lead to sales of the other &#8211; they are both tied to an outside factor. Throughout grade school, mathematical skills correlate positively to height. Height doesn&#8217;t make you good at math, but older children are on [&#8230;]</p>
<p>The post <a href="https://www.rweber.net/mathematics/statistics/hazards-correlation-regression/">Hazards of correlation and regression</a> appeared first on <a href="https://www.rweber.net">rweber.net</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>1. Drawing causation conclusions<br />
Ski and snowboard sales tend to rise and fall together, but sales of one don&#8217;t lead to sales of the other &#8211; they are both tied to an outside factor. Throughout grade school, mathematical skills correlate positively to height. Height doesn&#8217;t make you good at math, but older children are on average both taller and better at math than younger children.</p>
<p>2. Ecological correlation<br />
I.e., averaging subsets of data and then correlating the averages (such as comparing population traits after averaging by nation). This can overstate correlation because outliers have been averaged away before the correlation happens.</p>
<p>3. Regression fallacy<br />
If you take data on a population twice and the correlation between the two data points is not 1, individuals with exceptionally high measurements in round 1 will tend to decrease in round 2 and those with exceptionally low measurements in round 1 will tend to increase (&#8220;regression toward the mean&#8221;). The fallacy is in attributing this to anything other than plain mathematics. One way to think about it is that if you are in the top ranks in the first measurement and you don&#8217;t have the same result on the second, there&#8217;s a lot more room to move down than up.</p>
<p>4. Application problems<br />
Applying a technique to a data set that doesn&#8217;t meet the criteria for that technique to be applicable will at worst completely lie to you about the data. Choosing the wrong variables to compare might not give you any mathematical errors, but it could mask what&#8217;s &#8220;really going on.&#8221; For example, perimeter and area of rectangles are strongly correlated, but the real story is about each of those relating to length and width.</p>
<p>The post <a href="https://www.rweber.net/mathematics/statistics/hazards-correlation-regression/">Hazards of correlation and regression</a> appeared first on <a href="https://www.rweber.net">rweber.net</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.rweber.net/mathematics/statistics/hazards-correlation-regression/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">6036</post-id>	</item>
		<item>
		<title>Regression Question</title>
		<link>https://www.rweber.net/mathematics/statistics/regression-question/</link>
					<comments>https://www.rweber.net/mathematics/statistics/regression-question/#respond</comments>
		
		<dc:creator><![CDATA[Rebecca]]></dc:creator>
		<pubDate>Mon, 25 Nov 2013 13:00:09 +0000</pubDate>
				<category><![CDATA[statistics]]></category>
		<category><![CDATA[regression]]></category>
		<guid isPermaLink="false">http://www.rweber.net/?p=434</guid>

					<description><![CDATA[<p>For a statistics exam, I wrote a question about the regression effect in application that I quite like. Here it is, with the answer this time. Q. A lab assistant in charge of measuring how long it takes for rats to run a maze predicts that usually a rat will take less time on its [&#8230;]</p>
<p>The post <a href="https://www.rweber.net/mathematics/statistics/regression-question/">Regression Question</a> appeared first on <a href="https://www.rweber.net">rweber.net</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>For a statistics exam, I wrote a question about the regression effect in application that I quite like. Here it is, with the answer this time.</p>
<p>Q. A lab assistant in charge of measuring how long it takes for rats to run a maze predicts that usually a rat will take less time on its second time through. A colleague of his, however, notes that the rats ought to regress to the mean maze running time, on average. Are these predictions compatible? Why or why not?</p>
<p><span id="more-434"></span>A. They are compatible. The lab assistant is predicting overall improvement, and his colleague is predicting the regression effect (if somewhat loosely stated). The key to their agreement is that we would expect the rats to (on average) have less extreme second maze running times <b>relative to the second run&#8217;s mean and standard deviation</b> than their first run time was, <b>relative to the first run&#8217;s mean and standard deviation</b>. If the second maze run has a lower mean time we could see general improvement of times without losing the regression effect (which of course is a statistical fact unless |r|=1, which seems unlikely). If the first run is assigned to the x-axis and the second run to the y-axis, visualize the graph of individual rat runtimes lying completely below the line x=y.</p>
<p>Note there is <b>no</b> regression fallacy here. Neither person is proposing an explanation for anything, just a guess at what the numbers will look like.</p>
<p>The post <a href="https://www.rweber.net/mathematics/statistics/regression-question/">Regression Question</a> appeared first on <a href="https://www.rweber.net">rweber.net</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.rweber.net/mathematics/statistics/regression-question/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">434</post-id>	</item>
	</channel>
</rss>
