<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress/2.0.7" -->
<rss version="2.0" 
	xmlns:content="http://purl.org/rss/1.0/modules/content/">
<channel>
	<title>Comments on: Voodoo Correlations in Social Neuroscience</title>
	<link>http://www.blog.sethroberts.net/2008/12/28/voodoo-correlations-in-social-neuroscience/</link>
	<description>Self-Experimentation, Scientific Method, the Shangri-La Diet, etc.</description>
	<pubDate>Tue, 16 Mar 2010 07:17:04 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.0.7</generator>

	<item>
		<title>by: seth</title>
		<link>http://www.blog.sethroberts.net/2008/12/28/voodoo-correlations-in-social-neuroscience/#comment-377443</link>
		<pubDate>Fri, 01 Jan 2010 14:13:34 +0000</pubDate>
		<guid>http://www.blog.sethroberts.net/2008/12/28/voodoo-correlations-in-social-neuroscience/#comment-377443</guid>
					<description>Justin, because I'm in China I can't get YouTube.</description>
		<content:encoded><![CDATA[<p>Justin, because I&#8217;m in China I can&#8217;t get YouTube.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Justin</title>
		<link>http://www.blog.sethroberts.net/2008/12/28/voodoo-correlations-in-social-neuroscience/#comment-377400</link>
		<pubDate>Fri, 01 Jan 2010 12:14:28 +0000</pubDate>
		<guid>http://www.blog.sethroberts.net/2008/12/28/voodoo-correlations-in-social-neuroscience/#comment-377400</guid>
					<description>Hi Seth, 
Any chance of some constructive feedback on this video on the above study?
[youtube=http://www.youtube.com/watch?v=nMZvpVwfObE]
Regards

Justin</description>
		<content:encoded><![CDATA[<p>Hi Seth,<br />
Any chance of some constructive feedback on this video on the above study?<br />
[youtube=http://www.youtube.com/watch?v=nMZvpVwfObE]<br />
Regards</p>
<p>Justin
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: &#8216;Voodoo Correlations in Social Neuroscience&#8217; &#171; The Amazing World of Psychiatry: A Psychiatry Blog</title>
		<link>http://www.blog.sethroberts.net/2008/12/28/voodoo-correlations-in-social-neuroscience/#comment-266121</link>
		<pubDate>Sat, 31 Jan 2009 01:08:54 +0000</pubDate>
		<guid>http://www.blog.sethroberts.net/2008/12/28/voodoo-correlations-in-social-neuroscience/#comment-266121</guid>
					<description>[...] Seths Blog article [...]</description>
		<content:encoded><![CDATA[<p>[&#8230;] Seths Blog article [&#8230;]
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: seth</title>
		<link>http://www.blog.sethroberts.net/2008/12/28/voodoo-correlations-in-social-neuroscience/#comment-265809</link>
		<pubDate>Thu, 29 Jan 2009 16:56:06 +0000</pubDate>
		<guid>http://www.blog.sethroberts.net/2008/12/28/voodoo-correlations-in-social-neuroscience/#comment-265809</guid>
					<description>Thanks, Matt.</description>
		<content:encoded><![CDATA[<p>Thanks, Matt.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Matt</title>
		<link>http://www.blog.sethroberts.net/2008/12/28/voodoo-correlations-in-social-neuroscience/#comment-265793</link>
		<pubDate>Thu, 29 Jan 2009 16:03:42 +0000</pubDate>
		<guid>http://www.blog.sethroberts.net/2008/12/28/voodoo-correlations-in-social-neuroscience/#comment-265793</guid>
					<description>So, we combine a significance threshold (i.e. p-values less than .005 or .001) with an extent threshold (i.e. there have to be at least 10 contiguous voxels that all have p-values less than the significance threshold).  This is a standard procedure used throughout cognitive neuroscience for the past 15 years.</description>
		<content:encoded><![CDATA[<p>So, we combine a significance threshold (i.e. p-values less than .005 or .001) with an extent threshold (i.e. there have to be at least 10 contiguous voxels that all have p-values less than the significance threshold).  This is a standard procedure used throughout cognitive neuroscience for the past 15 years.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Matt</title>
		<link>http://www.blog.sethroberts.net/2008/12/28/voodoo-correlations-in-social-neuroscience/#comment-265780</link>
		<pubDate>Thu, 29 Jan 2009 15:04:59 +0000</pubDate>
		<guid>http://www.blog.sethroberts.net/2008/12/28/voodoo-correlations-in-social-neuroscience/#comment-265780</guid>
					<description>Sorry for the typos (yes its the correlations that are inflated, but that's not what the test is testing - its testing for reliable non-zero relationships).  Your Jordan analogy doesn't quite apply.  That would assume that we are making claims about the average correlation in the brain but only reporting on a subset of voxels (and pretending they are all the voxels).  We aren't making claims about how the brain as a whole or on average relates to personality - rather we are looking for which regions do correlate reliably and then providing descriptive statistics for those that do.

To get the Jordan analogy right, the question would be "Are there certain days of the week when Jordan shoots a higher percentage than others?".  We'd have him shoot 100 free throws each day of the week for say 10 weeks.  So we'd have 1000 data points for each of the seven days of the week.  We wouldn't care at all what his average across all days was, just how each day compares to each other.  If his averages were 30% on mondays, 90% on fridays and 60% on all other days, we would say something interesting is happening on mondays and fridays, report that test and the descriptives that go along with it (e.g. 90%).  Now if we reported that Jordan shoots 90% on average because we claimed that fridays were the only days were looking at, we'd be in trouble, but nobody does that.  Our question isn't the average, but rather, when is there something different from average going on.</description>
		<content:encoded><![CDATA[<p>Sorry for the typos (yes its the correlations that are inflated, but that&#8217;s not what the test is testing - its testing for reliable non-zero relationships).  Your Jordan analogy doesn&#8217;t quite apply.  That would assume that we are making claims about the average correlation in the brain but only reporting on a subset of voxels (and pretending they are all the voxels).  We aren&#8217;t making claims about how the brain as a whole or on average relates to personality - rather we are looking for which regions do correlate reliably and then providing descriptive statistics for those that do.</p>
<p>To get the Jordan analogy right, the question would be &#8220;Are there certain days of the week when Jordan shoots a higher percentage than others?&#8221;.  We&#8217;d have him shoot 100 free throws each day of the week for say 10 weeks.  So we&#8217;d have 1000 data points for each of the seven days of the week.  We wouldn&#8217;t care at all what his average across all days was, just how each day compares to each other.  If his averages were 30% on mondays, 90% on fridays and 60% on all other days, we would say something interesting is happening on mondays and fridays, report that test and the descriptives that go along with it (e.g. 90%).  Now if we reported that Jordan shoots 90% on average because we claimed that fridays were the only days were looking at, we&#8217;d be in trouble, but nobody does that.  Our question isn&#8217;t the average, but rather, when is there something different from average going on.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: john</title>
		<link>http://www.blog.sethroberts.net/2008/12/28/voodoo-correlations-in-social-neuroscience/#comment-265748</link>
		<pubDate>Thu, 29 Jan 2009 12:40:50 +0000</pubDate>
		<guid>http://www.blog.sethroberts.net/2008/12/28/voodoo-correlations-in-social-neuroscience/#comment-265748</guid>
					<description>Seth,

"Let’s say I select a subset of free throws where Michael Jordan missed. Then I compute his free throw percentage over only those free throws. It is 0%. To report that 0% as if it means something is  . . . well, call it what you want. As far as I can tell, that is basically what you did."

I may have missed something here, but isn't that what Matt is claiming that Vul has done? Isn't one of the strong arguements in the Lieberman paper that Ed Vul simply hand picked results from papers that would show the effect he wanted to show and ignored the others that didn't? In fact when Matt puts all the data in to the analysis there is no bias in the correlation coeffecient at all.</description>
		<content:encoded><![CDATA[<p>Seth,</p>
<p>&#8220;Let’s say I select a subset of free throws where Michael Jordan missed. Then I compute his free throw percentage over only those free throws. It is 0%. To report that 0% as if it means something is  . . . well, call it what you want. As far as I can tell, that is basically what you did.&#8221;</p>
<p>I may have missed something here, but isn&#8217;t that what Matt is claiming that Vul has done? Isn&#8217;t one of the strong arguements in the Lieberman paper that Ed Vul simply hand picked results from papers that would show the effect he wanted to show and ignored the others that didn&#8217;t? In fact when Matt puts all the data in to the analysis there is no bias in the correlation coeffecient at all.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: seth</title>
		<link>http://www.blog.sethroberts.net/2008/12/28/voodoo-correlations-in-social-neuroscience/#comment-265697</link>
		<pubDate>Thu, 29 Jan 2009 08:47:24 +0000</pubDate>
		<guid>http://www.blog.sethroberts.net/2008/12/28/voodoo-correlations-in-social-neuroscience/#comment-265697</guid>
					<description>Matt, could you post again the last part of your comment? It was cut off.

What you call "selection bias" --  computing a correlation using only voxels selected by looking at the data and thereby inflating the correlation -- doesn't inflate "tests" it inflates correlations.

Nor are "voxels" inflated (by "voxel" I guess you mean the correlation computed for just one voxel), it is the correlation computed over many voxels that is inflated. That's when you got into trouble -- by computing a number that might be grossly inflated.

Let's say I select a subset of free throw attempts where Michael Jordan missed. Then I compute his free throw percentage over only those free throws. It is 0%. To report that 0% as if it means something is  . . . well, call it what you want. As far as I can tell, that is basically what you did.</description>
		<content:encoded><![CDATA[<p>Matt, could you post again the last part of your comment? It was cut off.</p>
<p>What you call &#8220;selection bias&#8221; &#8212;  computing a correlation using only voxels selected by looking at the data and thereby inflating the correlation &#8212; doesn&#8217;t inflate &#8220;tests&#8221; it inflates correlations.</p>
<p>Nor are &#8220;voxels&#8221; inflated (by &#8220;voxel&#8221; I guess you mean the correlation computed for just one voxel), it is the correlation computed over many voxels that is inflated. That&#8217;s when you got into trouble &#8212; by computing a number that might be grossly inflated.</p>
<p>Let&#8217;s say I select a subset of free throw attempts where Michael Jordan missed. Then I compute his free throw percentage over only those free throws. It is 0%. To report that 0% as if it means something is  . . . well, call it what you want. As far as I can tell, that is basically what you did.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Matt</title>
		<link>http://www.blog.sethroberts.net/2008/12/28/voodoo-correlations-in-social-neuroscience/#comment-265661</link>
		<pubDate>Thu, 29 Jan 2009 05:49:43 +0000</pubDate>
		<guid>http://www.blog.sethroberts.net/2008/12/28/voodoo-correlations-in-social-neuroscience/#comment-265661</guid>
					<description>Actually, what I am denying is that there was any post-hoc selection of a subset of the data.  Running multiple comparisons leads to inflated effect sizes but that has nothing to do with "post-hoc selection".  And actually, the main point of their article was not that there is inflation but rather that this inflation is so great that the results should be considered worthless and likely spurious and also that the methods used to obtain the results are invalid and therefore the results themselves are invalid.  These tests are run in order to identify regions where there are reliably non-zero correlations and they are a perfectly valid way of doing so.  To report descriptive statistics is entirely valid as well.  Since we seem to be talking past each other, let's consider one last example.  Let's say I run my 40,000 independent tests on my 40,000 voxels.  You would admit at this point there has been no "selection bias" inflating these tests, correct?  You might have some large effects due to sampling fluxuations, but our Figure 1 shows that with normal fMRI sample sizes and appropriate correction for multiple comparisons, this is relatively rare (Vul's simulation was done assuming 10 subjects which is not representative of fMRI studies).  Let's further assume that I submit my paper to the journal with a 200 page table that lists the p-value (along with descriptive statistics) for every voxel in the brain.  Still no selection bias inflating these tests, correct?  If you sorted this table by p-value we'd still be ok, right?  Now the editor comes along and says "we can't have a 200 page table" so cut off everything with a p-value worse than ___ and add a note to indicate that all other voxels had p-values above that threshold.  The voxels that remained would be no more inflated after this editorial decision than before - its just a matter of convention for displaying data.  This is what we all do and there is no "non-independence error" as Vul claims.</description>
		<content:encoded><![CDATA[<p>Actually, what I am denying is that there was any post-hoc selection of a subset of the data.  Running multiple comparisons leads to inflated effect sizes but that has nothing to do with &#8220;post-hoc selection&#8221;.  And actually, the main point of their article was not that there is inflation but rather that this inflation is so great that the results should be considered worthless and likely spurious and also that the methods used to obtain the results are invalid and therefore the results themselves are invalid.  These tests are run in order to identify regions where there are reliably non-zero correlations and they are a perfectly valid way of doing so.  To report descriptive statistics is entirely valid as well.  Since we seem to be talking past each other, let&#8217;s consider one last example.  Let&#8217;s say I run my 40,000 independent tests on my 40,000 voxels.  You would admit at this point there has been no &#8220;selection bias&#8221; inflating these tests, correct?  You might have some large effects due to sampling fluxuations, but our Figure 1 shows that with normal fMRI sample sizes and appropriate correction for multiple comparisons, this is relatively rare (Vul&#8217;s simulation was done assuming 10 subjects which is not representative of fMRI studies).  Let&#8217;s further assume that I submit my paper to the journal with a 200 page table that lists the p-value (along with descriptive statistics) for every voxel in the brain.  Still no selection bias inflating these tests, correct?  If you sorted this table by p-value we&#8217;d still be ok, right?  Now the editor comes along and says &#8220;we can&#8217;t have a 200 page table&#8221; so cut off everything with a p-value worse than ___ and add a note to indicate that all other voxels had p-values above that threshold.  The voxels that remained would be no more inflated after this editorial decision than before - its just a matter of convention for displaying data.  This is what we all do and there is no &#8220;non-independence error&#8221; as Vul claims.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: seth</title>
		<link>http://www.blog.sethroberts.net/2008/12/28/voodoo-correlations-in-social-neuroscience/#comment-265657</link>
		<pubDate>Thu, 29 Jan 2009 05:22:33 +0000</pubDate>
		<guid>http://www.blog.sethroberts.net/2008/12/28/voodoo-correlations-in-social-neuroscience/#comment-265657</guid>
					<description>Gee, Matt, you're still not denying that post-hoc selection of a subset inflates the correlations. Which -- correct me if I'm wrong -- was the main point of Vul et al. Along with the point that this inflation was not made clear in the published papers.
I'm still curious: What correction for multiple tests did your research group use?</description>
		<content:encoded><![CDATA[<p>Gee, Matt, you&#8217;re still not denying that post-hoc selection of a subset inflates the correlations. Which &#8212; correct me if I&#8217;m wrong &#8212; was the main point of Vul et al. Along with the point that this inflation was not made clear in the published papers.<br />
I&#8217;m still curious: What correction for multiple tests did your research group use?
</p>
]]></content:encoded>
				</item>
</channel>
</rss>
