<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Forget your fancy data science, try overkill analytics</title>
	<atom:link href="http://gigaom.com/2012/09/21/forget-your-fancy-data-science-try-overkill-analytics/feed/" rel="self" type="application/rss+xml" />
	<link>http://gigaom.com/2012/09/21/forget-your-fancy-data-science-try-overkill-analytics/</link>
	<description></description>
	<lastBuildDate>Mon, 20 May 2013 06:48:05 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: Theodore Van Rooy</title>
		<link>http://gigaom.com/2012/09/21/forget-your-fancy-data-science-try-overkill-analytics/#comment-1024505</link>
		<dc:creator><![CDATA[Theodore Van Rooy]]></dc:creator>
		<pubDate>Fri, 28 Sep 2012 16:43:28 +0000</pubDate>
		<guid isPermaLink="false">http://gigaom.com/?p=565355#comment-1024505</guid>
		<description><![CDATA[The downside:  overkill-analytics often leads to overfit-analytics.  As Carter himself points out:  &quot;as long as you account for the effects of overprocessing data&quot;.

The more computation power you throw at the problem the more likely it is that your model  happens (by chance) to work well ...  even measures like cross fold validation will not prevent this overfitting.    

Not saying that it doesn&#039;t work as an approach (it&#039;s my favorite approach too) but that you should at least put some thought into the problem before you throw massive computational power at it.]]></description>
		<content:encoded><![CDATA[<p>The downside:  overkill-analytics often leads to overfit-analytics.  As Carter himself points out:  &#8220;as long as you account for the effects of overprocessing data&#8221;.</p>
<p>The more computation power you throw at the problem the more likely it is that your model  happens (by chance) to work well &#8230;  even measures like cross fold validation will not prevent this overfitting.    </p>
<p>Not saying that it doesn&#8217;t work as an approach (it&#8217;s my favorite approach too) but that you should at least put some thought into the problem before you throw massive computational power at it.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: mikeklawrence</title>
		<link>http://gigaom.com/2012/09/21/forget-your-fancy-data-science-try-overkill-analytics/#comment-1020432</link>
		<dc:creator><![CDATA[mikeklawrence]]></dc:creator>
		<pubDate>Wed, 26 Sep 2012 22:15:28 +0000</pubDate>
		<guid isPermaLink="false">http://gigaom.com/?p=565355#comment-1020432</guid>
		<description><![CDATA[Peter Norvig, Director of Research at Google, has a similar philosophy aimed at the problem of natural language processing. His talk (and paper) &quot;the unreasonable effectiveness of data&quot; is worth watching (it&#039;s on youtube). The premise is that problems where complicated models were once needed, can now be solved by simple algorithms and lots of data.]]></description>
		<content:encoded><![CDATA[<p>Peter Norvig, Director of Research at Google, has a similar philosophy aimed at the problem of natural language processing. His talk (and paper) &#8220;the unreasonable effectiveness of data&#8221; is worth watching (it&#8217;s on youtube). The premise is that problems where complicated models were once needed, can now be solved by simple algorithms and lots of data.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: mikeklawrence</title>
		<link>http://gigaom.com/2012/09/21/forget-your-fancy-data-science-try-overkill-analytics/#comment-1020427</link>
		<dc:creator><![CDATA[mikeklawrence]]></dc:creator>
		<pubDate>Wed, 26 Sep 2012 22:12:57 +0000</pubDate>
		<guid isPermaLink="false">http://gigaom.com/?p=565355#comment-1020427</guid>
		<description><![CDATA[Peter Norvig, Director of Research at Google, has a similar philosophy (aimed at natural language processing) that&#039;s captured in a recent paper (and talk) called &quot;The Unreasonable Effectiveness of Data&quot;. In a nutshell, what others have tried to achieve with complicated models can be done using simple algorithms, and reams and reams of data. The talk can be found on youtube and is worth watching.]]></description>
		<content:encoded><![CDATA[<p>Peter Norvig, Director of Research at Google, has a similar philosophy (aimed at natural language processing) that&#8217;s captured in a recent paper (and talk) called &#8220;The Unreasonable Effectiveness of Data&#8221;. In a nutshell, what others have tried to achieve with complicated models can be done using simple algorithms, and reams and reams of data. The talk can be found on youtube and is worth watching.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jody Wynnyk</title>
		<link>http://gigaom.com/2012/09/21/forget-your-fancy-data-science-try-overkill-analytics/#comment-1017421</link>
		<dc:creator><![CDATA[Jody Wynnyk]]></dc:creator>
		<pubDate>Tue, 25 Sep 2012 16:43:01 +0000</pubDate>
		<guid isPermaLink="false">http://gigaom.com/?p=565355#comment-1017421</guid>
		<description><![CDATA[Great article - one of my challenges as a data analyst is producing reports that the business &#039;gets&#039;.  They often think they want to see more information, but I find the more information they have, the more useless it becomes.   This streamlined approach captures everything, but draws the focus where it needs to be.  I&#039;ll be integrating this technique into my reporting!]]></description>
		<content:encoded><![CDATA[<p>Great article &#8211; one of my challenges as a data analyst is producing reports that the business &#8216;gets&#8217;.  They often think they want to see more information, but I find the more information they have, the more useless it becomes.   This streamlined approach captures everything, but draws the focus where it needs to be.  I&#8217;ll be integrating this technique into my reporting!</p>
]]></content:encoded>
	</item>
</channel>
</rss>