<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Want to ditch your data scientists? Here are 7 startups that can help</title>
	<atom:link href="http://gigaom.com/2012/07/05/want-to-ditch-your-data-scientists-heres-are-7-startups-that-can-help/feed/" rel="self" type="application/rss+xml" />
	<link>http://gigaom.com/2012/07/05/want-to-ditch-your-data-scientists-heres-are-7-startups-that-can-help/</link>
	<description></description>
	<lastBuildDate>Wed, 22 May 2013 17:30:46 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: Marc Smith</title>
		<link>http://gigaom.com/2012/07/05/want-to-ditch-your-data-scientists-heres-are-7-startups-that-can-help/#comment-870856</link>
		<dc:creator><![CDATA[Marc Smith]]></dc:creator>
		<pubDate>Mon, 30 Jul 2012 15:53:38 +0000</pubDate>
		<guid isPermaLink="false">http://gigaom.com/?p=539784#comment-870856</guid>
		<description><![CDATA[Data science will likely follow the path of &quot;desktop publishing&quot; - taking a high skilled task and reducing it to an easy to use tool that gets most people&#039;s work done reasonably well.  The Social Media Research Foundation (http://www.smrfoundation.org) has been developing the NodeXL project that performs many &quot;data scientist&quot; tasks with a few clicks.  Designed for people who are usually making pie charts and line charts, NodeXL supports the collection, analysis, visualization and publication of social media networks.  See: http://nodexl.codeplex.com

Regards,
Marc]]></description>
		<content:encoded><![CDATA[<p>Data science will likely follow the path of &#8220;desktop publishing&#8221; &#8211; taking a high skilled task and reducing it to an easy to use tool that gets most people&#8217;s work done reasonably well.  The Social Media Research Foundation (<a href="http://www.smrfoundation.org" rel="nofollow">http://www.smrfoundation.org</a>) has been developing the NodeXL project that performs many &#8220;data scientist&#8221; tasks with a few clicks.  Designed for people who are usually making pie charts and line charts, NodeXL supports the collection, analysis, visualization and publication of social media networks.  See: <a href="http://nodexl.codeplex.com" rel="nofollow">http://nodexl.codeplex.com</a></p>
<p>Regards,<br />
Marc</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steve Ardire</title>
		<link>http://gigaom.com/2012/07/05/want-to-ditch-your-data-scientists-heres-are-7-startups-that-can-help/#comment-870486</link>
		<dc:creator><![CDATA[Steve Ardire]]></dc:creator>
		<pubDate>Sun, 29 Jul 2012 20:10:52 +0000</pubDate>
		<guid isPermaLink="false">http://gigaom.com/?p=539784#comment-870486</guid>
		<description><![CDATA[And here&#039;s another way to ditch your data scientists....

Use a semantic data model ( RDF ) that encodes contextual data relationships enabling machines and knowledge workers to discover meaning for sharing and reuse across applications and platforms without data relationships having to be predefined like for SQL, NoSQL, Hadoop ( ergo need for data scientists ).

So if you want to know the semtech startups that have their act together than ping me ;)]]></description>
		<content:encoded><![CDATA[<p>And here&#8217;s another way to ditch your data scientists&#8230;.</p>
<p>Use a semantic data model ( RDF ) that encodes contextual data relationships enabling machines and knowledge workers to discover meaning for sharing and reuse across applications and platforms without data relationships having to be predefined like for SQL, NoSQL, Hadoop ( ergo need for data scientists ).</p>
<p>So if you want to know the semtech startups that have their act together than ping me ;)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: charlene</title>
		<link>http://gigaom.com/2012/07/05/want-to-ditch-your-data-scientists-heres-are-7-startups-that-can-help/#comment-862859</link>
		<dc:creator><![CDATA[charlene]]></dc:creator>
		<pubDate>Wed, 11 Jul 2012 07:52:29 +0000</pubDate>
		<guid isPermaLink="false">http://gigaom.com/?p=539784#comment-862859</guid>
		<description><![CDATA[It&#039;s critical to have intuitive analysis tools that business users can use to make better decisions: companies employing data-driven decision making outperform other companies. Data scientists do have an invaluable role in bringing together data acumen, analysis expertise and business understanding. There aren&#039;t enough data scientists. Metamarkets is providing data science-as-a-service, a SaaS solution enabling data scientists to extend their reach and provide intuitive analytics to business users within their organizations.]]></description>
		<content:encoded><![CDATA[<p>It&#8217;s critical to have intuitive analysis tools that business users can use to make better decisions: companies employing data-driven decision making outperform other companies. Data scientists do have an invaluable role in bringing together data acumen, analysis expertise and business understanding. There aren&#8217;t enough data scientists. Metamarkets is providing data science-as-a-service, a SaaS solution enabling data scientists to extend their reach and provide intuitive analytics to business users within their organizations.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: H.M.</title>
		<link>http://gigaom.com/2012/07/05/want-to-ditch-your-data-scientists-heres-are-7-startups-that-can-help/#comment-862751</link>
		<dc:creator><![CDATA[H.M.]]></dc:creator>
		<pubDate>Wed, 11 Jul 2012 01:02:07 +0000</pubDate>
		<guid isPermaLink="false">http://gigaom.com/?p=539784#comment-862751</guid>
		<description><![CDATA[Stacey, great insight. It is worth mentioning the HPCC Systems open source offering which provides a single platform that is easy to install, manage and code too. Their built-in analytics libraries for Machine Learning and integration tools with Pentaho for great BI capabilities make it easy for users who do not hold a PhD degree or carry a title like &quot;Data Scientist&quot; to easily analyze Big Data. I believe HPCC is better than Hadoop and commercial offerings, it has a real-time data analytics and delivery engine (Roxie) and runs on the Amazon cloud like a charm through the One Click portal. For more info visit: hpccsystems.com]]></description>
		<content:encoded><![CDATA[<p>Stacey, great insight. It is worth mentioning the HPCC Systems open source offering which provides a single platform that is easy to install, manage and code too. Their built-in analytics libraries for Machine Learning and integration tools with Pentaho for great BI capabilities make it easy for users who do not hold a PhD degree or carry a title like &#8220;Data Scientist&#8221; to easily analyze Big Data. I believe HPCC is better than Hadoop and commercial offerings, it has a real-time data analytics and delivery engine (Roxie) and runs on the Amazon cloud like a charm through the One Click portal. For more info visit: hpccsystems.com</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: anon</title>
		<link>http://gigaom.com/2012/07/05/want-to-ditch-your-data-scientists-heres-are-7-startups-that-can-help/#comment-862138</link>
		<dc:creator><![CDATA[anon]]></dc:creator>
		<pubDate>Mon, 09 Jul 2012 14:44:35 +0000</pubDate>
		<guid isPermaLink="false">http://gigaom.com/?p=539784#comment-862138</guid>
		<description><![CDATA[BLUF: a typically provocative title that doesn&#039;t accurately reflect the issues at hand.

From the article:

Ford’s data chief joined many other top executives who are bemoaning the lack of simple tools to solve big data problems — namely the fact that running Hadoop clusters or performing analytics is still a job that requires a specialist.

Assumption #1: it&#039;s possible to &quot;dumb down&quot; the analytics tools to the point that a non-specialist can effectively replace a specialist.

My response: the basic tools of carpentry (hammer, nail gun, etc.) are easy enough to use, but nobody would want me to build a house for them. Such tasks are better left in the hands of a specialist, one who is skilled in understanding the problem space, formulating a solution, and applying the tools to get the job done.

To use another analogy, easy-to-use software like TurboTax hasn&#039;t put all CPAs out of work.

I mean, honestly, is the author really suggesting that we roll-back the historical drive towards division of labor, or is she just bemoaning the fact that it&#039;s hard to find &amp; hire good people who are capable of doing the advanced analytical work he needs? General-purpose computer programming isn&#039;t something that a non-specialist can perform, but good programmers aren&#039;t as hard to find as good data scientists (a relatively new title that serves as an umbrella term for a very broad set of specialties).

If we want to [sic] big data revolution to scale, then we need to make it as easy as Netscape made the web surfing experience.

Assumption #2: performing advanced analytics on big data is somehow comparable in any way to browsing the web for photos of LOLcats.

My response: not every problem lends itself to being solved by an &quot;easy button.&quot; Easy-to-use tools are great, but you still need to be a skilled practitioner (i.e., a specialist) in order to use them effectively.

=====

It seems that author of this article somewhat misinterpreted what Ford&#039;s data chief actually said:

I’d love us to move toward [...] where data specialists — but not computer scientists — could go in and interrogate it and look for correlations that might not have been able to look at before. That’s a beautiful future state, but we’re not there yet.

To me, what he&#039;s is saying is that he wants to be able to have data specialists (who are not computer scientists by training) to be able to do the work currently being performed by data scientists, who are largely computer scientists with an aptitude for analyzing big data.

This already exists in some ways within the business intelligence (BI) community. Really good BI tools exist to enable analysts (who are not computer scientists) to analyze business operations and add value to organizations.

The question is whether it is possible to build general-purpose data analytics tools capable of analyzing any kind of data as well as BI tools analyze business-related data. I suppose it&#039;s possible, but a more likely scenario involves lots of different specialized tools focused on different problem domains (e.g., Orion for IC applications, RedOwl for corporate communications), since each domain has different kinds of data problems to solve.

The tools outlined in the article could be useful, especially for a non-computer scientist &quot;data specialist&quot; to perform some initial exploration and analysis of data (cf. TurboTax), but there&#039;ll still be a need for someone (i.e., a traditional data scientist) to build the custom tools necessary for more advanced analytics.]]></description>
		<content:encoded><![CDATA[<p>BLUF: a typically provocative title that doesn&#8217;t accurately reflect the issues at hand.</p>
<p>From the article:</p>
<p>Ford’s data chief joined many other top executives who are bemoaning the lack of simple tools to solve big data problems — namely the fact that running Hadoop clusters or performing analytics is still a job that requires a specialist.</p>
<p>Assumption #1: it&#8217;s possible to &#8220;dumb down&#8221; the analytics tools to the point that a non-specialist can effectively replace a specialist.</p>
<p>My response: the basic tools of carpentry (hammer, nail gun, etc.) are easy enough to use, but nobody would want me to build a house for them. Such tasks are better left in the hands of a specialist, one who is skilled in understanding the problem space, formulating a solution, and applying the tools to get the job done.</p>
<p>To use another analogy, easy-to-use software like TurboTax hasn&#8217;t put all CPAs out of work.</p>
<p>I mean, honestly, is the author really suggesting that we roll-back the historical drive towards division of labor, or is she just bemoaning the fact that it&#8217;s hard to find &amp; hire good people who are capable of doing the advanced analytical work he needs? General-purpose computer programming isn&#8217;t something that a non-specialist can perform, but good programmers aren&#8217;t as hard to find as good data scientists (a relatively new title that serves as an umbrella term for a very broad set of specialties).</p>
<p>If we want to [sic] big data revolution to scale, then we need to make it as easy as Netscape made the web surfing experience.</p>
<p>Assumption #2: performing advanced analytics on big data is somehow comparable in any way to browsing the web for photos of LOLcats.</p>
<p>My response: not every problem lends itself to being solved by an &#8220;easy button.&#8221; Easy-to-use tools are great, but you still need to be a skilled practitioner (i.e., a specialist) in order to use them effectively.</p>
<p>=====</p>
<p>It seems that author of this article somewhat misinterpreted what Ford&#8217;s data chief actually said:</p>
<p>I’d love us to move toward [...] where data specialists — but not computer scientists — could go in and interrogate it and look for correlations that might not have been able to look at before. That’s a beautiful future state, but we’re not there yet.</p>
<p>To me, what he&#8217;s is saying is that he wants to be able to have data specialists (who are not computer scientists by training) to be able to do the work currently being performed by data scientists, who are largely computer scientists with an aptitude for analyzing big data.</p>
<p>This already exists in some ways within the business intelligence (BI) community. Really good BI tools exist to enable analysts (who are not computer scientists) to analyze business operations and add value to organizations.</p>
<p>The question is whether it is possible to build general-purpose data analytics tools capable of analyzing any kind of data as well as BI tools analyze business-related data. I suppose it&#8217;s possible, but a more likely scenario involves lots of different specialized tools focused on different problem domains (e.g., Orion for IC applications, RedOwl for corporate communications), since each domain has different kinds of data problems to solve.</p>
<p>The tools outlined in the article could be useful, especially for a non-computer scientist &#8220;data specialist&#8221; to perform some initial exploration and analysis of data (cf. TurboTax), but there&#8217;ll still be a need for someone (i.e., a traditional data scientist) to build the custom tools necessary for more advanced analytics.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Morgan Goeller</title>
		<link>http://gigaom.com/2012/07/05/want-to-ditch-your-data-scientists-heres-are-7-startups-that-can-help/#comment-861983</link>
		<dc:creator><![CDATA[Morgan Goeller]]></dc:creator>
		<pubDate>Mon, 09 Jul 2012 02:52:31 +0000</pubDate>
		<guid isPermaLink="false">http://gigaom.com/?p=539784#comment-861983</guid>
		<description><![CDATA[Don&#039;t forget Cetas (http://www.cetas.net), just acquired by VMware.  Very cool technology that takes a huge amount of the heavy lifting out of data science and can be deployed either on-site or in the cloud.]]></description>
		<content:encoded><![CDATA[<p>Don&#8217;t forget Cetas (<a href="http://www.cetas.net" rel="nofollow">http://www.cetas.net</a>), just acquired by VMware.  Very cool technology that takes a huge amount of the heavy lifting out of data science and can be deployed either on-site or in the cloud.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nitin Borwankar</title>
		<link>http://gigaom.com/2012/07/05/want-to-ditch-your-data-scientists-heres-are-7-startups-that-can-help/#comment-861460</link>
		<dc:creator><![CDATA[Nitin Borwankar]]></dc:creator>
		<pubDate>Sat, 07 Jul 2012 07:41:07 +0000</pubDate>
		<guid isPermaLink="false">http://gigaom.com/?p=539784#comment-861460</guid>
		<description><![CDATA[There are a number of fundamental problems with trying to route around the problem of data scientists shortage

a) a lot of this data is confidential and it is not going to leave the LAN.  The more valuable it is the more this applies.  Conversely if a company doesn&#039;t care about uploading the data to an unknown third party - how valuable can this data be?  Bottom line this &quot;outsourced data science&quot; is a solution for the long tail.  But the money is at the other end of the power law curve ie the enterprise which is not uploading its data &quot;family jewels&quot; anytime soon.

b) It takes a lot of bandwidth to upload any meaningful amount of data - AWS allows you to send in hard drives in the mail which they will then upload locally.  This should tell you that the bottleneck is not lack of data scientists, its the thin pipe between the ground and the cloud.

c) We are at a structural inflection point - we need to integrate data driven thinking into everything we do and we can&#039;t delegate this. There is no silver bullet here.
There is a chasm and only companies that are in it fo rthe marathon rather than the sprint will survive.
Good luck to the sprinters.

Did I already say there was no short cut?]]></description>
		<content:encoded><![CDATA[<p>There are a number of fundamental problems with trying to route around the problem of data scientists shortage</p>
<p>a) a lot of this data is confidential and it is not going to leave the LAN.  The more valuable it is the more this applies.  Conversely if a company doesn&#8217;t care about uploading the data to an unknown third party &#8211; how valuable can this data be?  Bottom line this &#8220;outsourced data science&#8221; is a solution for the long tail.  But the money is at the other end of the power law curve ie the enterprise which is not uploading its data &#8220;family jewels&#8221; anytime soon.</p>
<p>b) It takes a lot of bandwidth to upload any meaningful amount of data &#8211; AWS allows you to send in hard drives in the mail which they will then upload locally.  This should tell you that the bottleneck is not lack of data scientists, its the thin pipe between the ground and the cloud.</p>
<p>c) We are at a structural inflection point &#8211; we need to integrate data driven thinking into everything we do and we can&#8217;t delegate this. There is no silver bullet here.<br />
There is a chasm and only companies that are in it fo rthe marathon rather than the sprint will survive.<br />
Good luck to the sprinters.</p>
<p>Did I already say there was no short cut?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Justin Donaldson</title>
		<link>http://gigaom.com/2012/07/05/want-to-ditch-your-data-scientists-heres-are-7-startups-that-can-help/#comment-861354</link>
		<dc:creator><![CDATA[Justin Donaldson]]></dc:creator>
		<pubDate>Fri, 06 Jul 2012 21:36:17 +0000</pubDate>
		<guid isPermaLink="false">http://gigaom.com/?p=539784#comment-861354</guid>
		<description><![CDATA[Hi Stacey, thanks for the story!  Can we please get a link to www.bigml.com on the relevant paragraph?]]></description>
		<content:encoded><![CDATA[<p>Hi Stacey, thanks for the story!  Can we please get a link to <a href="http://www.bigml.com" rel="nofollow">http://www.bigml.com</a> on the relevant paragraph?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mark Slusar</title>
		<link>http://gigaom.com/2012/07/05/want-to-ditch-your-data-scientists-heres-are-7-startups-that-can-help/#comment-861230</link>
		<dc:creator><![CDATA[Mark Slusar]]></dc:creator>
		<pubDate>Fri, 06 Jul 2012 16:03:42 +0000</pubDate>
		<guid isPermaLink="false">http://gigaom.com/?p=539784#comment-861230</guid>
		<description><![CDATA[Don&#039;t &quot;ditch your data scientists&quot; give them tougher problems to solve and pull ahead of the pack.]]></description>
		<content:encoded><![CDATA[<p>Don&#8217;t &#8220;ditch your data scientists&#8221; give them tougher problems to solve and pull ahead of the pack.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Anon</title>
		<link>http://gigaom.com/2012/07/05/want-to-ditch-your-data-scientists-heres-are-7-startups-that-can-help/#comment-860954</link>
		<dc:creator><![CDATA[Anon]]></dc:creator>
		<pubDate>Thu, 05 Jul 2012 22:34:29 +0000</pubDate>
		<guid isPermaLink="false">http://gigaom.com/?p=539784#comment-860954</guid>
		<description><![CDATA[What is Hadoop???]]></description>
		<content:encoded><![CDATA[<p>What is Hadoop???</p>
]]></content:encoded>
	</item>
</channel>
</rss>
