<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>GigaOM &#187; Cloudera</title>
	<atom:link href="http://gigaom.com/tag/cloudera/feed/" rel="self" type="application/rss+xml" />
	<link>http://gigaom.com</link>
	<description></description>
	<lastBuildDate>Thu, 20 Jun 2013 06:44:32 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='gigaom.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://0.gravatar.com/blavatar/0db8f6557d022075dbbf010c54d46d93?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>GigaOM &#187; Cloudera</title>
		<link>http://gigaom.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://gigaom.com/osd.xml" title="GigaOM" />
	<atom:link rel='hub' href='http://gigaom.com/?pushpress=hub'/>
		<item>
		<title>Cloudera names new CEO; Mike Olson now chairman and chief strategy officer</title>
		<link>http://gigaom.com/2013/06/18/cloudera-names-new-ceo-mike-olson-now-chairman-and-chief-strategy-officer/</link>
		<comments>http://gigaom.com/2013/06/18/cloudera-names-new-ceo-mike-olson-now-chairman-and-chief-strategy-officer/#comments</comments>
		<pubDate>Tue, 18 Jun 2013 19:17:46 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[Amr Awadallah]]></category>
		<category><![CDATA[Christophe Bisciglia]]></category>
		<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[Jeff Hammerbacher]]></category>
		<category><![CDATA[Mike Olson]]></category>
		<category><![CDATA[Tom Reilly]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=658682</guid>
		<description><![CDATA[Cloudera CEO Mike Olson is now chief strategy officer and chairman of the board, while former Arcsight CEO Tom Reilly will take over the Hadoop pioneer's leadership role.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=658682&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Cloudera co-founder Mike Olson is no longer the company&#8217;s CEO, according to <a href="http://blog.cloudera.com/blog/2013/06/welcome-tom/">a Tuesday morning blog post</a> from Olson. Former Arcsight CEO Tom Reilly is taking over the company&#8217;s leadership role, while Olson is transitioning to a position as chief strategy officer and chairman of the board.</p>
<p>Here&#8217;s Olson&#8217;s explanation from his post:</p>
<blockquote id="quote-we%e2%80%99re-buildi"><p>We’re building a company that we expect to stay independent and to lead the market for many years to come. That will demand relentless focus on operational excellence <em>and</em> on the technology and solutions that our customers need. The combination is, at our present scale, two jobs, not one.</p>
<p>Tom led Arcsight through its IPO to industry leadership in mission-critical, bet-the-business security. He knows the enterprise. He’s a been-there, done-that executive. He’s run very successful enterprise software companies at global scale.</p>
<p>Tom’s appointment as CEO allows me to concentrate my time with key constituents: Our product and engineering teams, our partners, our sales force, our customer-facing technical teams and – most importantly – our customers themselves. He and I have spent considerable time together over the last several months, building trust in both directions and planning for a smooth transition. Cloudera is a better company, today and for the long term, with the two of us working together.</p></blockquote>
<p>It&#8217;s kind of ironic that Olson is stepping down, considering he was always considered the seasoned enterprise executive of a founding team custom-built to grow into the company that has Cloudera now become. It was the first company to make a living selling a commercial version of the Apache Hadoop big data platform, and it&#8217;s still the biggest. Cloudera has <a href="http://gigaom.com/2012/12/06/cloudera-snares-big-65m-more-to-boost-international-enterprise-growth/">raised $141 million in venture capital</a> since launching in 2009, is <a href="http://gigaom.com/2012/11/09/a-few-stats-rumors-and-stories-on-on-hadoops-rapid-growth/">doing about $100 million in annual revenue</a> and has well more than 300 employees.</p>
<p>Before <a href="http://gigaom.com/2013/03/04/the-history-of-hadoop-from-4-nodes-to-the-future-of-data/">founding Cloudera along with Amr Awadallah, Jeff Hammerbacher and Christophe Bisciglia</a>, Olson was CEO of Sleepycat Software. Sleepycat was the creator and primary developer of the BerkeleyDB database technology, and Oracle bought the company in 2006. However, Olson told me during an interview on Tuesday afternoon, Cloudera is &#8220;vastly larger&#8221; than Sleepycat on every metric &#8212; including headcount and revenue.</p>
<p>That scale of operations, Olson, said, has necessarily forced him to put more energy into company management and short-term oversight rather than working with customers and on long-term product strategy. With Reilly on board, Cloudera has someone who&#8217;s managed companies at this scale and has the operational aspects down pat.</p>
<p>If he has any hard feelings about the change in leadership, Olson isn&#8217;t letting on publicly. &#8221;We believe we&#8217;re building one for the ages here,&#8221; he said. &#8220;&#8230; [Tom and I] got to know each other very well, obviously, otherwise I&#8217;d never be able to hand my baby over.&#8221;</p>
<p><strong><em>Update: </em></strong><em>This post was updated at 3:05 p.m. to reflect comments from Mike Olson.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=658682&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=634082"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=634082" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=658682+cloudera-names-new-ceo-mike-olson-now-chairman-and-chief-strategy-officer&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=658682+cloudera-names-new-ceo-mike-olson-now-chairman-and-chief-strategy-officer&utm_content=dharrisstructure">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2010/12/9-companies-that-pushed-the-infrastructure-discussion-in-2010/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=658682+cloudera-names-new-ceo-mike-olson-now-chairman-and-chief-strategy-officer&utm_content=dharrisstructure">9 Companies that Pushed the Infrastructure Discussion in 2010</a></li><li><a href="http://pro.gigaom.com/2009/06/why-the-hoopla-about-hadoop/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=658682+cloudera-names-new-ceo-mike-olson-now-chairman-and-chief-strategy-officer&utm_content=dharrisstructure">Why the Hoopla About Hadoop?</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/06/18/cloudera-names-new-ceo-mike-olson-now-chairman-and-chief-strategy-officer/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/01/1z5o1503.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/01/1z5o1503.jpg?w=150" medium="image">
			<media:title type="html">Structure Data 2012: Michael Olson – CEO, Cloudera</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>
	</item>
		<item>
		<title>Cloudera adds search to Hadoop distro and says it&#8217;s just getting started</title>
		<link>http://gigaom.com/2013/06/04/cloudera-adds-search-to-hadoop-distro-and-says-its-just-getting-started/</link>
		<comments>http://gigaom.com/2013/06/04/cloudera-adds-search-to-hadoop-distro-and-says-its-just-getting-started/#comments</comments>
		<pubDate>Tue, 04 Jun 2013 17:01:41 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[Apache Solr]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[enterprise-search]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[lucene]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[SOLR]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=653888</guid>
		<description><![CDATA[Cloudera's new search feature, based on the Apache Solr project, is the latest move by the company to expand the utility of its Hadoop distribution. It's also far from the last.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=653888&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>According to Cloudera CEO Mike Olson, his company has &#8220;decades&#8221; in front of it in which to enhance its Hadoop platform to become the go-to place for data storage and analysis. At a Tuesday event in San Francisco, Cloudera announced the latest feature meant to further that strategy &#8212; full-text search. It comes just weeks after the company&#8217;s Impala interactive SQL query engine <a href="http://gigaom.com/2013/04/30/with-impala-now-ga-clouderas-ceo-sizes-up-the-sql-on-hadoop-market/">became publicly available</a>.</p>
<p>The general idea behind adding search (something competitor MapR <a href="http://gigaom.com/2013/05/01/mapr-releases-m7-its-commercial-hbase-distro/">actually did in May</a>), is to let people without deep technical skills find the information they need within a Hadoop cluster in a way that&#8217;s familiar to them. &#8220;You don&#8217;t even have to understand what SQL is. You can just type words into a box,&#8221; Olson said during a recent phone call, comparing Cloudera&#8217;s search to the process of finding information online or within your Gmail history.</p>
<div id="attachment_603561" class="wp-caption alignright" style="width: 310px"><a href="http://gigaom2.files.wordpress.com/2013/01/1z5o1503.jpg"><img  alt="Structure Data 2012: Michael Olson – CEO, Cloudera" src="http://gigaom2.files.wordpress.com/2013/01/1z5o1503.jpg?w=300&#038;h=200" width="300" height="200" class="size-medium wp-image-603561" /></a><p class="wp-caption-text">Cloudera CEO Mike Olson at Structure: Data 2012<br />(c) 2012 Pinar Ozger pinar@pinarozger.com</p></div>
<p>&#8220;Think about it,&#8221; he added. &#8220;You got a petabyte of data, you can&#8217;t use folders anymore.&#8221;</p>
<p>Even though search is easier than SQL, though, it seems pretty obvious that hourly workers and front-desk staff probably won&#8217;t be rooting around in Hadoop searching for data (although that&#8217;s possible in theory if the right application was in place).</p>
<p>However, a couple of examples from the search feature&#8217;s private beta users (it&#8217;s now available in public beta and will be generally available in the third quarter) help illustrate what Olson is talking about and how it might apply in corporate settings. Agri-business giant Monsanto is using search to help index &#8212; and later find information from &#8212; its collections of images that track plant characteristics through their lifecycle, a process that used to require lots of manual work within a database not designed to handle images and metadata. Health care customer Exlorys is using Cloudera&#8217;s search tool to consolidate and index its server logs so it can track down IT issues more easily and maintain SLAs for its applications.</p>
<p>Discussing MapR&#8217;s new search feature in April, VP of Marketing Jack Norris suggested a use case wherein users might use MapReduce to cluster a group of customers and then use search to drill down further into their behavior.</p>
<p>Cloudera&#8217;s search is powered by the <a href="http://lucene.apache.org/solr/">Apache Solr project</a>, which happens to be based on the Apache Lucene project that Cloudera Chief Architect Doug Cutting founded before <a href="http://gigaom.com/2013/03/04/the-history-of-hadoop-from-4-nodes-to-the-future-of-data/">he founded Hadoop</a>. Exact features of Cloudera Search, as well as a quote from private beta user Dell, are <a href="http://www.marketwire.com/press-release/cloudera-democratizes-apache-hadoop-enterprise-end-users-with-open-source-interactive-1798110.htm">available in the product&#8217;s press release</a>. MapR&#8217;s search is powered by <a href="http://www.lucidworks.com">LucidWorks</a>, a commercial search platform based on the Solr and Lucene projects.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/06/search.jpg"><img  alt="search" src="http://gigaom2.files.wordpress.com/2013/06/search.jpg?w=708&#038;h=390" width="708" height="390" class="aligncenter size-large wp-image-654152" /></a></p>
<p>Olson said Cloudera is dedicated to continually improving the capabilities of Solr now that it&#8217;s officially part of the company&#8217;s Hadoop distribution. When asked about predictive and semantic search like consumers now experience with Google and Microsoft Bing, he pointed to a feature called <a href="http://www.cloudera.com/content/cloudera/en/products/cloudera-navigator.html">Navigator</a> &#8212; which keeps track of who touched pieces of data, what systems it passed through, what types of queries people run on it and various other attributes &#8212; as the possible foundation for such features in an enterprise environment. He&#8217;s not sure exactly what that might look like in practice, but, Olson added, &#8220;I think there&#8217;s lots of opportunity for advancement there.&#8221;</p>
<h2 id="one-platform-to-rule-them-all">One platform to rule them all?</h2>
<p>The bigger picture here, though, is the encroachment of the open source Hadoop technology &#8212; whether sold by Cloudera, Hortonworks, MapR or whomever &#8212; into the lucrative data management and analytics space once (and still) dominated by vendors selling expensive software and big-iron systems. For now, Olson said, technologies like Cloudera Impala and the new search feature will be less functional than their legacy counterparts (Teradata for data warehousing and Autonomy for enterprise search, for example), but that could change over time.</p>
<p>&#8220;We have decades of life in front of this company in order to enhance [our platform],&#8221; Olson said.</p>
<p>Further, inertia can kick in as companies place more data into Hadoop, making it less appealing to move that data onto another system if the work can just as easily be done within Hadoop. When it comes to how many workloads and how much money Hadoop could ultimately steal from legacy vendors, Olson &#8212; <a href="http://gigaom.com/2013/05/29/why-hortonworks-is-riding-a-faster-hive-to-the-bitter-end/">like his peers at other Hadoop vendors</a> &#8212; is hedging for the time being. &#8220;I don&#8217;t want to make audacious and unsupportable claims,&#8221; he said. &#8220;&#8230; We can all make up numbers.&#8221;</p>
<p>However, he did take some credit fore Teradata&#8217;s recent lackluster quarter(s), stating that even though Cloudera customers aren&#8217;t ripping out their legacy systems, they&#8217;re also not really investing more money into them. &#8220;It is true to say folks are looking at what they&#8217;re running on Teradata and rationalizing those decisions,&#8221; Olson said. &#8220;&#8230; [They're trying to] concentrate first-class spend on a first-class workload.&#8221;</p>
<p>For more on how the SQL-on-Hadoop community, specifically, intends to take on the legacy vendors, check out this panel from our Structure: Data conference in March.</p>
<span class='embed-youtube' style='text-align:center; display: block;'><iframe class='youtube-player' type='text/html' width='604' height='370' src='http://www.youtube.com/embed/neo6TE41I8I?version=3&#038;rel=1&#038;fs=1&#038;showsearch=0&#038;showinfo=1&#038;iv_load_policy=1&#038;wmode=transparent' frameborder='0'></iframe></span>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=653888&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=326789"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=326789" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=653888+cloudera-adds-search-to-hadoop-distro-and-says-its-just-getting-started&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/11/unlocking-big-datas-potential-with-search/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=653888+cloudera-adds-search-to-hadoop-distro-and-says-its-just-getting-started&utm_content=dharrisstructure">How search can unlock the power of big data</a></li><li><a href="http://pro.gigaom.com/2010/10/the-case-for-open-source-search-in-the-enterprise/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=653888+cloudera-adds-search-to-hadoop-distro-and-says-its-just-getting-started&utm_content=dharrisstructure">The Case for Open Source Search in the Enterprise</a></li><li><a href="http://pro.gigaom.com/2010/07/the-incredible-growing-commercial-hadoop-market/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=653888+cloudera-adds-search-to-hadoop-distro-and-says-its-just-getting-started&utm_content=dharrisstructure">The Incredible, Growing, Commercial Hadoop Market</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/06/04/cloudera-adds-search-to-hadoop-distro-and-says-its-just-getting-started/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/01/1z5o1503.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/01/1z5o1503.jpg?w=150" medium="image">
			<media:title type="html">Structure Data 2012: Michael Olson – CEO, Cloudera</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/01/1z5o1503.jpg?w=300" medium="image">
			<media:title type="html">Structure Data 2012: Michael Olson – CEO, Cloudera</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/06/search.jpg?w=708" medium="image">
			<media:title type="html">search</media:title>
		</media:content>
	</item>
		<item>
		<title>Why Hortonworks is riding a faster Hive to the bitter end</title>
		<link>http://gigaom.com/2013/05/29/why-hortonworks-is-riding-a-faster-hive-to-the-bitter-end/</link>
		<comments>http://gigaom.com/2013/05/29/why-hortonworks-is-riding-a-faster-hive-to-the-bitter-end/#comments</comments>
		<pubDate>Wed, 29 May 2013 23:14:51 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[analytics]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[ebay]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[hive]]></category>
		<category><![CDATA[Hortonworks]]></category>
		<category><![CDATA[SQL on Hadoop]]></category>
		<category><![CDATA[Teradata]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=650170</guid>
		<description><![CDATA[While the rest of the Hadoop world is trying to distance itself from Hive with new interactive engines, Hortonworks is trying to make it faster. It might actually be a sound strategy.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=650170&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Hortonworks isn’t about to get off the Apache Hadoop elephant just because everyone around it is now trying to <a href="http://gigaom.com/2013/04/30/with-impala-now-ga-clouderas-ceo-sizes-up-the-sql-on-hadoop-market/">ride impalas</a>. The company released version 1.3 of its Hortonworks Data Platform on Wednesday, a major aspect of which is an improved iteration of <a href="http://hive.apache.org/">Apache Hive</a> that the company claims runs 50 times faster the previous version. Over the next year or so, Hortonworks expects to improve the speed of Hive by 100x its previous limits — this while its competitors are all but leaving Hive in the dust in favor of newer, faster analytic systems.</p>
<p>If you’re unfamiliar with Hive, it’s a project that Facebook developed in 2008 to make Hadoop function more like a traditional enterprise data warehouse. Hive stores data inside the Hadoop Distributed File System in structured format, and then allows users to query it using a language very similar to SQL. Until very recently, Hive has been the de facto method for querying (in a traditional sense) data stored in Hadoop, and it has proven immensely popular as more companies have begun tackling their big data woes with Hadoop.</p>
<h2 id="hive-wasnt-built-for-speed">Hive wasn’t built for speed</h2>
<p>However, as more companies got used to Hadoop, they also began to notice its shortcomings. One of them is around MapReduce, a powerful but not-exactly-speedy method of processing data that requires running the job across every node in the cluster in order to find the right data. Although the Hive interface is that of a SQL query, it relies on on MapReduce as the processing engine.</p>
<p>(For more on how Hadoop and its flavor of MapReduce came to be, read <a href="http://gigaom.com/2013/03/04/the-history-of-hadoop-from-4-nodes-to-the-future-of-data/">this post on the history of Hadoop</a>. To see me speak with Google Fellow and MapReduce creator Jeff Dean about how far Google has moved from a MapReduce-centric computing model, <a href="http://event.gigaom.com/structure/?utm_source=data&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=650170+why-hortonworks-is-riding-a-faster-hive-to-the-bitter-end&amp;utm_content=dharrisstructure">come to Structure next month</a>.)</p>
<p>Users wanted faster, more-interactive query processing on top of Hadoop, similar to what they had grown accustomed to with data warehouse systems such as Teradata, Greenplum and Netezza. Hadoop vendors such as Cloudera (with Impala), MapR (with <a href="http://gigaom.com/2012/08/17/for-fast-interactive-hadoop-queries-drill-may-be-the-answer/">Drill</a>), IBM (with <a href="http://gigaom.com/2013/05/06/look-ibm-is-doing-sql-on-hadoop-too/">Big SQL</a>) — as well as <a href="http://gigaom.com/2013/02/21/sql-is-whats-next-for-hadoop-heres-whos-doing-it/">a spate of startups</a> — have obliged with their own new technologies that in various ways blend the familiarity of SQL with the scalability of Hadoop. EMC Greenplum, now Pivotal, has <a href="http://gigaom.com/2013/02/25/emc-to-hadoop-competition-see-ya-wouldnt-wanna-be-ya/">transplanted its existing database system</a> inside of Hadoop.</p>
<p>Even <a href="http://www.qubole.com/">Qubole,</a> a cloud-based startup from Hive creators Ashish Thusoo and Joydeep Sen Sarma, is <a href="http://gigaom.com/2013/04/23/hadoop-startup-qubole-raises-7m-for-hive-as-a-service/">keeping an eye on how projects such as Impala and Shark</a> (from <a href="http://gigaom.com/2013/04/17/welcome-to-berkeley-where-hadoop-isnt-nearly-fast-enough/">the University of California, Berkeley’s AMPLab</a>) might factor into its plans.</p>
<h2 id="giving-hive-a-better-stinger">Giving Hive a better “Stinger”</h2>
<p>Hortonworks, the Yahoo spinoff dedicated to driving the Apache Hadoop bus, is sticking with Hive. But is has a plan, and a point.</p>
<p>Essentially, VP of Products Bob Page told me during a recent briefing, “It just makes more sense from our view to have everything done in one place.” He means that Hive is already the method by which most people are already comfortable using SQL to access Hadoop data, so there’s no use rocking the boat by adding yet another technology into the mix. Hortonworks will just make Hive faster to the point (100x) where it’s at least in the ballpark of what these entirely new systems are capable of doing, but where users still use the same tools for interactive and batch queries.</p>
<p>It has in place a three-phase plan, under the <a href="http://hortonworks.com/stinger/">“Stinger” codename</a>, in order to make this happen. The first phase, now available as part of the Hive 0.11 release, is a new set of analytic functions and a columnar file format that Page says has resulted in a 50x performance increase over the previous version. The next phase is to move <del>YARN</del> Hive off of MapReduce and onto a still-under-development processing framework called <a href="http://wiki.apache.org/incubator/TezProposal">Tez</a>.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/05/stinger.png"><img alt="stinger" src="http://gigaom2.files.wordpress.com/2013/05/stinger.png?w=708"   class="aligncenter size-full wp-image-650283"></a>“You’ll see phase two come to bear later this year,” Page said, once <a href="http://hadoop.apache.org/docs/current/hadoop-yarn/hadoop-yarn-site/YARN.html">YARN</a> — a new resource manager that lets Hadoop clusters run multiple processing engines simultaneously — is ready for production.</p>
<p>The third phase is a whole new vector query engine for Hive and new tools for intelligent query planning. Page didn’t have a target date in mind for that phase, except to note that “we’re not talking about a five-year cycle.”</p>
<h2 id="sql-isnt-the-end-game-for-hado">SQL isn’t the end game for Hadoop</h2>
<p>It would be easy to dismiss Page’s and Hortonworks’ optimism about Stinger as a sweet lemons type of rationalization — the company was founded around Apache Hadoop and can’t really go about developing entirely new products outside that foundation — but they also appear to have their eyes focused on a future where SQL isn’t too big a differentiator.</p>
<p>SQL is the way folks used to data for the last 30 years can see how Hadoop fits in their environment, Page said, but the compelling thing about Hadoop “is it really unlocks a new way about how one thinks about storing and processing data.” Once YARN is ready to go, he added, there will be new avenues of innovation in areas like graph analysis and stream processing.</p>
<p>Page comes from a place of credibility when he talks about this evolution in thinking. Before coming to Hortonworks in March, he was vice president of analytics platform and delivery at eBay, <a href="http://gigaom.com/2012/01/31/under-the-covers-of-ebays-big-data-operation/">a company that knows its way around big data</a>. When people get all their data in one place, they want to do more things with it, he explained. The thinking becomes less about using Hadoop to lower cost and more about “How do I use Hadoop to increase my top line?”.</p>
<p>Besides, Page noted (echoing the sentiment of just about everybody else in the Hadoop space, <a href="http://gigaom.com/2013/04/30/with-impala-now-ga-clouderas-ceo-sizes-up-the-sql-on-hadoop-market/">including Cloudera CEO Mike Olson</a>), even as companies turn Hadoop into their primary data store, it’s difficult to see Hadoop ever entirely replacing high-value relational data warehouse systems like Teradata. One could argue, then, that there’s no real purpose in trying too hard to match those systems in terms of capabilities.</p>
<p>At eBay, he said, they ran an in-depth analysis to see if it was economically or technologically feasible to collapse its big data workloads onto a single system. eBay has dozens of petabytes stored in Hadoop and <a href="http://gigaom.com/2013/03/27/why-apple-ebay-and-walmart-have-some-of-the-biggest-data-warehouses-youve-ever-seen/">possibly more within various Teradata appliances</a>. The result: “We just couldn’t find a way in which we could justify collapsing everything we do into one system.”</p>
<p><em>Feature image courtesy of <a href="http://www.shutterstock.com/gallery-486163p1.html">Shutterstock user vblinov</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=650170&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=176956"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=176956" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=650170+why-hortonworks-is-riding-a-faster-hive-to-the-bitter-end&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=650170+why-hortonworks-is-riding-a-faster-hive-to-the-bitter-end&utm_content=dharrisstructure">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/report/sql-on-hadoop-roadmap-2013/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=650170+why-hortonworks-is-riding-a-faster-hive-to-the-bitter-end&utm_content=dharrisstructure">Sector RoadMap: SQL-on-Hadoop platforms in 2013</a></li><li><a href="http://pro.gigaom.com/2012/04/sector-roadmap-hadoop-platforms-2012/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=650170+why-hortonworks-is-riding-a-faster-hive-to-the-bitter-end&utm_content=dharrisstructure">2012: The Hadoop infrastructure market booms</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/29/why-hortonworks-is-riding-a-faster-hive-to-the-bitter-end/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/05/shutterstock_57327433.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/05/shutterstock_57327433.jpg?w=150" medium="image">
			<media:title type="html">wasps nest</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/stinger.png" medium="image">
			<media:title type="html">stinger</media:title>
		</media:content>
	</item>
		<item>
		<title>WibiData gets $15M to help it become the Hadoop application company</title>
		<link>http://gigaom.com/2013/05/23/wibidata-gets-15m-to-help-it-become-the-hadoop-application-company/</link>
		<comments>http://gigaom.com/2013/05/23/wibidata-gets-15m-to-help-it-become-the-hadoop-application-company/#comments</comments>
		<pubDate>Thu, 23 May 2013 11:31:17 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Hbase]]></category>
		<category><![CDATA[machine-learning]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[OPower]]></category>
		<category><![CDATA[predictive analytics]]></category>
		<category><![CDATA[WibiData]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=648663</guid>
		<description><![CDATA[Startup WibiData has raised another $15 million and wants to turn the lessons it has learned in the field into generic software that can let anyone build predictive applications on Hadoop.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=648663&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.wibidata.com/">WibiData</a> &#8212; the big data startup from Cloudera Co-founder Christophe Bisciglia and Aaron Kimball &#8212; doesn&#8217;t have <em>overly</em> big plans. It only wants to become one of the first, if not the first, company selling off-the-shelf software that lets other companies build valuable, customer-facing applications on Hadoop. On Thursday, WibiData announced $15 million in Series B funding from Canaan Partners, as well as existing investors NEA and Google Chairman Eric Schmidt, to help make the goal a reality. </p>
<p>Kidding aside, that&#8217;s actually quite an ambitious goal in a Hadoop market that&#8217;s big and growing, but that&#8217;s exemplified by expensive consulting arrangements and purpose-built applications. Even more so for companies that want to do something other than transforming unstructured data into structured data (often called ETL) or run back-office analytics jobs. In fact, WibiData has spent the last 18 months doing just this type of deal, and Bisciglia says every single customer has already engaged with one of the big three Hadoop vendors (Cloudera, Hortonworks and MapR). </p>
<p>Home energy-management startup <a href="http://gigaom.com/2012/11/19/opower-the-big-data-energy-player-to-beat/">Opower</a> is a good example of this process. It&#8217;s actually one of Cloudera&#8217;s banner customers, but &#8220;when they wanted to take [their software-as-a-service tool] beyond batch analysis and ETL workloads,&#8221; Bisciglia said, Opower came to WibiData. So whereas the Opower service was originally focused on nightly data analysis comparing users&#8217; energy usage against that of other users, it&#8217;s now working on dynamic recommendations for users and letting them engage with the application in new ways.</p>
<div id="attachment_648685" class="wp-caption alignright" style="width: 310px"><a href="http://gigaom2.files.wordpress.com/2013/05/wibi-kiji.jpg"><img  alt="The WibiData architecture" src="http://gigaom2.files.wordpress.com/2013/05/wibi-kiji.jpg?w=300&#038;h=224" width="300" height="224" class="size-medium wp-image-648685" /></a><p class="wp-caption-text">The WibiData architecture</p></div>
<p>During these engagements, WibiData <a href="http://gigaom.com/2012/03/22/wibidata-structure-data-2012/">has been building up its core technology</a> for connecting those brawny back-office Hadoop environments to predictive customer-facing applications &#8211; a collection of HBase, data-formatting tools and machine learning algorithms that the company <a href="http://gigaom.com/2012/11/14/wibidata-open-sources-kiji-to-make-hbase-more-useful/">has been slowly open-sourcing under the Kiji banner</a>. It has also been learning the similarities among the applications it&#8217;s building for customers in the same field, figuring out what&#8217;s repeatable. What does any given company in the retail space, for example, need to get started on <a href="http://gigaom.com/2013/05/08/why-3-celebrity-data-scientists-are-willing-to-work-for-free-for-you/">its own recommendation engine</a>? </p>
<p>And now, Bisciglia says, WibiData is going to double down on building application software based on what it has learned. The first two industries it targets will likely be financial services and retail, two areas where the company has seen a lot of traction. He envisions the finished product including some pre-defined schema for formatting data and some pre-built predictive models, both broadly applicable across that industry rather than specific to a single user. </p>
<p>There will also be different interfaces that allow different types of users (e.g., data scientists, systems engineers and business users) to interact with the data in the ways they need to. </p>
<p>Time will tell if WibiData can actually accomplish its goal of turning Hadoop into a collection of somewhat specialized software packages, but someone has to. Even industry heavyweights like Cloudera see the need, but their hands are full just getting Hadoop integrated into existing environments and getting those early uses up and running. As Cloudera CEO Mike Olson <a href="http://gigaom.com/2012/03/21/cloudera-structure-data-2012/">said at Structure: Data in 2012</a> to anyone ambitious enough to tackle the Hadoop-application gap, &#8220;Call me, I’ll connect you with funding. The money is out there.&#8221; </p>
<p>If you want to hear more about the need for Hadoop applications, check out this panel from Structure: Data 2013, where I speak with WibiData&#8217;s Omer Trajman, Continuuity&#8217;s Jonathan Gray and Pivotal&#8217;s Muddu Sudhakar. <span class='embed-youtube' style='text-align:center; display: block;'><iframe class='youtube-player' type='text/html' width='604' height='370' src='http://www.youtube.com/embed/z7BhGEQX9BQ?version=3&#038;rel=1&#038;fs=1&#038;showsearch=0&#038;showinfo=1&#038;iv_load_policy=1&#038;wmode=transparent' frameborder='0'></iframe></span></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=648663&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=413596"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=413596" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=648663+wibidata-gets-15m-to-help-it-become-the-hadoop-application-company&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=648663+wibidata-gets-15m-to-help-it-become-the-hadoop-application-company&utm_content=dharrisstructure">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2012/07/cloud-and-data-second-quarter-2012-analysis-and-outlook-2/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=648663+wibidata-gets-15m-to-help-it-become-the-hadoop-application-company&utm_content=dharrisstructure">Takeaways from the second quarter in cloud and data</a></li><li><a href="http://pro.gigaom.com/2011/12/why-the-big-data-startup-boom-will-likely-be-short-lived/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=648663+wibidata-gets-15m-to-help-it-become-the-hadoop-application-company&utm_content=dharrisstructure">Why the big data startup boom will likely be short-lived</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/23/wibidata-gets-15m-to-help-it-become-the-hadoop-application-company/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/05/wibi-founders.png?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/05/wibi-founders.png?w=150" medium="image">
			<media:title type="html">wibi founders</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/wibi-kiji.jpg?w=300" medium="image">
			<media:title type="html">The WibiData architecture</media:title>
		</media:content>
	</item>
		<item>
		<title>With Impala now GA, Cloudera&#8217;s CEO sizes up the SQL-on-Hadoop market</title>
		<link>http://gigaom.com/2013/04/30/with-impala-now-ga-clouderas-ceo-sizes-up-the-sql-on-hadoop-market/</link>
		<comments>http://gigaom.com/2013/04/30/with-impala-now-ga-clouderas-ceo-sizes-up-the-sql-on-hadoop-market/#comments</comments>
		<pubDate>Tue, 30 Apr 2013 13:00:40 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[hive]]></category>
		<category><![CDATA[Hortonworks]]></category>
		<category><![CDATA[Impala]]></category>
		<category><![CDATA[Mapr]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[SQL]]></category>
		<category><![CDATA[SQL on Hadoop]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=640777</guid>
		<description><![CDATA[Cloudera's Impala engine for interactive SQL queries on Hadoop data is now generally available, and CEO Mike Olson gives his lay of the competitive landscape.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=640777&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>There is no shortage of confidence in the Hadoop space, and market leader Cloudera bolstered its own on Tuesday with the general availability of its Impala SQL query engine for Hadoop. And if CEO Mike Olson&#8217;s comments are any indication, we&#8217;re in for a long ride of competitive jockeying and oneupmanship as Cloudera and its peers go all Microsoft or Google and create myriad new data-processing engines to turn their Hadoop distributions into bona fide platforms.</p>
<p>Launched as a private beta in May 2012 and <a href="http://gigaom.com/2012/10/24/cloudera-makes-sql-a-first-class-citizen-in-hadoop/">made public in October</a>, Impala is Cloudera&#8217;s attempt to address the growing demand for interactive SQL analytics on Hadoop data. It&#8217;s essentially a massively parallel database designed to share the same storage platform and metadata as Hadoop MapReduce, only it is its own separate processing engine.</p>
<div id="attachment_640848" class="wp-caption alignleft" style="width: 310px"><a href="http://gigaom2.files.wordpress.com/2013/04/impala-arch-new.jpg"><img  alt="How Impala fits in" src="http://gigaom2.files.wordpress.com/2013/04/impala-arch-new.jpg?w=300&#038;h=257" width="300" height="257" class="size-medium wp-image-640848" /></a><p class="wp-caption-text">How Impala fits in</p></div>
<p>Impala actually uses the same &#8220;nearly ANSI&#8221; version of SQL as does current standard bearer Hive, but that technology (created by Facebook in 2009 as a data warehouse layer for Hadoop) doesn&#8217;t run nearly fast enough to sate many users&#8217; desire for interactive analytics. This is because Hive transforms SQL queries into MapReduce jobs, meaning every one is processed against the entire corpus of data in the Hadoop Distributed File System.</p>
<h2 id="sizing-up-the-competition">Sizing up the competition</h2>
<p>Only Cloudera isn&#8217;t the first to have the idea, <a href="http://gigaom.com/2013/02/21/sql-is-whats-next-for-hadoop-heres-whos-doing-it/">nor is it alone in trying to sell interactive SQL on Hadoop</a>. The idea was <a href="http://gigaom.com/2011/10/21/hadapt-raises-9-5m-for-hadoop-data-warehouse/">first commercialized by Boston-based startup Hadapt</a> in 2011, and is now being pushed by numerous startups and larger Hadoop players. Among them: Pivotal (formerly EMC) Greenplum, MapR (with <a href="http://gigaom.com/2012/08/17/for-fast-interactive-hadoop-queries-drill-may-be-the-answer/">Drill</a>), Hortonworks (with <a href="http://hortonworks.com/blog/100x-faster-hive/">Stinger</a>), Drawn to Scale, Splice Machine, Jethro Data and Citus Data.</p>
<div id="attachment_640858" class="wp-caption aligncenter" style="width: 600px"><a href="http://gigaom2.files.wordpress.com/2013/04/had_graphic2-scaled.jpg"><img  alt="Hadapt's architecture" src="http://gigaom2.files.wordpress.com/2013/04/had_graphic2-scaled.jpg?w=708"   class="size-full wp-image-640858" /></a><p class="wp-caption-text">Hadapt&#8217;s architecture</p></div>
<p>But Cloudera is arguably the biggest name pushing SQL on Hadoop, and CEO Mike Olson thinks Impala stands out for several reasons &#8212; not the least of which is that it exists as a product. &#8220;Nobody else is shipping production-grade SQL query support on Hadoop,&#8221; he told me during a recent call. &#8220;At least not in open source.&#8221; He seems content to let the startups do their things, instead focusing his attention on Cloudera&#8217;s big three Hadoop-distribution competitors in Pivotal, MapR and Hortonworks. Greenplum and Pivotal SVP Scott Yara <a href="http://gigaom.com/2013/02/25/emc-to-hadoop-competition-see-ya-wouldnt-wanna-be-ya/">was full of confidence &#8212; and R&amp;D budget</a>&#8211; when the company announced the Pivotal HD distribution and HAWQ technology in February, but Olson claims the approach requires a siloed DBMS within HDFS and is a &#8220;rearguard defensive strategy&#8221; to protect the company&#8217;s sunk costs in its database technology.</p>
<div id="attachment_615210" class="wp-caption aligncenter" style="width: 718px"><a href="http://gigaom2.files.wordpress.com/2013/02/hawq1.jpg"><img  alt="The Pivotal HD and Hawq architecture" src="http://gigaom2.files.wordpress.com/2013/02/hawq1.jpg?w=708&#038;h=387" width="708" height="387" class="size-large wp-image-615210" /></a><p class="wp-caption-text">The Pivotal HD and Hawq architecture</p></div>
<p>As for Hortonworks, Olson questions the wisdom of its Stinger initiative to boost Hive&#8217;s speed, noting that &#8220;Hive never got good while it was running standalone on MapReduce.&#8221; Hortonworks also <a href="http://gigaom.com/2013/04/15/teradata-to-connect-hadoop-and-data-warehouses-roll-out-new-appliance/">partners with vendors such as Teradata</a> to let their platforms access Hadoop data in its native format, but those approaches still require sending data over the network. &#8220;It&#8217;s not the way you would build it if you woke up in the 2000s and were building this anew,&#8221; Olson said.</p>
<div id="attachment_640854" class="wp-caption aligncenter" style="width: 718px"><a href="http://gigaom2.files.wordpress.com/2013/04/stingerroad.png"><img  alt="The Stinger roadmap" src="http://gigaom2.files.wordpress.com/2013/04/stingerroad.png?w=708&#038;h=558" width="708" height="558" class="size-large wp-image-640854" /></a><p class="wp-caption-text">The Stinger roadmap</p></div>
<p>Olson acknowledged that the MapR-led Apache Drill project is cut from the same cloth as Impala (that is, being a Google Dremel clone designed specifically for Hadoop), but &#8220;the difference is we&#8217;re shipping code.&#8221; Being generally available and ready for production workloads means Cloudera can lock down users and market share before many even have a chance to experiment with Drill. He all but dismissed questions over the readiness of Impala, spurred by rumblings in the Hadoop space that Cloudera rushed it into public beta in order to get on the scoreboard against more fully baked offerings.</p>
<p>&#8220;I don&#8217;t feel we&#8217;re under the gun competitively to pull it out of beta because no one else has product in the market,&#8221; Olson said. &#8220;I have no problems &#8230; calling this GA quality.&#8221; He did, however, acknowledge that Impala is shipping with a &#8220;minium viable feature set&#8221; that the company has plans to build on in the near future. Impala Senior Product Manager Justin Erickson noted a few issues of concern, including around the number of concurrent users Impala can support, but said they have been addressed during the beta period.</p>
<h2 id="one-piece-of-a-larger-platform">One piece of a larger platform</h2>
<p>Really, though, the whole point of Impala and its competitors is to turn Hadoop from a tool for batch analytics and mass storage <a href="http://gigaom.com/2013/03/07/5-reasons-why-the-future-of-hadoop-is-real-time-relatively-speaking/">into a platform that can handle nearly all of companies&#8217; data-processing needs</a>. In that regard, it appears we&#8217;re just getting started. Cloudera, MapR, Pivotal Greenplum and Hortonworks are already pushing their own products and projects, and Olson said &#8220;it&#8217;s absolutely our intent&#8221; to enhance Cloudera&#8217;s platform with even more open-source products &#8212; perhaps even more database technologies <a href="http://gigaom.com/2013/04/22/how-hbase-converted-myspaces-mysql-champion-and-is-driving-hadoop-mainstream/">a la HBase</a> &#8212; that will let users do more stuff with more types of data. Over time, this strategy could result in Hadoop displacing the current breed of databases and data warehouses and becoming the single data store atop of which users run whatever applications they so desire. For now, though, especially when it comes to Impala and the data warehouse incumbents, Olson is taking a measured approach. &#8220;The likelihood that we&#8217;re going to knock them off in the near term,&#8221; he said, &#8220;&#8230; it would be a tough fight to win.&#8221;</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=640777&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=388518"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=388518" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=640777+with-impala-now-ga-clouderas-ceo-sizes-up-the-sql-on-hadoop-market&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/report/sql-on-hadoop-roadmap-2013/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=640777+with-impala-now-ga-clouderas-ceo-sizes-up-the-sql-on-hadoop-market&utm_content=dharrisstructure">Sector RoadMap: SQL-on-Hadoop platforms in 2013</a></li><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=640777+with-impala-now-ga-clouderas-ceo-sizes-up-the-sql-on-hadoop-market&utm_content=dharrisstructure">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/report/cloud-and-data-first-quarter-2013-analysis-and-outlook/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=640777+with-impala-now-ga-clouderas-ceo-sizes-up-the-sql-on-hadoop-market&utm_content=dharrisstructure">Cloud and data first-quarter 2013: analysis and outlook</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/04/30/with-impala-now-ga-clouderas-ceo-sizes-up-the-sql-on-hadoop-market/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/01/1z5o1503.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/01/1z5o1503.jpg?w=150" medium="image">
			<media:title type="html">Structure Data 2012: Michael Olson – CEO, Cloudera</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/04/impala-arch-new.jpg?w=300" medium="image">
			<media:title type="html">How Impala fits in</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/04/had_graphic2-scaled.jpg" medium="image">
			<media:title type="html">Hadapt&#039;s architecture</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/02/hawq1.jpg?w=708" medium="image">
			<media:title type="html">The Pivotal HD and Hawq architecture</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/04/stingerroad.png?w=708" medium="image">
			<media:title type="html">The Stinger roadmap</media:title>
		</media:content>
	</item>
		<item>
		<title>Cloud and data first-quarter 2013: analysis and outlook</title>
		<link>http://pro.gigaom.com/report/cloud-and-data-first-quarter-2013-analysis-and-outlook/</link>
		<comments>http://pro.gigaom.com/report/cloud-and-data-first-quarter-2013-analysis-and-outlook/#comments</comments>
		<pubDate>Tue, 09 Apr 2013 06:55:36 +0000</pubDate>
		<dc:creator><a href="http://pro.gigaom.com/members/davidlinthicum/" rel="author">David S. Linthicum</a></dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Amazon]]></category>
		<category><![CDATA[Amazon cloud computing]]></category>
		<category><![CDATA[Amazon Web Services]]></category>
		<category><![CDATA[amazon-elastic-compute-cloud]]></category>
		<category><![CDATA[Amazon.com]]></category>
		<category><![CDATA[apache-hadoop]]></category>
		<category><![CDATA[apple inc.]]></category>
		<category><![CDATA[aws]]></category>
		<category><![CDATA[Azure Services Platform]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[central-intelligence-agency]]></category>
		<category><![CDATA[Centralized computing]]></category>
		<category><![CDATA[CIA]]></category>
		<category><![CDATA[Cisco]]></category>
		<category><![CDATA[Cisco Systems]]></category>
		<category><![CDATA[Client/Server]]></category>
		<category><![CDATA[Cloud]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[cloud computing services]]></category>
		<category><![CDATA[Cloud computing taxes]]></category>
		<category><![CDATA[Cloud Storage]]></category>
		<category><![CDATA[cloud storage services]]></category>
		<category><![CDATA[cloud technology]]></category>
		<category><![CDATA[cloud-applications]]></category>
		<category><![CDATA[cloud-based storage services]]></category>
		<category><![CDATA[cloud-infrastructure]]></category>
		<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[CloudMe]]></category>
		<category><![CDATA[computing]]></category>
		<category><![CDATA[consumer-oriented cloud storage services]]></category>
		<category><![CDATA[data management]]></category>
		<category><![CDATA[data processing store]]></category>
		<category><![CDATA[Data Synchronization]]></category>
		<category><![CDATA[database management systems]]></category>
		<category><![CDATA[database technology]]></category>
		<category><![CDATA[DataDirect Networks]]></category>
		<category><![CDATA[Datameer]]></category>
		<category><![CDATA[Dropbox]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[file hosting]]></category>
		<category><![CDATA[File system-sharing services]]></category>
		<category><![CDATA[firewall]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Hewlett-Packard]]></category>
		<category><![CDATA[HP]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[icloud]]></category>
		<category><![CDATA[Idaho State Tax Commission]]></category>
		<category><![CDATA[Income taxes]]></category>
		<category><![CDATA[Intel]]></category>
		<category><![CDATA[iPad]]></category>
		<category><![CDATA[iPhone]]></category>
		<category><![CDATA[Joyent]]></category>
		<category><![CDATA[Linux]]></category>
		<category><![CDATA[Macquarie Capital]]></category>
		<category><![CDATA[Mapr]]></category>
		<category><![CDATA[massively parallel processing]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[microsoft-windows]]></category>
		<category><![CDATA[mobile device]]></category>
		<category><![CDATA[MongoDB]]></category>
		<category><![CDATA[Nimbula]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[ObjectRocket]]></category>
		<category><![CDATA[Online backup services]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[OpenStack]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[oracle-corporation]]></category>
		<category><![CDATA[oracle-database]]></category>
		<category><![CDATA[parallel processing]]></category>
		<category><![CDATA[private clouds]]></category>
		<category><![CDATA[Public Clouds]]></category>
		<category><![CDATA[Rackspace]]></category>
		<category><![CDATA[Relational database]]></category>
		<category><![CDATA[relational database management systems]]></category>
		<category><![CDATA[saleseforce-com]]></category>
		<category><![CDATA[Salesforce.com]]></category>
		<category><![CDATA[SAN]]></category>
		<category><![CDATA[smartphone]]></category>
		<category><![CDATA[smartphones]]></category>
		<category><![CDATA[software]]></category>
		<category><![CDATA[software delivery]]></category>
		<category><![CDATA[SQL]]></category>
		<category><![CDATA[Star Analytics]]></category>
		<category><![CDATA[storage-area-network]]></category>
		<category><![CDATA[Tablet computer]]></category>
		<category><![CDATA[Teradata]]></category>
		<category><![CDATA[U.S. government]]></category>

		<guid isPermaLink="false">http://pro.gigaom.com/?post_type=go-report&#038;p=173124/</guid>
		<description><![CDATA[Cloud computing is finally starting to add value to business, as those in charge of cloud within enterprises are moving from talking to doing. That much was very evident in the first quarter of 2013.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=648537&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Cloud computing is finally starting to add value to business, as those in charge of cloud within enterprises are moving from talking to doing. That much was very evident in the first quarter of 2013.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=648537&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=316195"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=316195" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=648537+cloud-and-data-first-quarter-2013-analysis-and-outlook&utm_content=gigaedit">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2011/04/infrastructure-q1-iaas-comes-down-to-earth-big-data-takes-flight/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=648537+cloud-and-data-first-quarter-2013-analysis-and-outlook&utm_content=gigaedit">Infrastructure Q1: IaaS Comes Down to Earth; Big Data Takes Flight</a></li><li><a href="http://pro.gigaom.com/2012/04/infrastructure-q1-cloud-and-big-data-woo-the-enterprise/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=648537+cloud-and-data-first-quarter-2013-analysis-and-outlook&utm_content=gigaedit">Infrastructure Q1: Cloud and big data woo enterprises</a></li><li><a href="http://pro.gigaom.com/2011/07/infrastructure-q2-big-data-and-paas-gain-more-momentum/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=648537+cloud-and-data-first-quarter-2013-analysis-and-outlook&utm_content=gigaedit">Infrastructure Q2: Big data and PaaS gain more momentum</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://pro.gigaom.com/report/cloud-and-data-first-quarter-2013-analysis-and-outlook/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://pro.gigaom.com/files/2009/04/gigaompromasterimagecloud.jpg?w=150" />
		<media:content url="http://pro.gigaom.com/files/2009/04/gigaompromasterimagecloud.jpg?w=150" medium="image">
			<media:title type="html">gigaompromasterimagecloud</media:title>
		</media:content>

		<media:content url="http://1.gravatar.com/avatar/4f3860069d181dbeeb398304f5940a9e?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">gigaedit</media:title>
		</media:content>
	</item>
		<item>
		<title>Ignition raises $150M fund, opens Silicon Valley office, to back enterprise IT</title>
		<link>http://gigaom.com/2013/04/03/ignition-raises-150m-fund-opens-silicon-valley-office-to-back-enterprise-it/</link>
		<comments>http://gigaom.com/2013/04/03/ignition-raises-150m-fund-opens-silicon-valley-office-to-back-enterprise-it/#comments</comments>
		<pubDate>Wed, 03 Apr 2013 13:00:38 +0000</pubDate>
		<dc:creator>Barb Darrow</dc:creator>
				<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[enterprise IT]]></category>
		<category><![CDATA[Frank Artale]]></category>
		<category><![CDATA[Ignition Partners]]></category>
		<category><![CDATA[John Connors]]></category>
		<category><![CDATA[Nick Sturiale]]></category>
		<category><![CDATA[Paul Maritz]]></category>
		<category><![CDATA[splunk]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=626900</guid>
		<description><![CDATA[Enterprise IT is a segment that has been underserved, says Ignition Partners' Frank Artale, so Ignition launched a new fund to attack that opportunity.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=626900&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>More evidence that <a href="http://gigaom.com/2013/03/30/welcome-to-the-golden-age-of-enterprise-it-and-get-used-to-it-itll-be-here-for-a-while/">boring enterprise IT is not so boring anymore</a>: <a href="http://www.ignitionpartners.com/">Ignition Partners</a> has launched (and already closed) a new $150 million fund focused on technologies that businesses will buy and implement.</p>
<p><a href="http://gigaom.com/2013/01/18/wanted-an-amazon-enterprise-challenge/shutterstock_71910823/" rel="attachment wp-att-602411"><img alt="enterprise IT" src="http://gigaom2.files.wordpress.com/2013/01/shutterstock_71910823.jpg?w=300&#038;h=225" width="300" height="225" class="alignleft size-medium wp-image-602411"></a>The Bellevue, Wash.-based early-stage VC firm will also open an office in Palo Alto, Calif. to better attack these opportunities, said Frank Artale, general partner who will run this new fund, informally dubbed Ignition V. The company brought on Nick Sturiale, a new partner, to run that office.</p>
<p>“We think that businesses and people who work in businesses have been largely underserved for the past 15 years,”  Artale said in a recent interview.</p>
<p>The goal of the dual offices is to promote cross-pollination and collaboration. ”We want to do real social networking here — not just Facebook stuff,” Artale added. “Palo Alto and the Bay Area are super important as great entrepreneurial engines — Cisco, Oracle and other companies down there spit out great entrepreneurs.”</p>
<h2 id="goal-apps-that-combine-consume">Goal: Apps that combine consumer ease of use with enterprise utility</h2>
<p>Ignition has some credibility in the enterprise. Several team members – including Artale, John Connors, and Cameron Myhrvold — are former Microsoft executives. And previous investments include Cloudera, Splunk, Zenprise, DocuSign, Opscode, Parse and Bromium.</p>
<p>New enterprise applications have to work well and look good on laptops and PCs, but also on tablets and phones as the consumerization of IT trend continues, he said.</p>
<p>Artale which described the new fund as “slightly oversubscribed”  took three months to fund. Investors include new and existing university endowments, pension funds and investment companies. Ignition V is smaller than the previous fund, which weighed in at $400 million but will also focus more — eschewing investments in telecom and consumer internet companies, Artale said.</p>
<p>The notion that enterprise IT is back as a hot category is cropping up all over. New vendors — large and small — are building consumer-grade products but for business use. Pivotal Initiative chief <a href="http://gigaom.com/2013/03/19/the-world-is-ready-for-the-consumer-grade-enterprise/">Paul Maritz spoke in depth about this</a> at the recent Structure: Data conference in New York and the topic will doubtless crop up again at <a href="http://event.gigaom.com/structure/?utm_source=cloud&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=626900+ignition-raises-150m-fund-opens-silicon-valley-office-to-back-enterprise-it&amp;utm_content=gigabarb">Structure</a> in San Francisco in June.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=626900&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=431675"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=431675" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=626900+ignition-raises-150m-fund-opens-silicon-valley-office-to-back-enterprise-it&utm_content=gigabarb">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/12/big-data-2013-key-trends-and-companies-to-watch/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=626900+ignition-raises-150m-fund-opens-silicon-valley-office-to-back-enterprise-it&utm_content=gigabarb">Big data 2013: key trends and companies to watch</a></li><li><a href="http://pro.gigaom.com/2012/04/sector-roadmap-hadoop-platforms-2012/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=626900+ignition-raises-150m-fund-opens-silicon-valley-office-to-back-enterprise-it&utm_content=gigabarb">2012: The Hadoop infrastructure market booms</a></li><li><a href="http://pro.gigaom.com/report/how-to-manage-big-data-without-breaking-the-bank/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=626900+ignition-raises-150m-fund-opens-silicon-valley-office-to-back-enterprise-it&utm_content=gigabarb">How to manage big data without breaking the bank</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/04/03/ignition-raises-150m-fund-opens-silicon-valley-office-to-back-enterprise-it/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/04/frank-artale-april-2013-gigaom1.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/04/frank-artale-april-2013-gigaom1.jpg?w=150" medium="image">
			<media:title type="html">Frank Artale</media:title>
		</media:content>

		<media:content url="http://1.gravatar.com/avatar/4af03439988d64f816da72496325cb73?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">gigabarb</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/01/shutterstock_71910823.jpg?w=300" medium="image">
			<media:title type="html">enterprise IT</media:title>
		</media:content>
	</item>
		<item>
		<title>Cloudera scores analytics-as-a-service deal with Germany&#8217;s T-Systems</title>
		<link>http://gigaom.com/2013/03/21/cloudera-scores-analytics-as-a-service-deal-with-germanys-t-systems/</link>
		<comments>http://gigaom.com/2013/03/21/cloudera-scores-analytics-as-a-service-deal-with-germanys-t-systems/#comments</comments>
		<pubDate>Thu, 21 Mar 2013 10:12:13 +0000</pubDate>
		<dc:creator>David Meyer</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[Cloud]]></category>
		<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[t-systems]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=622707</guid>
		<description><![CDATA[The strategic partnership will see Cloudera's enterprise Hadoop distribution, along with its Impala real-time query engine, running on top of T-Systems' extensive cloud infrastructure in Europe and beyond.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=622707&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Cloudera&#8217;s Hadoop implementation just got a big boost through a strategic partnership with Deutsche Telekom&#8217;s T-Systems, one of Europe&#8217;s biggest IT services companies. This would appear to be one of the first results of Cloudera&#8217;s <a href="http://gigaom.com/2012/12/06/cloudera-snares-big-65m-more-to-boost-international-enterprise-growth/">freshly-funded international enterprise push</a>.</p>
<p>The two companies are touting analytics-as-a-service using <a href="http://www.cloudera.com/content/cloudera/en/products/cloudera-enterprise-core/cloudera-enterprise-RTQ.html">Cloudera Enterprise RTQ</a>, featuring the <a href="http://gigaom.com/2012/10/24/cloudera-makes-sql-a-first-class-citizen-in-hadoop/">Impala SQL query engine</a>, on top of T-Systems&#8217; existing cloud computing infrastructure. The package is available immediately for T-Systems&#8217; European customers, while those outside Europe will get access in due course.</p>
<p>&#8220;Our customers don&#8217;t want to have to worry about the hardware and software for big data,&#8221; claimed T-Systems BI and big data chief Christian Wirth in a statement. &#8220;They don&#8217;t want technology, just a reliable service. We can offer precisely this &#8212; which is what makes our new offer with Cloudera so special.&#8221;</p>
<p>T-Systems counts big names such as Volkswagen and Royal Dutch Shell among its customers, so this is a significant deal for Cloudera. Cloudera&#8217;s is the go-to Hadoop distribution right now, but its position may not be unassailable: EMC&#8217;s GreenPlum division recently <a href="http://gigaom.com/2013/02/26/cloudera-who-intel-announces-its-own-hadoop-distribution/">revamped its distribution</a> by fusing it with its own analytics database, and even <a href="http://gigaom.com/2013/02/26/cloudera-who-intel-announces-its-own-hadoop-distribution/">Intel now has a distribution out there</a>.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=622707&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=224934"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=224934" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=622707+cloudera-scores-analytics-as-a-service-deal-with-germanys-t-systems&utm_content=superglaze">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/report/cloud-and-data-first-quarter-2013-analysis-and-outlook/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=622707+cloudera-scores-analytics-as-a-service-deal-with-germanys-t-systems&utm_content=superglaze">Cloud and data first-quarter 2013: analysis and outlook</a></li><li><a href="http://pro.gigaom.com/2012/04/infrastructure-q1-cloud-and-big-data-woo-the-enterprise/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=622707+cloudera-scores-analytics-as-a-service-deal-with-germanys-t-systems&utm_content=superglaze">Infrastructure Q1: Cloud and big data woo enterprises</a></li><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=622707+cloudera-scores-analytics-as-a-service-deal-with-germanys-t-systems&utm_content=superglaze">A near-term outlook for big data</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/03/21/cloudera-scores-analytics-as-a-service-deal-with-germanys-t-systems/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/01/1z5o1503.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/01/1z5o1503.jpg?w=150" medium="image">
			<media:title type="html">Structure Data 2012: Michael Olson – CEO, Cloudera</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/6599daccfd7e897e68744fe0065e5a2e?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">superglaze</media:title>
		</media:content>
	</item>
		<item>
		<title>Big data is still hard, but it gets better</title>
		<link>http://gigaom.com/2013/03/20/big-data-is-still-hard-but-it-gets-better/</link>
		<comments>http://gigaom.com/2013/03/20/big-data-is-still-hard-but-it-gets-better/#comments</comments>
		<pubDate>Wed, 20 Mar 2013 19:35:03 +0000</pubDate>
		<dc:creator>Stacey Higginbotham</dc:creator>
				<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[data science]]></category>
		<category><![CDATA[dj patil]]></category>
		<category><![CDATA[Greylock]]></category>
		<category><![CDATA[Jeff Hammerbacher]]></category>
		<category><![CDATA[Quid]]></category>
		<category><![CDATA[Structure Data 2013]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=622484</guid>
		<description><![CDATA[When it comes to using big data, there are still bottlenecks. Many of these are around the tools that people use to try to make sense of massive amounts of information.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=622484&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>What’s standing between your staff and big data analysis? That was the existential question posed of DJ Patil and Jeff Hammerbacher at the <a href="http://event.gigaom.com/structuredata/?utm_source=data&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=622484+big-data-is-still-hard-but-it-gets-better&amp;utm_content=shigginbotham">GigaOM Structure:Data</a> event today in New York. The two had different takes on how easy it was to give people the power to use data, with Hammerbacher, who is the co-founder of Cloudera, saying that it’s pretty simple today.</p>
<p>He did say that today many aspects of the input and ingress of data will end up being automated, much like systems administrators responsible for running the data center have seen many of their tasks automated.</p>
<p>Patil, who is now a data scientist in residence at Greylock Partners, was a bit more focused on end users. He shared his visit to a nonprofit called DoSomething.org earlier today, and said that people there had plenty of curiosity and a desire to play with data and ask questions, but they didn’t always know what to ask to get the insights they seemed to want. “We need another layer to help those people figure out what they want to ask,” he said.</p>
<p>From Patil’s perspective we need tools that will help us tell stories with data and let people play with it in ways that can help people come to new conclusions or see new relationships. “This is less of a machine learning problem than a ‘Can I try a bunch of things with the data?’ kind of problem,” said Patil.</p>
<p>And for those who are still intimidated by playing around with big data Patil has this to say, “Most people doing sophisticated analysis they don’t really know what they are doing.”</p>
<p>Check out the rest of our Structure:Data 2013 coverage here, and a video embed of the session follows below:</p>
<p><iframe src="http://new.livestream.com/accounts/74987/events/1927733/videos/14313139/player?autoPlay=false&amp;height=360&amp;mute=false&amp;width=640" width="640" height="360" frameborder="0" scrolling="no"></iframe><br>
A transcription of the video follows on the next page</p>
<p><a href="http://gigaom.com/2013/03/20/big-data-is-still-hard-but-it-gets-better/2/">Go to page 2 (of 2) on GigaOM .</a></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=622484&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=953515"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=953515" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=622484+big-data-is-still-hard-but-it-gets-better&utm_content=shigginbotham">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=622484+big-data-is-still-hard-but-it-gets-better&utm_content=shigginbotham">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2011/08/big-data-is-real-but-only-when-you-ask-the-right-question/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=622484+big-data-is-still-hard-but-it-gets-better&utm_content=shigginbotham">Big data is real, but only when you ask the right questions</a></li><li><a href="http://pro.gigaom.com/2012/05/pervasive-software-retools-for-cloud-big-data-will-it-be-heard/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=622484+big-data-is-still-hard-but-it-gets-better&utm_content=shigginbotham">Pervasive Software retools for cloud, big data: will it be heard?</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/03/20/big-data-is-still-hard-but-it-gets-better/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/03/6qyvdm-kms8y7dikvq2vjo6702u-lksoid-tmvkdmau.jpeg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/03/6qyvdm-kms8y7dikvq2vjo6702u-lksoid-tmvkdmau.jpeg?w=150" medium="image">
			<media:title type="html">DJ Patil Greylock Ventures Jeff Hammerbacher Cloudera  Structure Data 2013</media:title>
		</media:content>

		<media:content url="http://1.gravatar.com/avatar/aee37121e18bf76bb9fee4494bab237a?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">shigginbotham</media:title>
		</media:content>
	</item>
		<item>
		<title>Sector RoadMap: SQL-on-Hadoop platforms in 2013</title>
		<link>http://pro.gigaom.com/report/sql-on-hadoop-roadmap-2013/</link>
		<comments>http://pro.gigaom.com/report/sql-on-hadoop-roadmap-2013/#comments</comments>
		<pubDate>Wed, 20 Mar 2013 12:00:16 +0000</pubDate>
		<dc:creator><a href="http://pro.gigaom.com/members/josephturian/" rel="author">Joseph Turian</a></dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[apache]]></category>
		<category><![CDATA[apache-hive]]></category>
		<category><![CDATA[aster]]></category>
		<category><![CDATA[Aster Big Analytics Appliance]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[BigInsights]]></category>
		<category><![CDATA[Citus Data]]></category>
		<category><![CDATA[CitusDB]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[Clustrix]]></category>
		<category><![CDATA[Concurrent]]></category>
		<category><![CDATA[Database theory]]></category>
		<category><![CDATA[Dremel]]></category>
		<category><![CDATA[Drill]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[Hadapt]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Hadoop Distributed File System]]></category>
		<category><![CDATA[HAWQ]]></category>
		<category><![CDATA[Hbase]]></category>
		<category><![CDATA[HCatalog]]></category>
		<category><![CDATA[HDFS]]></category>
		<category><![CDATA[hive]]></category>
		<category><![CDATA[Hortonworks]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[Impala]]></category>
		<category><![CDATA[JethroData]]></category>
		<category><![CDATA[karmasphere]]></category>
		<category><![CDATA[Lingual]]></category>
		<category><![CDATA[Mapr]]></category>
		<category><![CDATA[mapreduce]]></category>
		<category><![CDATA[MemSQL]]></category>
		<category><![CDATA[microstrategy]]></category>
		<category><![CDATA[MongoDB]]></category>
		<category><![CDATA[MPP]]></category>
		<category><![CDATA[NewSQL]]></category>
		<category><![CDATA[Optiq]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[parallel computing]]></category>
		<category><![CDATA[pig]]></category>
		<category><![CDATA[Platfora]]></category>
		<category><![CDATA[PostGIS]]></category>
		<category><![CDATA[PostgreSQL]]></category>
		<category><![CDATA[PostreSQL]]></category>
		<category><![CDATA[RainStor]]></category>
		<category><![CDATA[Salesforce.com]]></category>
		<category><![CDATA[SAP]]></category>
		<category><![CDATA[SAP HANA]]></category>
		<category><![CDATA[Splice Machine]]></category>
		<category><![CDATA[SQL]]></category>
		<category><![CDATA[SQL 92]]></category>
		<category><![CDATA[SQL-H]]></category>
		<category><![CDATA[SQLStream]]></category>
		<category><![CDATA[Stinger]]></category>
		<category><![CDATA[Stringer]]></category>
		<category><![CDATA[tableau]]></category>
		<category><![CDATA[Teradata]]></category>
		<category><![CDATA[Twitter]]></category>
		<category><![CDATA[VoltDB]]></category>
		<category><![CDATA[zookeeper]]></category>

		<guid isPermaLink="false">http://pro.gigaom.com/?post_type=go-report&#038;p=171512/</guid>
		<description><![CDATA[Today’s most successful companies are the ones with the ability to capture and analyze all data available to them. Enter SQL-on-Hadoop solutions, which increase the accessibility of Hadoop and allow organizations to reuse their investment learning in SQL. <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=648564&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Today’s most successful companies are the ones with the ability to capture and analyze all data available to them. Enter SQL-on-Hadoop solutions, which increase the accessibility of Hadoop and allow organizations to reuse their investment learning in SQL. </p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=648564&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=739492"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=739492" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=648564+sql-on-hadoop-roadmap-2013&utm_content=gigaedit">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=648564+sql-on-hadoop-roadmap-2013&utm_content=gigaedit">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2012/04/sector-roadmap-hadoop-platforms-2012/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=648564+sql-on-hadoop-roadmap-2013&utm_content=gigaedit">2012: The Hadoop infrastructure market booms</a></li><li><a href="http://pro.gigaom.com/2011/03/defining-hadoop-the-players-technologies-and-challenges-of-2011/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=648564+sql-on-hadoop-roadmap-2013&utm_content=gigaedit">Defining Hadoop: the Players, Technologies and Challenges of 2011</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://pro.gigaom.com/report/sql-on-hadoop-roadmap-2013/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="https://gigaom-pro-files.s3.amazonaws.com/files/2012/04/elephant.jpg?w=150" />
		<media:content url="https://gigaom-pro-files.s3.amazonaws.com/files/2012/04/elephant.jpg?w=150" medium="image">
			<media:title type="html">elephant</media:title>
		</media:content>

		<media:content url="http://1.gravatar.com/avatar/4f3860069d181dbeeb398304f5940a9e?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">gigaedit</media:title>
		</media:content>
	</item>
	</channel>
</rss>