<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>GigaOM &#187; Hadoop</title>
	<atom:link href="http://gigaom.com/tag/hadoop/feed/" rel="self" type="application/rss+xml" />
	<link>http://gigaom.com</link>
	<description></description>
	<lastBuildDate>Sun, 26 May 2013 05:11:56 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='gigaom.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://0.gravatar.com/blavatar/0db8f6557d022075dbbf010c54d46d93?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>GigaOM &#187; Hadoop</title>
		<link>http://gigaom.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://gigaom.com/osd.xml" title="GigaOM" />
	<atom:link rel='hub' href='http://gigaom.com/?pushpress=hub'/>
		<item>
		<title>WibiData gets $15M to help it become the Hadoop application company</title>
		<link>http://gigaom.com/2013/05/23/wibidata-gets-15m-to-help-it-become-the-hadoop-application-company/</link>
		<comments>http://gigaom.com/2013/05/23/wibidata-gets-15m-to-help-it-become-the-hadoop-application-company/#comments</comments>
		<pubDate>Thu, 23 May 2013 11:31:17 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Hbase]]></category>
		<category><![CDATA[machine-learning]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[OPower]]></category>
		<category><![CDATA[predictive analytics]]></category>
		<category><![CDATA[WibiData]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=648663</guid>
		<description><![CDATA[Startup WibiData has raised another $15 million and wants to turn the lessons it has learned in the field into generic software that can let anyone build predictive applications on Hadoop.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=648663&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.wibidata.com/">WibiData</a> &#8212; the big data startup from Cloudera Co-founder Christophe Bisciglia and Aaron Kimball &#8212; doesn&#8217;t have <em>overly</em> big plans. It only wants to become one of the first, if not the first, company selling off-the-shelf software that lets other companies build valuable, customer-facing applications on Hadoop. On Thursday, WibiData announced $15 million in Series B funding from Canaan Partners, as well as existing investors NEA and Google Chairman Eric Schmidt, to help make the goal a reality. </p>
<p>Kidding aside, that&#8217;s actually quite an ambitious goal in a Hadoop market that&#8217;s big and growing, but that&#8217;s exemplified by expensive consulting arrangements and purpose-built applications. Even more so for companies that want to do something other than transforming unstructured data into structured data (often called ETL) or run back-office analytics jobs. In fact, WibiData has spent the last 18 months doing just this type of deal, and Bisciglia says every single customer has already engaged with one of the big three Hadoop vendors (Cloudera, Hortonworks and MapR). </p>
<p>Home energy-management startup <a href="http://gigaom.com/2012/11/19/opower-the-big-data-energy-player-to-beat/">Opower</a> is a good example of this process. It&#8217;s actually one of Cloudera&#8217;s banner customers, but &#8220;when they wanted to take [their software-as-a-service tool] beyond batch analysis and ETL workloads,&#8221; Bisciglia said, Opower came to WibiData. So whereas the Opower service was originally focused on nightly data analysis comparing users&#8217; energy usage against that of other users, it&#8217;s now working on dynamic recommendations for users and letting them engage with the application in new ways.</p>
<div id="attachment_648685" class="wp-caption alignright" style="width: 310px"><a href="http://gigaom2.files.wordpress.com/2013/05/wibi-kiji.jpg"><img  alt="The WibiData architecture" src="http://gigaom2.files.wordpress.com/2013/05/wibi-kiji.jpg?w=300&#038;h=224" width="300" height="224" class="size-medium wp-image-648685" /></a><p class="wp-caption-text">The WibiData architecture</p></div>
<p>During these engagements, WibiData <a href="http://gigaom.com/2012/03/22/wibidata-structure-data-2012/">has been building up its core technology</a> for connecting those brawny back-office Hadoop environments to predictive customer-facing applications &#8211; a collection of HBase, data-formatting tools and machine learning algorithms that the company <a href="http://gigaom.com/2012/11/14/wibidata-open-sources-kiji-to-make-hbase-more-useful/">has been slowly open-sourcing under the Kiji banner</a>. It has also been learning the similarities among the applications it&#8217;s building for customers in the same field, figuring out what&#8217;s repeatable. What does any given company in the retail space, for example, need to get started on <a href="http://gigaom.com/2013/05/08/why-3-celebrity-data-scientists-are-willing-to-work-for-free-for-you/">its own recommendation engine</a>? </p>
<p>And now, Bisciglia says, WibiData is going to double down on building application software based on what it has learned. The first two industries it targets will likely be financial services and retail, two areas where the company has seen a lot of traction. He envisions the finished product including some pre-defined schema for formatting data and some pre-built predictive models, both broadly applicable across that industry rather than specific to a single user. </p>
<p>There will also be different interfaces that allow different types of users (e.g., data scientists, systems engineers and business users) to interact with the data in the ways they need to. </p>
<p>Time will tell if WibiData can actually accomplish its goal of turning Hadoop into a collection of somewhat specialized software packages, but someone has to. Even industry heavyweights like Cloudera see the need, but their hands are full just getting Hadoop integrated into existing environments and getting those early uses up and running. As Cloudera CEO Mike Olson <a href="http://gigaom.com/2012/03/21/cloudera-structure-data-2012/">said at Structure: Data in 2012</a> to anyone ambitious enough to tackle the Hadoop-application gap, &#8220;Call me, I’ll connect you with funding. The money is out there.&#8221; </p>
<p>If you want to hear more about the need for Hadoop applications, check out this panel from Structure: Data 2013, where I speak with WibiData&#8217;s Omer Trajman, Continuuity&#8217;s Jonathan Gray and Pivotal&#8217;s Muddu Sudhakar. <span class='embed-youtube' style='text-align:center; display: block;'><iframe class='youtube-player' type='text/html' width='604' height='370' src='http://www.youtube.com/embed/z7BhGEQX9BQ?version=3&#038;rel=1&#038;fs=1&#038;showsearch=0&#038;showinfo=1&#038;iv_load_policy=1&#038;wmode=transparent' frameborder='0'></iframe></span></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=648663&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=381226"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=381226" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=648663+wibidata-gets-15m-to-help-it-become-the-hadoop-application-company&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=648663+wibidata-gets-15m-to-help-it-become-the-hadoop-application-company&utm_content=dharrisstructure">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2012/07/cloud-and-data-second-quarter-2012-analysis-and-outlook-2/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=648663+wibidata-gets-15m-to-help-it-become-the-hadoop-application-company&utm_content=dharrisstructure">Takeaways from the second quarter in cloud and data</a></li><li><a href="http://pro.gigaom.com/2011/12/why-the-big-data-startup-boom-will-likely-be-short-lived/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=648663+wibidata-gets-15m-to-help-it-become-the-hadoop-application-company&utm_content=dharrisstructure">Why the big data startup boom will likely be short-lived</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/23/wibidata-gets-15m-to-help-it-become-the-hadoop-application-company/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/05/wibi-founders.png?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/05/wibi-founders.png?w=150" medium="image">
			<media:title type="html">wibi founders</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/wibi-kiji.jpg?w=300" medium="image">
			<media:title type="html">The WibiData architecture</media:title>
		</media:content>
	</item>
		<item>
		<title>Concurrent is building a Hadoop assembly line in open source</title>
		<link>http://gigaom.com/2013/05/22/concurrent-is-building-a-hadoop-assembly-line-in-open-source/</link>
		<comments>http://gigaom.com/2013/05/22/concurrent-is-building-a-hadoop-assembly-line-in-open-source/#comments</comments>
		<pubDate>Wed, 22 May 2013 19:21:16 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[Cascading]]></category>
		<category><![CDATA[Concurrent]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Lingual]]></category>
		<category><![CDATA[machine-learning]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[Pattern]]></category>
		<category><![CDATA[predictive analytics]]></category>
		<category><![CDATA[SQL]]></category>
		<category><![CDATA[statistical analysis]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=648186</guid>
		<description><![CDATA[Cascading creator Concurrent has developed a new open source tool called Pattern for running machine learning models on Hadoop clusters. When combined with its SQL tool called Lingual, users can move data from one stage to another easily.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=648186&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>If you know Java, R or SAS, doing machine learning on Hadoop data just got a lot easier. <a href="http://www.concurrentinc.com/">Concurrent</a> <em>(</em><em>see disclosure)</em>, the company behind the popular <a href="http://www.cascading.org/">Cascading</a> framework for writing big data jobs, has developed a new open source tool called <a href="http://www.cascading.org/pattern/">Pattern</a> that lets users export their models from statistical analysis applications and run THEM? at scale on Hadoop data with little to no code change.</p>
<p>The reason for creating Pattern is pretty simple, according to Concurrent founder and CTO Chris Wensel: &#8220;Hadoop is never used alone.&#8221; It&#8217;s always part of a data environment that also includes databases, visualization tools, analytics software and/or statistical analysis tools that arguably do the really valuable work. Hadoop&#8217;s real value is an integration platform that can feed data into these other systems and, ideally, put their outputs to work across much larger datasets.</p>
<p>Developers <em>can</em> use the Pattern Java API to create machine learning jobs, but they can also simply export a Predictive Model Markup Language (PMML) file from software like R, SAS and MicroStrategy that Pattern will read and run them as a Cascading workflow. Models are useless unless you can run them in production, Wensel said, and Pattern lets them run across more data, stored in Hadoop, than you can use to build them with those other tools.</p>
<p>However, Wensel noted, &#8220;The real takeaway isn&#8217;t Pattern itself.&#8221;</p>
<p>From his perspective, the real story is Pattern plus Cascading plus <a href="http://www.cascading.org/lingual/">Lingual</a>, the open source SQL-to-Hadoop tool that Concurrent recently developed and released. Lingual is the tie that binds everything together, creating a sort of assembly line for data as it works its way from generation to delivering some value. For example, someone might create a Cascading job that adds structure to incoming data, and then pull some of the data into R using Lingual. Once a model is created in R and exported to the Hadoop cluster using Pattern, Lingual can feed the MapReduce output file back to R so a data scientist can test the model&#8217;s accuracy.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/05/arch-diagram.png"><img  alt="arch-diagram" src="http://gigaom2.files.wordpress.com/2013/05/arch-diagram.png?w=708"   class="aligncenter size-full wp-image-648347" /></a></p>
<p>And actually, Wensel said, Lingual could have a positive effect on companies&#8217; bottom lines. Airbnb recently replaced a departed engineer with Lingual for monthly migrations of data from Hadoop and into SQL environments. Climate Corporation, <a href="http://gigaom.com/2012/05/02/how-climate-corp-is-pitting-big-data-against-mother-nature/">a massive Hadoop and Cascading user</a>, could use Lingual to let its crop-and-weather insurance customers access their data from the company&#8217;s Hadoop store.</p>
<p>Lingual and Pattern should help Concurrent finally make some money, too. Both of them, as well as the Cascading framework that underpins them, will always be open source, Wensel said, but it plans to create &#8220;a suite of products that will make your life much better if &#8230; you standardize on Cascading.&#8221;</p>
<p>For example, the company has the ability to monitor jobs at the application level rather than the cluster level, meaning it can tell you the details of that job that&#8217;s locking up all the resources and whether you really want to kill it (it might be an important report for the CFO &#8230;). &#8220;We can do some really interesting things,&#8221; Wensel said.</p>
<p><em><strong>Disclosure</strong>: Concurrent is backed by True Ventures, a venture capital firm that is an investor in the parent company of this blog, Giga Omni Media. Om Malik, the founder of Giga Omni Media, is also a venture partner at True.</em></p>
<p><em>This post was updated at 2:48pm PT to correct Chris Wensel&#8217;s title. He is CTO.</em></p>
<p><em>Feature image courtesy of <a href="http://www.shutterstock.com/gallery-908242p1.html">Shutterstock user PENGYOU91</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=648186&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=970075"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=970075" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=648186+concurrent-is-building-a-hadoop-assembly-line-in-open-source&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/report/sql-on-hadoop-roadmap-2013/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=648186+concurrent-is-building-a-hadoop-assembly-line-in-open-source&utm_content=dharrisstructure">Sector RoadMap: SQL-on-Hadoop platforms in 2013</a></li><li><a href="http://pro.gigaom.com/report/cloud-and-data-first-quarter-2013-analysis-and-outlook/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=648186+concurrent-is-building-a-hadoop-assembly-line-in-open-source&utm_content=dharrisstructure">Cloud and data first-quarter 2013: analysis and outlook</a></li><li><a href="http://pro.gigaom.com/report/how-to-use-big-data-to-make-better-business-decisions/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=648186+concurrent-is-building-a-hadoop-assembly-line-in-open-source&utm_content=dharrisstructure">How to use big data to make better business decisions</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/22/concurrent-is-building-a-hadoop-assembly-line-in-open-source/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/05/shutterstock_98915513.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/05/shutterstock_98915513.jpg?w=150" medium="image">
			<media:title type="html">assembly line</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/arch-diagram.png" medium="image">
			<media:title type="html">arch-diagram</media:title>
		</media:content>
	</item>
		<item>
		<title>Database startup Drawn to Scale is closing down</title>
		<link>http://gigaom.com/2013/05/17/database-startup-drawn-to-scale-is-closing-down/</link>
		<comments>http://gigaom.com/2013/05/17/database-startup-drawn-to-scale-is-closing-down/#comments</comments>
		<pubDate>Fri, 17 May 2013 21:24:03 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[Drawn to Scale]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Hbase]]></category>
		<category><![CDATA[SQL on Hadoop]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=646718</guid>
		<description><![CDATA[Database startup Drawn to Scale, creator of the SQL-on-Hadoop technology called Spire, is closing down. The company's product, Spire, was one of the first SQL-on-Hadoop technologies.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=646718&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Database startup Drawn to Scale, creator of the SQL-on-Hadoop technology called Spire, is closing down. Co-founder and CEO Bradford Stephens officially <a href="http://www.roadtofailure.com/?p=11">announced the closure in a blog post</a> on Friday.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/05/spirearchitecture-015-e1361407038325.png"><img  alt="spirearchitecture-015-e1361407038325" src="http://gigaom2.files.wordpress.com/2013/05/spirearchitecture-015-e1361407038325.png?w=300&#038;h=185" width="300" height="185" class="alignleft size-medium wp-image-646740" /></a>The company&#8217;s product, Spire, which provided full SQL support on top of the HBase NoSQL database, was one of the first products to <a href="http://gigaom.com/2012/07/24/how-one-startup-wants-to-inject-hadoop-into-your-sql/">try to blend Hadoop&#8217;s scalability with the robustness and familiarity of SQL</a>. That&#8217;s now <a href="http://gigaom.com/2013/03/05/the-hadoop-ecosystem-the-welcome-elephant-in-the-room-infographic/">an increasingly crowded space</a> (and has grown since that linked graphic was created). In March, Drawn to Scale <a href="http://gigaom.com/2013/03/19/drawn-to-scale-wants-to-solve-your-mongodb-scalability-problems/">expanded its support to MongoDB</a>, as well.</p>
<p>I wasn&#8217;t shocked when Stephens told me the news &#8212; questions about the four-year-old company&#8217;s financial health had been swirling for a while &#8212; but to hear of its financial woes was a bit surprising. His account in the post pretty much echoes what I had heard from others:</p>
<blockquote id="quote-it-seemed-we-had-eve"><p>&#8220;It seemed we had everything going for us — paid customers such as American Express, Orange Telecom, Flurry, and 4 others. Our technology worked brilliantly, we had a big hiring pipeline, and we had great media presence against our competitors who raised 10-100x more cash.&#8221;</p></blockquote>
<p>He added:</p>
<blockquote id="quote-yet-five-days-before2"><p>&#8220;Yet five days before we signed term sheets for a big A round or sold the company, we started getting hit by a series of black swans — and we just didn’t have what we needed to recover. I’ll leave the public detail at that level, but I will say that paying employees’ health insurance out of your meager savings is a powerful incentive to change course.&#8221;</p></blockquote>
<p>Up to this point, the company <a href="http://gigaom.com/2012/03/08/drawn-to-scale-raises-money-to-make-sql-big-data-ready/">had raised $925,000</a> from RTP Ventures, IA Ventures and SK Ventures. There&#8217;s no word yet on what will come of the company&#8217;s intellectual property.</p>
<p>As Stephens &#8212; who&#8217;s now doing an entrepreneur-in-residence gig at Ping Identity and helping out other startups (including popular wardrobe app <a href="http://www.clothapp.com/">Cloth</a>) &#8212; succinctly put it during a phone discussion, &#8220;We just don&#8217;t have the horsepower to keep running the company.&#8221;</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=646718&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=243352"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=243352" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=646718+database-startup-drawn-to-scale-is-closing-down&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/report/the-new-economics-of-enterprise-data-warehousing/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=646718+database-startup-drawn-to-scale-is-closing-down&utm_content=dharrisstructure">How data warehousing is now a cost-effective solution for businesses</a></li><li><a href="http://pro.gigaom.com/report/sql-on-hadoop-roadmap-2013/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=646718+database-startup-drawn-to-scale-is-closing-down&utm_content=dharrisstructure">Sector RoadMap: SQL-on-Hadoop platforms in 2013</a></li><li><a href="http://pro.gigaom.com/report/how-to-use-big-data-to-make-better-business-decisions/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=646718+database-startup-drawn-to-scale-is-closing-down&utm_content=dharrisstructure">How to use big data to make better business decisions</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/17/database-startup-drawn-to-scale-is-closing-down/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/05/dtsdragon.png?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/05/dtsdragon.png?w=150" medium="image">
			<media:title type="html">dtsdragon</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/spirearchitecture-015-e1361407038325.png?w=300" medium="image">
			<media:title type="html">spirearchitecture-015-e1361407038325</media:title>
		</media:content>
	</item>
		<item>
		<title>How data warehousing is now a cost-effective solution for businesses</title>
		<link>http://pro.gigaom.com/report/the-new-economics-of-enterprise-data-warehousing/</link>
		<comments>http://pro.gigaom.com/report/the-new-economics-of-enterprise-data-warehousing/#comments</comments>
		<pubDate>Mon, 13 May 2013 06:55:34 +0000</pubDate>
		<dc:creator>nraden</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[ADAPA]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[apache-hadoop]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[clickstream analytics]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[cloud-infrastructure]]></category>
		<category><![CDATA[columnar databases]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[data analysis]]></category>
		<category><![CDATA[data management]]></category>
		<category><![CDATA[data storage]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[data warehousing]]></category>
		<category><![CDATA[data-analytics]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[database technology]]></category>
		<category><![CDATA[Database theory]]></category>
		<category><![CDATA[distributed processing]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[etl]]></category>
		<category><![CDATA[extraction transform load systems]]></category>
		<category><![CDATA[Ferrari]]></category>
		<category><![CDATA[file systems]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[HDFS]]></category>
		<category><![CDATA[Hewlett-Packard]]></category>
		<category><![CDATA[high-speed technologies]]></category>
		<category><![CDATA[HP]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[information technology]]></category>
		<category><![CDATA[integrated circuit]]></category>
		<category><![CDATA[Linux]]></category>
		<category><![CDATA[Mapr]]></category>
		<category><![CDATA[mobile devices]]></category>
		<category><![CDATA[Moore's Law]]></category>
		<category><![CDATA[Netezza]]></category>
		<category><![CDATA[Open Source Software]]></category>
		<category><![CDATA[parallel processing]]></category>
		<category><![CDATA[relational-databases]]></category>
		<category><![CDATA[scalability]]></category>
		<category><![CDATA[social networks]]></category>
		<category><![CDATA[storage devices]]></category>
		<category><![CDATA[storage virtualization technologies]]></category>
		<category><![CDATA[System administration]]></category>
		<category><![CDATA[tco]]></category>
		<category><![CDATA[total-cost-of-ownership]]></category>
		<category><![CDATA[Transaction processing]]></category>
		<category><![CDATA[Truviso]]></category>
		<category><![CDATA[Vertica]]></category>
		<category><![CDATA[Virtualization technology]]></category>

		<guid isPermaLink="false">http://pro.gigaom.com/?post_type=go-report&#038;p=175747/</guid>
		<description><![CDATA[Data-warehouse providers are quickly adding Hadoop distributions, or even their own versions of Hadoop, into their architecture, adding further cost advantages to collections of extremely large data sets. Finding the talent to manage this newly converged environment will not be easy, but it presents tremendous opportunity for companies willing to take some risk.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=648494&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>The new economics of data warehousing provide attractive alternatives in both costs and benefits. While big data gets most of the attention, evolved data warehousing will play an important role for the foreseeable future. In order to be relevant, data-warehouse design and operation need to be simplified, taking advantage of greatly improved hardware, software, and methods.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=648494&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=733051"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=733051" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=648494+the-new-economics-of-enterprise-data-warehousing&utm_content=nraden">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/04/infrastructure-q1-cloud-and-big-data-woo-the-enterprise/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=648494+the-new-economics-of-enterprise-data-warehousing&utm_content=nraden">Infrastructure Q1: Cloud and big data woo enterprises</a></li><li><a href="http://pro.gigaom.com/report/cloud-and-data-first-quarter-2013-analysis-and-outlook/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=648494+the-new-economics-of-enterprise-data-warehousing&utm_content=nraden">Cloud and data first-quarter 2013: analysis and outlook</a></li><li><a href="http://pro.gigaom.com/report/how-to-use-big-data-to-make-better-business-decisions/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=648494+the-new-economics-of-enterprise-data-warehousing&utm_content=nraden">How to use big data to make better business decisions</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://pro.gigaom.com/report/the-new-economics-of-enterprise-data-warehousing/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="https://gigaom-pro-files.s3.amazonaws.com/files/2012/08/datacenter1.jpg?w=150" />
		<media:content url="https://gigaom-pro-files.s3.amazonaws.com/files/2012/08/datacenter1.jpg?w=150" medium="image">
			<media:title type="html">datacenter1</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/fdbbd80432b14e9d84aa12c6fc0cce24?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">nraden</media:title>
		</media:content>
	</item>
		<item>
		<title>Why 3 celebrity data scientists are willing to work for free &#8212; for you</title>
		<link>http://gigaom.com/2013/05/08/why-3-celebrity-data-scientists-are-willing-to-work-for-free-for-you/</link>
		<comments>http://gigaom.com/2013/05/08/why-3-celebrity-data-scientists-are-willing-to-work-for-free-for-you/#comments</comments>
		<pubDate>Wed, 08 May 2013 16:58:30 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[data science]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Hilary Mason]]></category>
		<category><![CDATA[Mortar Data]]></category>
		<category><![CDATA[recommendation engines]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=643353</guid>
		<description><![CDATA[Hadoop startup Mortar Data is offering to build recommendation systems for 10 companies, with help from Hilary Mason, Drew Conway and Max Shron. It's part of a bigger plan to democratize the science behind online recommendations.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=643353&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Hadoop-in-the-cloud startup Mortar Data is on a mission to bring recommendation engines to the masses, and it has recruited three well-known data scientists to aid its cause. On Wednesday, the company will start accepting applications <a href="http://mortardata.com/">on its website</a> from companies that would like to have Mortar Data &#8212; as well as Bit.ly&#8217;s <a href="http://www.hilarymason.com/">Hilary Mason</a>, IA Ventures Scientist-in-Residence <a href="http://drewconway.com/">Drew Conway</a> and freelancer (and former OKCupid data scientist) <a href="http://shron.net/about">Max Shron</a> &#8212; build a custom recommendation system for them.</p>
<p>The way it works, said Mortar Co-founder and CEO K Young, is that his company will choose eight companies (in addition to the two it has been working with already) to implement custom systems based on their specific needs and businesses. Mason, Conway and Shron will split their time among the 10 total companies, but will be much more than advisers &#8212; they&#8217;ll actually dig into the data and work hands-on to ensure the right techniques and algorithms are applied in the right places.</p>
<p>The applicant companies will keep any custom code, but the ultimate goal from Mortar&#8217;s perspective is to learn some best practices and create reusable building blocks that will let anyone create recommendation engines without pre-existing data science knowledge. Recommendation engines <a href="http://gigaom.com/2013/01/29/you-might-also-like-to-know-how-online-recommendations-work/">are commonplace on large web sites</a> (Netflix, Spotify, iTunes, Google, Amazon, <a href="http://gigaom.com/2013/03/03/how-and-why-linkedin-is-becoming-an-engineering-powerhouse/">LinkedIn</a>, Eventbrite and the list goes on) but smaller companies can sometimes struggle to do them, or to do them well. Young hopes Mortar can establish an open source reference architecture of sorts that makes it easy to implement everything from building data pipelines to the actual algorithms that power recommendations.</p>
<p>&#8220;They&#8217;re really common and they&#8217;re really useful, but they&#8217;re really hard,&#8221; he said. &#8220;That&#8217;s why [a reference implementation] hasn&#8217;t been done before.&#8221;</p>
<div id="attachment_643436" class="wp-caption aligncenter" style="width: 718px"><a href="http://gigaom2.files.wordpress.com/2013/05/gernres-support-1.jpg"><img  alt="They can get pretty complex, as evidence by this Netflix example." src="http://gigaom2.files.wordpress.com/2013/05/gernres-support-1.jpg?w=708&#038;h=358" width="708" height="358" class="size-large wp-image-643436" /></a><p class="wp-caption-text">They can get pretty complex, as evidence by this Netflix example.</p></div>
<p>Presently, Young explained, anyone wanting to build a recommendation system probably knows some of the algorithms to begin with and then gets to work researching how to implement them with specific processing frameworks (e.g., MapReduce) and on their specific data. Alternatively, they might have to hire a consultant that helps them build the recommendation engine. Either way, he noted, they&#8217;re probably not open sourcing it at the end because it&#8217;s presumed too valuable a competitive edge.</p>
<p>Mortar Data&#8217;s recommendation framework will be based on Pig, Python and Java, <a href="http://gigaom.com/2012/11/28/mortar-data-wants-to-become-a-hadoop-developers-best-friend/">just like the company&#8217;s flagship platform</a> for creating Hadoop jobs. Those languages will make the implementation more accessible and customizable by more people, Young said.</p>
<p>Really, he added, any web site or service that has multiple customers and deals with multiple entities &#8212; be they restaurants, songs, dating profiles, artisan necklaces, what have you &#8212; should have some sort of recommendation engine to help provide a more-intelligent customer experience. &#8220;It should become so ubiquitous that any service you go to knows enough about you to put forward the things you actually want to see,&#8221; Young said.</p>
<p>There is, however, one catch to Mortar&#8217;s plans as they stand: Because the service is hosted on Amazon Web Services, anyone interested in having Mason, Conway, Shron and Mortar work on their systems must have their data in AWS or be able to move it there. The initial reference implementation will likely be AWS-centric, too, but Young hopes contributors will use it and share methods for running it atop other platforms.</p>
<p><em>Feature image of Hilary Mason at Structure: Data 2011 courtesy of Pinar Ozger (www.pinarozger.com).</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=643353&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=292819"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=292819" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=643353+why-3-celebrity-data-scientists-are-willing-to-work-for-free-for-you&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/08/why-3-celebrity-data-scientists-are-willing-to-work-for-free-for-you/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/05/hilarymason.jpeg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/05/hilarymason.jpeg?w=150" medium="image">
			<media:title type="html">hilarymason</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/gernres-support-1.jpg?w=708" medium="image">
			<media:title type="html">They can get pretty complex, as evidence by this Netflix example.</media:title>
		</media:content>
	</item>
		<item>
		<title>How EMC&#8217;s CTO is trying to keep EMC, VMware and Pivotal orbiting the same sun</title>
		<link>http://gigaom.com/2013/05/07/how-emcs-cto-is-trying-to-keep-emc-vmware-and-pivotal-orbiting-the-same-sun/</link>
		<comments>http://gigaom.com/2013/05/07/how-emcs-cto-is-trying-to-keep-emc-vmware-and-pivotal-orbiting-the-same-sun/#comments</comments>
		<pubDate>Wed, 08 May 2013 01:17:09 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Pivotal]]></category>
		<category><![CDATA[software-defined data center]]></category>
		<category><![CDATA[storage]]></category>
		<category><![CDATA[VMWare]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=643152</guid>
		<description><![CDATA[EMC CTO John Roese has a tough, but important job trying to keep EMC, VMware and Pivotal all moving in the same direction. While the three are separate companies, their fates are also very much aligned.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=643152&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>If you&#8217;re confused about all the action with EMC, VMware and Pivotal over the past several months, you&#8217;re not alone. CEOs <a href="http://gigaom.com/2012/07/17/maritz-is-out-as-vmware-ceo-but-takes-strategic-role-at-emc/">have traded places,</a> joint ventures <a href="http://gigaom.com/2013/03/13/the-pivotal-initiative-in-case-you-were-wondering-is-now-official/">have been struck</a>, product lines <a href="http://gigaom.com/2013/05/01/vmware-garage-sale-continues-as-it-offloads-wavemaker-to-pramati/">have been sold</a> and GE <a href="http://gigaom.com/2013/04/24/ge-to-pour-105m-into-emc-and-vmwares-pivotal-initiative/">even came on board</a>. And that&#8217;s before you even start talking about all the new technology.</p>
<p>I sat down with EMC SVP and CTO John Roese on Tuesday at the company&#8217;s annual EMC World conference to find out what&#8217;s up. Here&#8217;s what he had to say.</p>
<h2 id="on-three-companies-under-one-r">On three companies under one roof</h2>
<p>While they&#8217;re technically three separate companies, EMC is really in control. It&#8217;s the majority shareholder in VMware and owns more than 60 percent of Pivotal, its new joint venture with VMware that includes the <a href="http://gigaom.com/2013/02/25/emc-to-hadoop-competition-see-ya-wouldnt-wanna-be-ya/">Greenplum</a>, <a href="http://gigaom.com/2012/03/16/exclusive-emc-buys-pivotal-labs/">Pivotal Labs</a>, <a href="http://gigaom.com/2012/05/15/can-vmware-draw-developers-developers-developers/">SpringSource</a>, <a href="http://gigaom.com/2013/03/07/for-sale-from-pivotal-initiative-cloud-foundry/">Cloud Foundry</a> and <a href="http://gigaom.com/2012/04/24/vmware-buys-big-data-startup-cetas/">Cetas</a> business lines. When it comes to everyone working toward a common goal, Roese said, &#8220;The good news is that while there is independence, Joe Tucci is the chairman of all these companies.&#8221;</p>
<p>Roese calls himself the &#8220;gravitational center&#8221; of the three companies when it comes to technology. This is a reinvention of the CTO role at EMC, which used to be more of a research position. Now, he puts the stake in the ground and generally directs everyone toward it, even if they&#8217;re not all taking the same path to get there.</p>
<h2 id="on-why-pivotal-happened-and-wh">On why Pivotal happened and why it matters</h2>
<p>My takeaway from Roese&#8217;s comments on formation of Pivotal is that Greenplum is really the linchpin of the whole company. At its core, Pivotal is about building big data infrastructure <a href="http://gigaom.com/2013/03/19/the-world-is-ready-for-the-consumer-grade-enterprise/">that can handle next-generation workloads</a>, but it&#8217;s aware that broad adoption is only possible if that high technology becomes easier to consume. That means new higher-level applications, which is where SpringSource, Cloud Foundry and Pivotal Labs come into play.</p>
<p>All of this technically could have been accomplished by just selling Greenplum and Pivotal Labs (the only assets of the new company that was under the EMC umbrella) to VMware, but Roese said VMware wasn&#8217;t the right home because VMware is not so important in the places where next-generation workloads are popping up. There&#8217;s not a lot of VMware inside carriers&#8217; data centers, he acknowledged, but <a href="http://gigaom.com/2013/04/14/rackspace-wants-to-be-the-openstack-provider-to-the-stars/">there is a lot of OpenStack popping up</a>. And there&#8217;s a lot of Amazon Web Services everywhere you look.</p>
<p>&#8220;We would like the big data infrastructure to not care about that,&#8221; Roese explained. From EMC&#8217;s perspective, it doesn&#8217;t need to own the middle &#8212; the cloud operating system, if you will &#8212; if it can still engage customers at the storage and application-platform layers.</p>
<h2 id="on-keeping-independent-while-w">On keeping independent while working an &#8216;unfair advantage&#8217;</h2>
<p>Roese doesn&#8217;t think a vertically integrated approach is the best way to do business in today&#8217;s technology world, which is why EMC, VMware and Pivotal all operate independently and no one relies on another in order to work within customers&#8217; data centers. That&#8217;s why VMware <a href="http://gigaom.com/2013/03/13/vmwares-hybrid-vcloud-takes-on-amazon-kinda/">has its own cloud computing efforts</a> but Pivotal is cloud-agnostic, why EMC storage can operate with any higher-level software and why VMware doesn&#8217;t care about what&#8217;s running underneath or, usually, above it.</p>
<p>However, he added, it&#8217;s only natural the three companies seek an &#8220;unfair advantage&#8221; from the incestuous bonds they share. What he means, of course, is that they should keep a close eye on what the others are doing and work together to ensure they&#8217;re all optimized for the same types of workloads. For example, Roese said, if EMC didn&#8217;t reconsider how storage had to perform given that virtualization is the norm or that technology like Hadoop exists, it would &#8220;become suboptimal or generic.&#8221;</p>
<p>The same holds true for Pivotal and VMware. Pivotal needs to think about <a href="http://gigaom.com/2012/06/13/vmware-aims-for-hadoop-on-vms-with-serengeti-project/">how big data applications run on virtualized resources</a> differently than on big bare metal systems, as well as on flash-based arrays like what EMC is about to roll out based on its <a href="http://gigaom.com/2012/05/10/emc-goes-all-flash-buys-xtremio-for-430m/">XtremIO acquisition</a>. VMware and EMC need to think about how their <a href="http://gigaom.com/2013/03/13/vmware-to-virtualize-networks-with-software-incorporating-niciras-capabilities/">software-defined data center</a> and <a href="http://gigaom.com/2013/05/06/emc-plots-software-defined-data-center-journey-from-vipr-storage-virtualization-base/">software-defined storage</a> approaches can build off each other.</p>
<p>From EMC&#8217;s perspective, it&#8217;s easy to see why this all matters. It is at its core an information infrastructure company, but &#8220;the challenging thing with that is that it&#8217;s a moving target,&#8221; Roese said. A company like EMC can&#8217;t get by on storage arrays alone anymore, but it also can&#8217;t be dumb enough to think it can be everything to everyone and still be good at anything.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=643152&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=689926"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=689926" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=643152+how-emcs-cto-is-trying-to-keep-emc-vmware-and-pivotal-orbiting-the-same-sun&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/07/how-emcs-cto-is-trying-to-keep-emc-vmware-and-pivotal-orbiting-the-same-sun/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/05/john_roese_225.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/05/john_roese_225.jpg?w=150" medium="image">
			<media:title type="html">John_Roese_225</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>
	</item>
		<item>
		<title>Look, IBM is doing SQL on Hadoop, too</title>
		<link>http://gigaom.com/2013/05/06/look-ibm-is-doing-sql-on-hadoop-too/</link>
		<comments>http://gigaom.com/2013/05/06/look-ibm-is-doing-sql-on-hadoop-too/#comments</comments>
		<pubDate>Mon, 06 May 2013 17:37:41 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[SQL]]></category>
		<category><![CDATA[SQL on Hadoop]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=642523</guid>
		<description><![CDATA[IBM's entrant in the SQL-on-Hadoop competition has been flying under the radar, but is available as a technology preview. Called Big SQL, it's a big deal if IBM wants to be a major player in the Hadoop space.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=642523&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Maybe this is just news to me, but IBM has a SQL-on-Hadoop product in the works called Big SQL. The company <a href="https://www.ibm.com/developerworks/community/blogs/SusanVisser/entry/introducing_the_ibm_big_sql_technology_preview1?lang=en">announced the technology preview version in March</a> (well under my radar and, from what I&#8217;ve seen, nearly everyone else&#8217;s radar), and is offering up a cloud-based demo environment for a select group of early users.</p>
<p>As a refresher, the big difference between SQL on Hadoop and the Hadoop connectors that were popular a couple years ago is that SQL-on-Hadoop products query the data where it resides &#8212; in HDFS or HBase &#8212; rather than pulling it into a relational database environment to analyze it. We have been <a href="http://gigaom.com/2013/04/30/with-impala-now-ga-clouderas-ceo-sizes-up-the-sql-on-hadoop-market/">talking for months about the emergence of a large SQL-on-Hadoop market</a>, but IBM&#8217;s name was conspicuously absent from that discussion. The company has Hadoop software called BigInsights and lots of SQL expertise, so it only made sense that IBM would get into the game at some point.</p>
<p>Details on Big SQL are still pretty sparse save for a few high-level blog posts and an instructional video (embedded below), but it looks to take the standard approach, <a href="http://gigaom.com/2013/04/30/with-impala-now-ga-clouderas-ceo-sizes-up-the-sql-on-hadoop-market/">as Cloudera is doing with Impala</a>, of enabling access through traditional tools via JDBC and ODBC drivers.</p>
<p>Ultimately, I think the advent of big data will <a href="http://gigaom.com/2013/05/01/precog-launches-with-a-plan-to-simplify-analytics-on-unstructured-data/">enable some new types of querying techniques</a> quite a bit different than the SQL queries we&#8217;ve come to know and love over the past couple decades. But SQL is still the language du jour and might never go away, so there&#8217;s a lot of value to be had if people can put their SQL skills to work on data stored inside Hadoop or other environments, and if companies can work toward a nirvana <a href="http://gigaom.com/2013/02/25/emc-to-hadoop-competition-see-ya-wouldnt-wanna-be-ya/">where all the data is stored in a single place</a> rather than across database environments.</p>
<p>That IBM got this message and got into the game isn&#8217;t surprising at all, but it is important. Lots of large companies buy IBM&#8217;s software.  If it wants them to follow it into the world of big data and Hadoop, it has to give them the tools they need to use it.</p>
<span class='embed-youtube' style='text-align:center; display: block;'><iframe class='youtube-player' type='text/html' width='604' height='370' src='http://www.youtube.com/embed/DCWig4-h1F4?version=3&#038;rel=1&#038;fs=1&#038;showsearch=0&#038;showinfo=1&#038;iv_load_policy=1&#038;wmode=transparent' frameborder='0'></iframe></span>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=642523&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=290969"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=290969" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=642523+look-ibm-is-doing-sql-on-hadoop-too&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=642523+look-ibm-is-doing-sql-on-hadoop-too&utm_content=dharrisstructure">The importance of putting the U and I in visualization</a></li><li><a href="http://pro.gigaom.com/report/the-new-economics-of-enterprise-data-warehousing/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=642523+look-ibm-is-doing-sql-on-hadoop-too&utm_content=dharrisstructure">How data warehousing is now a cost-effective solution for businesses</a></li><li><a href="http://pro.gigaom.com/report/how-to-manage-big-data-without-breaking-the-bank/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=642523+look-ibm-is-doing-sql-on-hadoop-too&utm_content=dharrisstructure">How to manage big data without breaking the bank</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/06/look-ibm-is-doing-sql-on-hadoop-too/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/02/shutterstock_37622056.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/02/shutterstock_37622056.jpg?w=150" medium="image">
			<media:title type="html">sql statement</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>
	</item>
		<item>
		<title>MapR releases M7, its commercial HBase distro</title>
		<link>http://gigaom.com/2013/05/01/mapr-releases-m7-its-commercial-hbase-distro/</link>
		<comments>http://gigaom.com/2013/05/01/mapr-releases-m7-its-commercial-hbase-distro/#comments</comments>
		<pubDate>Wed, 01 May 2013 23:21:07 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[Databases]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Hbase]]></category>
		<category><![CDATA[Mapr]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[open source]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=641425</guid>
		<description><![CDATA[MapR on Wednesday released its commercial version of HBase called M7, the first such product on the market, that the company claims is bigger, faster and better than the open source version.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=641425&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>MapR didn&#8217;t miss the memo about the key to success in the Hadoop space being the creation of a data platform that can do many things. And on Wednesday, the company released its take on HBase, <a href="http://www.mapr.com/products/mapr-editions/m7-edition">called M7.</a></p>
<p>Last week, I <a href="http://gigaom.com/2013/04/22/how-hbase-converted-myspaces-mysql-champion-and-is-driving-hadoop-mainstream/">explained how HBase is fast becoming the star of the Hadoop ecosystem</a> because it allows users to build more real-time, almost transactional applications on top of Hadoop. True to its form with its other products, MapR has taken HBase even further with M7 by promising greater availability (99.999 percent), instant recovery, faster operations and the ability to handle 1 trillion tables in a single cluster. In open source versions of HBase, MapR VP of Marketing Jack Norris told me, the accepted table limit per cluster is several hundred.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/05/m7.jpg"><img  alt="m7" src="http://gigaom2.files.wordpress.com/2013/05/m7.jpg?w=300&#038;h=265" width="300" height="265" class="alignright size-medium wp-image-641471" /></a>Additionally, M7 shares a single data layer with the Hadoop file system, meaning less performance overhead and, presumably, easier management.</p>
<p>As we&#8217;re seeing with other Hadoop vendors, including Cloudera (which <a href="http://gigaom.com/2013/04/30/with-impala-now-ga-clouderas-ceo-sizes-up-the-sql-on-hadoop-market/">released its Impala SQL query engine on Tuesday</a>), the Hadoop market is fast becoming one where each vendor is trying to set itself apart from the rest by building the best platform with the broadest set of capabilities. In furtherance of that mission, MapR also announced on Wednesday full-text search on its Hadoop distribution thanks to a partnership with Lucene specialist LucidWorks. It already has its own Hadoop distribution complete with proprietary code to bolster the file system and speed up MapReduce, as well as an <a href="http://gigaom.com/2012/08/17/for-fast-interactive-hadoop-queries-drill-may-be-the-answer/">open source SQL-on-Hadoop project called Drill</a> in the works.</p>
<p>MapR employees are probably sleeping a lot easier these days as a result of this platform push. Others in the Hadoop market used to talk about the fear of fragmentation and then point at MapR as the example of a company helping foment that outcome with its proprietary software. Now, however, even if everyone else is building open source products, they&#8217;re all still backing their own and largely dismissing the others.</p>
<p>I suspect the result is feature lock-in even there&#8217;s no technological lock-in, kind of <a href="http://gigaom.com/2011/03/16/how-amazon-is-following-apples-lead-to-rule-cloud-computing/">like using Amazon Web Services for cloud computing</a> and then hoping to replicate its various servies elsewhere. It might be easy enough to move your data, but impossible or very difficult to replicate those additional capabilities elsewhere. If MapR can build a better version of HBase and companies are willing to pay for it, then so be it.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=641425&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=411259"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=411259" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=641425+mapr-releases-m7-its-commercial-hbase-distro&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/report/cloud-and-data-first-quarter-2013-analysis-and-outlook/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=641425+mapr-releases-m7-its-commercial-hbase-distro&utm_content=dharrisstructure">Cloud and data first-quarter 2013: analysis and outlook</a></li><li><a href="http://pro.gigaom.com/2012/04/infrastructure-q1-cloud-and-big-data-woo-the-enterprise/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=641425+mapr-releases-m7-its-commercial-hbase-distro&utm_content=dharrisstructure">Infrastructure Q1: Cloud and big data woo enterprises</a></li><li><a href="http://pro.gigaom.com/2011/03/defining-hadoop-the-players-technologies-and-challenges-of-2011/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=641425+mapr-releases-m7-its-commercial-hbase-distro&utm_content=dharrisstructure">Defining Hadoop: the Players, Technologies and Challenges of 2011</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/01/mapr-releases-m7-its-commercial-hbase-distro/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/09/shutterstock_110961494.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/09/shutterstock_110961494.jpg?w=150" medium="image">
			<media:title type="html">Database rows</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/m7.jpg?w=300" medium="image">
			<media:title type="html">m7</media:title>
		</media:content>
	</item>
		<item>
		<title>Precog launches with a plan to simplify analytics on unstructured data</title>
		<link>http://gigaom.com/2013/05/01/precog-launches-with-a-plan-to-simplify-analytics-on-unstructured-data/</link>
		<comments>http://gigaom.com/2013/05/01/precog-launches-with-a-plan-to-simplify-analytics-on-unstructured-data/#comments</comments>
		<pubDate>Wed, 01 May 2013 18:50:32 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[analytics]]></category>
		<category><![CDATA[BI]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Precog]]></category>
		<category><![CDATA[unstructured data]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=641260</guid>
		<description><![CDATA[Analytics startup Precog is on a mission to make analytics on unstructured data as simple as possible with a new line of targeted appliances. <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=641260&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.precog.com/">Precog</a>, a Boulder, Colo.-based startup that&#8217;s trying to seed the market for advanced analytics on unstructured data, is coming out of beta on Thursday with a line of appliances designed to let everyday users get started on making sense of social, web and application data. The company&#8217;s underlying technology has remained the same <a href="http://gigaom.com/2012/09/27/startup-precog-says-big-data-doesnt-need-to-be-so-complex/">since we profiled Precog in September</a>, but a journey into the world outside Silicon Valley has changed its thinking about how to market and deliver its product.</p>
<p>Put simply, Precog&#8217;s technology lets users ask questions of their unstructured data (e.g., stuff sitting in Hadoop, MongoDB or any other non-relational data store) in whatever format it was created &#8212; JSON, logfile, XML, what have you. This is different from the standard operating procedure of querying unstructured data &#8212; including <a href="http://gigaom.com/2013/04/30/with-impala-now-ga-clouderas-ceo-sizes-up-the-sql-on-hadoop-market/">the current SQL-on-Hadoop craze</a> &#8212; which usually involves somehow transforming data into a format that a relational engine can read before beginning the analysis. Precog also features visualizations, charts and reports designed with these new types of data, and presumably larger datasets, in mind.</p>
<div id="attachment_641294" class="wp-caption alignleft" style="width: 310px"><a href="http://gigaom2.files.wordpress.com/2013/05/de-goes-and-carr.jpg"><img  alt="CEO John De Goes and COO Jeff Carr" src="http://gigaom2.files.wordpress.com/2013/05/de-goes-and-carr.jpg?w=708"   class="size-full wp-image-641294" /></a><p class="wp-caption-text">CEO John De Goes and COO Jeff Carr</p></div>
<p>However, Founder and CEO John De Goes told me, the company came to realize over the past several months that as much as what it&#8217;s doing might fall under the &#8220;data science&#8221; umbrella, that&#8217;s the wrong messaging. Outside of Silicon Valley, he said, &#8220;a lot of companies don&#8217;t have the technological sophistication to understand the whole data science thing&#8221; &#8212; they just want to know that they can ask deeper questions of the new data types they&#8217;re storing in their NoSQL databases without having to perform ETL operations on it or write a lot of complicated code.</p>
<p>And the bigger those companies are, Precog COO Jeff Carr said, the less likely they are to want a cloud service like Precog initially offered.</p>
<p>So the company took both lessons to heart and is rolling out a line of appliances (physical or virtual) that complement its flagship cloud service, each targeting specific use cases. The first three are social media, web analytics and application data, and the appliances are equipped with baked-in capabilities important to each of those fields. The social media one, for example, will feature advanced sentiment analysis and natural language processing, while the web analytics one will focus on features such as behavioral clustering.</p>
<p>Under the covers, though, each appliance still runs on the broader Precog platform, Carr noted, and someone who buys one just to get started in a specific area can pretty easily (i.e., without reaching &#8220;super-coder&#8221; status) turn it toward other data types and other types of analysis. But right now, De Goes added, no one really knows what it means to have an analytics product designed for unstructured data, so the appliance approach should make it easier for large enterprises and non-tech companies to digest.</p>
<div id="attachment_641292" class="wp-caption alignright" style="width: 310px"><a href="http://gigaom2.files.wordpress.com/2013/05/precog-web.jpg"><img  alt="precog web" src="http://gigaom2.files.wordpress.com/2013/05/precog-web.jpg?w=300&#038;h=192" width="300" height="192" class="size-medium wp-image-641292" /></a><p class="wp-caption-text">An example of web analytics in Precog.</p></div>
<p>It&#8217;s a &#8220;baby steps&#8221; situation, explained Carr: &#8220;Don&#8217;t sit there and try to think about how to solve every problem all at once. Let&#8217;s try to sit there and think about data types you know you&#8217;re having problems with [now].&#8221;</p>
<p>Analyzing data in its native format has advantages beyond just omitting an extra transformation step, though, and the Precog team thinks companies will get hip to these advantages as they begin to understand the analytic aspects of non-relational databases as well as they do the operational aspects. Often times, these will be new use cases, which is why Precog considers itself more complementary to than competitive with traditional data warehouses, SQL-on-Hadoop tools and BI software.</p>
<p>One early customer is using Precog to match up résumé data &#8212; often enhanced résumé data &#8212; with job openings, which is a tricky proposition in a relational format because résumés can include so much personalized information or content that doesn&#8217;t fit into a schema at all, really. Another user, a large telco, is trying to build new data products for its customers by mashing together all sorts of internal and third-party data in numerous formats.</p>
<p>Carr compared the shift to the shift from just flat files to relational data decades ago. &#8220;It&#8217;s happening again,&#8221; he said. &#8220;It has to happen again &#8230; people are not going to abandon JSON because it does&#8217;t fit neatly inside a table.&#8221;</p>
<p>Precog is telling the right story around why unstructured analytics matters, but one has to assume there will be a major shakeout in the big data analytics space over the next few years. There are only so many new technologies companies can absorb at once &#8212; Hadoop, NoSQL, <a href="http://gigaom.com/2013/02/21/sql-is-whats-next-for-hadoop-heres-whos-doing-it/">SQL on Hadoop</a>, unstructured analytics, <a href="http://gigaom.com/2013/03/26/white-hot-bi-on-hadoop-startup-platfora-now-ga/">Platfora</a>, in-memory, stream processing, <a href="http://gigaom.com/2013/02/19/citusdb-today-sql-on-hadoop-tomorrow-the-world/">next-gen analytic databases</a>, etc. &#8212; and it&#8217;s hard to predict which messages and capabilities will win out.</p>
<p>However, unless Hadoop really does become the lone dumping ground for <em>all </em>non-operational data &#8212; regardless the source &#8212; technologies like Precog that can act as the analytics layer across numerous data stores would seem to have an advantage.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=641260&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=208825"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=208825" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=641260+precog-launches-with-a-plan-to-simplify-analytics-on-unstructured-data&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/04/infrastructure-q1-cloud-and-big-data-woo-the-enterprise/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=641260+precog-launches-with-a-plan-to-simplify-analytics-on-unstructured-data&utm_content=dharrisstructure">Infrastructure Q1: Cloud and big data woo enterprises</a></li><li><a href="http://pro.gigaom.com/report/the-new-economics-of-enterprise-data-warehousing/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=641260+precog-launches-with-a-plan-to-simplify-analytics-on-unstructured-data&utm_content=dharrisstructure">How data warehousing is now a cost-effective solution for businesses</a></li><li><a href="http://pro.gigaom.com/report/how-to-manage-big-data-without-breaking-the-bank/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=641260+precog-launches-with-a-plan-to-simplify-analytics-on-unstructured-data&utm_content=dharrisstructure">How to manage big data without breaking the bank</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/01/precog-launches-with-a-plan-to-simplify-analytics-on-unstructured-data/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/05/precog-web.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/05/precog-web.jpg?w=150" medium="image">
			<media:title type="html">precog web</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/de-goes-and-carr.jpg" medium="image">
			<media:title type="html">CEO John De Goes and COO Jeff Carr</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/precog-web.jpg?w=300" medium="image">
			<media:title type="html">precog web</media:title>
		</media:content>
	</item>
		<item>
		<title>How to manage big data without breaking the bank</title>
		<link>http://pro.gigaom.com/report/how-to-manage-big-data-without-breaking-the-bank/</link>
		<comments>http://pro.gigaom.com/report/how-to-manage-big-data-without-breaking-the-bank/#comments</comments>
		<pubDate>Tue, 30 Apr 2013 15:55:18 +0000</pubDate>
		<dc:creator>George Gilbert</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[column store]]></category>
		<category><![CDATA[compute]]></category>
		<category><![CDATA[data archive]]></category>
		<category><![CDATA[data compression]]></category>
		<category><![CDATA[data-analytics]]></category>
		<category><![CDATA[database management systems]]></category>
		<category><![CDATA[DBMS]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[enterprise IT]]></category>
		<category><![CDATA[Exadata]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[HDFS]]></category>
		<category><![CDATA[hive]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[machine data]]></category>
		<category><![CDATA[mapreduce]]></category>
		<category><![CDATA[Nasdaq]]></category>
		<category><![CDATA[network-attached-storage]]></category>
		<category><![CDATA[networking]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[pig]]></category>
		<category><![CDATA[RainStor]]></category>
		<category><![CDATA[service-level-agreements]]></category>
		<category><![CDATA[splunk]]></category>
		<category><![CDATA[SQL]]></category>
		<category><![CDATA[storage-area-network]]></category>
		<category><![CDATA[Sybase IQ]]></category>
		<category><![CDATA[Telcos]]></category>

		<guid isPermaLink="false">http://pro.gigaom.com/?post_type=go-report&#038;p=174639/</guid>
		<description><![CDATA[In the tsunami of experimentation, investment, and deployment of systems that analyze big data, vendors have seemingly been trying approaches at two extremes—either embracing the Hadoop ecosystem or building increasingly sophisticated query capabilities into database management system (DBMS) engines.For some use cases, there appears to be room for a third approach that lies between the extremes and borrows from the best of each.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=648515&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>In the tsunami of experimentation, investment, and deployment of systems that analyze big data, vendors have seemingly been trying approaches at two extremes—either embracing the Hadoop ecosystem or building increasingly sophisticated query capabilities into database management system (DBMS) engines.For some use cases, there appears to be room for a third approach that lies between the extremes and borrows from the best of each.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=648515&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=608881"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=608881" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=648515+how-to-manage-big-data-without-breaking-the-bank&utm_content=techstrategypartners">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/04/infrastructure-q1-cloud-and-big-data-woo-the-enterprise/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=648515+how-to-manage-big-data-without-breaking-the-bank&utm_content=techstrategypartners">Infrastructure Q1: Cloud and big data woo enterprises</a></li><li><a href="http://pro.gigaom.com/2012/04/sector-roadmap-hadoop-platforms-2012/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=648515+how-to-manage-big-data-without-breaking-the-bank&utm_content=techstrategypartners">2012: The Hadoop infrastructure market booms</a></li><li><a href="http://pro.gigaom.com/report/sql-on-hadoop-roadmap-2013/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=648515+how-to-manage-big-data-without-breaking-the-bank&utm_content=techstrategypartners">Sector RoadMap: SQL-on-Hadoop platforms in 2013</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://pro.gigaom.com/report/how-to-manage-big-data-without-breaking-the-bank/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="https://gigaom-pro-files.s3.amazonaws.com/files/2012/04/big-data-on-computer-image.jpg?w=150" />
		<media:content url="https://gigaom-pro-files.s3.amazonaws.com/files/2012/04/big-data-on-computer-image.jpg?w=150" medium="image">
			<media:title type="html">big data on computer image</media:title>
		</media:content>

		<media:content url="http://1.gravatar.com/avatar/1ad3ba18460ee12eb7567dd78f23756f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">George Gilbert</media:title>
		</media:content>
	</item>
	</channel>
</rss>