<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>GigaOM &#187; Platfora</title>
	<atom:link href="http://gigaom.com/tag/platfora/feed/" rel="self" type="application/rss+xml" />
	<link>http://gigaom.com</link>
	<description></description>
	<lastBuildDate>Mon, 20 May 2013 13:19:55 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='gigaom.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://0.gravatar.com/blavatar/0db8f6557d022075dbbf010c54d46d93?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>GigaOM &#187; Platfora</title>
		<link>http://gigaom.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://gigaom.com/osd.xml" title="GigaOM" />
	<atom:link rel='hub' href='http://gigaom.com/?pushpress=hub'/>
		<item>
		<title>Visualization is the future: 6 startups re-imagining how we consume data</title>
		<link>http://gigaom.com/2013/05/13/visualization-is-the-future-6-startups-re-imagining-how-we-consume-data/</link>
		<comments>http://gigaom.com/2013/05/13/visualization-is-the-future-6-startups-re-imagining-how-we-consume-data/#comments</comments>
		<pubDate>Mon, 13 May 2013 18:20:25 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[analytics]]></category>
		<category><![CDATA[Ayasdi]]></category>
		<category><![CDATA[BeyondCore]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[ClearStory]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[data democratization]]></category>
		<category><![CDATA[Datahero]]></category>
		<category><![CDATA[Platfora]]></category>
		<category><![CDATA[Visualization]]></category>
		<category><![CDATA[Zoomdata]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=643727</guid>
		<description><![CDATA[If the big data era is really going to revolutionize our world, visualizations that let more people make sense of data will be critical. Here are six startups trying to change how we interact with and look at our data.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=643727&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Although visualization is hardly the most technologically challenging part of the data-analysis puzzle, it’s arguably the most important.</p>
<p>Storage, databases, query processing and algorithms are all extremely important — heck, visualization is next to nothing without them — but in a data-driven world where is obsessed with insights, they’re just the foundational layers. They are to big data what server and network configurations are to mobile-app development on <a href="http://gigaom.com/2013/04/25/facebook-acquires-mobile-development-platform-parse/">platforms like Parse</a>. If you’re going to find out new things from massive and highly complex data sets, or <a href="http://gigaom.com/2013/04/07/we-need-a-data-democracy-not-a-benevolent-data-dictatorship/">going to give new types of people the ability to analyze even simple data</a>, the presentation of that data and the ability to create consumable presentations are critical.</p>
<p>With that in mind, here are six startups I’ve seen trying to fundamentally change the way that data is visualized. Some are highly complex under the covers, some are not and none are perfect, but they’re all doing their part to make us rethink what it means to look at data and make spreadsheets and static charts look like relics. (And this list is by no means exhaustive, so feel free to add your favorite visualization tools in the comments.) We’ll be highlighting data visualization at our design-focused RoadMap conference in San Francisco in November (<a href="http://event.gigaom.com/gigaomroadmap/?utm_source=data&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=643727+visualization-is-the-future-6-startups-re-imagining-how-we-consume-data&amp;utm_content=dharrisstructure">sign up here</a> to get first access to tickets this Summer).</p>
<h2 id="ayasdi">Ayasdi</h2>
<p>The idea of network graphs isn’t new, but <a href="http://ayasdi.com/">Ayasdi’s</a> approach to it is. Under the covers, there’s an HBase data store, a technique called <del>topographical</del> topological data analysis and hundreds of machine learning algorithms to churn through complex data sets and determine the similarity among the data points. To the end user, though, <a href="http://gigaom.com/2013/01/16/has-ayasdi-turned-machine-learning-into-a-magic-bullet/">there’s a map of the data set that looks a lot like a network graph</a> (only it’s probably not network data) highlighting clusters of related data points that analysts might want to investigate further.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/05/tcga.png"><img alt="tcga" src="http://gigaom2.files.wordpress.com/2013/05/tcga.png?w=708"   class="aligncenter size-full wp-image-644682"></a></p>
<h2 id="beyondcore">BeyondCORE</h2>
<p><a href="http://beyondcore.com/">BeyondCore</a> actually operates under the same basic premise as Ayasdi — show users the significant correlations so they don’t have to think of the queries that will uncover them — but it uses some different techniques to get there. It uses a different visualization method, too: BeyondCore sticks to standard charts, but actually offers the option of <a href="http://gigaom.com/2012/11/20/a-startup-asks-what-if-you-didnt-have-to-analyze-data-at-all/">having an avatar talk users through the correlations</a> the software has discovered.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/05/animatedbriefing.jpg"><img alt="animatedbriefing" src="http://gigaom2.files.wordpress.com/2013/05/animatedbriefing.jpg?w=708"   class="aligncenter size-full wp-image-644685"></a></p>
<h2 id="clearstory">ClearStory</h2>
<p><a href="http://www.clearstorydata.com/">ClearStory</a> has a pretty unique product in the works — even if it’s keeping many details and all of its screenshots under lock and key until its formally launches. Essentially, though, <a href="http://gigaom.com/2012/12/05/clearstory-data-raises-9m-and-might-actually-make-data-your-friend/">it’s trying to tell stories via visualizations</a> that display mashups of numerous data sources, update automatically when the source data changes, and invoke collaboration and social concepts. Here’s Co-founder and CEO Sharmila Mulligan explaining the idea behind ClearStory at Structure: Data in March.</p>
<span class="embed-youtube" style="text-align:center; display: block;"><iframe class="youtube-player" type="text/html" width="604" height="370" src="http://www.youtube.com/embed/O62VVrKD1NE?version=3&amp;rel=1&amp;fs=1&amp;showsearch=0&amp;showinfo=1&amp;iv_load_policy=1&amp;wmode=transparent" frameborder="0"></iframe></span>
<h2 id="datahero">Datahero</h2>
<p>Unlike so many data startups, <a href="http://www.datahero.com/">Datahero</a> isn’t trying to woo people fed up with business-intelligence software or the difficulties of getting insights from Hadoop data. Rather, it’s <a href="http://gigaom.com/2013/04/23/visualization-startup-datahero-opens-its-doors-and-delivers-data-analysis-for-the-masses/">trying to let people with simple business or personal data make simple charts</a> without ever having to enter an Excel function or worry too much about how their spreadsheets are formatted. Early on, Datahero’s visualizations are still pretty commonplace (bars, pies, plots, etc.), but it’s the ease of creating them that’s so unique.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/05/dh-10-e1366704037117.jpg"><img alt="dh-10-e1366704037117" src="http://gigaom2.files.wordpress.com/2013/05/dh-10-e1366704037117.jpg?w=708&#038;h=402" width="708" height="402" class="aligncenter size-full wp-image-644697"></a></p>
<h2 id="platfora">Platfora</h2>
<p><a href="http://platfora.com/">Platfora</a> has undertaken the ambitious task of trying to make analyzing mountains of data stored in Hadoop clusters as easy as analyzing their own <a href="https://stripe.com/">Stripe</a> data might be for developers using Datahero. It’s <a href="http://gigaom.com/2012/10/23/platfora-shows-a-whole-new-way-to-do-business-intelligence-on-big-data/">based on a foundation of Hadoop and massively parallel query processing</a>, but is presented like an HTML5 version of <a href="http://gigaom.com/2013/04/03/a-tableau-ipo-could-validate-the-big-data-visualization-push-or-not/">current visualization golden boy Tableau</a> that’s all about dragging, dropping, and visually slicing and dicing through data. The latter capability is actually critical in a big data world where there are likely more data points than you can ever digest at once.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/05/explore_slide_4.jpg"><img alt="explore_slide_4" src="http://gigaom2.files.wordpress.com/2013/05/explore_slide_4.jpg?w=708&#038;h=375" width="708" height="375" class="aligncenter size-large wp-image-644705"></a></p>
<h2 id="zoomdata">Zoomdata</h2>
<p><a href="http://www.zoomdata.com/">Zoomdata</a> is far from the only analytics company to support mobile devices, but it’s one of the few I know of (<a href="http://www.roambi.com/analytics-overview.html">Roambi</a> also comes to mind) designed primarily for them. Zoomdata connects to standard business data sources, but takes advantage of touch screens and the D3.js visualization project to offer up some visually interesting charts that are <a href="http://gigaom.com/2012/11/13/heres-how-it-looks-when-big-data-goes-mobile-first/">designed to be manipulated like an artist’s palette</a>.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/05/ticketstatus_101812.jpg"><img alt="ticketstatus_101812" src="http://gigaom2.files.wordpress.com/2013/05/ticketstatus_101812.jpg?w=708&#038;h=531" width="708" height="531" class="aligncenter size-full wp-image-644709"></a></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=643727&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=993334"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=993334" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=643727+visualization-is-the-future-6-startups-re-imagining-how-we-consume-data&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=643727+visualization-is-the-future-6-startups-re-imagining-how-we-consume-data&utm_content=dharrisstructure">The importance of putting the U and I in visualization</a></li><li><a href="http://pro.gigaom.com/2012/07/cloud-computing-and-trickle-down-analytics/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=643727+visualization-is-the-future-6-startups-re-imagining-how-we-consume-data&utm_content=dharrisstructure">Cloud computing and trickle-down analytics</a></li><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=643727+visualization-is-the-future-6-startups-re-imagining-how-we-consume-data&utm_content=dharrisstructure">A near-term outlook for big data</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/13/visualization-is-the-future-6-startups-re-imagining-how-we-consume-data/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/05/tcga.png?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/05/tcga.png?w=150" medium="image">
			<media:title type="html">tcga</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/tcga.png" medium="image">
			<media:title type="html">tcga</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/animatedbriefing.jpg" medium="image">
			<media:title type="html">animatedbriefing</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/dh-10-e1366704037117.jpg" medium="image">
			<media:title type="html">dh-10-e1366704037117</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/explore_slide_4.jpg?w=708" medium="image">
			<media:title type="html">explore_slide_4</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/ticketstatus_101812.jpg" medium="image">
			<media:title type="html">ticketstatus_101812</media:title>
		</media:content>
	</item>
		<item>
		<title>White-hot BI-on-Hadoop startup Platfora now GA</title>
		<link>http://gigaom.com/2013/03/26/white-hot-bi-on-hadoop-startup-platfora-now-ga/</link>
		<comments>http://gigaom.com/2013/03/26/white-hot-bi-on-hadoop-startup-platfora-now-ga/#comments</comments>
		<pubDate>Tue, 26 Mar 2013 13:01:31 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[BI]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Platfora]]></category>
		<category><![CDATA[SQL on Hadop]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=624260</guid>
		<description><![CDATA[Platfora, the San Mateo, Calif.-based startup that helped spur a general rethinking of business intelligence for a big data world, is finally exiting its beta period and is generally available. It&#8217;s no wonder the company has garnered so much attention given its stated mission to make [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=624260&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://platfora.com/">Platfora</a>, the San Mateo, Calif.-based startup that helped spur a general rethinking of business intelligence for a big data world, is finally exiting its beta period and is generally available. It&#8217;s no wonder the company has garnered so much attention given its stated mission to make Hadoop an interactive experience and to disrupt a multi-billion-dollar data warehouse and BI market.</p>
<p>Unlike legacy BI applications that generally connect to Hadoop but otherwise retain their old-school performance limitations, <a href="http://gigaom.com/2013/03/21/if-the-future-of-bi-is-hadoop-sql-and-the-cloud-are-the-glue/">Platfora and its ilk have big data at their core</a>. Platfora is built on Hadoop for scale, but the company also has its own IP around in-memory processing to improve the speed of slicing and dicing through data, and its HTML5 interface provides an easy way to navigate through lots of data points.</p>
<div id="attachment_622973" class="wp-caption alignleft" style="width: 310px"><a href="http://gigaom2.files.wordpress.com/2013/03/2jdr1lkkt5mzdquv3eohac6hpes7cwccwjhxtsbri0g.jpg"><img  alt="Justin Borgman Hadapt Tomer Shiran MapR Technologies Ashish Thusoo Qubole Ben Werther Platfora Structure Data 2013" src="http://gigaom2.files.wordpress.com/2013/03/2jdr1lkkt5mzdquv3eohac6hpes7cwccwjhxtsbri0g.jpg?w=300&#038;h=199" width="300" height="199" class="size-medium wp-image-622973" /></a><p class="wp-caption-text">Werther (far left) talking SQL on Hadoop at Structure: Data 2013 along with representatives from Qubole, MapR, Hadapt and Facebook.</p></div>
<p>I&#8217;ve compared this general family of products &#8212; in which I&#8217;d also include <a href="http://gigaom.com/2012/12/05/clearstory-data-raises-9m-and-might-actually-make-data-your-friend/">ClearStory</a>, <a href="http://gigaom.com/2012/09/27/startup-precog-says-big-data-doesnt-need-to-be-so-complex/">Precog</a>, <a href="http://gigaom.com/2012/10/30/as-sandy-strikes-another-big-data-opportunity-emerges/">SiSense</a> and <a href="http://gigaom.com/2012/10/17/batten-down-the-analysts-its-a-big-data-bi-storm/">Birst</a>, among others I&#8217;m sure &#8212; to Tableau, albeit slightly (sometimes significantly) rethought and then jacked up on steroids to handle big data scale and/or speed. The big difference with Platfora, though, is that it&#8217;s built on top of Hadoop and is therefore <a href="http://gigaom.com/2013/03/05/the-hadoop-ecosystem-the-welcome-elephant-in-the-room-infographic/">part of an even bigger movement around that open source platform</a> and <a href="http://gigaom.com/2013/02/21/sql-is-whats-next-for-hadoop-heres-whos-doing-it/">a quest to build native SQL queries</a> into a system designed for MapReduce.</p>
<p>We have been covering Platfora since its inception, <a href="http://gigaom.com/2011/06/30/stealth-startup-platfora-wants-to-do-hadoop-for-the-rest-of-us/">from stealth mode</a> to <a href="http://gigaom.com/2012/10/23/platfora-shows-a-whole-new-way-to-do-business-intelligence-on-big-data/">launch</a>, and then <a href="http://gigaom.com/2012/11/13/plotting-a-bi-coup-hadoop-startup-platfora-raises-20m/">a whopping $20 million VC investment</a> in November.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=624260&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=188164"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=188164" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=624260+white-hot-bi-on-hadoop-startup-platfora-now-ga&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=624260+white-hot-bi-on-hadoop-startup-platfora-now-ga&utm_content=dharrisstructure">The importance of putting the U and I in visualization</a></li><li><a href="http://pro.gigaom.com/2012/04/sector-roadmap-hadoop-platforms-2012/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=624260+white-hot-bi-on-hadoop-startup-platfora-now-ga&utm_content=dharrisstructure">2012: The Hadoop infrastructure market booms</a></li><li><a href="http://pro.gigaom.com/2012/04/infrastructure-q1-cloud-and-big-data-woo-the-enterprise/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=624260+white-hot-bi-on-hadoop-startup-platfora-now-ga&utm_content=dharrisstructure">Infrastructure Q1: Cloud and big data woo enterprises</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/03/26/white-hot-bi-on-hadoop-startup-platfora-now-ga/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/10/segmentation.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/10/segmentation.jpg?w=150" medium="image">
			<media:title type="html">Segmentation</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/03/2jdr1lkkt5mzdquv3eohac6hpes7cwccwjhxtsbri0g.jpg?w=300" medium="image">
			<media:title type="html">Justin Borgman Hadapt Tomer Shiran MapR Technologies Ashish Thusoo Qubole Ben Werther Platfora Structure Data 2013</media:title>
		</media:content>
	</item>
		<item>
		<title>If the future of BI is Hadoop, SQL and the cloud are the glue</title>
		<link>http://gigaom.com/2013/03/21/if-the-future-of-bi-is-hadoop-sql-and-the-cloud-are-the-glue/</link>
		<comments>http://gigaom.com/2013/03/21/if-the-future-of-bi-is-hadoop-sql-and-the-cloud-are-the-glue/#comments</comments>
		<pubDate>Thu, 21 Mar 2013 18:18:10 +0000</pubDate>
		<dc:creator>Kevin C. Tofel</dc:creator>
				<category><![CDATA[Ashish Thusoo]]></category>
		<category><![CDATA[Ben Werther]]></category>
		<category><![CDATA[Hadapt]]></category>
		<category><![CDATA[Justin Borgman]]></category>
		<category><![CDATA[Platfora]]></category>
		<category><![CDATA[Qubole]]></category>
		<category><![CDATA[Ravi Murthy]]></category>
		<category><![CDATA[Structure Data 2013]]></category>
		<category><![CDATA[Tomer Shiran]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=622888</guid>
		<description><![CDATA[There's no doubt that Hadoop is the data tool of the present and future, but more can be done to make it really shine for business intelligence.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=622888&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Starting with the well-known quote — “A good way to predict the future is to invent it” – Ravi Murthy, engineering manager at Facebook, kicked off an interesting panel discussion at <a href="http://event.gigaom.com/structuredata/?utm_source=data&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=622888+if-the-future-of-bi-is-hadoop-sql-and-the-cloud-are-the-glue&amp;utm_content=kevintofel">GigaOM Structure:Data 2013</a> Thursday with four industry experts on business intelligence (BI) and Hadoop. Hadoop has a big place in that future, but not by itself. The conclusion? Applications and SQL databases built atop Hadoop are needed for better BI, noted the panel.</p>
<p>“Why are so many systems being built in the BI landscape? If Hadoop can deliver the promise, why have all these other solutions?” asked Murthy.</p>
<p>Ashish Thusoo, co-Founder and CEO at <a href="http://www.qubole.com/">Qubole,</a> said that putting SQL on top of Hadoop just makes sense. “As a system, Hadoop is not a low-latency system, opening the need for faster SQL-based systems to query the data. And there’s probably only space for half-dozen of these solutions in the market; not dozens.”</p>
<p>Agreeing with Thusoo was Tomer Shiran, director, product management at <a href="http://www.mapr.com/">MapR Technologies</a>. “With our open source Apache Drill we’re enabling lots of differing BI use cases allowing companies to do different things with Hadoop. One use case is ability to interactively query and explore data.” Apache Drill is an interactive, low-latency SQL way to get at the data reservoir in Hadoop. Ben Werther, founder and CEO, <a href="http://www.platfora.com/">Platfora</a> completely agreed, saying that customers looking for much more agile approaches to data exploration without building more IT work.</p>
<p>But Hadoop is still an important underlying part of the puzzle. Justin Borgman, CEO, <a href="http://hadapt.com/">Hadapt</a> noted that “Hadoop scales so cost effectively; it’s a landfill where you can dump everything. That opens up new opportunities to explore that data including indexing to boost performance and interactivity across a broader data set.”</p>
<p>When asked for a use case of the benefits, Werther pointed out an unnamed customer. “They had 50 analysts working against SQL stores in a very siloed fashion. We moved them to a Hadoop-based stack and built a data reservoir. Only 5 of the 50 were able to be productive before. Within a week, all 50 became productive.”</p>
<p>Of course, the cloud is also part of BI’s future, although it’s not without risks. Sure, running Hadoop in the cloud is very elastic so that you can use as many resources as you need in near real-time. But the issues of security and data gravity in particular are worth noting: Generating data in the cloud could make it tough to move out in the future and may require more apps build on this data to also be in the cloud.</p>
<p>Check out <a href="http://gigaom.com/2013/03/20/structuredata-2013-live-coverage/">the rest of our Structure:Data 2013 live coverage here</a>, and a video embed of the session follows below:</p>
<p><span class="embed-youtube" style="text-align:center; display: block;"><iframe class="youtube-player" type="text/html" width="560" height="315" src="http://www.youtube.com/embed/neo6TE41I8I?version=3&amp;rel=1&amp;fs=1&amp;showsearch=0&amp;showinfo=1&amp;iv_load_policy=1&amp;wmode=transparent" frameborder="0"></iframe></span><br>
A transcription of the video follows on the next page</p>
<p><a href="http://gigaom.com/2013/03/21/if-the-future-of-bi-is-hadoop-sql-and-the-cloud-are-the-glue/2/">Go to page 2 (of 2) on GigaOM .</a></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=622888&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=561923"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=561923" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=622888+if-the-future-of-bi-is-hadoop-sql-and-the-cloud-are-the-glue&utm_content=kevintofel">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=622888+if-the-future-of-bi-is-hadoop-sql-and-the-cloud-are-the-glue&utm_content=kevintofel">The importance of putting the U and I in visualization</a></li><li><a href="http://pro.gigaom.com/2012/03/why-service-providers-matter-for-the-future-of-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=622888+if-the-future-of-bi-is-hadoop-sql-and-the-cloud-are-the-glue&utm_content=kevintofel">Why service providers matter for the future of big data</a></li><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=622888+if-the-future-of-bi-is-hadoop-sql-and-the-cloud-are-the-glue&utm_content=kevintofel">A near-term outlook for big data</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/03/21/if-the-future-of-bi-is-hadoop-sql-and-the-cloud-are-the-glue/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/03/2jdr1lkkt5mzdquv3eohac6hpes7cwccwjhxtsbri0g.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/03/2jdr1lkkt5mzdquv3eohac6hpes7cwccwjhxtsbri0g.jpg?w=150" medium="image">
			<media:title type="html">Justin Borgman Hadapt Tomer Shiran MapR Technologies Ashish Thusoo Qubole Ben Werther Platfora Structure Data 2013</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/6cbb45abac59965c2626e40155358d1b?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">Kevin C. Tofel</media:title>
		</media:content>
	</item>
		<item>
		<title>Plotting a BI coup, Hadoop startup Platfora raises $20M</title>
		<link>http://gigaom.com/2012/11/13/plotting-a-bi-coup-hadoop-startup-platfora-raises-20m/</link>
		<comments>http://gigaom.com/2012/11/13/plotting-a-bi-coup-hadoop-startup-platfora-raises-20m/#comments</comments>
		<pubDate>Tue, 13 Nov 2012 12:30:41 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[analytics]]></category>
		<category><![CDATA[BI]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Platfora]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=583875</guid>
		<description><![CDATA[Less than a month after unveiling its Hadoop analytics engine to great fanfare, Platfora has raised $20 million to realize its mission of displacing the business intelligence and data warehouse incumbents. It'll need every dime: although the competition is old, it's also rich, entrenched and determined.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=583875&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.platfora.com/">Platfora</a>, the white-hot San Mateo, Calif., startup that has built a next-generation business intelligence engine on top of Hadoop, has raised a $20 million Series B round from Battery Ventures along with Andreessen Horowitz and Sutter Hill Ventures. The new investment brings Platfora&#8217;s total funding to $25.7 million just over a year <a href="http://gigaom.com/cloud/platfora-gets-5-7m-to-make-hadoop-mainstream/">after exiting stealth mode</a>.</p>
<p>Platfora finally <a href="http://gigaom.com/data/platfora-shows-a-whole-new-way-to-do-business-intelligence-on-big-data/">took the wraps off its product</a> in October. The product combines Hadoop scale, in-memory speed and HTML5 flexibility, resulting in something that shows Hadoop&#8217;s promise as a technology that can be accessible by everyday business users and that <a href="http://gigaom.com/data/5-trends-that-are-changing-how-we-do-big-data/">can do more than perform batch processing</a>. Essentially, says Founder and CEO Ben Werther, it&#8217;s a better way to do data warehousing and business intelligence than what legacy technologies can provide.</p>
<p>Platfora&#8217;s focus on disrupting the BI incumbents is actually somewhat unique in the Hadoop space, where most well-funded companies provide the core Hadoop software and actually partner with those legacy vendors in order to position themselves as complementary technologies. Even Impala, <a href="http://gigaom.com/data/cloudera-makes-sql-a-first-class-citizen-in-hadoop/">Cloudera&#8217;s new interactive query engine</a>, is still too slow for true interactivity like what Platfora can provide, Werther said, and just &#8220;seems like a lifeline&#8221; for the incumbent companies who are now &#8220;on the wrong side of history.&#8221;</p>
<div id="attachment_583899" class="wp-caption aligncenter" style="width: 614px"><a href="http://gigaom2.files.wordpress.com/2012/11/explore_slide_5.jpg"><img  title="explore_slide_5" alt="" src="http://gigaom2.files.wordpress.com/2012/11/explore_slide_5.jpg?w=708"   class="size-full wp-image-583899" /></a><p class="wp-caption-text">A screenshot of Platfora in action.</p></div>
<p>If Werther sounds overconfident, maybe he is. Or maybe Platfora really has struck a nerve with the IT buyers and data analysts whose devotion the company will require if it&#8217;s going to live up to Werther&#8217;s plans of becoming a billion-dollar public company. Platfora has several dozen customers queued up that it just hasn&#8217;t had time to engage with yet, he said, and he and others have received standing ovations after presenting the technology to potential users.</p>
<p>Investors seem to get it, too &#8212; a good thing considering Platfora&#8217;s big plans. Werther said this funding round went from earnest discussions to signed term sheets in three weeks and was generally a stress-free experience. That probably has something to do with the consensus the company has seen among venture capitalists, who project Hadoop will take about 20 percent of a $30 billion legacy BI market and are looking for the startups with the vision to win that business.</p>
<p>However, although Platfora&#8217;s legacy BI competition might be many things, they&#8217;re not underfunded. &#8220;We&#8217;re going to be going effectively head to head with some very large incumbents,&#8221; Werther said. And it takes the kind of money Platfora is raising in order to build a company that scale to compete with the like of <a href="http://gigaom.com/2011/10/02/oracle-exalytics-attacks-big-data-analytics/">Oracle</a> or <a href="http://gigaom.com/data/batten-down-the-analysts-its-a-big-data-bi-storm/">Teradata</a> in terms of developing its product, spreading its message and supporting its customers. If those companies want a piece of the big data market &#8212; and they do &#8212; perhaps we shouldn&#8217;t be so fast to dismiss their chances of getting it.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=583875&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=430495"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=430495" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=583875+plotting-a-bi-coup-hadoop-startup-platfora-raises-20m&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=583875+plotting-a-bi-coup-hadoop-startup-platfora-raises-20m&utm_content=dharrisstructure">The importance of putting the U and I in visualization</a></li><li><a href="http://pro.gigaom.com/2012/04/infrastructure-q1-cloud-and-big-data-woo-the-enterprise/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=583875+plotting-a-bi-coup-hadoop-startup-platfora-raises-20m&utm_content=dharrisstructure">Infrastructure Q1: Cloud and big data woo enterprises</a></li><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=583875+plotting-a-bi-coup-hadoop-startup-platfora-raises-20m&utm_content=dharrisstructure">A near-term outlook for big data</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/11/13/plotting-a-bi-coup-hadoop-startup-platfora-raises-20m/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/11/ben_werther_platfora-e1350965445298.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/11/ben_werther_platfora-e1350965445298.jpg?w=150" medium="image">
			<media:title type="html">ben_werther_platfora-e1350965445298</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/11/explore_slide_5.jpg" medium="image">
			<media:title type="html">explore_slide_5</media:title>
		</media:content>
	</item>
		<item>
		<title>Cloudera makes SQL a first-class citizen in Hadoop</title>
		<link>http://gigaom.com/2012/10/24/cloudera-makes-sql-a-first-class-citizen-in-hadoop/</link>
		<comments>http://gigaom.com/2012/10/24/cloudera-makes-sql-a-first-class-citizen-in-hadoop/#comments</comments>
		<pubDate>Wed, 24 Oct 2012 13:00:52 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[analytic database]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[Drill]]></category>
		<category><![CDATA[Hadapt]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Hortonworks]]></category>
		<category><![CDATA[Impala]]></category>
		<category><![CDATA[Mapr]]></category>
		<category><![CDATA[Platfora]]></category>
		<category><![CDATA[SQL]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=576626</guid>
		<description><![CDATA[Cloudera has joined the fray of Hadoop companies trying to turn the big data platform into an engine for exploring data interactively using standard SQL. As the biggest company in the space, its new technology called Impala could go a long way toward changing Hadoop's image.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=576626&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Not content to watch its competitors leave it in the dust, veteran big data startup Cloudera is fundamentally changing the face of its flagship Hadoop distribution into something much more appealing. The company has developed a real-time <a href="http://www.cloudera.com/content/cloudera/en/products/cloudera-enterprise-core/cloudera-enterprise-RTQ.html">SQL query engine called Impala</a> that will sit aside MapReduce as a native processing option within Cloudera&#8217;s version of Hadoop. Cloudera is biggest and most well-known Hadoop vendor around, so opening its platform up to the wide world of SQL-trained data analysts is a really big deal &#8212; even if Cloudera is a bit late to the SQL party.</p>
<h2>From batch processing to data interaction</h2>
<p>The business world regularly laments the circumstances that spurred Impala&#8217;s creation. I summed them up last week and again yesterday when reporting similar products <a href="http://gigaom.com/data/hadapt-does-big-love-for-big-data-and-hints-at-hadoops-future/">from startups Hadapt</a> and <a href="http://gigaom.com/data/platfora-shows-a-whole-new-way-to-do-business-intelligence-on-big-data/">Platfora</a>, but the gist is that although Hadoop is more scalable and more flexible than traditional data warehouses or analytic databases, it&#8217;s also slower, harder to learn and designed for batch processing an entire data set rather than interactively querying a data set. Until now, the common methods for querying Hadoop were to <a href="http://hive.apache.org/">use a custom-built language such as Hive</a>, or to transport data to a data warehouse from Hadoop and then analyze it using traditional business intelligence software.</p>
<p>However, Cloudera&#8217;s Cloud VP of Products Charles Zedlewski was quick to point out during a recent conversation that Impala isn&#8217;t a replacement for other BI tools, just a new data source into which they can connect. If anything, it&#8217;s a replacement for Hive, which Facebook built to bring data warehouse capabilities to Hadoop, but which wasn&#8217;t really developed for public consumption as a software product. For the sake of uniformity, Impala actually uses the same SQL set as Hive, but is on average 10 times faster thanks to its purpose-built query engine that foregoes reliance on MapReduce. Small queries, Zedlewski said, can run in less than a second.</p>
<p>Impala has been in the making for almost two years, and Cloudera &#8220;took a a lot of pains to stitch this really well in with the rest of the Hadoop stack,&#8221; Zedlewski said. Users still store data in the Hadoop Distributed File System of the HBase database, and they can still store whatever types of structured, semi-structured on unstructured data they please. Impala uses the same metadata as the other Hadoop components, the same drivers and &#8212; like almost everything else in the Hadoop world &#8212; is open source under the Apache Software Foundation license.</p>
<p>Unlike some other Hadoop startups, though, Cloudera isn&#8217;t interested in selling BI or other analytic applications. Impala (which is called Real-Time Query for customers who pay for support) is the execution engine, but it still relies on software from Cloudera partners such as <a href="http://gigaom.com/cloud/thanks-to-consumerization-its-ipo-season-in-analytics/">Tableau, QlikTech</a> and MicroStrategy in order to ask questions and visualize the results. &#8220;We&#8217;re sticking to our knitting as a platform vendor,&#8221; said Zedlewski, echoing a sentiment on which his boss, Cloudera CEO Mike Olson, <a href="http://gigaom.com/2012/03/21/cloudera-structure-data-2012/">has been bullish for years</a>.</p>
<p style="text-align: center;"><a href="http://gigaom2.files.wordpress.com/2012/10/impala.jpg"><img  title="impala" alt="" src="http://gigaom2.files.wordpress.com/2012/10/impala.jpg?w=708"   class="size-full wp-image-576688 aligncenter" /></a></p>
<h2>Different strokes move the world</h2>
<p>I can&#8217;t underscore enough how critical all of this innovation is for Hadoop, which in order to add substance to its unparalleled hype needed to become far more useful to far more users. But the sudden shift from Hadoop as a batch-processing engine built on MapReduce into an ad hoc SQL querying engine might leave industry analysts and even Hadoop users scratching their heads.</p>
<p>Cloudera, now with more than 300 employees and annual revenue rumored to be in hundreds of millions, is the 800-pound gorilla in the Hadoop market, and its implementation of Impala has to make it look even better for prospective customers. But Cloudera doesn&#8217;t have this space to itself. Assuming your goal is to use Hadoop as the platform for running SQL queries (as opposed to, for example, <a href="http://gigaom.com/data/metamarkets-open-sources-druid-its-in-memory-database/">using it for ETL before putting it in an in-memory system</a>), there are plenty of choices on the table. And everyone&#8217;s approach is different.</p>
<p>For starters, bitter distribution-level rival MapR announced in August that it&#8217;s <a href="http://gigaom.com/cloud/for-fast-interactive-hadoop-queries-drill-may-be-the-answer/">leading an open source project called Drill</a> that provides essentially the same functionality as Impala. MapR is <a href="http://gigaom.com/cloud/amazon-taps-mapr-for-high-powered-elastic-mapreduce/">getting a lot of love from Hadoop users right now</a>, and a future implementation of Drill into its product lineup would add even more legitimacy. Not wanting to cede the innovation edge to Cloudera of MapR, one has to suspect <a href="http://gigaom.com/cloud/hortonworks-teams-with-vmware-to-keep-hadoop-running/">Yahoo spinoff Hortonworks</a> will also get into the query engine game at some point. (We&#8217;ll leave the debate over whether the myriad different flavors of Hadoop constitute the beginning of a community fracture for another day.)</p>
<p>Like Cloudera, however, if MapR and Hortonworks decide to integrate query engines in their products, they&#8217;ll likely rely on application providers to deliver the user experience on top. For better or worse, that presently means reliance on legacy vendors until startups can get familiar with the source code and start building BI products designed to take advantage of the new capabilities. When asked about Impala as a technology for disrupting the traditional data warehouse market, Cloudera&#8217;s Zedlewski noted that existing products are often very good at what they do.</p>
<p>&#8220;I think it&#8217;s highly unlikely that something like Impala would really be considered an alternative of that,&#8221; he said. Those vendors don&#8217;t seem to think so either, as companies like Teradata and EMC Greenplum (e emc) are <a href="http://gigaom.com/cloud/emc-throws-lots-of-hardware-at-hadoop/">telling always-improving stories</a> about integrating their existing product lines with Hadoop.</p>
<div id="attachment_576706" class="wp-caption alignright" style="width: 310px"><a href="http://gigaom2.files.wordpress.com/2012/10/2-drill_down-11.jpg"><img  title="2-drill_down-1" alt="" src="http://gigaom2.files.wordpress.com/2012/10/2-drill_down-11.jpg?w=300&#038;h=144" height="144" width="300" class="size-medium wp-image-576706" /></a><p class="wp-caption-text">Running a sentiment analysis in Tableau with Hadapt</p></div>
<p>On the other end of the spectrum are startups such as Hadapt, Platfora and <a href="http://gigaom.com/data/batten-down-the-analysts-its-a-big-data-bi-storm/">Birst</a>, which have built Hadoop-based query engines on their own, independent of loyalty to any particular Hadoop distribution. These companies have a lot of smart people on board, and their technologies are for real. Platfora CEO Ben Werther, in particular, makes no bones about his goal of unseating the BI incumbents with analytics applications built from the ground up to analyze big data stored in Hadoop.</p>
<p>Similar, although not necessarily competitive, technologies include <a href="http://gigaom.com/cloud/how-one-startup-wants-to-inject-hadoop-into-your-sql/">Spire (from Drawn to Scale)</a> and <a href="http://www.splicemachine.com">Splice Machine</a>. Both support some level of SQL querying and/or BI integration, although their real value comes in leveraging HBase to provide transactional capabilities that analytic databases aren&#8217;t designed to do.</p>
<p>Even though all these choices and approaches might add to the confusion over how to use Hadoop and which products to choose, the result is a net gain for Hadoop <a href="http://gigaom.com/cloud/the-state-of-hadoop-strong-and-poised-to-explode/">as the de facto platform for big data environments</a> even in the face of some alternative approaches. It has changed from a batch system to an interactive query engine pretty much overnight, so although he wouldn&#8217;t comment on the competition, Zedlewski wasn&#8217;t just blowing vendor smoke when told me, &#8220;I would argue Impala is a proof point that Hadoop as a platform has an ability to grow that no other data management platform has.&#8221;</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=576626&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=788560"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=788560" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=576626+cloudera-makes-sql-a-first-class-citizen-in-hadoop&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=576626+cloudera-makes-sql-a-first-class-citizen-in-hadoop&utm_content=dharrisstructure">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2012/03/why-service-providers-matter-for-the-future-of-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=576626+cloudera-makes-sql-a-first-class-citizen-in-hadoop&utm_content=dharrisstructure">Why service providers matter for the future of big data</a></li><li><a href="http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=576626+cloudera-makes-sql-a-first-class-citizen-in-hadoop&utm_content=dharrisstructure">The importance of putting the U and I in visualization</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/10/24/cloudera-makes-sql-a-first-class-citizen-in-hadoop/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/10/impala1-e1351083747709.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/10/impala1-e1351083747709.jpg?w=150" medium="image">
			<media:title type="html">impala</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/10/impala.jpg" medium="image">
			<media:title type="html">impala</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/10/2-drill_down-11.jpg?w=300" medium="image">
			<media:title type="html">2-drill_down-1</media:title>
		</media:content>
	</item>
		<item>
		<title>Platfora shows a whole new way to do business intelligence on big data</title>
		<link>http://gigaom.com/2012/10/23/platfora-shows-a-whole-new-way-to-do-business-intelligence-on-big-data/</link>
		<comments>http://gigaom.com/2012/10/23/platfora-shows-a-whole-new-way-to-do-business-intelligence-on-big-data/#comments</comments>
		<pubDate>Tue, 23 Oct 2012 11:30:07 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[analytics]]></category>
		<category><![CDATA[BI]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Platfora]]></category>
		<category><![CDATA[tableau]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=575975</guid>
		<description><![CDATA[Analytics startup Platfora is finally showing off its next-generation business intelligence software to the world, combining Hadoop, in-memory processing and HTML5 into an impressive product. It's entering a competitive market full of large incumbents and other innovative startups all trying to change how we do BI.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=575975&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://platfora.com">Platfora</a> Founder and CEO Ben Werther doesn’t want to make Hadoop less painful to use; he wants Hadoop to underpin a whole new way of analyzing and visualizing business data. Platfora’s flagship product, which it unveiled publicly for the first time on Tuesday after <a href="http://gigaom.com/cloud/platfora-gets-5-7m-to-make-hadoop-mainstream/">launching in September 2011</a>, looks to do just that by turning the business intelligence experience on its head. It’s <a href="http://pro.gigaom.com/2012/07/cloud-computing-and-trickle-down-analytics/?utm_source=data&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=575975+platfora-shows-a-whole-new-way-to-do-business-intelligence-on-big-data&amp;utm_content=dharrisstructure">part of a bigger trend toward democratizing data analysis</a>, but with an emphasis on scale.</p>
<div id="attachment_576119" class="wp-caption alignleft" style="width: 310px"><a href="http://gigaom2.files.wordpress.com/2012/10/ben_werther_platfora.jpg"><img title="Ben_Werther_Platfora" alt="" src="http://gigaom2.files.wordpress.com/2012/10/ben_werther_platfora-e1350965445298.jpg?w=300&#038;h=282" height="282" width="300" class="size-medium wp-image-576119"></a><p class="wp-caption-text">Ben Werther</p></div>
<p>Werther, who spent time at DataStax and Greenplum before starting Platfora, describes the current state of business intelligence as being like a double-edged sword. Traditional data warehouses are mature, he explained, but limited because strict processes for adding data and changing schema essentially force analysts to “live inside the constraints of whatever’s available to them.” Hadoop, on the other hand, is irresistible but flawed — it stores lots of raw data without fixed schema, but tools such as Hive that try to add a familiar facade “don’t in any way make it interactive or suitable for general use.”</p>
<p>The state of affairs has Fortune 500 companies “literally screaming out for an answer,” Werther said.</p>
<h2>Big, fast data</h2>
<p>With its eponymous software product, Platfora thinks it has that answer. It uses Hadoop as a scalable data store from which users can grab data sets and manipulate which variables are shown and how that data set relates to others. Platfora calls this data-management process building a “lens.”</p>
<p><a href="http://gigaom2.files.wordpress.com/2012/10/platfora-arch.jpg"><img title="platfora arch" alt="" src="http://gigaom2.files.wordpress.com/2012/10/platfora-arch.jpg?w=202&#038;h=300" height="300" width="202" class="alignright size-medium wp-image-576121"></a>What makes the product really hum, though, is how users can interact with data once it’s visualized into a graph. <a href="http://gigaom.com/cloud/thanks-to-consumerization-its-ipo-season-in-analytics/">Like with Tableau</a>, they can drag and drop new variables into a graph and watch it automatically account for them. However, Platfora also utilizes its own massively parallel in-memory database to store more than a terabyte of related metadata, which is the underpinning of what the company calls its “Fractal Cache” technology. It means users can change their minds about what data to include or how it’s related, and then start analyzing it anew without a hitch.</p>
<p>Based on what I saw in a demonstration, Platfora’s HTML5 canvas rendering makes the process as visually rich as it is fast in terms of processing speed. You can highlight a portion of a graph and drill down into just the data points included in that zone, and then drill down even further or just start a whole new analysis using the smaller data set as a the starting point. Using concepts from the “<a href="http://www.amazon.com/Grammar-Graphics-Statistics-Computing/dp/0387245448">grammar of graphics</a>,” Platfora VP of of Products and Marketing Peter Schlampp said users can create just about any type of visualization they can imagine.</p>
<p>Essentially, Werther said, Platfora has turned Hadoop into a sub-second interactive engine that operates much faster than any Hadoop-to-data-warehouse connector or Hive-based approach could ever hope to do. (Hive <a href="http://gigaom.com/cloud/exclusive-the-brains-behind-hive-launch-on-demand-hadoop-service/">is the SQL-like query language developed for Hadoop</a> that companies such as Facebook use to turn Hadoop into a data warehouse for unstructured data.) “At the point where you can synthesize on the fly,” he said, “[legacy BI tools] start to look like relics of another age.”</p>
<p><a href="http://gigaom2.files.wordpress.com/2012/10/segmentation.jpg"><img title="Segmentation" alt="" src="http://gigaom2.files.wordpress.com/2012/10/segmentation.jpg?w=604&#038;h=431" height="431" width="604" class="aligncenter size-large wp-image-576120"></a></p>
<h2><em>If </em>legacy dies, who wins?</h2>
<p>As proof that its approach works, Werther points to the tens of well-known customers taking part in Platfora’s private beta and the hundreds it hopes to let in now that it has officially taken the lid off the product. One customer, he said, was able to get up and running on a petabyte of data in just four hours. Another, a large media company, is using Platfora to analyze 2.4 petabytes and more than 2 billion user records. In that deployment, Platfora replaced a BI strategy that included Hadoop, Hive, Vertica and Tableau, and that required users to request a new Vertica database every time they wanted to change the data underlying their analyses.</p>
<p>However, there are many similar data environments out there, and getting companies to make the switch to something new won’t necessarily be easy. Even when it’s relatively slow and cumbersome, a combination of Hadoop, a data warehouse and a BI tool generally serves its purpose, even if it’s an old-world stack, Werther acknowledges. And it’s not as if legacy companies such as Informatica or MicroStrategy <a href="http://gigaom.com/cloud/big-data-is-a-big-deal-and-getting-bigger-for-retailers/">are going gently into that good night </a>and just letting the big data revolution pass them by.</p>
<p>Even if companies really are looking to make a change, they might also look at more-modern alternatives <a href="http://gigaom.com/data/hadapt-does-big-love-for-big-data-and-hints-at-hadoops-future/">from vendors including Hadapt</a>, <a href="http://clearstorydata.com/">ClearStory</a>, <a href="http://gigaom.com/data/batten-down-the-analysts-its-a-big-data-bi-storm/">Birst and even Teradata</a>. Although they’re more batch-oriented and less interactive BI, established Hadoop startups such as Datameer and Karmasphere <a href="http://gigaom.com/cloud/is-2013-the-year-hadoop-uptake-turns-into-a-tornado/">are offering some impressive products as well</a>. Datameer, in particular, has put a focus on letting business users create top-notch graphics to illustrate their data, and we’ll see if <a href="http://gigaom.com/cloud/data-hero-aims-to-turn-us-all-into-analytics-stars/">young startups such as Datahero</a> decide to tackle big data in the future.</p>
<p>Despite some stiff competition, though, Werther isn’t sweating Platfora’s future. He knows his company has promise, and he knows every one of his competitors is part of a larger movement to disrupt a lucrative market for BI products that analysts <a href="http://www.informationweek.com/software/business-intelligence/if-bi-is-dead-whats-next/240003837">estimate at $12 billion a year and growing</a>. “We all win if we can accelerate Hadoop usage,” Werther said.</p>
<p>That’s true even if some companies in the Hadoop distribution space are happy propping up legacy BI vendors for as long as companies still want to buy those products. However, he cautioned, “The old guys never win, they never survive the change.”</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=575975&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=779087"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=779087" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=575975+platfora-shows-a-whole-new-way-to-do-business-intelligence-on-big-data&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/07/cloud-computing-and-trickle-down-analytics/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=575975+platfora-shows-a-whole-new-way-to-do-business-intelligence-on-big-data&utm_content=dharrisstructure">Cloud computing and trickle-down analytics</a></li><li><a href="http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=575975+platfora-shows-a-whole-new-way-to-do-business-intelligence-on-big-data&utm_content=dharrisstructure">The importance of putting the U and I in visualization</a></li><li><a href="http://pro.gigaom.com/2012/04/infrastructure-q1-cloud-and-big-data-woo-the-enterprise/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=575975+platfora-shows-a-whole-new-way-to-do-business-intelligence-on-big-data&utm_content=dharrisstructure">Infrastructure Q1: Cloud and big data woo enterprises</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/10/23/platfora-shows-a-whole-new-way-to-do-business-intelligence-on-big-data/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/10/full-screen-shot-e1350966738447.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/10/full-screen-shot-e1350966738447.jpg?w=150" medium="image">
			<media:title type="html">Full Screen Shot</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/10/ben_werther_platfora-e1350965445298.jpg?w=300" medium="image">
			<media:title type="html">Ben_Werther_Platfora</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/10/platfora-arch.jpg?w=202" medium="image">
			<media:title type="html">platfora arch</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/10/segmentation.jpg?w=604" medium="image">
			<media:title type="html">Segmentation</media:title>
		</media:content>
	</item>
		<item>
		<title>How 0xdata wants to help everyone become data scientists</title>
		<link>http://gigaom.com/2012/08/14/how-0xdata-wants-to-help-everyone-become-data-scientists/</link>
		<comments>http://gigaom.com/2012/08/14/how-0xdata-wants-to-help-everyone-become-data-scientists/#comments</comments>
		<pubDate>Tue, 14 Aug 2012 19:45:04 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[0xdata]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[data science]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[HDFS]]></category>
		<category><![CDATA[machine-learning]]></category>
		<category><![CDATA[Platfora]]></category>
		<category><![CDATA[R]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=552457</guid>
		<description><![CDATA[Although it's still a work in progress, 0xdata thinks it has the answer to the problem of doing advanced statistical analysis at scale: Build on HDFS for scale, use the widely known R programming language and hide it all under a simple interface.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=552457&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>There&#8217;s a trend afoot in the big data space <a href="http://gigaom.com/cloud/want-to-ditch-your-data-scientists-heres-are-7-startups-that-can-help/">to turn data science from black magic into child&#8217;s play</a>, and one of the newest companies trying to pull of this technological alchemy is <a href="http://www.0xdata.com/index.html">0xdata</a>. The bootstrapped startup, pronounced &#8220;hexadata,&#8221; is the brainchild of former DataStax engineer, and Platfora co-founder, SriSatish Ambati, and it&#8217;s trying to blend Hadoop, R and Google BigQuery into the ultimate tool for statistical analysis. Scientists, data analysts or whoever ultimately uses the product only need to be experts in their domains, not in statistics.</p>
<p>At its core, <a href="http://www.0xdata.com/faq.html">oxdata&#8217;s flagship product, called H2O</a>, is a statistical analysis engine that uses the Hadoop Distributed File System (HDFS) as its storage platform, but the goal is to make it <a href="http://gigaom.com/cloud/google-opens-up-its-biq-query-data-analytics-service-to-all/">as simple as using a Google service such as BigQuery</a>. Users will interact with H2O via a simple web-search-like bar and standard <a href="http://www.r-project.org/">R statistical-analysis</a> syntax, but H2O will run machine-learning algorithms behind the scenes. Alternatively, users can call out to H2O from Microsoft Excel or the <a href="http://rstudio.org/">RStudio</a> integrated development environment using a REST API.</p>
<div id="attachment_552941" class="wp-caption alignleft" style="width: 310px"><a href="http://gigaom2.files.wordpress.com/2012/08/big_banner-copy.jpg"><img  title="big_banner copy" src="http://gigaom2.files.wordpress.com/2012/08/big_banner-copy.jpg?w=300&#038;h=114" alt="" width="300" height="114" class="size-medium wp-image-552941" /></a><p class="wp-caption-text">Although BigQuery is a SQL service hosted by Google, 0xdata follows a similar theory on simplicity.</p></div>
<p>However they choose to leverage the product, Ambati said, the scale of the underlying data and the complexity of running advanced analysis are details that need to be hidden. It&#8217;s the same theory that underlies Platfora, the company Ambati co-founded last year with his former DataStax colleague Ben Werther, although their approaches appear to be different. Whereas Platfora is <a href="http://gigaom.com/cloud/platfora-gets-5-7m-to-make-hadoop-mainstream/">trying to disrupt the data warehouse market</a> by building a next-generation user experience atop Hadoop, 0xdata is trying to change the way users interact with popular statistical software such as R.</p>
<p>But either way, Ambati says of new data-analysis products, &#8220;[There are] no bragging rights for making it simple. If you don&#8217;t do that, you won&#8217;t be able to go forward.&#8221;</p>
<p>oxdata is also putting a focus on speed, both in terms of how fast it processes data and how fast it lets users react. Google search changed our thinking around how many questions people can ask successively, Ambati explained, and data analysts should have the same experience. That&#8217;s why H2O provides approximate results at every step in the analysis process. Rather than wait for the entire job to run and the exact results to be computed, users can get a general idea of results and kill the job and start over quicker if they&#8217;re completely outside the expected range.</p>
<p>But it will be a while before the public gets a chance to see whether H2O lives up to its promises. Ambati said the product is just four months into development and won&#8217;t have its first set of algorithms available for another few months. His team of eight engineers has &#8220;built a lot of cool stuff,&#8221; but now it needs to round out the process and turn its code for H2O into an actual product.</p>
<p>Still, having decided to tackle data as a system, Ambati and his team are having a lot of fun. &#8220;We are live-and-die-with-infrastructure people,&#8221; he said, but for a bunch of folks who spent a lot of time learning math, it&#8217;s like going back to the their days as computer science students.</p>
<p><em>Feature image courtesy of <a href="http://www.shutterstock.com/gallery-11418p1.html">Shutterstock user Bruce Rolff</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=552457&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=196586"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=196586" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=552457+how-0xdata-wants-to-help-everyone-become-data-scientists&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=552457+how-0xdata-wants-to-help-everyone-become-data-scientists&utm_content=dharrisstructure">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=552457+how-0xdata-wants-to-help-everyone-become-data-scientists&utm_content=dharrisstructure">The importance of putting the U and I in visualization</a></li><li><a href="http://pro.gigaom.com/2012/04/sector-roadmap-hadoop-platforms-2012/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=552457+how-0xdata-wants-to-help-everyone-become-data-scientists&utm_content=dharrisstructure">2012: The Hadoop infrastructure market booms</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/08/14/how-0xdata-wants-to-help-everyone-become-data-scientists/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/08/shutterstock_107081264-e1344971009541.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/08/shutterstock_107081264-e1344971009541.jpg?w=150" medium="image">
			<media:title type="html">shutterstock_107081264</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/08/big_banner-copy.jpg?w=300" medium="image">
			<media:title type="html">big_banner copy</media:title>
		</media:content>
	</item>
		<item>
		<title>Want to ditch your data scientists? Here are 7 startups that can help</title>
		<link>http://gigaom.com/2012/07/05/want-to-ditch-your-data-scientists-heres-are-7-startups-that-can-help/</link>
		<comments>http://gigaom.com/2012/07/05/want-to-ditch-your-data-scientists-heres-are-7-startups-that-can-help/#comments</comments>
		<pubDate>Thu, 05 Jul 2012 21:59:04 +0000</pubDate>
		<dc:creator>Stacey Higginbotham</dc:creator>
				<category><![CDATA[BigML]]></category>
		<category><![CDATA[Cloud]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[Datahero]]></category>
		<category><![CDATA[Datameer]]></category>
		<category><![CDATA[karmasphere]]></category>
		<category><![CDATA[Platfora]]></category>
		<category><![CDATA[Prior Knowledge]]></category>
		<category><![CDATA[tableau]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=539784</guid>
		<description><![CDATA[If we want to big data revolution to scale, then we need to make it as easy as Netscape made the web surfing experience. Here are 7 startups making that happen.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=539784&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://gigaom2.files.wordpress.com/2011/09/3968092988_7644769b52_z-e1315427882461.jpg"><img  title="scientists " src="http://gigaom2.files.wordpress.com/2011/09/3968092988_7644769b52_z-e1315427882461.jpg?w=300&#038;h=200" alt="" width="300" height="200" class="alignright size-medium wp-image-402634" /></a>Ford&#8217;s data chief joined many other top executives who are <a href="http://www.zdnet.com/fords-big-data-chief-sees-massive-possibilities-but-the-tools-need-work-7000000322/">bemoaning the lack of simple tools</a> to solve big data problems &#8212; namely the fact that running Hadoop clusters or performing analytics is still a job that requires a specialist. If we want to big data revolution to scale, then we need to make it as easy as Netscape made the web surfing experience. Here are 7 startups making that happen.</p>
<p>Ford&#8217;s data chief John Ginder, did an interview with <a href="http://www.zdnet.com/fords-big-data-chief-sees-massive-possibilities-but-the-tools-need-work-7000000322/">ZDNet in which</a> he says:</p>
<blockquote><p>&#8220;That&#8217;s a great endpoint I&#8217;d love us to move toward,” said Ginder, “but there aren&#8217;t enough of us and there aren&#8217;t enough of those tools out there to enable us to do that yet. We have our own specialists who are working with the tools and developing some of our own in some cases and applying them to specific problems. But, there is this future state where we&#8217;d like to be where all that data would be exposed. [And] where data specialists &#8212; but not computer scientists &#8212; could go in and interrogate it and look for correlations that might not have been able to look at before. That&#8217;s a beautiful future state, but we&#8217;re not there yet.&#8221;</p></blockquote>
<p><strong><a href="http://data-hero.com/">Datahero</a></strong>: This <a href="http://gigaom.com/cloud/data-hero-aims-to-turn-us-all-into-analytics-stars/">startup is all about visualization</a> &#8212; namely making it easy to take data and turn it into pretty pictures that can then generate new understanding or convince someone to take action. Users bring their datasets files and Datahero does the rest.</p>
<p><strong><a href="https://www.priorknowledge.com/">Prior Knowledge</a></strong>: Relative newcomer <a href="http://gigaom.com/cloud/exclusive-prior-knowledge-wants-to-be-your-data-oracle/">Prior Knowledge is the brainchild of MIT grads</a> who wanted to let non data scientists play around with data. The company offers a service that lets people upload their data and hook into PK&#8217;s database API. The service then assess the information for correlations as well as helps app developers build predictive models. It&#8217;s raised $1.4 million in funding from Founders Fund and angels.</p>
<p><strong><a href="http://www.platfora.com/">Platfora</a></strong>: Hadoop is everyone&#8217;s favorite big data batch processing platform, but it&#8217;s not easy enough for everyone to use. Like others <a href="http://gigaom.com/cloud/platfora-gets-5-7m-to-make-hadoop-mainstream/">Platfora wants to make Hadoop so easy</a> even I could use it, through an intuitive user interface that has advanced data science functions built in, rather than making users perform queries. It has raised $5.7 million and its product will be out next year.</p>
<p><strong><a href="http://clearstorydata.com/">ClearStory</a></strong>: Big names back this startup, which is also a service as opposed to software. Google Ventures, Andreeseen Horowitz, and Khosla Ventures have funded ClearStory, which aims to help funnel data from a variety of source (including Hadoop!) into one place, where employees can then use a GUI to interact with and visualize that data.</p>
<p><strong><a href="https://karmasphere.com/">Karamasphere</a></strong>: The Karmasphere product is designed to ease the process of developing Hadoop workloads and applications, even from the desktop. It lets users write SQL-like queries while also connecting to their favorite BI tools and analytics software to the software to perform analysis.</p>
<p><strong><a href="http://www.datameer.com/">Datameer</a></strong>: Like others on this list Datameer is out to make Hadoop more relatable to non nerds. In this case it does this by creating a familiar spreadsheet overlay so businesspeople can analyze their Hadoop jobs and then let&#8217;s people create visualizations and draw correlations. It&#8217;s closest to Karamsphere, but its latest feature that allows someone to run it on a single machine is a differentiator.</p>
<p><strong><a href="http://gigaom.com/cloud/your-data-has-a-secret-but-you-yes-you-can-make-it-talk/">BigML</a></strong>: Much like Prior Knowledge, BigML is a startup that combines data with machine learning to help give normal people access to the smarts to help them answer questions with their data. It hopes to let people do machine learning in four easy steps: set up a data source; create a dataset; create a model; and generate predictions. It’s in private-beta mode right now.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=539784&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=416990"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=416990" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=539784+want-to-ditch-your-data-scientists-heres-are-7-startups-that-can-help&utm_content=shigginbotham">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=539784+want-to-ditch-your-data-scientists-heres-are-7-startups-that-can-help&utm_content=shigginbotham">The importance of putting the U and I in visualization</a></li><li><a href="http://pro.gigaom.com/2012/04/sector-roadmap-hadoop-platforms-2012/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=539784+want-to-ditch-your-data-scientists-heres-are-7-startups-that-can-help&utm_content=shigginbotham">2012: The Hadoop infrastructure market booms</a></li><li><a href="http://pro.gigaom.com/2012/04/aws-storage-gateway-jolts-cloud-storage-ecosystem/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=539784+want-to-ditch-your-data-scientists-heres-are-7-startups-that-can-help&utm_content=shigginbotham">AWS Storage Gateway jolts cloud-storage ecosystem</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/07/05/want-to-ditch-your-data-scientists-heres-are-7-startups-that-can-help/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2011/09/3968092988_7644769b52_z-e1315427882461.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2011/09/3968092988_7644769b52_z-e1315427882461.jpg?w=150" medium="image">
			<media:title type="html">scientists</media:title>
		</media:content>

		<media:content url="http://1.gravatar.com/avatar/aee37121e18bf76bb9fee4494bab237a?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">shigginbotham</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2011/09/3968092988_7644769b52_z-e1315427882461.jpg?w=300" medium="image">
			<media:title type="html">scientists </media:title>
		</media:content>
	</item>
		<item>
		<title>Does function trump form in application design?</title>
		<link>http://gigaom.com/2012/05/15/does-function-trump-form-in-application-design/</link>
		<comments>http://gigaom.com/2012/05/15/does-function-trump-form-in-application-design/#comments</comments>
		<pubDate>Wed, 16 May 2012 01:20:25 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[application design]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[ClearStory]]></category>
		<category><![CDATA[consumerization]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Platfora]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=521878</guid>
		<description><![CDATA[There's a principle of application design that beautiful means usable, but a new study out of Google suggests that while beauty doesn't necessarily affect perceived usability, poor usability can negatively affect perceived beauty. Nobody wants a reputation as selling a product that's both unusable and ugly.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=521878&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://gigaom2.files.wordpress.com/2012/05/utility-knife.jpg"><img title="utility knife" src="http://gigaom2.files.wordpress.com/2012/05/utility-knife.jpg?w=300&#038;h=200" alt="" width="300" height="200" class="alignleft size-medium wp-image-521946"></a>There’s a principle of application design that beautiful means usable, but a <a href="http://googleresearch.blogspot.com/2012/05/is-beautiful-usable-what-is-influence.html">new study out of Google</a> suggests that’s not always the case. Rather, the researchers found, while beauty doesn’t necessarily affect perceived usability, poor usability can negatively affect perceived beauty. The study is far from conclusive, but as consumerization continues to sweep through software design, it might offer some warning worth heeding. Nobody wants a reputation as selling a product that’s both unusable <em>and</em> ugly.</p>
<p>The subject is on the top of my mind after recently writing a piece for GigaOM Pro in which I <a href="http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/?utm_source=cloud&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=521878+does-function-trump-form-in-application-design&amp;utm_content=dharrisstructure">look at the evolution of user interfaces and user experiences within big data applications</a> (subscription req’d). In that piece, I looked at the current state of the art in terms of creating broadly accessible big data applications, and how startups such as Platfora and ClearStory are looking to looking to take it to the next level by creating visually compelling applications usable even by everyday businesspeople. They’ll soon be joined by even more companies trying to make big data even more intuitive.</p>
<p>After reading the study (which focused on e-commerce sites, not business apps) my concern, which isn’t limited to big data by any means, is that companies striving for broad appeal might place beauty too far ahead of usability. As they design for mobile devices first, and as they do everything to stand apart from the enterprise applications and even the successful next-generation applications they’re trying to replace, there might be a temptation to focus too much on the surface level.</p>
<p>Even Google — the master of utilitarian design — has been guilty of this from time to time, rolling out a new Gmail or Reader interface, or search results layout, <a href="http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/?utm_source=cloud&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=521878+does-function-trump-form-in-application-design&amp;utm_content=dharrisstructure">that initially has users saying it’s ugly</a> because they haven’t yet adjusted to using it. But Google is Google, it’s entrenched and ubiquitous. Startups are not.</p>
<p><a href="http://gigaom2.files.wordpress.com/2012/05/chart-for-blog-001.png"><img title="Chart for Blog.001" src="http://gigaom2.files.wordpress.com/2012/05/chart-for-blog-001.png?w=300&#038;h=225" alt="" width="300" height="225" class="alignright size-medium wp-image-521937"></a>The study suggests the answer to effective design may be to reconceive how we think about design in the first place. Aesthetic attributes such as “clean” and “organized,” they say, might also be viewed as usability atributes, which might help explain the correlation between poor usability and the perception of bad design.</p>
<p>Or maybe, they note, it’s just a matter of poor usability putting participants in a bad mood: “the evaluation of aesthetics is influenced by the user’s affective response caused by the interaction experience.”</p>
<p>Whatever the case, the study raises some questions to ponder for anyone building applications, especially those trying to reach a broad audience with consumer-friendly interfaces. Those applications often gain traction from the bottom up, garnering more users as positive word of mouth grows, which means reputation is everything. The study suggests poor usability will make even the most-beautiful applications seem less beautiful — imagine those that are only marginally beautiful to begin with.</p>
<p><em>Feature image courtesy of <a href="http://www.shutterstock.com/gallery-360643p1.html">Shutterstock user megastocker</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=521878&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=518791"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=518791" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=521878+does-function-trump-form-in-application-design&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=521878+does-function-trump-form-in-application-design&utm_content=dharrisstructure">The importance of putting the U and I in visualization</a></li><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=521878+does-function-trump-form-in-application-design&utm_content=dharrisstructure">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2012/04/sector-roadmap-hadoop-platforms-2012/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=521878+does-function-trump-form-in-application-design&utm_content=dharrisstructure">2012: The Hadoop infrastructure market booms</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/05/15/does-function-trump-form-in-application-design/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/05/utility-knife.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/05/utility-knife.jpg?w=150" medium="image">
			<media:title type="html">utility knife</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/05/utility-knife.jpg?w=300" medium="image">
			<media:title type="html">utility knife</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/05/chart-for-blog-001.png?w=300" medium="image">
			<media:title type="html">Chart for Blog.001</media:title>
		</media:content>
	</item>
		<item>
		<title>The importance of putting the U and I in visualization</title>
		<link>http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/</link>
		<comments>http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/#comments</comments>
		<pubDate>Fri, 04 May 2012 06:55:34 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[1010data]]></category>
		<category><![CDATA[apache]]></category>
		<category><![CDATA[apache-hive]]></category>
		<category><![CDATA[Apple]]></category>
		<category><![CDATA[Aster Data Systems]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[BigQuery]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[Cassandra]]></category>
		<category><![CDATA[ClearStory]]></category>
		<category><![CDATA[data analysis]]></category>
		<category><![CDATA[Datameer]]></category>
		<category><![CDATA[datamine]]></category>
		<category><![CDATA[DataStax]]></category>
		<category><![CDATA[dive]]></category>
		<category><![CDATA[Excel]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Hadapt]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[infochimps]]></category>
		<category><![CDATA[infochimps-platform]]></category>
		<category><![CDATA[IT]]></category>
		<category><![CDATA[jaspersoft]]></category>
		<category><![CDATA[Kontagent]]></category>
		<category><![CDATA[mapreduce]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[Mortar Data]]></category>
		<category><![CDATA[odbc]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[pentaho]]></category>
		<category><![CDATA[pig]]></category>
		<category><![CDATA[Platfora]]></category>
		<category><![CDATA[python]]></category>
		<category><![CDATA[qliktech]]></category>
		<category><![CDATA[ruby]]></category>
		<category><![CDATA[spreadsheets]]></category>
		<category><![CDATA[SQL]]></category>
		<category><![CDATA[Stanford]]></category>
		<category><![CDATA[Startups]]></category>
		<category><![CDATA[tableau]]></category>
		<category><![CDATA[Teradata]]></category>
		<category><![CDATA[Twitter]]></category>
		<category><![CDATA[UI]]></category>
		<category><![CDATA[UIS]]></category>
		<category><![CDATA[User interface]]></category>
		<category><![CDATA[vc]]></category>
		<category><![CDATA[venture capital]]></category>
		<category><![CDATA[wukong]]></category>

		<guid isPermaLink="false">http://pro.gigaom.com/?p=104734</guid>
		<description><![CDATA[Ask a VC about big data and she will probably tell you about visualization of the user interface. We're talking about intuitive UIs that let users visually work with data using charts and tools, not algorithms. It's hard to do right, but the payoff could be huge.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=517773&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Ask a venture capitalist about big data and she will probably tell you about visualization. Only it won&#8217;t be visualization in the usual sense. Instead, it will be about visualization of the user interface. We&#8217;re talking about strikingly intuitive UIs that let users visually work with data using charts and tools instead of with algorithms and code. It&#8217;s hard work to do right — especially when you&#8217;re talking about massive data sets and complex computations — but the payoff could be huge for businesses.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=517773&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=282727"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=282727" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=517773+the-importance-of-putting-the-u-and-i-in-visualization&utm_content=gigaguest">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/04/infrastructure-q1-cloud-and-big-data-woo-the-enterprise/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=517773+the-importance-of-putting-the-u-and-i-in-visualization&utm_content=gigaguest">Infrastructure Q1: Cloud and big data woo enterprises</a></li><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=517773+the-importance-of-putting-the-u-and-i-in-visualization&utm_content=gigaguest">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2011/03/defining-hadoop-the-players-technologies-and-challenges-of-2011/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=517773+the-importance-of-putting-the-u-and-i-in-visualization&utm_content=gigaguest">Defining Hadoop: the Players, Technologies and Challenges of 2011</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/4411542bbd7a2a9a2fc2a1b38809e45c?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">gigaguest</media:title>
		</media:content>
	</item>
	</channel>
</rss>
