<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>GigaOM &#187; database</title>
	<atom:link href="http://gigaom.com/tag/database/feed/" rel="self" type="application/rss+xml" />
	<link>http://gigaom.com</link>
	<description></description>
	<lastBuildDate>Wed, 22 May 2013 07:25:55 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='gigaom.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://0.gravatar.com/blavatar/0db8f6557d022075dbbf010c54d46d93?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>GigaOM &#187; database</title>
		<link>http://gigaom.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://gigaom.com/osd.xml" title="GigaOM" />
	<atom:link rel='hub' href='http://gigaom.com/?pushpress=hub'/>
		<item>
		<title>How to make a less creepy robot? Simple, just add data</title>
		<link>http://gigaom.com/2013/05/20/how-to-make-a-less-creepy-robot-simple-just-add-big-data/</link>
		<comments>http://gigaom.com/2013/05/20/how-to-make-a-less-creepy-robot-simple-just-add-big-data/#comments</comments>
		<pubDate>Mon, 20 May 2013 21:22:00 +0000</pubDate>
		<dc:creator>Stacey Higginbotham</dc:creator>
				<category><![CDATA[database]]></category>
		<category><![CDATA[Disney Research]]></category>
		<category><![CDATA[robots]]></category>
		<category><![CDATA[UI]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=647315</guid>
		<description><![CDATA[Deep inside the House of Mouse researchers are solving computer science and mechanical engineering problems -- like how to build a robot that can hand you a drink without creeping you out.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=647315&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Disney&#8217;s research arm has <a href="http://www.disneyresearch.com/wp-content/uploads/Disney-Research-Human-to-Robot-Handoff-FINAL.pdf"> solved a problem that you probably didn&#8217;t even know robots have</a> &#8212; their inability to accept objects from people in a natural way. The Disney Research team, working with funding from the International Center for Advanced Communication Technologies (interACT) at Carnegie Mellon and the University and Karlsruhe Institute of Technology (KIT), believe that robots who can&#8217;t naturally accept &#8220;handoffs&#8221; of objects from people are creepy. In a <a href="http://www.disneyresearch.com/wp-content/uploads/icra13_RecMoHumanoidRobotics_final.pdf">paper</a> presented this month, Disney and its partners detailed how they used several motion-sensitive cameras, a database of gestures and some fancy algorithms to solve this handoff problem.</p>
<p>From the <a href="http://www.disneyresearch.com/project/objectreceivingrobots/">press release announcing the findings</a>:</p>
<blockquote id="quote-%e2%80%9cif-a-robot-"><p>“If a robot just sticks out its hand blindly, or uses motions that look more robotic than human, a person might feel uneasy working with that robot or might question whether it is up to the task,” Katsu Yamane, Disney Research, Pittsburgh senior research scientist explained. “We assume human-like motions are more user-friendly because they are familiar.”</p></blockquote>
<p><a href="http://gigaom2.files.wordpress.com/2013/05/recmo_image-1024x174.png"><img  alt="RecMo_image-1024x174" src="http://gigaom2.files.wordpress.com/2013/05/recmo_image-1024x174.png?w=708&#038;h=120" width="708" height="120" class="aligncenter size-large wp-image-647348" /></a></p>
<p>Despite the robot pictured on the Disney page touting this research looking like the mechanical, blue-haired skeleton that haunted my childhood nightmares, its attempts to grab the purse from the person do seem reactive to the human&#8217;s gestures, as opposed to the robot just sticking his arm out there and the person having to accommodate it. And that sort of naturalism will be important as we bring more robots into our homes and workplaces.</p>
<p>For example, an MIT group used a dancer&#8217;s motions to build a <a href="http://gigaom.com/2013/05/13/just-add-robots-mit-and-coke-show-off-a-smartphone-controlled-bartender/">robotic bartender</a> in a quest for naturalism &#8212; even though that robot doesn&#8217;t interact with people.</p>
<p>Today, designers try to <a href="http://www.nytimes.com/2013/01/27/opinion/sunday/our-talking-walking-objects.html">endear robots to us</a> with quirky noises (like R2D2) and maybe light displays or LED faces &#8212; anything to help anthropomorphize them. But as robots become more human-looking they can also become more sinister &#8212; <a href="http://en.wikipedia.org/wiki/Uncanny_valley">achieving that same uncanny valley</a> that Disney and other content companies have struggled with in animation. Remember the dead-eyed stars of the Polar Express that you probably couldn&#8217;t empathize with? The jerky movements of a home health robot might engender similar feelings &#8212; or worse &#8212; they may scare people.</p>
<p>Building the natural gestures of the Disney robot took the creation of a hierarchical gesture database that the robot can access as it detects the person passing something to it. In the Disney paper research, the robot is not only able to reach for the handbag, but when the human attempts a fake pass to the robot, the <del datetime="2013-05-20T19:18:59+00:00">blue-haired monstrosity</del> robot is able to adapt. From the release:</p>
<blockquote id="quote-to-enable-a-robot-to2"><p>To enable a robot to access a library of human-to-human passing motions with the speed necessary for robot-human interaction, the researchers developed a hierarchical data structure. Using principal component analysis, the researchers first developed a rough estimate of the distribution of various motion samples. They then grouped samples of similar poses and organized them into a binary tree structure. With a series of “either/or” decisions, the robot can rapidly search this database, so it can recognize when the person initiates a handing motion and then refine its response as the person follows through.</p></blockquote>
<p>Even if you don&#8217;t have an opinion on how naturally robots should move, this research brings home the awesome amount of work it takes to build computers and robots that mimic the capabilities of a person. Much like computer visualization, the science of robotic interaction takes a problem the size of a mountain and has to chip it down into grains of sand using a toothpick to find solutions. It&#8217;s a testament to human curiosity that people are willing to try.</p>
<p>Also, I expect Disney might be lured by the idea of natural-looking robots roaming its theme parks. My only question is would they be dressed up as characters or working the cash register at the gift stores. </p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=647315&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=663301"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=663301" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=647315+how-to-make-a-less-creepy-robot-simple-just-add-big-data&utm_content=shigginbotham">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=647315+how-to-make-a-less-creepy-robot-simple-just-add-big-data&utm_content=shigginbotham">The importance of putting the U and I in visualization</a></li><li><a href="http://pro.gigaom.com/2012/01/newnet-q4-platform-mania-and-social-commerce-shakeout/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=647315+how-to-make-a-less-creepy-robot-simple-just-add-big-data&utm_content=shigginbotham">NewNet Q4: Platform mania and social commerce shakeout</a></li><li><a href="http://pro.gigaom.com/2012/01/newnet-q4-platform-mania-and-social-commerce-shakeout/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=647315+how-to-make-a-less-creepy-robot-simple-just-add-big-data&utm_content=shigginbotham">NewNet Q4: Platform mania and social commerce shakeout</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/20/how-to-make-a-less-creepy-robot-simple-just-add-big-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/05/disneyrobot.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/05/disneyrobot.jpg?w=150" medium="image">
			<media:title type="html">disneyrobot</media:title>
		</media:content>

		<media:content url="http://1.gravatar.com/avatar/aee37121e18bf76bb9fee4494bab237a?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">shigginbotham</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/recmo_image-1024x174.png?w=708" medium="image">
			<media:title type="html">RecMo_image-1024x174</media:title>
		</media:content>
	</item>
		<item>
		<title>Database startup Drawn to Scale is closing down</title>
		<link>http://gigaom.com/2013/05/17/database-startup-drawn-to-scale-is-closing-down/</link>
		<comments>http://gigaom.com/2013/05/17/database-startup-drawn-to-scale-is-closing-down/#comments</comments>
		<pubDate>Fri, 17 May 2013 21:24:03 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[Drawn to Scale]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Hbase]]></category>
		<category><![CDATA[SQL on Hadoop]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=646718</guid>
		<description><![CDATA[Database startup Drawn to Scale, creator of the SQL-on-Hadoop technology called Spire, is closing down. The company's product, Spire, was one of the first SQL-on-Hadoop technologies.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=646718&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Database startup Drawn to Scale, creator of the SQL-on-Hadoop technology called Spire, is closing down. Co-founder and CEO Bradford Stephens officially <a href="http://www.roadtofailure.com/?p=11">announced the closure in a blog post</a> on Friday.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/05/spirearchitecture-015-e1361407038325.png"><img  alt="spirearchitecture-015-e1361407038325" src="http://gigaom2.files.wordpress.com/2013/05/spirearchitecture-015-e1361407038325.png?w=300&#038;h=185" width="300" height="185" class="alignleft size-medium wp-image-646740" /></a>The company&#8217;s product, Spire, which provided full SQL support on top of the HBase NoSQL database, was one of the first products to <a href="http://gigaom.com/2012/07/24/how-one-startup-wants-to-inject-hadoop-into-your-sql/">try to blend Hadoop&#8217;s scalability with the robustness and familiarity of SQL</a>. That&#8217;s now <a href="http://gigaom.com/2013/03/05/the-hadoop-ecosystem-the-welcome-elephant-in-the-room-infographic/">an increasingly crowded space</a> (and has grown since that linked graphic was created). In March, Drawn to Scale <a href="http://gigaom.com/2013/03/19/drawn-to-scale-wants-to-solve-your-mongodb-scalability-problems/">expanded its support to MongoDB</a>, as well.</p>
<p>I wasn&#8217;t shocked when Stephens told me the news &#8212; questions about the four-year-old company&#8217;s financial health had been swirling for a while &#8212; but to hear of its financial woes was a bit surprising. His account in the post pretty much echoes what I had heard from others:</p>
<blockquote id="quote-it-seemed-we-had-eve"><p>&#8220;It seemed we had everything going for us — paid customers such as American Express, Orange Telecom, Flurry, and 4 others. Our technology worked brilliantly, we had a big hiring pipeline, and we had great media presence against our competitors who raised 10-100x more cash.&#8221;</p></blockquote>
<p>He added:</p>
<blockquote id="quote-yet-five-days-before2"><p>&#8220;Yet five days before we signed term sheets for a big A round or sold the company, we started getting hit by a series of black swans — and we just didn’t have what we needed to recover. I’ll leave the public detail at that level, but I will say that paying employees’ health insurance out of your meager savings is a powerful incentive to change course.&#8221;</p></blockquote>
<p>Up to this point, the company <a href="http://gigaom.com/2012/03/08/drawn-to-scale-raises-money-to-make-sql-big-data-ready/">had raised $925,000</a> from RTP Ventures, IA Ventures and SK Ventures. There&#8217;s no word yet on what will come of the company&#8217;s intellectual property.</p>
<p>As Stephens &#8212; who&#8217;s now doing an entrepreneur-in-residence gig at Ping Identity and helping out other startups (including popular wardrobe app <a href="http://www.clothapp.com/">Cloth</a>) &#8212; succinctly put it during a phone discussion, &#8220;We just don&#8217;t have the horsepower to keep running the company.&#8221;</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=646718&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=188339"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=188339" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=646718+database-startup-drawn-to-scale-is-closing-down&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/07/scaling-hadoop-clusters-the-role-of-cluster-management/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=646718+database-startup-drawn-to-scale-is-closing-down&utm_content=dharrisstructure">Scaling Hadoop clusters: the role of cluster management</a></li><li><a href="http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=646718+database-startup-drawn-to-scale-is-closing-down&utm_content=dharrisstructure">The importance of putting the U and I in visualization</a></li><li><a href="http://pro.gigaom.com/2012/04/sector-roadmap-hadoop-platforms-2012/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=646718+database-startup-drawn-to-scale-is-closing-down&utm_content=dharrisstructure">2012: The Hadoop infrastructure market booms</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/17/database-startup-drawn-to-scale-is-closing-down/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/05/dtsdragon.png?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/05/dtsdragon.png?w=150" medium="image">
			<media:title type="html">dtsdragon</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/spirearchitecture-015-e1361407038325.png?w=300" medium="image">
			<media:title type="html">spirearchitecture-015-e1361407038325</media:title>
		</media:content>
	</item>
		<item>
		<title>Teradata gets into the in-memory biz to take on SAP&#8217;s HANA</title>
		<link>http://gigaom.com/2013/05/08/teradata-gets-into-the-in-memory-biz-to-take-on-saps-hana/</link>
		<comments>http://gigaom.com/2013/05/08/teradata-gets-into-the-in-memory-biz-to-take-on-saps-hana/#comments</comments>
		<pubDate>Wed, 08 May 2013 14:07:43 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[analytics]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[in-memory database]]></category>
		<category><![CDATA[Teradata]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=643284</guid>
		<description><![CDATA[Teradata is trying to steal some thunder in the in-memory analytics space with a new technology called Intelligent Memory that places hot data in RAM while dispersing the rest across solid-state drives and disk.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=643284&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Data analytics veteran Teradata will not let the new era of data-analysis architectures pass it by without a fight. It has already built products to address massive data volumes and Hadoop, and on Wednesday it announced an in-memory database technology to answer the industry’s latest call.</p>
<p>Speed is the driving factory behind the in-memory analytics push that spans everyone from classic Teradata <a href="http://gigaom.com/2013/01/11/sap-marries-transaction-processing-with-analytics-by-putting-business-suite-on-hana/">rivals like SAP</a> and Oracle to startups <a href="http://gigaom.com/2013/01/18/can-a-new-database-help-get-zynga-back-on-track/">such as MemSQL</a>. Estimates vary as to the exact speed difference between data access in RAM versus hard disk, but Teradata is claiming RAM is 3,000 times faster. The speed difference between RAM and solid-state drives or flash memory is smaller, although still significant.</p>
<p>Of course, cost also comes into play, as the speed and cost tend to go hand in hand when it comes to storage media. That’s one reason Teradata says its new technology, called Intelligent Memory, doesn’t operate fully in-memory like some competitive offerings do. Rather, it places only the “hottest” data in memory for super-fast analysis and spreads the rest between solid-state drives and disk within a Teradata environment.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/05/tdc2.jpg"><img alt="tdc2" src="http://gigaom2.files.wordpress.com/2013/05/tdc2.jpg?w=708"   class="aligncenter size-full wp-image-643317"></a></p>
<p>This concept of intelligent data placement has been around a while in the storage space (it’s part of <a href="http://gigaom.com/2013/05/06/emc-plots-software-defined-data-center-journey-from-vipr-storage-virtualization-base/">EMC’s new ViPR software-defined storage platform</a>, too), but the advent of big data and abundant flash has given it some new life. Many companies <a href="http://gigaom.com/2012/09/10/nimble-storage-gets-40m-as-ipo-approaches/">desire a tiered system</a> in which they can pay more for fast access to their important or hot data, while saving some cash on lower-performance for their older and less-accessed data. Facebook is <a href="http://gigaom.com/2013/01/16/why-facebook-might-put-blu-ray-to-use-on-big-data/">really pushing the envelope here</a> with its cold storage initiative — something VP of Engineering Jay Parikh will likely discuss at our <a href="http://event.gigaom.com/structure/?utm_source=data&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=643284+teradata-gets-into-the-in-memory-biz-to-take-on-saps-hana&amp;utm_content=dharrisstructure">Structure conference</a> June 19 and 20 in San Francisco.</p>
<p>In analytics, though, RAM, not flash, is the fastest medium out there. Whether someone goes all-RAM or a tiered approach like Teradata pushing probably depends on how much performance they need across how much data, as well as how much they’re willing to pay. But if you’re doing interactive analytics in the next decade, they’re almost certain to be in-memory to some degree.</p>
<p><em>Feature image courtesy of <a href="http://www.shutterstock.com/gallery-537709p1.html">Shutterstock user Hellen Sergeyeva</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=643284&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=535240"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=535240" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=643284+teradata-gets-into-the-in-memory-biz-to-take-on-saps-hana&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2010/09/the-red-hot-data-warehouse-market-whos-buying-next/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=643284+teradata-gets-into-the-in-memory-biz-to-take-on-saps-hana&utm_content=dharrisstructure">The Red-Hot Data Warehouse Market: Who&#8217;s Buying Next?</a></li><li><a href="http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=643284+teradata-gets-into-the-in-memory-biz-to-take-on-saps-hana&utm_content=dharrisstructure">The importance of putting the U and I in visualization</a></li><li><a href="http://pro.gigaom.com/2010/10/will-hadoop-vendors-profit-from-banks-big-data-woes/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=643284+teradata-gets-into-the-in-memory-biz-to-take-on-saps-hana&utm_content=dharrisstructure">Will Hadoop Vendors Profit from Banks&#8217; Big Data Woes?</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/08/teradata-gets-into-the-in-memory-biz-to-take-on-saps-hana/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/05/shutterstock_127075202.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/05/shutterstock_127075202.jpg?w=150" medium="image">
			<media:title type="html">ram memory</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/tdc2.jpg" medium="image">
			<media:title type="html">tdc2</media:title>
		</media:content>
	</item>
		<item>
		<title>The promise of better data has MetLife investing $300M in new tech</title>
		<link>http://gigaom.com/2013/05/07/with-300m-earmarked-for-tech-innovation-metlife-wants-to-remake-insurance/</link>
		<comments>http://gigaom.com/2013/05/07/with-300m-earmarked-for-tech-innovation-metlife-wants-to-remake-insurance/#comments</comments>
		<pubDate>Tue, 07 May 2013 14:00:31 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[10Gen]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[insurance industry]]></category>
		<category><![CDATA[MetLife]]></category>
		<category><![CDATA[MongoDB]]></category>
		<category><![CDATA[open source]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=642824</guid>
		<description><![CDATA[MetLife is building new products on new technologies thanks to a $300 million investment in new technology and new skills. One of the first products is a MongoDB-based app that puts all of customers' information in one place.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=642824&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>The insurance industry hasn&#8217;t always been a beacon of technological innovation. Then again, its major providers haven&#8217;t always earmarked $300 million for investments in new technology and new talent like MetLife has. The strategy has already borne its first fruit in the form of a new database system and application that lets the company see everything it knows about a customer in a single place.</p>
<p>The new application, called The Wall, is essentially a way to make the customer service experience more palatable for consumers and to lower the burden of hiring new representatives. Because it&#8217;s designed to look and function like Facebook, MetLife CIO and SVP of Regional Application Development Gary Hoberman told me, The Wall means new hires don&#8217;t have to be trained on complex enterprise call center software. For customers calling MetLife to discuss a claim or their coverage, it means fewer annoying waits as an agent accesses data from any of dozens of different places.</p>
<p>&#8220;Instead of seeing what someone had for dinner, [The Wall is] all a customer&#8217;s transactions,&#8221; Hoberman said. Claims, records, status, possible cross-sell information (e.g., if someone lives in an apartment and might need renter&#8217;s insurance) &#8212; it&#8217;s all in there. Looking forward, he said, it might even contain other publicly available information from social media and certain mobile apps that would give the company even greater visibility into its customers&#8217; lives.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/05/metlife-screen-shot_active-contract.png"><img  alt="MetLife Screen Shot_Active Contract" src="http://gigaom2.files.wordpress.com/2013/05/metlife-screen-shot_active-contract.png?w=708&#038;h=720" width="708" height="720" class="aligncenter size-large wp-image-642871" /></a></p>
<h2 id="up-and-running-in-3-months-on-">Up and running in 3 months, on MongoDB</h2>
<p>From a business perspective, though, the most-impressive part of The Wall is how quickly it was implemented and what a divergence from classic large-enterprise IT practices it represents. For Hoberman, who spent 16 years at Citi before joining MetLife in mid-2012, the process was eye-opening. If you told someone in the financial services industry that it would take just five days to get servers up and running for the prototype of such a big application, he said, &#8220;they&#8217;d look at you like you had two heads.&#8221;</p>
<p>But that&#8217;s exactly what MetLife did. In fact, it had the entire prototype built just two weeks after devising it and the production system up and running in just three months. It came together so fast because of MetLife&#8217;s new focus on cutting-edge IT and clear mission to build a useful product rather than, as Hoberman put it, &#8220;doing big data for big data&#8217;s sake.&#8221; The tech team was willingly working nights and weekends and the leadership team was directly involved because everyone understood what a fundamental change the application could have on the business.</p>
<p>&#8220;In insurance,&#8221; Hoberman said, &#8220;&#8230; working in months, not years, is really a startup mentality.&#8221;</p>
<p>How big an undertaking was it? Built atop MongoDB, The Wall brings together data from more than 70 legacy systems and merges it into a single record. It runs across six servers in two data centers and presently stores about 24 terabytes of data. That includes MetLife&#8217;s entire U.S. customer base (some 45 million agreements in total), although the goal is to expand it to international customers and multiple languages, as well, and maybe even create a customer-facing version. It updates in near real time, just like the Facebook wall, as new customer data is entered.</p>
<p>Building a production database system on NoSQL technology isn&#8217;t commonplace in insurance or other large industries, but it was about the only way to pull this off. Going with the relational model, Hoberman explained, would have meant figuring out a common set of schema across such a wide range of products (insurance products and terms vary from state to state and country to country) that it would have been nearly impossible to actually achieve that coveted 360-degree customer view. MongoDB let Hoberman&#8217;s team build some light schema to give the app order, but to be able to take in all the data it had available.</p>
<h2 id="bringing-in-new-tech-and-new-b">Bringing in new tech, and new blood</h2>
<p>This is only a part of what MetLife is doing with new information technologies, though, and only a fraction of what it wants to do. With The Wall, specifically, MetLife Hoberman wants to build next-best action models that will give agents guidance on how to best deal with customers. Elsewhere, the company has already used its new centralized MongoDB system to build models for predicting attrition, and it&#8217;s using Hadoop and HBase for some other workloads where they&#8217;re a better fit.</p>
<p>It&#8217;s all thanks to a company mandate to save $450 million from its bloated technology and operations budget and then invest two-thirds of it back into new technology. &#8220;We literally have a $300 million investment to decide what&#8217;s going to be the future of MetLife,&#8221; Hoberman said. It&#8217;s kind of like being in a startup, he added, only with the resources to make sure everything is done right (much <a href="http://gigaom.com/2012/09/16/how-disney-built-a-big-data-platform-on-a-startup-budget/">like with other large enterprises embracing open source</a>, Hoberman&#8217;s team prototyped The Wall using open source MongoDB but brought in <a href="http://gigaom.com/2013/04/09/mongodb-ftw-fast-growing-10gen-hires-first-cfo/">10gen</a> when it came time to build a production system).</p>
<p>It might be easy to mock that statement, except that Hoberman and his peers are putting their money where their mouths are by bringing in new talent, as well. It&#8217;s setting up a team in the Research Triangle region of North Carolina and bringing in employees with expertise in areas such as social, mobile and big data. And Hoberman is far less concerned with specific technical skills than he is with motivation.</p>
<p>It&#8217;s all about &#8220;attitude and aptitude,&#8221; he said. &#8220;They can learn anything.&#8221;</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=642824&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=354571"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=354571" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=642824+with-300m-earmarked-for-tech-innovation-metlife-wants-to-remake-insurance&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2013/01/cloud-and-data-fourth-quarter-2012-analysis/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=642824+with-300m-earmarked-for-tech-innovation-metlife-wants-to-remake-insurance&utm_content=dharrisstructure">The fourth quarter of 2012 in cloud</a></li><li><a href="http://pro.gigaom.com/2012/11/unlocking-big-datas-potential-with-search/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=642824+with-300m-earmarked-for-tech-innovation-metlife-wants-to-remake-insurance&utm_content=dharrisstructure">How search can unlock the power of big data</a></li><li><a href="http://pro.gigaom.com/2012/07/cloud-and-data-second-quarter-2012-analysis-and-outlook-2/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=642824+with-300m-earmarked-for-tech-innovation-metlife-wants-to-remake-insurance&utm_content=dharrisstructure">Takeaways from the second quarter in cloud and data</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/07/with-300m-earmarked-for-tech-innovation-metlife-wants-to-remake-insurance/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/05/metlife-screen-shot_active-contract1-e1367933585875.png?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/05/metlife-screen-shot_active-contract1-e1367933585875.png?w=150" medium="image">
			<media:title type="html">MetLife Screen Shot_Active Contract</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/metlife-screen-shot_active-contract.png?w=708" medium="image">
			<media:title type="html">MetLife Screen Shot_Active Contract</media:title>
		</media:content>
	</item>
		<item>
		<title>Look, IBM is doing SQL on Hadoop, too</title>
		<link>http://gigaom.com/2013/05/06/look-ibm-is-doing-sql-on-hadoop-too/</link>
		<comments>http://gigaom.com/2013/05/06/look-ibm-is-doing-sql-on-hadoop-too/#comments</comments>
		<pubDate>Mon, 06 May 2013 17:37:41 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[SQL]]></category>
		<category><![CDATA[SQL on Hadoop]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=642523</guid>
		<description><![CDATA[IBM's entrant in the SQL-on-Hadoop competition has been flying under the radar, but is available as a technology preview. Called Big SQL, it's a big deal if IBM wants to be a major player in the Hadoop space.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=642523&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Maybe this is just news to me, but IBM has a SQL-on-Hadoop product in the works called Big SQL. The company <a href="https://www.ibm.com/developerworks/community/blogs/SusanVisser/entry/introducing_the_ibm_big_sql_technology_preview1?lang=en">announced the technology preview version in March</a> (well under my radar and, from what I&#8217;ve seen, nearly everyone else&#8217;s radar), and is offering up a cloud-based demo environment for a select group of early users.</p>
<p>As a refresher, the big difference between SQL on Hadoop and the Hadoop connectors that were popular a couple years ago is that SQL-on-Hadoop products query the data where it resides &#8212; in HDFS or HBase &#8212; rather than pulling it into a relational database environment to analyze it. We have been <a href="http://gigaom.com/2013/04/30/with-impala-now-ga-clouderas-ceo-sizes-up-the-sql-on-hadoop-market/">talking for months about the emergence of a large SQL-on-Hadoop market</a>, but IBM&#8217;s name was conspicuously absent from that discussion. The company has Hadoop software called BigInsights and lots of SQL expertise, so it only made sense that IBM would get into the game at some point.</p>
<p>Details on Big SQL are still pretty sparse save for a few high-level blog posts and an instructional video (embedded below), but it looks to take the standard approach, <a href="http://gigaom.com/2013/04/30/with-impala-now-ga-clouderas-ceo-sizes-up-the-sql-on-hadoop-market/">as Cloudera is doing with Impala</a>, of enabling access through traditional tools via JDBC and ODBC drivers.</p>
<p>Ultimately, I think the advent of big data will <a href="http://gigaom.com/2013/05/01/precog-launches-with-a-plan-to-simplify-analytics-on-unstructured-data/">enable some new types of querying techniques</a> quite a bit different than the SQL queries we&#8217;ve come to know and love over the past couple decades. But SQL is still the language du jour and might never go away, so there&#8217;s a lot of value to be had if people can put their SQL skills to work on data stored inside Hadoop or other environments, and if companies can work toward a nirvana <a href="http://gigaom.com/2013/02/25/emc-to-hadoop-competition-see-ya-wouldnt-wanna-be-ya/">where all the data is stored in a single place</a> rather than across database environments.</p>
<p>That IBM got this message and got into the game isn&#8217;t surprising at all, but it is important. Lots of large companies buy IBM&#8217;s software.  If it wants them to follow it into the world of big data and Hadoop, it has to give them the tools they need to use it.</p>
<span class='embed-youtube' style='text-align:center; display: block;'><iframe class='youtube-player' type='text/html' width='604' height='370' src='http://www.youtube.com/embed/DCWig4-h1F4?version=3&#038;rel=1&#038;fs=1&#038;showsearch=0&#038;showinfo=1&#038;iv_load_policy=1&#038;wmode=transparent' frameborder='0'></iframe></span>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=642523&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=105517"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=105517" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=642523+look-ibm-is-doing-sql-on-hadoop-too&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=642523+look-ibm-is-doing-sql-on-hadoop-too&utm_content=dharrisstructure">The importance of putting the U and I in visualization</a></li><li><a href="http://pro.gigaom.com/2012/04/sector-roadmap-hadoop-platforms-2012/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=642523+look-ibm-is-doing-sql-on-hadoop-too&utm_content=dharrisstructure">2012: The Hadoop infrastructure market booms</a></li><li><a href="http://pro.gigaom.com/2012/04/infrastructure-q1-cloud-and-big-data-woo-the-enterprise/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=642523+look-ibm-is-doing-sql-on-hadoop-too&utm_content=dharrisstructure">Infrastructure Q1: Cloud and big data woo enterprises</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/06/look-ibm-is-doing-sql-on-hadoop-too/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/02/shutterstock_37622056.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/02/shutterstock_37622056.jpg?w=150" medium="image">
			<media:title type="html">sql statement</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>
	</item>
		<item>
		<title>Wikipedia is now drawing facts from the Wikidata repository, and so can you</title>
		<link>http://gigaom.com/2013/04/26/wikipedia-is-now-drawing-facts-from-the-wikidata-repository-and-so-can-you/</link>
		<comments>http://gigaom.com/2013/04/26/wikipedia-is-now-drawing-facts-from-the-wikidata-repository-and-so-can-you/#comments</comments>
		<pubDate>Fri, 26 Apr 2013 09:10:56 +0000</pubDate>
		<dc:creator>David Meyer</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[ai]]></category>
		<category><![CDATA[artificial intelligence]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[Google Now]]></category>
		<category><![CDATA[Knowledge Graph]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[siri]]></category>
		<category><![CDATA[structured data]]></category>
		<category><![CDATA[Wikimedia]]></category>
		<category><![CDATA[Wikipedia]]></category>
		<category><![CDATA[Wolfram Alpha]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=634673</guid>
		<description><![CDATA[The Wikimedia Foundation's first major new project in 7 years is now feeding the biggest project in that stable, Wikipedia itself. But anyone can take structured data from Wikidata, due to its open license.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=634673&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="https://www.wikidata.org/">Wikidata</a>, a centralized structured data repository for facts and Wikimedia&#8217;s first big new project in the last 7 years, is now feeding the foundation&#8217;s main project, Wikipedia.</p>
<p>The Wikidata project was kicked off around a year ago by the German chapter of Wikimedia, which is still steering its gradual development. For Wikipedia, the advantage is simple and powerful &#8212; if there&#8217;s a central, machine-readable source for facts, such as the population of a city, then any update to that data can be instantly reflected across all the articles in which the facts are included.</p>
<p>To posit a morbid example: a singer may have dozens or even hundreds of language versions of her Wikipedia entry and, if she were to die, the addition of a date of death to the Wikidata database would immediately propagate across all those versions, with no need to manually update each one (yes, I can also see how this might go horribly wrong). </p>
<p>Indeed, Wikidata is now being used as a common data source for all 286 Wikipedia language versions. <a href="https://www.wikidata.org/wiki/Q159">Here&#8217;s the under-development &#8220;item&#8221; page for Russia</a>, if you want to see what Wikidata looks like in practise.</p>
<p><a href="http://gigaom.com/2013/04/26/wikipedia-is-now-drawing-facts-from-the-wikidata-repository-and-so-can-you/wikidata-russia/" rel="attachment wp-att-634675"><img src="http://gigaom2.files.wordpress.com/2013/04/wikidata-russia.jpg?w=708&#038;h=471" alt="Wikidata Russia" width="708" height="471"  class="aligncenter size-large wp-image-634675" /></a></p>
<p>But the really interesting thing with Wikidata is that it&#8217;s not just for Wikipedia – although it&#8217;s worth remembering that its API is still under development, the database can be used by anyone as it is published under a <a href="http://creativecommons.org/publicdomain/zero/1.0/">Creative Commons 0 public domain dedication</a>. Here&#8217;s how Wikidata project director Denny Vrandečić put it in a statement:</p>
<blockquote id="quote-it-is-the-goal-of-wi3"><p>&#8220;It is the goal of Wikidata to collect the world&#8217;s complex knowledge in a structured manner so that anybody can benefit from it, whether that&#8217;s readers of Wikipedia who are able to be up to date about certain facts or engineers who can use this data to create new products that improve the way we access knowledge.&#8221;</p></blockquote>
<p>There are already some pretty cool (if bare-bones) examples of what people can do with Wikidata. One is GeniaWiki, which is trying to map the family relationships between famous people (the first and so far only example is that of the <a href="https://toolserver.org/~magnus/ts2/geneawiki/?q=Q1339">Bach family</a>), while a <a href="http://simia.net/treeoflife/">Tree of Life project</a> is trying to put together a viable, Wikidata-based &#8220;taxonomy of all life&#8221;.</p>
<p>It&#8217;s worth noting that the initial funding for Wikidata&#8217;s development has come from Google, the Gordon and Betty Moore Foundation, and the Allen Institute for Artificial Intelligence. Ultimately, Wikidata is precisely the sort of venture that is needed to feed the nascent semantic web and AI movement. </p>
<p>It&#8217;s far from the only venture in this space – I&#8217;d also recommend keeping a close eye on Google&#8217;s <a href="http://www.google.com/insidesearch/features/search/knowledge.html">Knowledge Graph</a>, which powers Google Now, and <a href="http://blog.stephenwolfram.com/2013/03/talking-about-the-computational-future-at-sxsw-2013/">Wolfram|Alpha</a>, which partly powers Siri – but all these (often intertwined) projects are essentially trying to do the same thing: to turn facts into something that machines can understand. </p>
<p>And that, in conjunction with advances in natural language processing and machine learning, will ultimately help us converse with machines. These are the building blocks of artificial intelligence and the future of search, and Wikidata&#8217;s very permissive license should act as an open <a href="https://www.wikidata.org/wiki/Wikidata:Contribute">invitation</a> to anyone dabbling in this space. </p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=634673&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=903518"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=903518" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=634673+wikipedia-is-now-drawing-facts-from-the-wikidata-repository-and-so-can-you&utm_content=superglaze">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2011/10/siri-say-hello-to-the-coming-invisible-interface/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=634673+wikipedia-is-now-drawing-facts-from-the-wikidata-repository-and-so-can-you&utm_content=superglaze">Siri: Say hello to the coming &#8220;invisible interface&#8221;</a></li><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=634673+wikipedia-is-now-drawing-facts-from-the-wikidata-repository-and-so-can-you&utm_content=superglaze">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2011/09/the-future-of-mobile-a-segment-analysis-by-gigaom-pro/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=634673+wikipedia-is-now-drawing-facts-from-the-wikidata-repository-and-so-can-you&utm_content=superglaze">The future of mobile: a segment analysis by GigaOM Pro</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/04/26/wikipedia-is-now-drawing-facts-from-the-wikidata-repository-and-so-can-you/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/04/geneawiki.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/04/geneawiki.jpg?w=150" medium="image">
			<media:title type="html">GeneaWiki</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/6599daccfd7e897e68744fe0065e5a2e?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">superglaze</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/04/wikidata-russia.jpg?w=708" medium="image">
			<media:title type="html">Wikidata Russia</media:title>
		</media:content>
	</item>
		<item>
		<title>The ex-MySQL gang is back together, pushing MariaDB as a neutral &#8216;bridge&#8217;</title>
		<link>http://gigaom.com/2013/04/24/the-ex-mysql-gang-is-back-together-pushing-mariadb-as-a-neutral-bridge/</link>
		<comments>http://gigaom.com/2013/04/24/the-ex-mysql-gang-is-back-together-pushing-mariadb-as-a-neutral-bridge/#comments</comments>
		<pubDate>Wed, 24 Apr 2013 16:54:13 +0000</pubDate>
		<dc:creator>David Meyer</dc:creator>
				<category><![CDATA[database]]></category>
		<category><![CDATA[MariaDB]]></category>
		<category><![CDATA[Monty Widenius]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[SkySQL]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=634013</guid>
		<description><![CDATA[MySQL and MariaDB services company SkySQL has brought Monty Widenius and other MariaDB players on board. The result, says CEO Patrik Sallner, will be "a new form of database platform that ties together other databases."<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=634013&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Bad news for Oracle, maybe: some of the key pre-Sun-takeover MySQL players are back together, and their MariaDB fork of MySQL looks like it&#8217;s gaining serious traction.</p>
<p>The reunion comes courtesy of a merger between open source database services firm SkySQL (which supports both MySQL and MariaDB deployments for customers ranging from Harvard to Shutterstock) and a company called Monty Program &#8212; yes, as in Monty Widenius, who named MySQL after his oldest daughter My and its fork after his younger daughter, Maria. </p>
<p>So now we have Widenius and other ex-MySQLers such as Colin Charles back together with players such as MySQL co-founder David Axmark and former MySQL sales director Magnus Stenberg. Actually, that&#8217;s underselling the magnitude of what&#8217;s happened here: out of the 70 employees of the fused operation (which is continuing under the SkySQL name), 50 used to be at the original MySQL firm. </p>
<h2 id="open-appeal">Open appeal</h2>
<p>At the same time, MariaDB seems to be capitalizing on the <a href="http://www.theregister.co.uk/2012/11/29/monty_oracle_eu_promises/">disillusionment</a> of some in the open source community with Oracle&#8217;s stewardship of MySQL &#8212; doing things like releasing extensions for the commercial version but not the free version was never going to win favor in that scene. Wikipedia <a href="http://blog.wikimedia.org/2013/04/22/wikipedia-adopts-mariadb/">migrated to MariaDB</a> in the last few days, and the Fedora and OpenSUSE Linux distros will <a href="http://www.zdnet.com/oracle-who-fedora-and-opensuse-will-replace-mysql-with-mariadb-7000010640/">both make the jump</a> in their next releases. </p>
<p>The MariaDB Foundation, which is <a href="http://blog.mariadb.org/mariadb-foundation-takes-next-steps-to-community-governance/">busy sorting out its governance structure</a> and which now claims SkySQL as an early member, also <a href="http://webmink.com/2013/04/18/taking-mariadb-foundation-forward/">took on</a> former Sun Chief Open Source Officer Simon Phipps as its CEO a week ago.</p>
<p>&#8220;It is a pleasure to have a company representing the reunited core team of our code base joining the Foundation at its inception,&#8221; Phipps said in a statement this week.</p>
<h2 id="mariadb-the-bridge">MariaDB the &#8220;bridge&#8221;</h2>
<p>The fused team has a unique NewSQL proposition: not only is MariaDB fully compatible with MySQL, but it can also interface with newer NoSQL databases such as Cassandra and LevelDB. According to SkySQL CEO Patrik Sallner, SkySQL will continue to service both MySQL and MariaDB customers and won&#8217;t be forcing anyone to jump to MariaDB &#8212; but he expects many customers to make that leap nonetheless:</p>
<blockquote id="quote-right-now-because-my"><p>&#8220;Right now, because MySQL belongs to Oracle, it&#8217;s not necessarily perceived as independent. Linux is the default operating system in most enterprise contexts. Oracle, IBM and Microsoft control the vast majority of business in databases and most companies have at least two of these, which are not compatible with each other. And, as companies deploy new applications, they use new [NoSQL] database technologies to meet their needs.</p>
<p>&#8220;We believe that MariaDB has an opportunity to become a truly independent and interoperable open source database, meaning we can provide a solution that&#8217;s a neutral ground for companies. &#8230; Our aspiration is to start building this into a new form of database platform that ties together other databases in a seamless manner. By providing a bridge, we believe we can create more innovation.&#8221;</p></blockquote>
<p>Sallner noted that there isn&#8217;t currently a great deal of difference between MySQL and MariaDB, apart from the latter&#8217;s &#8220;pluggable&#8221; approach to storage engines. &#8220;Using the SQL language allows us to be compatible with other databases, and we have a connect engine which allows us to add on-the-fly support for other data formats,&#8221; he said.</p>
<p>As a next step, Sallner said he hoped to see other database providers join the MariaDB Foundation, in order to maintain this open common ground. &#8220;We&#8217;re not competing against DB2 or Oracle or Microsoft today &#8212; we&#8217;re all serving different needs,&#8221; he said. So does he want to sign up Oracle itself? &#8220;That&#8217;ll be a stretch, but it would be a huge sign of success,&#8221; he laughed.</p>
<p>It&#8217;s not all bonhomie, though &#8212; Sallner reckons large internet companies will engage with MariaDB in a way that they haven&#8217;t with Oracle&#8217;s MySQL.</p>
<p>&#8220;We believe those companies are willing to contribute the work they&#8217;ve done back to MariaDB,&#8221; he said. &#8220;Facebook and Twitter have contributed substantial new features to MariaDB. They probably wouldn&#8217;t have contributed that to Oracle.&#8221;</p>
<p><em>UPDATE (10.55am PT): This piece originally and incorrectly stated that Widenius is the new SkySQL CTO, whereas he is in fact the CTO of the MariaDB Foundation. Widenius is on the board of SkySQL, but his role is non-operational.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=634013&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=818632"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=818632" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=634013+the-ex-mysql-gang-is-back-together-pushing-mariadb-as-a-neutral-bridge&utm_content=superglaze">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/11/breaking-down-barriers-and-reducing-cycle-times-with-devops-and-continuous-delivery/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=634013+the-ex-mysql-gang-is-back-together-pushing-mariadb-as-a-neutral-bridge&utm_content=superglaze">How devops can reduce cycle times</a></li><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=634013+the-ex-mysql-gang-is-back-together-pushing-mariadb-as-a-neutral-bridge&utm_content=superglaze">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2010/10/is-the-future-of-enterprise-completely-open-source/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=634013+the-ex-mysql-gang-is-back-together-pushing-mariadb-as-a-neutral-bridge&utm_content=superglaze">Is the Future of Enterprise Completely Open Source?</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/04/24/the-ex-mysql-gang-is-back-together-pushing-mariadb-as-a-neutral-bridge/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/04/mariadb.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/04/mariadb.jpg?w=150" medium="image">
			<media:title type="html">MariaDB</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/6599daccfd7e897e68744fe0065e5a2e?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">superglaze</media:title>
		</media:content>
	</item>
		<item>
		<title>DataStax pushes NoSQL into Europe with new London-based subsidiary</title>
		<link>http://gigaom.com/2013/03/27/datastax-pushes-into-europe-with-new-london-based-subsidiary/</link>
		<comments>http://gigaom.com/2013/03/27/datastax-pushes-into-europe-with-new-london-based-subsidiary/#comments</comments>
		<pubDate>Wed, 27 Mar 2013 09:00:25 +0000</pubDate>
		<dc:creator>David Meyer</dc:creator>
				<category><![CDATA[apache]]></category>
		<category><![CDATA[Billy Bosworth]]></category>
		<category><![CDATA[Cassandra]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[DataStax]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[Oracle]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=624455</guid>
		<description><![CDATA[Having realized that 10 percent of its customer base is in the EMEA region, DataStax has launched a subsidiary there to further push its bundle of Hadoop, Cassandra and Solr.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=624455&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Last year was a good year for NoSQL outfit <a href="http://gigaom.com/2011/09/20/datastax-gets-11m-fuses-nosql-and-hadoop/">DataStax</a>. The big data company&#8217;s customer base increased roughly tenfold to 270, including 20 Fortune 100 firms and names such as eBay, Netflix and Thomson Reuters. It also picked up a <a href="http://www.datastax.com/2012/10/datastax-raises-25-million-in-third-round-of-funding">$25 million C round</a> in October, with one of the intended uses of that funding being global expansion. Now it&#8217;s making good on that promise by opening a European subsidiary.</p>
<p>The <a href="http://www.datastax.com/">DataStax</a> Enterprise 3 big data bundle fuses Hadoop with the Apache Cassandra database and Apache Solr enterprise search platform, creating what CEO Billy Bosworth claims is &#8220;the first viable alternative to Oracle since Oracle.&#8221; The big selling points here are linear scalability, operational simplicity and an emphasis on business continuity.</p>
<p>As the company has noticed that much of its new customer base was sited in Europe, the Middle East and Africa (EMEA), its latest move makes sense: DataStax has opened up a London office, and it&#8217;s a full-on subsidiary rather than just a branch office.</p>
<p>As Bosworth told me, the idea here is to be able to respond quickly to European market demands, which range from language variation to a different style of partnership:</p>
<blockquote id="quote-without-any-presence"><p>&#8220;Without any presence in EMEA, we ended up in 2012 with 10 percent of our customers located in the EMEA region – that was 100 percent inbound; we didn&#8217;t do any programs or outbound activity. We have <a href="http://www.scoreloop.com/">Scoreloop</a> in Germany, the mobile gaming platform, and <a href="http://gigaom.com/2012/08/31/report-40-percent-of-mobile-clicks-are-fraud-or-accidents/">Trademob</a>, the mobile app platform. We have mobile carriers who are decommissioning Oracle because they have to have a multi-data-center solution, and a London-based bank chose DataStax over Oracle for their ecommerce platform.</p>
<p>&#8220;In the UK, the business aspect of it is not that different from the U.S. &#8230; but as you move into the European continent, you do want to have some local language skills. And when you move into France and Spain and Italy, now you&#8217;re into a very boutique partner network. Those partners have very good relationships with their customers but are often not on the same scale as a big [systems integrator] like Accenture. The only way to really get close enough to that partner network is for us to be in the region as well.&#8221;</p></blockquote>
<p>With a portfolio as open-source-centric as DataStax&#8217;s is, Bosworth added, the company is also looking forward to hosting &#8220;a ton of meet-ups in the region&#8221; in the coming months.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=624455&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=500841"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=500841" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=624455+datastax-pushes-into-europe-with-new-london-based-subsidiary&utm_content=superglaze">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=624455+datastax-pushes-into-europe-with-new-london-based-subsidiary&utm_content=superglaze">The importance of putting the U and I in visualization</a></li><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=624455+datastax-pushes-into-europe-with-new-london-based-subsidiary&utm_content=superglaze">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2012/09/emerging-trends-in-the-non-relational-database-market/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=624455+datastax-pushes-into-europe-with-new-london-based-subsidiary&utm_content=superglaze">Emerging trends in the non-relational database market</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/03/27/datastax-pushes-into-europe-with-new-london-based-subsidiary/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/07/shutterstock_97676150.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/07/shutterstock_97676150.jpg?w=150" medium="image">
			<media:title type="html">London Underground sign and Big Ben clock at Houses of Parliament</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/6599daccfd7e897e68744fe0065e5a2e?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">superglaze</media:title>
		</media:content>
	</item>
		<item>
		<title>Drawn to Scale wants to make MongoDB scale like Hadoop</title>
		<link>http://gigaom.com/2013/03/19/drawn-to-scale-wants-to-solve-your-mongodb-scalability-problems/</link>
		<comments>http://gigaom.com/2013/03/19/drawn-to-scale-wants-to-solve-your-mongodb-scalability-problems/#comments</comments>
		<pubDate>Tue, 19 Mar 2013 17:00:13 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[Drawn to Scale]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Hbase]]></category>
		<category><![CDATA[MongoDB]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[scalability]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=621885</guid>
		<description><![CDATA[Database startup Drawn to Scale has extended its Spire distributed data platform from SQL to MongoDB. That means users can get high performance from the latter even across hundreds of terabytes.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=621885&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>If you love MongoDB but are tired of trying to scale it past a handful of machines and a few hundred gigabytes, database startup <a href="http://drawntoscale.com/">Drawn to Scale</a> says it has you covered. The company has <a href="http://drawntoscale.com/announcing-spire-for-mongo/">expanded the functionality of its distributed data platform from SQL to MongoDB</a>, meaning users of the popular NoSQL database can import their data to Spire and see high performance on hundreds of terabytes.</p>
<p>Drawn to Scale’s flagship product, called Spire, is a distributed data platform that’s built atop an optimized version of the Hadoop-based HBase database. HBase is what lets Spire scale cheaply and easily across. Its fully distributed index is what lets Spire read and write data at speeds that other approaches to scaling databases (e.g., sharding) can’t handle while maintaining the ability to handle rich queries.</p>
<p>To date, <a href="http://gigaom.com/2012/07/24/how-one-startup-wants-to-inject-hadoop-into-your-sql/">the company has been focused on letting users run massive SQL databases</a>, but it has finally completed a lengthy process of rewriting parts of MongoDB to work with Spire, Founder and CEO Bradford Stephens (who’ll be participating in our <a href="http://event.gigaom.com/structuredata/?utm_source=data&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=621885+drawn-to-scale-wants-to-solve-your-mongodb-scalability-problems&amp;utm_content=dharrisstructure">Structure: Data event</a> this week in New York) told me. The company had been keeping the work under tight wraps “because we didn’t know how long it was going to take to build,” he added.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/03/spiremongo-230x300.png"><img alt="SpireMongo-230x300" src="http://gigaom2.files.wordpress.com/2013/03/spiremongo-230x300.png?w=708"   class="alignright size-full wp-image-621963"></a>“Our big vision for the market is providing people with a universal data platform,” Stephens said. After SQL — which accounts for the vast majority of databases in existence — MongoDB is a logical next step (although Spire also supports queries using Hadoop MapReduce). It’s the most-widely used NoSQL database by a longshot, but although many users love its functionality and tooling, <a href="http://gigaom.com/2012/05/29/with-42m-more-10gen-wants-to-take-mongodb-mainstream/">the database is notoriously poor at scaling</a> to meet the demands of big data or high performance.</p>
<p>“You just sort of top out once you max out the memory,” Stephens explained, adding that MongoDB often starts getting inefficient as it’s forced to scale across 50 or 10 servers. “[T]hat’s where we <em>start</em> getting efficient.”</p>
<p>Now, without changing a single line of code, he claims, MongoDB users can import their data onto Spire and start handing 200-plus terabytes with ease. Of course, he noted, this doesn’t mean MongoDB users will abandon the database entirely. It might be they keep it for running applications that don’t require it to scale beyond a single server, and then use Spire to store big data for analytical purposes.</p>
<p>Initially, Spire will just support data importation and the basic CRUD (create, read, update, delete) functions of MongoDB, Stephens said. Later this year, assuming users want it, Drawn to Scale will implement MongoDB’s native MapReduce functionality as well as its management features.</p>
<p>As data volumes and data stores continue to proliferate, though, Drawn to Scale isn’t the only startup trying to provide a one-stop shop experience. At least for analytics, Citus Data is building a Postgres-based database <a href="http://gigaom.com/2013/02/19/citusdb-today-sql-on-hadoop-tomorrow-the-world/">capable of analyzing SQL, Hadoop and MongoDB data</a>, although each data store remains external. And there’s a <a href="http://gigaom.com/2013/03/05/the-hadoop-ecosystem-the-welcome-elephant-in-the-room-infographic/">whole group of companies merging SQL and Hadoop</a> for analytic workloads that might be wise to consider supporting operational data stores such as MongoDB, as well.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=621885&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=202838"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=202838" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=621885+drawn-to-scale-wants-to-solve-your-mongodb-scalability-problems&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=621885+drawn-to-scale-wants-to-solve-your-mongodb-scalability-problems&utm_content=dharrisstructure">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2011/03/defining-hadoop-the-players-technologies-and-challenges-of-2011/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=621885+drawn-to-scale-wants-to-solve-your-mongodb-scalability-problems&utm_content=dharrisstructure">Defining Hadoop: the Players, Technologies and Challenges of 2011</a></li><li><a href="http://pro.gigaom.com/2010/10/with-scalable-data-stores-around-is-nosql-a-non-starter/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=621885+drawn-to-scale-wants-to-solve-your-mongodb-scalability-problems&utm_content=dharrisstructure">With Scalable Data Stores Around, Is NoSQL a Non-Starter?</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/03/19/drawn-to-scale-wants-to-solve-your-mongodb-scalability-problems/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/10/shutterstock_113600470.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/10/shutterstock_113600470.jpg?w=150" medium="image">
			<media:title type="html">Shiny database</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/03/spiremongo-230x300.png" medium="image">
			<media:title type="html">SpireMongo-230x300</media:title>
		</media:content>
	</item>
		<item>
		<title>5 reasons why the future of Hadoop is real-time (relatively speaking)</title>
		<link>http://gigaom.com/2013/03/07/5-reasons-why-the-future-of-hadoop-is-real-time-relatively-speaking/</link>
		<comments>http://gigaom.com/2013/03/07/5-reasons-why-the-future-of-hadoop-is-real-time-relatively-speaking/#comments</comments>
		<pubDate>Thu, 07 Mar 2013 13:00:37 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Hbase]]></category>
		<category><![CDATA[real-time processing]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=616972</guid>
		<description><![CDATA[In Part III of our look at all things Hadoop, we examine the trends driving Hadoop's future. At the end of the day, everything is pushing Hadoop toward being just generally faster and easier to consume.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=616972&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>In some ways, Hadoop is a like a fine wine: It gets better with age as rough edges (or flavor profiles) are smoothed out, and those who wait to consume it will probably have a better experience. The only problem with this is that Hadoop exists in a world that’s more about <a href="http://www.urbandictionary.com/define.php?term=md+20%2F20">MD 20/20</a> than it is about <a href="http://www.winespectator.com/display/show?id=47374">Relentless Napa Valley 2008</a>: Companies often want to drink their big data fast, get drunk on insights, and then have some more — maybe something even stronger. And with data — unlike technology and tannins — it turns out older isn’t always better.</p>
<p>That’s a crude analogy, of course, but it gets at the essence of what’s currently plaguing Hadoop adoption and what will propel it forward in the next couple years. The work being done by companies like Cloudera and Hortonworks at the distribution level is great and important, as is MapReduce as a processing framework for certain types of batch workloads. But not every company can afford to be concerned about managing Hadoop on a day-to-day basis. And <a href="http://gigaom.com/2012/07/07/why-the-days-are-numbered-for-hadoop-as-we-know-it/">not every analytic job pairs well with MapReduce</a>.</p>
<p>In Part I of our four-part series on Hadoop, we <a href="http://gigaom.com/2013/03/04/the-history-of-hadoop-from-4-nodes-to-the-future-of-data/">looked at how the technology was born</a> and grew into the juggernaut it is today. In Part II, <a href="http://gigaom.com/2013/03/05/the-hadoop-ecosystem-the-welcome-elephant-in-the-room-infographic/">we laid out the map of the current products and projects</a> that comprise the Hadoop ecosystem. In this installment, we’ll take a closer look at some of them and how they’re positioning themselves to be important players down the road. Finally, <a href="http://gigaom.com/2013/03/08/hadoop-through-the-years-a-gigaom-retrospective/">Part IV will highlight some the best Hadoop applications and seminal moments in Hadoop history</a>, as reported by GigaOM over the years.</p>
<p>If there’s one big Hadoop theme at our <a href="http://event.gigaom.com/structuredata/?utm_source=data&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=616972+5-reasons-why-the-future-of-hadoop-is-real-time-relatively-speaking&amp;utm_content=dharrisstructure">Structure: Data conference</a> March 20-21 in New York, it’s the new realization that people shouldn’t be asking “What’s next after Hadoop?” but rather “What will Hadoop become next?”. Based on what’s transpiring today, the answer to that question is that Hadoop will become faster in all regards and more useful as a result.</p>
<h2 id="interactivity-big-data-style">Interactivity, big-data-style</h2>
<div id="attachment_612788" class="wp-caption alignright" style="width: 310px"><img alt="Source: Shutterstock user hauhu." src="http://gigaom2.files.wordpress.com/2013/02/shutterstock_37622056.jpg?w=300&#038;h=225" width="300" height="225" class="size-medium wp-image-612788"><p class="wp-caption-text">Source: Shutterstock user hauhu.</p></div>
<p>As I explained with some detail a couple weeks ago, <a href="http://gigaom.com/2013/02/21/sql-is-whats-next-for-hadoop-heres-whos-doing-it/">SQL is what’s next for Hadoop</a>, and that’s not because of familiarity alone or the types of queries permitted by SQL <del datetime="2013-03-07T02:30:39+00:00"></del>on relational data<del datetime="2013-03-07T02:30:39+00:00"></del>. It’s also because the types of massively parallel processing engines developed to analyze relational data over the years are very fast. That means analysts can ask questions and get answers at speeds much closer to the speed of their intuitions than is possible when querying entire data sets using standard MapReduce.</p>
<p>But just as SQL and its processing techniques bring something to Hadoop, Hadoop (the Hadoop Distributed File System, specifically) brings something to the table, too. Namely, it brings scale and flexibility that don’t exist in the traditional data warehouse world, where new hardware and licenses can be expensive; so only the “valuable” data makes its way inside and only after it has been fitted to a pre-defined structure. Hadoop, on the other hand, provides virtually unlimited scale and schema-free storage, so companies can store however much information they want in whatever format they want and worry later about what they’ll actually use it for. (Actually, though, most Hadoop jobs do require some sort of structure in order to run, and Hadoop co-creator Mike Cafarella is <a href="http://cloudera.github.com/RecordBreaker/">working on a project called RecordBreaker</a> that aims to automate this process for certain data types.)</p>
<p>How hot is SQL-on-Hadoop space? I profiled the companies and projects working on it on Feb. 21, and since then EMC Greenplum <a href="http://gigaom.com/2013/02/25/emc-to-hadoop-competition-see-ya-wouldnt-wanna-be-ya/">announced a completely rewritten Hadoop distribution</a> that fuses its analytic database to Hadoop, and an entirely new player called <a href="http://jethrodata.com/">JethroData</a> emerged along with $4.5 million in funding. Even if there’s a major shakeout, there will be a few lucky companies left standing to capitalize on a shift to Hadoop as <em>the</em> center of data gravity that EMC Greenplum’s Scott Yara (albeit a biased source) thinks will be the data equivalent of the mainframe’s demise.</p>
<h2 id="this-is-your-database-this-is-">This is your database. This is your database on HDFS</h2>
<p>The SQL versus NoSQL debate appears to be dying down as companies and developers begin to realize there’s definitely a place for both in most environments, but a new debate — with Hadoop at the center — might be about to start up. At its core is <a href="http://datagravity.org/">the concept of data gravity</a> and the large, attractive (in a gravitational sense) entity that is HDFS. Here’s the underlying question that might be posed: If I’m already storing my unstructured data in HDFS and am expected to replace my data warehouse with it, too, why would I also run a handful of other databases that require a separate data store?</p>
<p>This is in part why <a href="http://hbase.apache.org/">HBase</a> has attracted such a strong following despite its relative technical and commercial immaturity compared with comparable NoSQL database <a href="http://cassandra.apache.org/">Cassandra</a>. For applications that would benefit from a relational database, startups such as <a href="http://gigaom.com/2012/07/24/how-one-startup-wants-to-inject-hadoop-into-your-sql/">Drawn to Scale</a> and <a href="http://gigaom.com/2012/10/17/batten-down-the-analysts-its-a-big-data-bi-storm/http://gigaom.com/2012/10/17/batten-down-the-analysts-its-a-big-data-bi-storm/">Splice Machine</a> have turned HBase into a transactional SQL system. Wibidata, the <a href="http://gigaom.com/2012/02/07/hadoop-startup-wibidata-raises-5m-to-power-web-analytics/">new startup from Cloudera C0-founder Christophe Bisciglia and Aaron Kimball</a>, is <a href="http://gigaom.com/2012/11/14/wibidata-open-sources-kiji-to-make-hbase-more-useful/">pushing an open source framework called Kiji</a> to make it easier to develop applications that use HBase.</p>
<p>“If you talk to anyone from Cloudera or any of the platform vendors, I think they will tell you that a large percentage of their customers use HBase,” Bisciglia said. “It’s something that I only expect to see increasing.”</p>
<p>MapR seems to think so, too: the Hadoop-distribution vendor is getting ahead of the game by <a href="http://www.mapr.com/products/mapr-editions/m7-edition">selling an enterprise-grade version of HBase called M7</a>. Should hot startups such as <a href="http://gigaom.com/2012/04/13/meet-tempodb-a-database-startup-with-an-eye-for-time/">TempoDB</a> and <a href="http://gigaom.com/2013/01/16/has-ayasdi-turned-machine-learning-into-a-magic-bullet/">Ayasdi</a> decide to take their HBase-reliant cloud services into the data center, they’ll tap into Hadoop clusters, too.</p>
<p>And the National Security Agency built <a href="http://accumulo.apache.org/">Apache Accumulo</a>, a key-value database similar to HBase but designed for fine-grained security and massive scale. It’s now <a href="http://sqrrl.com/">being sold commercially by a startup called Sqrrl</a>. There’s even a graph-processing project called <a href="http://incubator.apache.org/giraph/">Giraph</a> that relies on HBase or Accumulo as the database layer.</p>
<h2 id="whatever-real-time-means-to-yo">Whatever “real-time” means to you</h2>
<p>Real-time is one of those terms that means different things to different people and different applications. The interactivity that SQL-on-Hadoop technologies promise is one definition, as is the type of stream processing <a href="http://gigaom.com/2011/08/04/twitter-to-open-source-hadoop-like-tool/">enabled by technologies like Storm</a>. When it comes to the latter, there’s a lot of excitement around YARN as the innovation will make it happen.</p>
<p><a href="http://hortonworks.com/blog/introducing-apache-hadoop-yarn/">YARN, aka MapReduce 2.0</a>, is a resource scheduler and distributed application framework that allows Hadoop users to run processing paradigms other than MapReduce. This could mean things, from traditional parallel-processing methods such as MPI to graph processing to newly developed stream-processing engines such as Storm and <a href="http://incubator.apache.org/s4/">S4</a>. Considering for how many years <em>Hadoop </em>meant <em>HDFS and MapReduce</em>, this type flexibility is certainly a big deal.</p>
<p><img alt="figure1" src="http://gigaom2.files.wordpress.com/2013/03/figure1.gif?w=300&#038;h=216" width="300" height="216" class="size-medium wp-image-617741 alignleft">Stream processing, of course, is the antithesis of batch processing, for which Hadoop is known, and which is inherently too slow for workloads such as serving real-time ads or monitoring sensor data. And even if Storm and other stream-processing platforms somehow don’t make their way onto Hadoop clusters, <a href="http://gigaom.com/2013/02/14/hstreaming-ready-to-show-the-world-its-real-time-hadoop/">a startup called HStreaming has made it its mission</a> to deliver stream processing to Hadoop, and <a href="http://www.continuuity.com/technology">it’s on other companies’ radars, as well</a>.</p>
<p>For what it’s worth, though, <a href="http://verticloud.com/">VertiCloud</a> Founder and CEO and former Yahoo CTO Raymie Stata thinks we should do away with terms such as <em></em>batch, real-time and interactive altogether. Instead, he prefers the terms synchronous and asynchronous to describe the human experience with the data rather than the speed of processing it. Synchronous computing happens at the speed of human activity, generally speaking, while asynchronous computing is largely decoupled from the idea of someone sitting in front of a computer screen awaiting a result.</p>
<p>The change in terms is associated with a change in how you manage SLAs for applications. Uploading photos to Flickr: synchronous. Running a MapReduce job: most likely asynchronous. Ironically, according to Stata, stream processing data with Storm is often asynchronous, too. That’s because there’s probably not someone on the other end waiting for a page to update or a query to return. And unless you’re doing something where guaranteed real-time latency is <em>necessary</em>, the occasional difference between milliseconds and 1 second probably isn’t critical.</p>
<iframe width="100%" height="166" scrolling="no" frameborder="no" src="http://w.soundcloud.com/player?url=http%3A%2F%2Fapi.soundcloud.com%2Ftracks%2F80972108%253Fsecret_token%253Ds-1QBTa"></iframe>
<h2 id="time-to-insight-starts-at-the-">Time to insight starts at the planning phase</h2>
<p>Even when MapReduce is the answer, though, not everyone is game for a long Hadoop deployment process coupled with a consulting deal to identify uses and build applications or workflows. Sometimes, you just want to buy some software and get going.</p>
<p>Already, companies such as Wibidata and Continuuity are trying to make it easier for companies to build Hadoop applications specific to their own needs, and Wibidata’s Bisciglia said his company is doing less and less customization the more it deals with customers in the same vertical markets. “I think it’s still a couple years out before you can buy a generic application that runs on Hadoop,” he told me, but he does see opportunity for billion-dollar businesses at this level, possibly selling the Hadoop equivalent of an ERP or CRM application.</p>
<div id="attachment_603561" class="wp-caption alignright" style="width: 310px"><img alt="Structure Data 2012: Michael Olson – CEO, Cloudera" src="http://gigaom2.files.wordpress.com/2013/01/1z5o1503.jpg?w=300&#038;h=200" width="300" height="200" class="size-medium wp-image-603561"><p class="wp-caption-text">Cloudera CEO Mike Olson at Structure: Data 2012<br>(c) 2012 Pinar Ozger pinar@pinarozger.com</p></div>
<p>And Cloudera CEO Mike Olson <a href="http://gigaom.com/2012/03/21/cloudera-structure-data-2012/">told the audience at our Structure: Data conference last year</a> that he’ll connect startups trying to build Hadoop-based applications with funding opportunities. In fact, Cloudera backer Accel Partners <a href="http://gigaom.com/2011/11/08/accel-forms-100m-fund-to-feed-big-data-apps/">launched a Big Data Fund in 2011</a> with the sole purpose of funding application-level big data startups.</p>
<p>But maybe Cloudera, like database vendor Oracle before it, will just get into the application space itself: According to Hadoop creator and Cloudera chief architect Doug Cutting:</p>
<blockquote id="quote-i-wouldnt-be-surpris"><p>“I wouldn’t be surprised if you see vendors, like Cloudera, starting to creep up the stack and sell some applications. You’ve seen that before from Red Hat, from Oracle. You could argue that the relational database is a platform for Oracle and they’ve sold a lot of applications on top. So I think that happens as the market matures. When it’s young, we don’t want to stomp on potential collaborators at this point, we want to open that up to other people to really enhance the platform.”</p></blockquote>
<p>Cloud computing is proving to be a big help in getting Hadoop projects off the ground, too. Even low-level services such as Amazon Elastic MapReduce can <a href="http://gigaom.com/2012/02/22/how-infochimps-wants-to-become-heroku-for-hadoop/">ease the burden of managing a physical Hadoop cluster</a>, and there are already a handful of cloud services <a href="http://gigaom.com/2012/04/05/kontagent-turns-data-mining-into-saas-for-mobile-apps/">exposing Hadoop as a SaaS application</a> for business intelligence and analytics. The easier it gets to store, process and analyze data in the cloud, the more appealing Hadoop looks to potential users who can’t be bothered to invest in yet another IT project.</p>
<h2 id="google-and-microsoft-a-guiding">Google (and Microsoft): A guiding light</h2>
<p>Lest we forget, Hadoop is based on a set of Google technologies, and it seems likely its future will also be influenced by what Google is doing. Already, <a href="http://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/Federation.html">improvements to HDFS</a> seem to mirror <a href="http://www.theregister.co.uk/2009/08/12/google_file_system_part_deux/">changes to the Google File System a few years bac</a>k, and YARN will enable some new types of non-MapReduce processing similar to what <a href="http://research.google.com/pubs/pub36726.html">Google’s new Percolator framework</a> does. (Google claims Percolator lets it “process the same number of documents per day, while reducing the average age of documents in Google search results by 50%.”) The MapR-led Apache Drill project <a href="http://gigaom.com/2012/08/17/for-fast-interactive-hadoop-queries-drill-may-be-the-answer/">is a Hadoop-based version of Google’s Dremel tool</a>; Giraph was likely inspired by Google’s <a href="http://googleresearch.blogspot.com/2009/06/large-scale-graph-computing-at-google.html">Pregel graph-processing technology</a>.</p>
<p>Cutting is particularly excited about Google Spanner, a database system that <a href="http://gigaom.com/2012/09/17/googles-spanner-a-database-that-knows-what-time-it-is/">spans data geographies while still maintaining transactional consistency</a>. “It’s a matter of time before somebody implements that in the Hadoop ecosystem,” he said. “That’s a huge change.”</p>
<p>It’s possible Microsoft could be an inspiration to the Hadoop community, too, especially if it begins to surface pieces of its Bing search infrastructure as products like a couple of company executives have told me it will. Bing <a href="http://research.microsoft.com/en-us/events/fs2011/helland_cosmos_big_data_and_big_challenges.pdf">runs on a combination of tools called Cosmos, Tiger and Scope</a>, and it’s part of the Online Services division ran by former Yahoo VP and Hadoop backer Qi Lu. Lu said that Microsoft (like Google) is looking beyond just search — Hadoop’s original function — and into building an information fabric that changes how data is indexed, searched for and presented.</p>
<p>However it evolves, though, it’s becoming pretty obvious that Hadoop is no longer just a technology for doing cheap storage and some MapReduce processing. “I think there’s still some doubt in people’s minds about whether Hadoop is a flash in the pan … and I think they’re missing the point,” Cutting said. “I think that’s going to be proven to people in the next year.”</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=616972&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=367173"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=367173" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=616972+5-reasons-why-the-future-of-hadoop-is-real-time-relatively-speaking&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=616972+5-reasons-why-the-future-of-hadoop-is-real-time-relatively-speaking&utm_content=dharrisstructure">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2012/07/scaling-hadoop-clusters-the-role-of-cluster-management/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=616972+5-reasons-why-the-future-of-hadoop-is-real-time-relatively-speaking&utm_content=dharrisstructure">Scaling Hadoop clusters: the role of cluster management</a></li><li><a href="http://pro.gigaom.com/2012/04/sector-roadmap-hadoop-platforms-2012/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=616972+5-reasons-why-the-future-of-hadoop-is-real-time-relatively-speaking&utm_content=dharrisstructure">2012: The Hadoop infrastructure market booms</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/03/07/5-reasons-why-the-future-of-hadoop-is-real-time-relatively-speaking/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/03/gigaom-hadoop-icon-final.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/03/gigaom-hadoop-icon-final.jpg?w=150" medium="image">
			<media:title type="html">gigaom hadoop icon final</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/02/shutterstock_37622056.jpg?w=300" medium="image">
			<media:title type="html">Source: Shutterstock user hauhu.</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/03/figure1.gif?w=300" medium="image">
			<media:title type="html">figure1</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/01/1z5o1503.jpg?w=300" medium="image">
			<media:title type="html">Structure Data 2012: Michael Olson – CEO, Cloudera</media:title>
		</media:content>
	</item>
	</channel>
</rss>
