<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>GigaOM &#187; Databases</title>
	<atom:link href="http://gigaom.com/tag/databases/feed/" rel="self" type="application/rss+xml" />
	<link>http://gigaom.com</link>
	<description></description>
	<lastBuildDate>Mon, 20 May 2013 15:50:08 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='gigaom.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://0.gravatar.com/blavatar/0db8f6557d022075dbbf010c54d46d93?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>GigaOM &#187; Databases</title>
		<link>http://gigaom.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://gigaom.com/osd.xml" title="GigaOM" />
	<atom:link rel='hub' href='http://gigaom.com/?pushpress=hub'/>
		<item>
		<title>MapR releases M7, its commercial HBase distro</title>
		<link>http://gigaom.com/2013/05/01/mapr-releases-m7-its-commercial-hbase-distro/</link>
		<comments>http://gigaom.com/2013/05/01/mapr-releases-m7-its-commercial-hbase-distro/#comments</comments>
		<pubDate>Wed, 01 May 2013 23:21:07 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[Databases]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Hbase]]></category>
		<category><![CDATA[Mapr]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[open source]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=641425</guid>
		<description><![CDATA[MapR on Wednesday released its commercial version of HBase called M7, the first such product on the market, that the company claims is bigger, faster and better than the open source version.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=641425&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>MapR didn&#8217;t miss the memo about the key to success in the Hadoop space being the creation of a data platform that can do many things. And on Wednesday, the company released its take on HBase, <a href="http://www.mapr.com/products/mapr-editions/m7-edition">called M7.</a></p>
<p>Last week, I <a href="http://gigaom.com/2013/04/22/how-hbase-converted-myspaces-mysql-champion-and-is-driving-hadoop-mainstream/">explained how HBase is fast becoming the star of the Hadoop ecosystem</a> because it allows users to build more real-time, almost transactional applications on top of Hadoop. True to its form with its other products, MapR has taken HBase even further with M7 by promising greater availability (99.999 percent), instant recovery, faster operations and the ability to handle 1 trillion tables in a single cluster. In open source versions of HBase, MapR VP of Marketing Jack Norris told me, the accepted table limit per cluster is several hundred.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/05/m7.jpg"><img  alt="m7" src="http://gigaom2.files.wordpress.com/2013/05/m7.jpg?w=300&#038;h=265" width="300" height="265" class="alignright size-medium wp-image-641471" /></a>Additionally, M7 shares a single data layer with the Hadoop file system, meaning less performance overhead and, presumably, easier management.</p>
<p>As we&#8217;re seeing with other Hadoop vendors, including Cloudera (which <a href="http://gigaom.com/2013/04/30/with-impala-now-ga-clouderas-ceo-sizes-up-the-sql-on-hadoop-market/">released its Impala SQL query engine on Tuesday</a>), the Hadoop market is fast becoming one where each vendor is trying to set itself apart from the rest by building the best platform with the broadest set of capabilities. In furtherance of that mission, MapR also announced on Wednesday full-text search on its Hadoop distribution thanks to a partnership with Lucene specialist LucidWorks. It already has its own Hadoop distribution complete with proprietary code to bolster the file system and speed up MapReduce, as well as an <a href="http://gigaom.com/2012/08/17/for-fast-interactive-hadoop-queries-drill-may-be-the-answer/">open source SQL-on-Hadoop project called Drill</a> in the works.</p>
<p>MapR employees are probably sleeping a lot easier these days as a result of this platform push. Others in the Hadoop market used to talk about the fear of fragmentation and then point at MapR as the example of a company helping foment that outcome with its proprietary software. Now, however, even if everyone else is building open source products, they&#8217;re all still backing their own and largely dismissing the others.</p>
<p>I suspect the result is feature lock-in even there&#8217;s no technological lock-in, kind of <a href="http://gigaom.com/2011/03/16/how-amazon-is-following-apples-lead-to-rule-cloud-computing/">like using Amazon Web Services for cloud computing</a> and then hoping to replicate its various servies elsewhere. It might be easy enough to move your data, but impossible or very difficult to replicate those additional capabilities elsewhere. If MapR can build a better version of HBase and companies are willing to pay for it, then so be it.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=641425&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=522371"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=522371" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=641425+mapr-releases-m7-its-commercial-hbase-distro&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/04/infrastructure-q1-cloud-and-big-data-woo-the-enterprise/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=641425+mapr-releases-m7-its-commercial-hbase-distro&utm_content=dharrisstructure">Infrastructure Q1: Cloud and big data woo enterprises</a></li><li><a href="http://pro.gigaom.com/2011/03/defining-hadoop-the-players-technologies-and-challenges-of-2011/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=641425+mapr-releases-m7-its-commercial-hbase-distro&utm_content=dharrisstructure">Defining Hadoop: the Players, Technologies and Challenges of 2011</a></li><li><a href="http://pro.gigaom.com/2012/12/big-data-2013-key-trends-and-companies-to-watch/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=641425+mapr-releases-m7-its-commercial-hbase-distro&utm_content=dharrisstructure">Big data 2013: key trends and companies to watch</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/01/mapr-releases-m7-its-commercial-hbase-distro/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/09/shutterstock_110961494.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/09/shutterstock_110961494.jpg?w=150" medium="image">
			<media:title type="html">Database rows</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/m7.jpg?w=300" medium="image">
			<media:title type="html">m7</media:title>
		</media:content>
	</item>
		<item>
		<title>The growing importance of timing in data centers</title>
		<link>http://gigaom.com/2013/04/28/the-growing-importance-of-timing-in-data-centers/</link>
		<comments>http://gigaom.com/2013/04/28/the-growing-importance-of-timing-in-data-centers/#comments</comments>
		<pubDate>Sun, 28 Apr 2013 18:00:45 +0000</pubDate>
		<dc:creator>Jim Theodoras, ADVA Optical Networking</dc:creator>
				<category><![CDATA[ADVA]]></category>
		<category><![CDATA[Databases]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Jim Theodoras]]></category>
		<category><![CDATA[Spanner]]></category>
		<category><![CDATA[storage]]></category>
		<category><![CDATA[timing]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=634743</guid>
		<description><![CDATA[Accurate timing has grown more important in distributed systems, not just for mobile networks, but also for tracking data between data centers. Our love of digital junk is pushing storage to the edge.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=634743&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><em><strong>Editor&#8217;s note</strong>: This is the second of a two-part series on the importance of timing in today&#8217;s distributed infrastructures. The <a href="http://gigaom.com/2013/04/27/timing-is-not-just-for-traders-anymore-networks-need-it-too/">first</a> ran on Saturday.</em></p>
<p>Like a bad episode of <a href="http://www.aetv.com/hoarders/"><em>Hoarders</em></a>, people love to store all things digital, most of which will never be accessed again. And, like a bad episode of <a href="http://www.aetv.com/storage-wars/"><em>Storage Wars</em></a>, our love of storing crap means we need more places to store it. Today’s content has outgrown even the hydro-electric dam powered Mega Data Centers built just yesteryear. Increasingly, operators are turning to distributing their information across multiple geographically dispersed data centers. As the number, size, and distances between the data centers have steadily grown, timing distribution and accuracy has likewise grown in importance in keeping the data centers in sync.</p>
<p>In a <a href="http://gigaom.com/2013/04/27/timing-is-not-just-for-traders-anymore-networks-need-it-too/">previous article</a> I discussed new standards being developed to increase the accuracy of timing for the internet and other IP-based networks. Current systems and protocols offer milliseconds of accuracy. But that just isn’t enough as we depend more on real-time information and compute, storage and communications networks become more distributed. While people often cite the importance of timing on mobile backhaul networks in the next-genration LTE-Advanced networks,there has been less publicity around the need for these new timing technologies in the continued growth of data centers. </p>
<h2 id="the-rise-of-hadoop-in-an-age-o">The rise of Hadoop in an age of digital garbage</h2>
<p><a href="http://gigaom2.files.wordpress.com/2011/12/13250237_1a49b5a7a3_z.png"><img src="http://gigaom2.files.wordpress.com/2011/12/13250237_1a49b5a7a3_z.png?w=708" alt="Dinosaurs"    class="aligncenter size-full wp-image-459351" /></a><br />
Massive storage of data appears to occur in periods, very analogous to <a href="http://dinosaurs.about.com/od/dinosaurbasics/a/dinosaurages.htm">dinosaur evolution</a>. A database architecture will rise to the forefront, based upon its advantages, until it scales to the breaking point and is completely superseded by a new architecture. At first, databases were simply serial listed values with row/column arrangements. Database technology leapt forward and became a self-sufficient business with the advent of relational databases. It appeared for a while <a href="http://computer.howstuffworks.com/question599.htm">relational databases</a> would be the end word in information storage, but then came Web 2.0, social media, and <a href="http://en.wikipedia.org/wiki/Cloud_computing">the cloud</a>. Enter Hadoop.</p>
<p>A centralized database works, as the name suggests, by having all the data located in a single indexed repository with massive computational power to run operations on it. But a centralized database cannot hope to scale to the size needed by today’s cloud apps. Even if it could, the time needed to perform a single lookup would be unbearable to an end user at a browser window. </p>
<p><a href="http://strata.oreilly.com/2011/01/what-is-hadoop.html">Hadoop de-centralizes the storage</a> and lookup, as well as computational power. There is no index, per se. Content is distributed across a wide array of servers, each with their own storage and CPU’s, and the location and relation of each piece of data mapped. When a lookup occurs, the map is read, and all the pieces of information are fetched and pieced together again. The main benefit of Hadoop is scalability. To grow a database (and computational power), you simply keep adding servers and growing your map.</p>
<h2 id="even-hadoop-is-buried-under-mo">Even Hadoop is buried under mounds of digital debris </h2>
<p><a href="http://gigaom2.files.wordpress.com/2013/04/hadoop-timing.jpg"><img src="http://gigaom2.files.wordpress.com/2013/04/hadoop-timing.jpg?w=708&#038;h=364" alt="hadoop timing" width="708" height="364"  class="aligncenter size-full wp-image-634756" /></a><br />
It looked like Hadoop would reign supreme for generations to come, with extensions continuously breathing new life into the protocol. Yet, after only a decade, databases based upon Hadoop such as Facebook are at the breaking point. Global traffic is growing beyond exponential, and most of it is trash. Today’s databases look more like landfills than the great <a href="http://starwars.wikia.com/wiki/Jedi_Archives">Jedi Archives</a>. And recently hyped trends such as <a href="https://www.facebook.com/lifeboxapp">lifelogging</a> suggest the problem will get much worse long before it gets better. </p>
<p>The main limitation of Hadoop is that it works great within the walls of a single massive data center, but is less than stellar once that database outgrows the walls of a single data center and has to be run across geographically separated databases. It turns out the main strength of Hadoop is also its Achilles heel. With no index to search, every piece of data must be sorted through, a difficult proposition once databases stretch across the globe. A piece of retrieved data might be stale by the time it reaches a requester, or mirrored copies of data might conflict with one another.</p>
<p>Enter an idea keep widely dispersed data centers in sync &#8212; <a href="http://gigaom.com/2012/09/17/googles-spanner-a-database-that-knows-what-time-it-is/">Google True Time</a>. To grossly oversimplify the concept, True Time API adds time attributes to data being stored, not just for expiration dating, but also so that all the geographically disparate data centers’ content can be time aligned. For database aficionados, this is sacrilegious, as all leading database protocols are specifically designed to ignore time to prevent conflicts and confusion. Google True Time completely turns the concept of data storage inside out.</p>
<h2 id="introducing-spanner">Introducing Spanner </h2>
<p>In True Time, knowing the accurate “age” of each piece of information, in other words where it falls on the timeline of data, allows data centers that may be 100ms apart to synchronize not just the values stored in memory locations, but the timeline of values in memory locations. In order for this to work, Google maintains an accurate “global wall-clock time” across their entire global Spanner network. </p>
<p>Transactions that write are time stamped and use strict two phase locking (<a href="http://en.wikipedia.org/wiki/Two-phase_locking">S2PL</a>) to manage access. The commit order is always the timestamp order. Both commit and timestamp orders respect global wall-clock time. This simple set of rules maintains coordination between databases all over the world. </p>
<p>However, there is an element of uncertainty introduced into each data field, the very reason that time has been shunned in database protocols since the dawn of the data itself. </p>
<p><a href="http://webworkerdaily.files.wordpress.com/2010/02/clocktower.jpg"><img src="http://webworkerdaily.files.wordpress.com/2010/02/clocktower.jpg?w=708" alt="clocktower"    class="aligncenter size-full wp-image-239761" /></a></p>
<p>Google calls this “network-induced uncertainty”, denoted with an epsilon, and actively monitors and tracks this metric. As of summer 2012, this value was running 10ms for 99.9 percent (3 nines) certainty. Google’s long term goal is to reduce this below 1ms. Accomplishing this will require a state of the art timing distribution network, leveraging the same technologies being developed and deployed for 4G LTE backhaul networks.</p>
<h2 id="a-modest-proposal">A modest proposal </h2>
<p>While True Time was most likely developed to improve geographic load balancing, now that accurate time stamping of data exists, the possibilities are profound. The problems associated with large databases go beyond simply managing the data. The growth rate itself is unsustainable. Data storage providers must do more than grow their storage, they must also come up with ways to improve efficiencies and ebb the tsunami of waste that is common in the age of relatively free storage.</p>
<p>It&#8217;s a dangerous notion, one simply must challenge the basic tenet that all data is forever. Our minds don’t work that way, why should computers? We only hold on to key memories, and the further the time from an event, the fewer the details are held. Perhaps data storage could work similarly. Rather than delete a picture that hasn’t been accessed in a while, a search is performed for similar photos and then only one kept. And as time passes, perhaps rather than simple deletion, a photo is continuously compressed, with less information kept, until the photo memory fades into oblivion. Like that old <a href="http://en.wikipedia.org/wiki/Instant_camera">Polaroid</a> hung on the refrigerator door.</p>
<p><em>Jim Theodoras is director of technical marketing at ADVA Optical Networking, working on Optical+Ethernet transport products. </em></p>
<p><em> Dinosaur image courtesy of <a href="http://www.flickr.com/photos/denn/13250237/">Flickr user Denise Chen</a>. </em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=634743&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=566329"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=566329" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=634743+the-growing-importance-of-timing-in-data-centers&utm_content=gigaguest">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=634743+the-growing-importance-of-timing-in-data-centers&utm_content=gigaguest">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2011/11/dissecting-the-data-5-issues-for-our-digital-future/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=634743+the-growing-importance-of-timing-in-data-centers&utm_content=gigaguest">Dissecting the data: 5 issues for our digital future</a></li><li><a href="http://pro.gigaom.com/2010/12/9-companies-that-pushed-the-infrastructure-discussion-in-2010/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=634743+the-growing-importance-of-timing-in-data-centers&utm_content=gigaguest">9 Companies that Pushed the Infrastructure Discussion in 2010</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/04/28/the-growing-importance-of-timing-in-data-centers/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2010/11/clock-prime-time.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2010/11/clock-prime-time.jpg?w=150" medium="image">
			<media:title type="html">clock prime time</media:title>
		</media:content>

		<media:content url="http://1.gravatar.com/avatar/4411542bbd7a2a9a2fc2a1b38809e45c?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">gigaguest</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2011/12/13250237_1a49b5a7a3_z.png" medium="image">
			<media:title type="html">Dinosaurs</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/04/hadoop-timing.jpg" medium="image">
			<media:title type="html">hadoop timing</media:title>
		</media:content>

		<media:content url="http://webworkerdaily.files.wordpress.com/2010/02/clocktower.jpg" medium="image">
			<media:title type="html">clocktower</media:title>
		</media:content>
	</item>
		<item>
		<title>How HBase converted MySpace&#8217;s MySQL champion and is driving Hadoop mainstream</title>
		<link>http://gigaom.com/2013/04/22/how-hbase-converted-myspaces-mysql-champion-and-is-driving-hadoop-mainstream/</link>
		<comments>http://gigaom.com/2013/04/22/how-hbase-converted-myspaces-mysql-champion-and-is-driving-hadoop-mainstream/#comments</comments>
		<pubDate>Mon, 22 Apr 2013 18:14:03 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[Databases]]></category>
		<category><![CDATA[Gravity]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Hbase]]></category>
		<category><![CDATA[Myspace]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[NoSQL]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=632738</guid>
		<description><![CDATA[ Gravity CTO Jim Benedetto knows his way around MySQL after managing a 600-instance cluster at MySpace, but he has found HBase religion as his real-time content-recommendation platform grew. And he's not alone.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=632738&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>How&#8217;s this for an understatement: Operational databases are important for many, if not the majority, of web applications. And if you&#8217;re doing big business on the web, finding one that can scale with your data volumes and still perform like you need it to is critical. MapReduce for batch data processing and analysis? Not so much, actually.</p>
<p>That&#8217;s why as Hadoop keeps <a href="http://gigaom.com/2013/03/04/the-history-of-hadoop-from-4-nodes-to-the-future-of-data/">thundering toward its destination as the de facto data platform</a> for next-generation applications, companies such as Cloudera and Hortonworks that are making a killing off it might want to stop and thank <a href="http://www.searchenginecaffe.com/2007/05/hbase-powersets-bigtable.html">the guys from Powerset for building HBase</a>. Because the database &#8212; <a href="http://hbase.apache.org/">a columnar Google BigTable clone that runs on top of the Hadoop Distributed File System</a> &#8212; is so fast and scalable, it&#8217;s helping Hadoop find a home in companies and with applications that HDFS and MapReduce alone might not have been able to penetrate so easily.</p>
<p>The latest HBase user I&#8217;ve come across is <a href="http://www.gravity.com/">Gravity</a>, the <a href="http://gigaom.com/2012/03/15/the-personalized-web-is-just-an-interest-graph-away/">interest graph</a> company that powers content recommendations for some of the biggest publishers on the web.</p>
<h2 id="from-big-mysql-at-myspace-to-b">From big MySQL at MySpace to big data with HBase</h2>
<p>Its co-founders were all senior executives at MySpace, including Gravity CTO Jim Benedetto, who was SVP of technology for the social networking pioneer. He was actually MySpace&#8217;s first architect and helped build platform&#8217;s MySQL database. Although MySpace never reached <a href="http://gigaom.com/2011/12/06/facebook-shares-some-secrets-on-making-mysql-scale/">Facebook&#8217;s scale</a>, it did have 150 millions users at its peak, all able to store unlimited numbers of wall posts, messages and photos. Benedetto eventually oversaw a 600-instance cluster that required about 30 database adminstrators to keep it up and running.</p>
<div id="attachment_603574" class="wp-caption alignleft" style="width: 310px"><a href="http://gigaom2.files.wordpress.com/2013/01/1z5o2256.jpg"><img  alt="Structure Data 2012: Jim Benedetto – CTO, Gravity Ashlie Beringer – Partner, Gibson, Dunn &amp; Crutcher" src="http://gigaom2.files.wordpress.com/2013/01/1z5o2256.jpg?w=300&#038;h=200" width="300" height="200" class="size-medium wp-image-603574" /></a><p class="wp-caption-text">Benedetto (center) at Structure: Data 2012. (c) Pinar Ozger</p></div>
<p>So naturally, when it came time to build out the Gravity architecture, Benedetto opted for the MySQL he knew so well. Until about three years ago, he told me recently, that database held about 95 percent of the company&#8217;s data. At some point, though, Benedetto and his team realized they were spending way too much time keeping their MySQL environment up insteading of building new things, so it was time for a change.</p>
<p>It ultimately opted for HBase, but the decision wasn&#8217;t easy. &#8220;For us,&#8221; Benedetto said, &#8220;our data and algorithms are our company,&#8221; so making the move from a relational database to a column-based database that can serve MapReduce jobs was nerve-racking. After all, he explained, &#8220;You never want to migrate your data &#8230; and if you have to, you never want to migrate it more than once.&#8221; In fact, he added, &#8220;you&#8217;re not going back.&#8221;</p>
<p>But Benedetto says the move to HBase as Gravity&#8217;s primary data store has been &#8220;life-saving,&#8221; and it&#8217;s arguably a more important component of the company&#8217;s infrastructure than is Hadoop MapReduce. HBase handles the company&#8217;s real-time recommendation algorithms, and it does it across the entire Gravity platform rather than on a site-by-site basis. And although it&#8217;s not banking-grade when it comes to the consistency of transactions, Benedetto says it&#8217;s about 99.95 percent consistent in real time. Later on, batch MapReduce jobs swoop in and pick up whatever HBase dropped earlier, and process it all against the company&#8217;s graph algorithms.</p>
<div id="attachment_633095" class="wp-caption aligncenter" style="width: 718px"><a href="http://gigaom2.files.wordpress.com/2013/04/canvas-copy.jpg"><img  alt="interest graph" src="http://gigaom2.files.wordpress.com/2013/04/canvas-copy.jpg?w=708&#038;h=708" width="708" height="708" class="size-large wp-image-633095" /></a><p class="wp-caption-text">An example of an interest graph from Gravity,</p></div>
<h2 id="scalable-for-sure-and-getting-">Scalable for sure, and getting easier to use</h2>
<p>And although it took some serious engineering effort to get HBase operational when Gravity began working with it three years ago, Benedetto thinks HBase is getting to the point (as is rival NoSQL database Cassandra, he acknowledged) where one could safely call it &#8220;enterprise-ready.&#8221; Right now, he noted, &#8220;You&#8217;re not gonna to see HBase in a company that just buys Oracle because Oracle is the name and Oracle has been around for 20 years,&#8221; but for web startups that hope to reach a certain scale and even for existing companies that are running into the MySQL wall, he sees a shift occurring.</p>
<p>&#8220;The web farm is the easiest part of your infrastructure to scale because all it does is cost more money,&#8221; Benedetto explained. Databases, on the other hand, require a lot of thinking about how to migrate data, shard the database and otherwise make a piece of software likely designed for a handful of servers, max, spread across dozens or hundreds. HBase really eases the scaling process, as well as the subsequent management, he said. Now, Gravity&#8217;s 100-node HBase cluster has only two operations engineers dedicated to it.</p>
<p>Indeed, there are startups trying to capitalize on HBase by <a href="http://gigaom.com/2013/03/19/drawn-to-scale-wants-to-solve-your-mongodb-scalability-problems/">using it to power SQL and even MongoDB-compliant databases</a> that can scale beyond what most relational databases can do.</p>
<p>Aside from scale HBase might soon start catching on because of the work companies like Gravity have been doing to make it more user-friendly. It might scale easily, but, as Benedetto noted, it&#8217;s not always easy to get started with &#8212; especially without some deep understanding of the intricacies of the underlying HDFS infrastructure. Last year, eBay VP of Experience, Search and Platforms Hugh Williams <a href="http://gigaom.com/2012/01/31/under-the-covers-of-ebays-big-data-operation/">told me that although HBase is one of the big data tools the company is most excited about</a>, it&#8217;s also the area where he&#8217;d like to see the most improvement.</p>
<p>To help alleviate some of the learning curve, Gravity has <a href="http://www.gravity.com/labs/hpaste/">developed an open-source tool called HPaste</a> that lets developers access data and run jobs on HBase data using Scala rather than the more-bloated Java programming language on which Hadoop and HBase are built. One of the biggest benefits of HPaste, Benedetto said, is that it lets new HBase developers see the data in a way that makes sense to them: HBase stores everything in byte arrays, he explained, and &#8220;when a human tries to read a byte array, it looks like ancient hieroglyphics.&#8221;</p>
<div id="attachment_633093" class="wp-caption alignright" style="width: 310px"><a href="http://gigaom2.files.wordpress.com/2013/04/kiji-org-architecture1.png"><img  alt="Kiji architecture" src="http://gigaom2.files.wordpress.com/2013/04/kiji-org-architecture1.png?w=300&#038;h=275" width="300" height="275" class="size-medium wp-image-633093" /></a><p class="wp-caption-text">The Kiji architecture</p></div>
<p>Elsewhere, a startup called WibiData has <a href="http://gigaom.com/2012/11/14/wibidata-open-sources-kiji-to-make-hbase-more-useful/">created an open-source framework called Kiji</a> that aims to provide a collection of high-level APIs that should make it easier to store different data types in and develop applications on HBase. The company envisions Kiji being to HBase what the Spring Framework has become to Java over the course of the past decade.</p>
<h2 id="hadoops-weapon-for-the-mainstr">Hadoop&#8217;s weapon for the mainstream?</h2>
<p>But user experience aside, a lot of companies already invested in Hadoop &#8212; aside from <a href="http://gigaom.com/2011/03/04/how-facebook-is-powering-real-time-analytics/">expert users such as Facebook</a> &#8212; are starting to see the promise of HBase and are incorporating it into their architectures.</p>
<p>WibiData co-founder Christophe Bisciglia, who also co-founded Hadoop pioneer Cloudera in 2008, gave me his take on the state of HBase while <a href="http://gigaom.com/2013/03/12/hadoops-past-present-and-future-a-gigaom-special-report/">discussing its role in the future of Hadoop</a> earlier this year. &#8221;If you talk to anyone from Cloudera or any of the platform vendors, I think they will tell you that a large percentage of their customers use HBase. It’s something that I only expect to see increasing,&#8221;  he explained. &#8220;&#8230; HBase is gonna be what takes Hadoop from an ETL and BI platform into a real-time application platform.&#8221;</p>
<div id="attachment_633120" class="wp-caption alignleft" style="width: 310px"><a href="http://gigaom2.files.wordpress.com/2013/04/cloudera_enterprise_diagram.png"><img  alt="The Cloudera Hadoop stack (Gravityu uses Cloudera's distro)." src="http://gigaom2.files.wordpress.com/2013/04/cloudera_enterprise_diagram.png?w=300&#038;h=165" width="300" height="165" class="size-medium wp-image-633120" /></a><p class="wp-caption-text">The Cloudera Hadoop stack (Gravity uses Cloudera&#8217;s distro).</p></div>
<p>Benedetto appears to agree. He considers Hadoop as a whole incredibly important, almost on par with what Amazon Web Services did for computing resources, because it lets startups use commercial-grade open source software to do data storage and processing that previously was only available to massive web companies. &#8220;More and more &#8230; the shining star in that suite is HBase,&#8221; he said. &#8220;If I were Oracle, I&#8217;d be scared.&#8221;</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=632738&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=490768"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=490768" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=632738+how-hbase-converted-myspaces-mysql-champion-and-is-driving-hadoop-mainstream&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=632738+how-hbase-converted-myspaces-mysql-champion-and-is-driving-hadoop-mainstream&utm_content=dharrisstructure">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2011/04/infrastructure-q1-iaas-comes-down-to-earth-big-data-takes-flight/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=632738+how-hbase-converted-myspaces-mysql-champion-and-is-driving-hadoop-mainstream&utm_content=dharrisstructure">Infrastructure Q1: IaaS Comes Down to Earth; Big Data Takes Flight</a></li><li><a href="http://pro.gigaom.com/2011/03/defining-hadoop-the-players-technologies-and-challenges-of-2011/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=632738+how-hbase-converted-myspaces-mysql-champion-and-is-driving-hadoop-mainstream&utm_content=dharrisstructure">Defining Hadoop: the Players, Technologies and Challenges of 2011</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/04/22/how-hbase-converted-myspaces-mysql-champion-and-is-driving-hadoop-mainstream/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/10/shutterstock_113600470.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/10/shutterstock_113600470.jpg?w=150" medium="image">
			<media:title type="html">Shiny database</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/01/1z5o2256.jpg?w=300" medium="image">
			<media:title type="html">Structure Data 2012: Jim Benedetto – CTO, Gravity Ashlie Beringer – Partner, Gibson, Dunn &#38; Crutcher</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/04/canvas-copy.jpg?w=708" medium="image">
			<media:title type="html">interest graph</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/04/kiji-org-architecture1.png?w=300" medium="image">
			<media:title type="html">Kiji architecture</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/04/cloudera_enterprise_diagram.png?w=300" medium="image">
			<media:title type="html">The Cloudera Hadoop stack (Gravityu uses Cloudera&#039;s distro).</media:title>
		</media:content>
	</item>
		<item>
		<title>No SQL or DynamoDB: Airbnb goes with Memcached for Neighborhoods feature</title>
		<link>http://gigaom.com/2013/04/12/no-sql-or-dynamodb-airbnb-goes-with-memcached-for-neighborhoods-feature/</link>
		<comments>http://gigaom.com/2013/04/12/no-sql-or-dynamodb-airbnb-goes-with-memcached-for-neighborhoods-feature/#comments</comments>
		<pubDate>Fri, 12 Apr 2013 19:44:32 +0000</pubDate>
		<dc:creator>Jordan Novet</dc:creator>
				<category><![CDATA[Airbnb]]></category>
		<category><![CDATA[Databases]]></category>
		<category><![CDATA[memcached]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=630672</guid>
		<description><![CDATA[To serve up data quickly inside its Neighborhoods feature, Airbnb engineers cycled through a few database choices before choosing Memcached.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=630672&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>One of Airbnb&#8217;s neat features, <a href="https://www.airbnb.com/neighborhoods">Neighborhoods</a>, shows people elegant pages on neighborhoods within big cities that can help them choose exactly where to stay. Actual homes where visitors can stay the night are directly tied to the neighborhood pages. The idea sounds obvious, but it took some engineering tinkering to figure out how to make it all work accurately and quickly.</p>
<p>On the Airbnb Nerd Blog on Thursday, engineers Andy Kramolisch and Ben Hughes, who worked on Neighborhoods and previously founded <a href="http://web.archive.org/web/20120503201059/http://nabewise.com/">NabeWise</a>, a neighborhood guide for American cities, <a href="http://nerds.airbnb.com/behind-the-scenes-airbnb-neighborhoods">explained</a> the back-end process of aligning locations with neighborhoods.</p>
<p>Behind the scenes, Kramolisch said, a cartographer carves out the borders of neighborhoods. Then it&#8217;s time to match up hosts&#8217; homes with the neighborhoods listed on Airbnb. On the site&#8217;s back end, Kramolisch said, the latitude and longitude of available homes are regularly associated with the various neighborhoods in a given city, if those neighborhoods are represented on Airbnb, through an internal system called Glop, short for Genome Location Pipeline. &#8220;For example, say you list your place, which is located at (12.333568650219718, 45.43647998034738). The next time Glop runs, it will correctly identify your listing as being in San Marco,&#8221; he said.</p>
<p>It&#8217;s not as if Neighborhoods works with &#8220;insane amounts of data,&#8221; Kramolisch said. Still, up-to-date data on places to stay in neighborhoods needs to be served up quickly, so users aren&#8217;t kept waiting in front of their screens. Data changes fast, and an SQL database wouldn&#8217;t work because of &#8220;mass updates,&#8221; Kramolisch said. So an internal NoSQL database in cooperation with Amazon Web Services&#8217; managed DynamoDB NoSQL database service was considered. But DynamoDB couldn&#8217;t handle Airbnb&#8217;s storage needs. So the engineers turned to the Memcached key-value store for quickly serving up data by keeping it in memory.</p>
<p>In going with Memcached, Airbnb is making the same choice as <a href="http://gigaom.com/2013/03/05/facebook-kisses-dram-goodbye-builds-memcached-for-flash/">Facebook</a>, <a href="http://gigaom.com/2012/08/31/etsy-unveils-its-infrastructure-and-its-supermicro-love/">Etsy</a> and other companies that operate at webscale. Location is the top criterion for Airbnb travelers, Kramolisch says, and the fast service Memcached enables &#8212; 35 milliseconds on average, to be precise &#8212; is the kind of solution that could help Airbnb focus on giving customers more of what they want from the site, when they want it.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=630672&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=348638"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=348638" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=630672+no-sql-or-dynamodb-airbnb-goes-with-memcached-for-neighborhoods-feature&utm_content=gigajordan">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/01/12-tech-leaders-resolutions-for-2012/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=630672+no-sql-or-dynamodb-airbnb-goes-with-memcached-for-neighborhoods-feature&utm_content=gigajordan">12 tech leaders’ resolutions for 2012</a></li><li><a href="http://pro.gigaom.com/2011/11/themes-for-a-connected-world-gigaom-roadmap-review/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=630672+no-sql-or-dynamodb-airbnb-goes-with-memcached-for-neighborhoods-feature&utm_content=gigajordan">Themes for a connected world: GigaOM RoadMap review</a></li><li><a href="http://pro.gigaom.com/2011/08/flash-analysis-collaborative-consumption-a-first-look-at-the-new-web-sharing-economy/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=630672+no-sql-or-dynamodb-airbnb-goes-with-memcached-for-neighborhoods-feature&utm_content=gigajordan">Flash analysis: Collaborative consumption &#8211; a first look at the new web-sharing economy</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/04/12/no-sql-or-dynamodb-airbnb-goes-with-memcached-for-neighborhoods-feature/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/04/screen-shot-2013-04-12-at-10-48-11-am.png?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/04/screen-shot-2013-04-12-at-10-48-11-am.png?w=150" medium="image">
			<media:title type="html">Screen shot 2013-04-12 at 10.48.11 AM</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/c00ab753df107b639e76ed4c3ab07ba7?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">gigajordan</media:title>
		</media:content>
	</item>
		<item>
		<title>MarkLogic nets $25M to keep up enterprise NoSQL pitch</title>
		<link>http://gigaom.com/2013/04/10/marklogic-nets-25m-to-keep-up-enterprise-nosql-pitch/</link>
		<comments>http://gigaom.com/2013/04/10/marklogic-nets-25m-to-keep-up-enterprise-nosql-pitch/#comments</comments>
		<pubDate>Wed, 10 Apr 2013 11:30:53 +0000</pubDate>
		<dc:creator>Jordan Novet</dc:creator>
				<category><![CDATA[Databases]]></category>
		<category><![CDATA[marklogic]]></category>
		<category><![CDATA[NoSQL]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=629466</guid>
		<description><![CDATA[MarkLogic has raised $25 million in new venture funding to add more customers for its NoSQL database. It wants to go after companies that have looked to longtime software vendors for relational solutions.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=629466&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>When MarkLogic Founder Christopher Lindblad started working on a database for unstructured data in 2001, his efforts were prescient. Since then, the database market has since seen a proliferation of non-relational, or NoSQL, startups to handle the wide variety of data types that new data sources such as web applications and digital documents generate. The space has grown so big, in fact, that it has <a href="http://gigaom.com/2013/03/21/no-not-every-database-was-created-equal-heres-how-theyre-stand-out/2/">already started to consolidate</a>. Amid all this, MarkLogic has managed to stand out by generating more revenue than pretty much any other vendor, according to <a href="http://wikibon.org/w/images/2/21/Forecast-BigDataDatabasebyVendor.png">figures</a> Wikibon released in February.</p>
<p>On Wednesday, MarkLogic&#8217;s success was validated again, as the company announced a $25 million round of venture funding, bringing the total it has raised to $71.2 million. Sequoia Capital and Tenaya Capital led the round; CEO Gary Bloom and other MarkLogic executives also contributed.</p>
<p>MarkLogic like to tout the fact that it&#8217;s geared for enterprise use. Features such as high availability, replication, clustering and ACID compliance help differentiate the company from other NoSQL databases, Bloom told me. And although the company is taking in revenue and looks robust enough to <a href="http://gigaom.com/2011/04/05/with-a-new-ceo-marklogic-eyes-big-data-ipo/">go public</a> now, Bloom said he would rather boost revenues to the point that MarkLogic could sustain success after an IPO.</p>
<p>Rather than go after the revenues that open-source NoSQL databases generate, Bloom said he wants to take away database marketshare from legacy companies peddling SQL databases, including IBM, SAP and Bloom&#8217;s previous employer, Oracle. That means MarkLogic salespeople will have to convince slower-to-change enterprises on the reality that relational databases might not be the best choice if they want to take advantage of unstructured data. MarkLogic also will have to put up with fellow NoSQL players that are adding enterprise functions, such as <a href="http://gigaom.com/2013/03/19/10gen-rolls-out-new-features-to-woo-more-enterprises-to-mongodb/">MongoDB</a>,</p>
<p>But if MarkLogic&#8217;s plan turns out to be fruitful, a public offering could come within a year or two, Bloom said.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=629466&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=157373"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=157373" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=629466+marklogic-nets-25m-to-keep-up-enterprise-nosql-pitch&utm_content=gigajordan">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2013/01/cloud-and-data-fourth-quarter-2012-analysis/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=629466+marklogic-nets-25m-to-keep-up-enterprise-nosql-pitch&utm_content=gigajordan">The fourth quarter of 2012 in cloud</a></li><li><a href="http://pro.gigaom.com/2011/03/putting-big-data-to-work-opportunities-for-enterprises/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=629466+marklogic-nets-25m-to-keep-up-enterprise-nosql-pitch&utm_content=gigajordan">Putting Big Data to Work: Opportunities for Enterprises</a></li><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=629466+marklogic-nets-25m-to-keep-up-enterprise-nosql-pitch&utm_content=gigajordan">A near-term outlook for big data</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/04/10/marklogic-nets-25m-to-keep-up-enterprise-nosql-pitch/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/04/marklogic-ceo.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/04/marklogic-ceo.jpg?w=150" medium="image">
			<media:title type="html">MarkLogic CEO</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/c00ab753df107b639e76ed4c3ab07ba7?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">gigajordan</media:title>
		</media:content>
	</item>
		<item>
		<title>Promising to remake cloud databases for web scale, ParElastic gets $5.7M</title>
		<link>http://gigaom.com/2013/04/09/promising-to-remake-cloud-databases-for-web-scale-parelastic-gets-5-7m/</link>
		<comments>http://gigaom.com/2013/04/09/promising-to-remake-cloud-databases-for-web-scale-parelastic-gets-5-7m/#comments</comments>
		<pubDate>Tue, 09 Apr 2013 15:38:41 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[Amazon Web Services]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Databases]]></category>
		<category><![CDATA[iaas]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[NewSQL]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[Parelastic]]></category>
		<category><![CDATA[scaliability]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=629103</guid>
		<description><![CDATA[A startup called ParElastic thinks it can change the cloud database game by helping companies scale their MySQL environments without resorting to sharding or deploying an entirely new database.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=629103&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Cloud computing and scalability are often mentioned in the same sentence, but often not when talking about databases. Especially not MySQL databases. A Boston-based startup called <a href="http://www.parelastic.com/">ParElastic</a> hopes to change that, and has raised a $5.7 million Series A led by General Catalyst Partners (former VMware CTO <a href="http://gigaom.com/2013/01/16/vmware-cto-herrod-leaves-to-join-vc-firm/">Steve Herrod&#8217;s new home</a>) to help fund its cause.</p>
<p><a href="http://gigaom.com/2012/03/02/five-boston-database-startups-to-watch/">ParElastic </a>sits in between the application and the underlying database and lets developers scale without having to resort to complicated sharding or maybe even moving the database back in-house where they can run it on a bigger server. Architecturally, Founder and CEO Ken Rugg told me, ParElastic&#8217;s Database Virtualization Engine is similar to a parallel database system, although it functions more like middleware that manages multiple database instances as one and is designed for operational rather than analytic workloads.</p>
<p>Because it intelligently balances database load and distributed data across servers, ParElastic is ideal for multitenant situations where multiple users, applications or services are accessing the database simultaneously, Rugg added.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/04/parelastic-architecture-chart.jpg"><img  alt="parelastic-architecture-chart" src="http://gigaom2.files.wordpress.com/2013/04/parelastic-architecture-chart.jpg?w=708"   class="aligncenter size-full wp-image-629151" /></a></p>
<p>Now, anyone familiar with the next-generation database market might think they&#8217;ve heard this story before, and they kind of have. The NoSQL database movement rode into town on the promise of high scalability, and <a href="http://gigaom.com/2013/01/15/upstart-nuodb-paints-picture-of-database-nirvana-for-the-cloud-era/">the NewSQL movement furthered that story</a> by bringing scale-out performance to SQL. Some of these databases <a href="http://gigaom.com/2012/07/20/cloud-databases-101-who-builds-em-and-what-they-do/">are even available as cloud services</a>.</p>
<p>However, Rugg explained, there&#8217;s a big difference between these options and what ParElastic does. Namely, while NoSQL and NewSQL options require deploying an entirely new database and likely rewriting some application code, ParElastic&#8217;s software just overlays customers&#8217; existing cloud databases. Rugg said about half of its early users are running standard MySQL versions on Amazon Web Services, while the rest are spread across cloud providers such as Rackspace, Joyent and LiquidWeb.</p>
<p>Some ParElastic users actually manage existing SQL services such as Amazon&#8217;s Relational Database Service and Google Cloud SQL. One even uses it to manage an in-house database environment. And technically, Rugg noted, ParElastic could manage cross-cloud database deployments but, because of the inherent latency hit that would entail, &#8220;we wouldn&#8217;t recommend that.&#8221;</p>
<p>However, he said, the biggest beneficiary of ParElastic aside from the company itself might well be AWS. It is by far the most widely used cloud in the world, but when users reach the limites of their single database instances, Amazon usually tells them to look into sharding or perhaps transitioning to DynamoDB. &#8220;None of those are really too friendly for Amazon keeping their customers moving forward in their cloud,&#8221; Rugg said.</p>
<p>Further, although certain cloud providers offer better CPU, IO or network performance than AWS does (Rugg cited Rackspace as being particularly strong on IO performance, for example, and <a href="http://gigaom.com/2012/09/10/profitbricks-says-it-can-out-amazon-amazons-cloud/">ProfitBricks</a> as looking promising on the network front), &#8220;Amazon is sort of the lowest common denominator in a number of ways,&#8221; Rugg explained. The economics and performance requirements vary from application to application, of course, but ParElastic could help stitch together a number of commodity AWS instances to provide suitable performance at a lower cost than might be possible using the biggest, fastest instances from other providers.</p>
<p>Having watched the cloud market unfold as it has, though, Rugg and ParElastic aren&#8217;t banking on AWS &#8212; which <a href="http://gigaom.com/2013/02/17/what-to-do-when-amazon-decides-to-jump-into-your-business/">has a reputation for launching services</a> that compete with startup ecosystem partners &#8212; as the future of the business. By supporting other cloud providers that are gaining acceptance (aside from the ones Rugg noted, Google has been impressing some <a href="http://gigaom.com/2013/03/15/by-the-numbers-how-google-compute-engine-stacks-up-to-amazon-ec2/">with the performance of its Compute Engine service</a>), ParElastic is in a pretty good position to handle whatever cloud-database market shifts might occur.</p>
<p>&#8220;Even if Amazon comes out and says &#8216;We&#8217;re going to replace you with something we built back in the lab,&#8217; that puts us in a great position in terms of validating the market,&#8221; Rugg said.</p>
<p>ParElastic&#8217;s existing investors &#8212; Point Judith Capital,  CommonAngels and LaunchCapital &#8212; also participated in the Series A round, which brings the company&#8217;s total venture capital to $8.7 million.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=629103&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=32796"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=32796" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=629103+promising-to-remake-cloud-databases-for-web-scale-parelastic-gets-5-7m&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2013/01/cloud-and-data-fourth-quarter-2012-analysis/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=629103+promising-to-remake-cloud-databases-for-web-scale-parelastic-gets-5-7m&utm_content=dharrisstructure">The fourth quarter of 2012 in cloud</a></li><li><a href="http://pro.gigaom.com/2012/12/big-data-2013-key-trends-and-companies-to-watch/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=629103+promising-to-remake-cloud-databases-for-web-scale-parelastic-gets-5-7m&utm_content=dharrisstructure">Big data 2013: key trends and companies to watch</a></li><li><a href="http://pro.gigaom.com/2012/08/understanding-and-managing-the-cost-of-the-cloud/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=629103+promising-to-remake-cloud-databases-for-web-scale-parelastic-gets-5-7m&utm_content=dharrisstructure">Understanding and managing the cost of the cloud</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/04/09/promising-to-remake-cloud-databases-for-web-scale-parelastic-gets-5-7m/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/10/shutterstock_113600470.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/10/shutterstock_113600470.jpg?w=150" medium="image">
			<media:title type="html">Shiny database</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/04/parelastic-architecture-chart.jpg" medium="image">
			<media:title type="html">parelastic-architecture-chart</media:title>
		</media:content>
	</item>
		<item>
		<title>MongoDB FTW: Fast-growing 10gen hires first CFO</title>
		<link>http://gigaom.com/2013/04/09/mongodb-ftw-fast-growing-10gen-hires-first-cfo/</link>
		<comments>http://gigaom.com/2013/04/09/mongodb-ftw-fast-growing-10gen-hires-first-cfo/#comments</comments>
		<pubDate>Tue, 09 Apr 2013 14:00:43 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[10Gen]]></category>
		<category><![CDATA[Databases]]></category>
		<category><![CDATA[MongoDB]]></category>
		<category><![CDATA[NoSQL]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=629056</guid>
		<description><![CDATA[MongoDB creator 10gen is growing again, this time with the addition of Sydney Carey as the company's first CFO. She'll help lay the infrastructure for plans to more than double in size in two years as it eyes an eventual IPO.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=629056&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.10gen.com/">10gen</a>, the creator and proprietor of the extremely popular MongoDB NoSQL database, is growing up fast and on Tuesday announced it has hired Sydney Carey as the company&#8217;s first-ever chief financial officer. Carey comes from enterprise software company Tibco where she was executive vice president and CFO.</p>
<p>According to 10gen CEO Max Schireson, Carey&#8217;s presence will be important for the company, which has more than 200 employees and should top 500 in the next couple years. While she helps grow and build the corporate infrastructure in the finance, legal and HR departments,  Schireson can spend more time working directly with customers, partners and products.</p>
<p>Carey told me she&#8217;s excited about getting back into a high-growth company, especially one like 10gen that has a disruptive technology and open-source business model. Granted, open-source business models do bring their own unique set of challenges on top of those associated with startup businesses, she noted, but they also open up doors for new business.</p>
<p>&#8220;I think 10gen has all the elements there to make that a really good fit for me,&#8221; Carey said.</p>
<p>Schireson said Carey&#8217;s hire isn&#8217;t indicative of a forthcoming 10gen IPO, but that is something on which the company is focused. However, he added, some great companies have went public rather late in their lives, so 10gen isn&#8217;t rushing that decision and is instead keeping its attention of product development. The company <a href="http://gigaom.com/2012/11/14/10gen-gets-more-dough-for-mongodb-this-time-from-intel-and-red-hat/">has raised $81 million in venture capital</a> since launching in 2007.</p>
<p>The addition of Carey is just the latest in a series of executive hires at 10gen that includes <a href="http://gigaom.com/2012/10/22/10gen-staffs-up-for-bigger-mongodb-push/">a handful of veteran big data and database industry vice presidents</a>, as well as Senior Vice President of Worldwide Sales Phillip Carty.</p>
<p>Despite some <a href="http://gigaom.com/2013/03/19/drawn-to-scale-wants-to-solve-your-mongodb-scalability-problems/">criticism of its ability to scale beyond a handful of nodes</a>, MongoDB is <a href="http://gigaom.com/2012/02/28/theres-a-lotta-mongodb-out-there-hadoop-too-infographic/">easily the most-used NoSQL database around</a>, largely because it&#8217;s so easy for even novice NoSQL developers to work with and performs well with smaller data volumes. And the MongoDB ecosystem that 10gen helped catalyze is big, growing and potentially very lucrative. There are a handful of popular cloud services around <a href="http://gigaom.com/2012/10/31/mongolab-explains-why-everyone-loves-mongodb-and-raises-5m/">such as MongoLab</a> and MongoHQ, and Rackspace just <a href="http://gigaom.com/2013/02/26/rackspace-buys-its-way-into-mongodb-market-with-objectrocket/">bought its way into the mix</a> by acquiring ObjectRocket.</p>
<p>Schireson said 10gen might consider some acquisitions that help advance the company&#8217;s MongoDB mission, but isn&#8217;t really looking at buying its way into the position of a one-stop NoSQL shop right now.</p>
<p>&#8220;Right now we&#8217;re 100 percent focused on MongoDB,&#8221; he said. &#8220;Everything we do is somehow related to that.&#8221;</p>
<p><em>This story was updated at 9:22 a.m. to correct the amount of venture capital 10gen has raised to $81 million from $86 million. The company changed the official amount between the time this post was written and the time of its publication.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=629056&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=287874"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=287874" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=629056+mongodb-ftw-fast-growing-10gen-hires-first-cfo&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2013/01/cloud-and-data-fourth-quarter-2012-analysis/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=629056+mongodb-ftw-fast-growing-10gen-hires-first-cfo&utm_content=dharrisstructure">The fourth quarter of 2012 in cloud</a></li><li><a href="http://pro.gigaom.com/2012/11/breaking-down-barriers-and-reducing-cycle-times-with-devops-and-continuous-delivery/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=629056+mongodb-ftw-fast-growing-10gen-hires-first-cfo&utm_content=dharrisstructure">How devops can reduce cycle times</a></li><li><a href="http://pro.gigaom.com/2012/04/aws-storage-gateway-jolts-cloud-storage-ecosystem/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=629056+mongodb-ftw-fast-growing-10gen-hires-first-cfo&utm_content=dharrisstructure">AWS Storage Gateway jolts cloud-storage ecosystem</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/04/09/mongodb-ftw-fast-growing-10gen-hires-first-cfo/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/04/mongodb.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/04/mongodb.jpg?w=150" medium="image">
			<media:title type="html">mongodb</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>
	</item>
		<item>
		<title>Facebook builds a database benchmark for a graph-powered world</title>
		<link>http://gigaom.com/2013/04/01/facebook-builds-a-database-benchmark-for-a-graph-powered-world/</link>
		<comments>http://gigaom.com/2013/04/01/facebook-builds-a-database-benchmark-for-a-graph-powered-world/#comments</comments>
		<pubDate>Mon, 01 Apr 2013 22:28:39 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[Benchmarks]]></category>
		<category><![CDATA[Databases]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[graph databases]]></category>
		<category><![CDATA[LinkBench]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[open source]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=626218</guid>
		<description><![CDATA[Facebook has built a new open source tool for benchmarking graph databases, called LinkBench. And although the chances are your infrastructure and workloads look nothing like Facebook's, the good news is LinkBench was built with configurability in mind.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=626218&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>If you&#8217;re doing any sort of social-media application, you might want to take note of what Facebook just built. The company has <a href="http://www.facebook.com/notes/facebook-engineering/linkbench-a-database-benchmark-for-the-social-graph/10151391496443920">created a benchmarking tool called LinkBench</a> that measures the performance of databases tasked with serving graph-structured data, which, presumably, is the lifeblood of every startup around that&#8217;s concerned with who&#8217;s connected to whom.</p>
<p>Although, of all LinkBench&#8217;s features &#8212; and you can read all about them in a Facebook Engineer wall post from Monday morning &#8212; probably the biggest is <a href="https://github.com/facebook/linkbench">that it&#8217;s open source</a> and built to be extensible. One of the biggest problems with benchmarks overall is that they rarely align with actual production workloads inside the companies that are supposed to care about them. In this case, for example, a benchmark for measuring the performance of <a href="http://gigaom.com/2011/12/06/facebook-shares-some-secrets-on-making-mysql-scale/">Facebook&#8217;s massive MySQL</a>+memcached+<a href="http://www.facebook.com/note.php?note_id=388112370932">Flashcache</a> database architecture against its massive social graph and transaction activity would be all but worthless unless someone was just planning to rebuild Facebook.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/04/linkbench-copy.jpg"><img  alt="linkbench copy" src="http://gigaom2.files.wordpress.com/2013/04/linkbench-copy.jpg?w=708&#038;h=610" width="708" height="610" class="aligncenter size-large wp-image-626252" /></a></p>
<p>I&#8217;ve written in the past that <a href="http://gigaom.com/2012/07/26/why-crowdsourced-computing-benchmarks-are-the-future/">perhaps crowdsourced benchmarks are the wave of the futur</a>e: essentially a compiled set of statistics and best practices as more companies test different database (or Hadoop) technologies on different hardware setups against different workloads and publish the results. Everything will of course vary by the exact details within any given environment, but it would be a good way to get a sense of how a particular stack might, or perhaps should, fare.</p>
<p>But an open source benchmark tuned for a specific use case &#8212; social graphs &#8212; by probably the world&#8217;s foremost expert on that use case is interesting, too. Anyone else trying to serve data from their own social graphs can benefit from some of LinkBench&#8217;s more-prominent features, such as its ability to generate &#8220;large synthetic social graphs,&#8221; while tuning it to the specifics of their own infrastructure. After all, it might be that your app has different requirement around reading versus writing data, and <a href="http://gigaom.com/2011/07/21/is-stonebraker-right-why-sql-isnt-the-choice-du-jour-for-many-apps/">it&#8217;s very possible you&#8217;re not using MySQL</a>, either.</p>
<p>Or maybe you are using MySQL and want to see how a newer database technology might handle your graph workload. That, by the way, is one of the reasons Facebook built LinkBench, according to this post.</p>
<p>At any rate, the social web is all about graphs, and database performance really matters for anyone trying to build a service that stays online and provides a pleasant user experience. Say what you want about Facebook, but its services perform, so the bar is set high for anyone trying to dethrone it or at least to build something than can attract an equally large and devout following.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=626218&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=71768"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=71768" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=626218+facebook-builds-a-database-benchmark-for-a-graph-powered-world&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2010/10/is-the-future-of-enterprise-completely-open-source/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=626218+facebook-builds-a-database-benchmark-for-a-graph-powered-world&utm_content=dharrisstructure">Is the Future of Enterprise Completely Open Source?</a></li><li><a href="http://pro.gigaom.com/2012/11/breaking-down-barriers-and-reducing-cycle-times-with-devops-and-continuous-delivery/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=626218+facebook-builds-a-database-benchmark-for-a-graph-powered-world&utm_content=dharrisstructure">How devops can reduce cycle times</a></li><li><a href="http://pro.gigaom.com/2011/12/migrating-media-applications-to-the-private-cloud-best-practices-for-businesses/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=626218+facebook-builds-a-database-benchmark-for-a-graph-powered-world&utm_content=dharrisstructure">Migrating media applications to the private cloud: best practices for businesses</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/04/01/facebook-builds-a-database-benchmark-for-a-graph-powered-world/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/04/graph-copy-e1364854754198.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/04/graph-copy-e1364854754198.jpg?w=150" medium="image">
			<media:title type="html">graph copy</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/04/linkbench-copy.jpg?w=708" medium="image">
			<media:title type="html">linkbench copy</media:title>
		</media:content>
	</item>
		<item>
		<title>No, not every database was created equal. Here&#8217;s how they stand out</title>
		<link>http://gigaom.com/2013/03/21/no-not-every-database-was-created-equal-heres-how-theyre-stand-out/</link>
		<comments>http://gigaom.com/2013/03/21/no-not-every-database-was-created-equal-heres-how-theyre-stand-out/#comments</comments>
		<pubDate>Thu, 21 Mar 2013 22:24:11 +0000</pubDate>
		<dc:creator>Jordan Novet</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[Databases]]></category>
		<category><![CDATA[MemSQL]]></category>
		<category><![CDATA[Neo4j]]></category>
		<category><![CDATA[SQLStream]]></category>
		<category><![CDATA[Structure Data 2013]]></category>
		<category><![CDATA[tempoDB]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=623116</guid>
		<description><![CDATA[In-memory, SQL, NoSQL and graph databases were on display in a fiesty discussion about databases that don't involve Hadoop. The distinctions stand out amid growing interest in specialized databases in a big-data age.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=623116&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>SQL or NoSQL? In-memory or hard disks? Graph? These questions have been top of mind in recent years as developers and IT administrators check out new-age databases capable of handling scale-out data sets. Executives from four databases showed how they stand out in a hot market at <a href="http://event.gigaom.com/structuredata/?utm_source=data&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=623116+no-not-every-database-was-created-equal-heres-how-theyre-stand-out&amp;utm_content=gigajordan">GigaOM’s Structure:Data</a> conference on Thursday.</p>
<p>Emil Eifrem, CEO of Neo Technology, touted the power of <a href="http://gigaom.com/2012/11/02/graph-startup-neo-raises-11m-as-specialized-databases-take-hold/">Neo4j</a> and other graph databases to show relationships among disparate varieties of data with nodes, edges and key-value properties. (Think of Facebook’s <a href="http://gigaom.com/2013/03/14/facebook-tweaks-its-algorithms-to-improve-graph-search-comment-search-coming/">Graph Search</a> as one version.) The style takes inspiration from the connections among neurons and synapses inside the brain, Eifrem said. But, like other NoSQL databases, Neo Technology’s Neo4j product doesn’t use the SQL programming language, which could limit its adoption among enterprises. </p>
<p>Damian Black, CEO of <a href="http://gigaom.com/2012/10/17/why-you-should-care-about-data-flow-computings-big-comeback/">SQLstream</a>, touted his database’s use of SQL, calling it “lingua franca for data management.” Sure, it isn’t the easiest language to use. Still, “you know it’s going to save, it’s going to work,” he said. “It’s auto-optimizing. It’s proven.” Plus, it might be easier to find developers who can use it. As specialized databases get more attention, that’s become a more important point, said the moderator of the talk, GigaOM Research Analyst David Linthicum. </p>
<p>Different databases have different sweet spots. For Ryan Garrett, vice president of product of <a href="http://gigaom.com/2012/06/18/ex-facebookers-launch-memsql-to-make-your-database-fly/">MemSQL</a>, it’s comparing real-time data — from the trading floor, say — with recent historical data from perhaps a day or a week ago. Andrew Cronk, CEO of <a href="http://gigaom.com/2012/04/13/meet-tempodb-a-database-startup-with-an-eye-for-time/">TempoDB</a>, said his database excels at crunching time-series data in long columns coming off of many new connected devices. </p>
<p>Black believes storing data in memory provides a clear advantage. “It’s obviously going to be faster if you’re pulling it from memory,” he said. Eifrem took issue with that notion, saying that Neo4j runs on wherever sufficient memory is available. “Generally speaking, we want to be as horizontal as possible,” he said.</p>
<p>Legacy database vendors such as Oracle still command large swaths of the database market. But specialized databases such as the four on display here could keep chipping away as data sets get larger and larger. Because there are so many flavors, a few databases could become leaders, rather than just one, as they really do have different strengths and weaknesses and use-case sweet spots. At least they do for now.</p>
<p>Check out the rest of our Structure:Data 2013 live coverage here, and a video embed of the session follows below.</p>
<p><iframe src="http://new.livestream.com/accounts/74987/events/1927733/videos/14389771/player?autoPlay=false&amp;height=360&amp;mute=false&amp;width=640" width="640" height="360" frameborder="0" scrolling="no"></iframe><br>
A transcription of the video follows on the next page</p>
<p><a href="http://gigaom.com/2013/03/21/no-not-every-database-was-created-equal-heres-how-theyre-stand-out/2/">Go to page 2 (of 2) on GigaOM .</a></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=623116&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=467602"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=467602" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=623116+no-not-every-database-was-created-equal-heres-how-theyre-stand-out&utm_content=gigajordan">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/09/listening-platforms-finding-the-value-in-social-media-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=623116+no-not-every-database-was-created-equal-heres-how-theyre-stand-out&utm_content=gigajordan">Listening platforms: finding the value in social media data</a></li><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=623116+no-not-every-database-was-created-equal-heres-how-theyre-stand-out&utm_content=gigajordan">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2012/01/12-tech-leaders-resolutions-for-2012/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=623116+no-not-every-database-was-created-equal-heres-how-theyre-stand-out&utm_content=gigajordan">12 tech leaders’ resolutions for 2012</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/03/21/no-not-every-database-was-created-equal-heres-how-theyre-stand-out/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/03/zdohbmuhpqwqix-zongi7soh_mwfhplbcatbm4rvbzq.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/03/zdohbmuhpqwqix-zongi7soh_mwfhplbcatbm4rvbzq.jpg?w=150" medium="image">
			<media:title type="html">Structure Data 2013 Ryan Garrett MemSQL Emil Eifrem Neo Technology/Neo4j Andrew Cronk TempoDB Damian Black SQLstream</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/c00ab753df107b639e76ed4c3ab07ba7?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">gigajordan</media:title>
		</media:content>
	</item>
		<item>
		<title>10gen rolls out new features to woo more enterprises to MongoDB</title>
		<link>http://gigaom.com/2013/03/19/10gen-rolls-out-new-features-to-woo-more-enterprises-to-mongodb/</link>
		<comments>http://gigaom.com/2013/03/19/10gen-rolls-out-new-features-to-woo-more-enterprises-to-mongodb/#comments</comments>
		<pubDate>Tue, 19 Mar 2013 14:00:52 +0000</pubDate>
		<dc:creator>Jordan Novet</dc:creator>
				<category><![CDATA[10Gen]]></category>
		<category><![CDATA[Databases]]></category>
		<category><![CDATA[MongoDB]]></category>
		<category><![CDATA[NoSQL]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=621703</guid>
		<description><![CDATA[10gen is introducing an enterprise edition of MongoDB, making the NoSQL database more appealing to larger-scale users. The company shares the space with plenty of competitors, and more features could be coming soon.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=621703&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Honing in further on the enterprise market, MongoDB creator <a href="http://www.10gen.com/">10gen</a> is bringing out features for enterprise customers and announcing upgrades of existing products for all users of its open-source non-relational database.</p>
<p>10gen had greater business adoption in mind last year when it <a href="http://gigaom.com/2012/05/29/with-42m-more-10gen-wants-to-take-mongodb-mainstream/">raised</a> $42 million and vowed to focus on research and development to improve MongoDB. Now, around 60 percent of customers are enterprises, said Kelly Stirman, the company&#8217;s direct of product marketing.</p>
<p>Once signed up for the MongoDB Enterprise software, customers can use their own on-premise hardware to run an extension of the <a href="http://www.10gen.com/products/mongodb-monitoring-service">MongoDB Monitoring Service</a> to track MongoDB deployments with more than 100 metrics and receive alerts. MongoDB Enterprise comes certified for deployment on several operating systems. It also supports the <a href="http://web.mit.edu/kerberos/">Kerberos authentication protocol</a>, which is popular among insurance companies and banks, and can hook in to customers&#8217; existing monitoring services, such as Nagios. And it introduces roles for giving certain abilities to certain database users.</p>
<p>New features in the MongoDB 2.4 release available to all users include full-text search for querying the database, an option to evenly shard data across machines, more accurate measurements of the distance between locations, the ability to count items in a database 20 times faster than before and the ability to maintain and query leaderboards of, say, the top 50 scorers in a baseball league.</p>
<p>The <a href="http://gigaom.com/2012/01/20/should-nosql-startups-be-afraid-of-dynamodb/">NoSQL database market</a> is crowded, and <a href="http://gigaom.com/2012/12/20/confused-by-the-glut-of-new-databases-heres-a-map-for-you/">differentiation is important</a>. That&#8217;s why it&#8217;s a good thing 10gen, which is based in New York and Palo Alto, Calif.-and has other offices in Australia, England, Ireland and Spain, will increase its headcount by 75 percent in the next year, Stirm<a href="http://gigaom.com/2012/12/20/confused-by-the-glut-of-new-databases-heres-a-map-for-you/">an</a> said. The time between product releases is getting shorter and shorter, he said, which means that still more improvements could be just a few months away.</p>
<p><em>Feature image courtesy of <a href="http://www.flickr.com/photos/sooey/5745780202/">Flickr user junyaogura</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=621703&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=61851"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=61851" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=621703+10gen-rolls-out-new-features-to-woo-more-enterprises-to-mongodb&utm_content=gigajordan">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2013/01/cloud-and-data-fourth-quarter-2012-analysis/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=621703+10gen-rolls-out-new-features-to-woo-more-enterprises-to-mongodb&utm_content=gigajordan">The fourth quarter of 2012 in cloud</a></li><li><a href="http://pro.gigaom.com/2012/11/breaking-down-barriers-and-reducing-cycle-times-with-devops-and-continuous-delivery/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=621703+10gen-rolls-out-new-features-to-woo-more-enterprises-to-mongodb&utm_content=gigajordan">How devops can reduce cycle times</a></li><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=621703+10gen-rolls-out-new-features-to-woo-more-enterprises-to-mongodb&utm_content=gigajordan">A near-term outlook for big data</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/03/19/10gen-rolls-out-new-features-to-woo-more-enterprises-to-mongodb/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/10/mongo-stickers-e1351701544214.jpeg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/10/mongo-stickers-e1351701544214.jpeg?w=150" medium="image">
			<media:title type="html">mongo stickers</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/c00ab753df107b639e76ed4c3ab07ba7?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">gigajordan</media:title>
		</media:content>
	</item>
	</channel>
</rss>
