<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>GigaOM &#187; webscale</title>
	<atom:link href="http://gigaom.com/tag/webscale/feed/" rel="self" type="application/rss+xml" />
	<link>http://gigaom.com</link>
	<description></description>
	<lastBuildDate>Sat, 18 May 2013 17:37:21 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='gigaom.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://0.gravatar.com/blavatar/0db8f6557d022075dbbf010c54d46d93?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>GigaOM &#187; webscale</title>
		<link>http://gigaom.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://gigaom.com/osd.xml" title="GigaOM" />
	<atom:link rel='hub' href='http://gigaom.com/?pushpress=hub'/>
		<item>
		<title>How Dell should go big now that it has gone private</title>
		<link>http://gigaom.com/2013/02/05/how-dell-should-go-big-now-that-it-has-gone-private/</link>
		<comments>http://gigaom.com/2013/02/05/how-dell-should-go-big-now-that-it-has-gone-private/#comments</comments>
		<pubDate>Wed, 06 Feb 2013 00:00:08 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Data Centers]]></category>
		<category><![CDATA[Dell]]></category>
		<category><![CDATA[low-power servers]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[OpenStack]]></category>
		<category><![CDATA[webscale]]></category>
		<category><![CDATA[Windows]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=607355</guid>
		<description><![CDATA[Free from the scrutiny of public markets, Dell should let its freak flag fly and take some real risks to distinguish itself from the server-vendor pack. I think that means doubling down on next-generation software and server design.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=607355&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>With news the this morning that <a href="http://gigaom.com/2013/02/05/dell-deal-is-done/">Dell&#8217;s board has agreed to a buyout</a> that will make the company private again, the speculation about how the company will capitalize on its newfound freedom can officially begin. Divesting its flailing consumer PC business seems like a wise decision &#8212; if not a foregone conclusion &#8212; but I think Dell also needs to make some <em>investments </em>in order to really position itself as a leader in the enterprise IT space.</p>
<p>Some of these suggestions are bold, but it&#8217;s hard to see how Dell can continue to compete against IBM, HP and Cisco on one end and cloud services on the other without taking some risks. I actually think Dell is well positioned to be a big part of the future of IT if it just <a href="http://gigaom.com/2011/09/08/dells-golden-opportunity-isnt-in-servers/">embraces a future that&#8217;s all about software and next-generation applications</a>. Hardware, well, it&#8217;s largely just a necessary evil <a href="http://gigaom.com/2012/05/09/vmware-the-software-defined-data-center-is-coming/">on which increasingly intelligent and virtualized software needs to run</a>.</p>
<h2 id="1-one-word-occasionally-hyphen">1. One word (occasionally hyphenated): Webscale.</h2>
<p>Here&#8217;s an interesting fact: <a href="http://www.idc.com/getdoc.jsp?containerId=prUS23808612#.URE-jOhrqGg">According to IDC</a>, while the rest of the big four server vendors lost both market share and revenue in the third quarter of 2012, Dell gained in both. Why? Because it&#8217;s selling webscale servers like crazy &#8212; thousands at a time to web companies such as <a href="http://gigaom.com/2012/04/06/making-the-web-more-efficient-a-thousand-servers-at-a-time/">eBay</a>, cloud providers such as Microsoft and even high-performance computing shops. In fact, Dell has <a href="http://gigaom.com/2012/10/30/dell-has-sold-1m-webscale-servers-in-five-years/">shipped more than 1 million of these no-frills, scale-out boxes</a> since it began selling them in 2007.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/02/idc.jpg"><img  alt="idc" src="http://gigaom2.files.wordpress.com/2013/02/idc.jpg?w=708&#038;h=391" width="708" height="391" class="aligncenter size-large wp-image-607706" /></a></p>
<p>It seems only logical that Dell should keep riding this horse on the hardware side (including by continuing <a href="http://gigaom.com/2013/01/16/facebook-and-open-compute-just-blew-up-the-server-and-disrupted-a-55b-market/">its close relationship with the Open Compute Project</a>), but there&#8217;s a real opportunity in software, too. Although companies like Facebook and Google have the engineers to <a href="http://gigaom.com/2011/10/14/could-facebook-be-your-next-software-vendor/">build their own tools for managing large server environments</a> &#8212; both in terms of operations and power consumption &#8212; many smaller and more-mainstream companies don&#8217;t. If it&#8217;s already selling the servers, Dell might as well be selling the software that ensures they&#8217;re easy to manage and troubleshoot, too.</p>
<h2 id="2-make-a-big-bet-on-big-data">2. Make a big bet on big data.</h2>
<p>If Dell wants to get a piece of a big data market that&#8217;s <a href="http://gigaom.com/2013/01/08/idc-says-big-data-will-be-24b-market-in-2016-i-say-its-bigger/">going to be worth tens of billions</a> in the next few years, it will have to do more than just sell servers. Dell needs a database or platform like Hadoop that it can deliver as a service on top of its commodity hardware &#8212; much like what Oracle, EMC, IBM and others are trying, only with a lower price tag and a focus on scaling out versus scaling up.</p>
<p>I actually <a href="http://gigaom.com/2011/01/28/5-cloud-software-vendors-that-dell-should-buy/">suggested Dell make a big data investment (and cloud and systems-management investments) two years ago</a>, and although a lot of potentially good fits have been bought in the meantime, there&#8217;s still a variety of options available. The Hadoop distribution vendors are probably out of its price range (so closer partnerships with Cloudera, Hortonworks and/or MapR should be in order), but there are whole markets full of analytic, NoSQL and NewSQL databases just waiting to be snatched up.</p>
<h2 id="3-buy-up-some-real-cloud-knowl">3. Buy up some real cloud knowledge.</h2>
<p style="text-align:left;">Sorry to burst any bubbles, but it&#8217;s hard to see how Dell&#8217;s proposed <a href="http://gigaom.com/2011/08/29/dell-launches-a-vmware-based-cloud-azure-next/">Microsoft, VMware and OpenStack-based public cloud offerings</a> will amount to much unless they&#8217;re part of a division with an ingrained cloud mentality. The company is competing in this space against not only Amazon Web Services, but also the companies (and Rackspace) who have developed and are selling the very platforms Dell is looking to resell. And <a href="http://gigaom.com/2013/01/18/hps-cloud-chief-exits-sparking-more-confusion/">the debacle that is HP&#8217;s cloud business</a> should highlight the perils of trying to build something from scratch <a href="http://gigaom.com/2012/09/08/why-winning-in-openstack-means-knowing-your-buyers-and-your-code/">when operating cloud platforms isn&#8217;t in your corporate DNA</a>.</p>
<p>Still, I think OpenStack is probably Dell&#8217;s best bet (if only because it has already <a href="http://slashdot.org/topic/datacenter/dell-announces-private-cloud-built-on-openstack/">made its bed with the open source platform</a>). Like <a href="http://gigaom.com/2011/02/11/lew-moorman-talks-anso-labs-openstack-and-cloud-revenue/">Rackspace with Anso Labs</a> (and unlike the DIY approach by HP), Dell could acquire some OpenStack talent to help it build quality cloud infrastructure offerings &#8212; both public and private &#8212; and services. And even outside of OpenStack, there are still a few startups that have real institutional knowledge of building and running cloud infrastructure, but that might not come with a huge price tag.</p>
<h2 id="4-push-the-envelope-with-arm">4. Push the envelope with ARM.</h2>
<div id="attachment_607711" class="wp-caption alignright" style="width: 310px"><a href="http://gigaom2.files.wordpress.com/2013/02/armserver_des_sled_4hdd.jpg"><img  alt="One of Dell's ARM server designs" src="http://gigaom2.files.wordpress.com/2013/02/armserver_des_sled_4hdd.jpg?w=300&#038;h=225" width="300" height="225" class="size-medium wp-image-607711" /></a><p class="wp-caption-text">One of Dell&#8217;s ARM server designs</p></div>
<p>Dell&#8217;s focus on alternative server architectures shouldn&#8217;t stop with its webscale business, but should color its entire hardware strategy. And if it wants to be the server maker for next-generation applications, doubling down on its ARM investment is probably a good thing to do. Dell has already designed a couple of ARM servers, but they&#8217;ve generally <a href="http://gigaom.com/2012/05/29/see-what-cloud-can-do-dell-unveils-arm-servers/">been reserved for webscale and research customers</a>, as well as <a href="http://gigaom.com/2012/10/24/dell-wants-to-tune-big-data-apps-for-arm-servers/">for the Apache Software Foundation</a>.</p>
<p>However, as even Dell has acknowledged, many big data and other applications that rely on distributed infrastructure and parallel processing don&#8217;t require brawny x86 chips. <a href="http://gigaom.com/2011/06/13/big-data-on-micro-servers-you-bet/">They just needs lots of cores</a>, which is something ARM&#8217;s server designs can deliver at a fraction of the energy footprint. Assuming software vendors are on board &#8212; and <a href="http://www.computerworld.com/s/article/9233231/Red_Hat_Xen_Java_Cloudera_prepped_for_64_bit_ARM">a growing number appear to be</a> &#8212; Dell could actually distinguish itself by being the only big server vendor delivering tomorrow&#8217;s applications on gear actually designed for their unique needs.</p>
<h2 id="5-tell-microsoft-to-suck-it">5. Tell Microsoft to suck it.</h2>
<p>Or, rather, gently ask its new creditor and likely board member to take a back seat and hope the rest of the board agrees. Because if there&#8217;s one thing Dell can&#8217;t afford to do, it&#8217;s get locked up in the same last-millenium relationship <a href="http://gigaom.com/2013/01/23/a-microsoft-investment-in-dell-would-show-how-vulnerable-both-it-giants-are/">that has both companies reeling today</a>. Yes, there might &#8212; and maybe should &#8212; be some deals between the two companies to deliver Microsoft software on Dell hardware, but in an era of open source, cloud computing, VMware and Apple, there&#8217;s no real future for Dell in pushing Windows anything down its customers&#8217; throats.</p>
<p>Besides, I don&#8217;t think Microsoft actually needs to push its agenda in order for the Dell investment to pay off (and it&#8217;s not just because a successful Dell is the only way to make Microsoft&#8217;s investment pay off). Microsoft, too, is struggling to find its direction in a world increasingly dominated by alternative operating systems (mobile, desktop and server) and it should realize there&#8217;s precious little Dell hardware can do to get people to buy Windows.</p>
<p>If anything, Microsoft should view Dell as a great server vendor to power its Windows Azure and Office 365 cloud offerings and provide the physical medium for some next-generation <em>enterprise</em> software.</p>
<p><em>Feature image courtesy of <a href="http://www.shutterstock.com/gallery-776821p1.html">Shutterstock user Dusit</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=607355&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=196948"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=196948" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=607355+how-dell-should-go-big-now-that-it-has-gone-private&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2011/04/infrastructure-q1-iaas-comes-down-to-earth-big-data-takes-flight/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=607355+how-dell-should-go-big-now-that-it-has-gone-private&utm_content=dharrisstructure">Infrastructure Q1: IaaS Comes Down to Earth; Big Data Takes Flight</a></li><li><a href="http://pro.gigaom.com/2012/11/an-overview-of-the-software-defined-networking-market/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=607355+how-dell-should-go-big-now-that-it-has-gone-private&utm_content=dharrisstructure">The promise of SDNs in the enterprise</a></li><li><a href="http://pro.gigaom.com/2012/06/cloud-computing-infrastructure-2012-and-beyond/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=607355+how-dell-should-go-big-now-that-it-has-gone-private&utm_content=dharrisstructure">Cloud computing infrastructure: 2012 and beyond</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/02/05/how-dell-should-go-big-now-that-it-has-gone-private/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/02/shutterstock_106100273.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/02/shutterstock_106100273.jpg?w=150" medium="image">
			<media:title type="html">risk chart</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/02/idc.jpg?w=708" medium="image">
			<media:title type="html">idc</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/02/armserver_des_sled_4hdd.jpg?w=300" medium="image">
			<media:title type="html">One of Dell&#039;s ARM server designs</media:title>
		</media:content>
	</item>
		<item>
		<title>Pinterest, Flipboard and Yelp tell how to save big bucks in the cloud</title>
		<link>http://gigaom.com/2012/12/02/pinterest-flipboard-and-yelp-tell-how-to-save-big-bucks-in-the-cloud/</link>
		<comments>http://gigaom.com/2012/12/02/pinterest-flipboard-and-yelp-tell-how-to-save-big-bucks-in-the-cloud/#comments</comments>
		<pubDate>Sun, 02 Dec 2012 21:30:39 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[Amazon EC2]]></category>
		<category><![CDATA[Amazon Web Services]]></category>
		<category><![CDATA[AWS re: Invent]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[elastic-mapreduce]]></category>
		<category><![CDATA[Flipboard]]></category>
		<category><![CDATA[Greg Scallan]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[iaas]]></category>
		<category><![CDATA[Pinterest]]></category>
		<category><![CDATA[Web Infrastructure]]></category>
		<category><![CDATA[webscale]]></category>
		<category><![CDATA[yelp]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=590008</guid>
		<description><![CDATA[At the AWS Re: Invent conference, engineers from Pinterest, Flipboard and Yelp detailed some of the strategies their companies employ in order to keep costs low as computing demand increases. The keys are keeping an eagle eye on usage and using the right types of resources.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=590008&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Amazon Web Services can be a great platform for startups when they&#8217;re small, but costs can outpace revenue growth pretty quick &#8212; especially if you&#8217;re offering a a free consumer service. At AWS&#8217;s Re: Invent user conference last week, engineers from Pinterest, Flipboard and Yelp shared their impressive and sometimes ingenious techniques for keeping costs under control and their bottom lines healthy.</p>
<p>Pinterest Operations Engineer Ryan Park had the stage to himself for a session on Wednesday, while Flipboard Chief Architect Greg Scallan and Yelp Engineering Manager Jim Blomo teamed up with Kleiner Perkins Caufield Byers Partner Ray Bradford to form a trifecta of wisdom on Thursday.</p>
<h2>Know &#8212; and measure &#8212; your costs</h2>
<p>Flipboard&#8217;s Scallan had a paradoxical lesson for the audience when it comes to managing cloud-based infrastructure: Embrace the cloud, but be afraid of the cloud. Yes, it&#8217;s flexible and affordable if done right, but all it takes is poor planning or a handful of servers left running ad infinitum, and the costs can begin to grow out of control. That&#8217;s why Flipboard assigns members of its engineering team the title of &#8220;chief miser,&#8221; which means they&#8217;re the ones who decide that applications are using the right resources and using them wisely.</p>
<p>Thanks to a variety of practices, including its miserly ways, Scallan said Flipboard is now running about 900 instances at any given time. That&#8217;s down from a peak of about 1,500.</p>
<div id="attachment_590210" class="wp-caption aligncenter" style="width: 614px"><a href="http://gigaom2.files.wordpress.com/2012/12/20121129_1528212.jpg"><img  alt="Some stats on Flipboard's AWS usage" src="http://gigaom2.files.wordpress.com/2012/12/20121129_1528212.jpg?w=604&#038;h=453" height="453" width="604" class="size-large wp-image-590210" /></a><p class="wp-caption-text">Some stats on Flipboard&#8217;s AWS usage</p></div>
<p>One way to help ensure this sort lean operation is to understand your business inputs and outputs, Kleiner Perkins&#8217;s Bradford explained. He suggests companies ask, for example, what it costs them to serve a free user on their platform and how does that change with scale or affect the experience they can offer premium users. Pick metrics that really matter, he said (e.g., infrastructure cost per user per month) and then consider how long your current  architecture can sustain that cost before it&#8217;s time to retool.</p>
<h2>The secret weapon: Source your instances wisely</h2>
<p>Pinterest, Yelp and Flipboard all swear by <a href="http://aws.amazon.com/ec2/reserved-instances/">AWS&#8217;s pre-paid Reserved Instances</a> in order to save money over the long haul. In fact, Flipboard&#8217;s Scallan said, the e-reading startup sees cost savings of about 80 percent over three years by using heavy-duty Reserved Instances instead of on-demand instances for its base workloads, and the break-even point might be only eight or nine months. Pinterest&#8217;s Park cited savings of about 70 percent over three years using them.</p>
<div id="attachment_590209" class="wp-caption alignright" style="width: 310px"><a href="http://gigaom2.files.wordpress.com/2012/12/20121129_154538.jpg"><img  alt="20121129_154538" src="http://gigaom2.files.wordpress.com/2012/12/20121129_154538.jpg?w=300&#038;h=225" height="225" width="300" class="size-medium wp-image-590209" /></a><p class="wp-caption-text">The trick is queuing another job to take up the waste.</p></div>
<p>Yelp&#8217;s Blomo said his company is a heavy Elastic MapReduce (EMR) user, peaking at more than 350 Elastic MapReduce instances when many developers run their Hadoop jobs simultaneously or when it&#8217;s doing nightly analysis of its log files. In order to keep costs in check, Yelp uses Reserved Instances whenever possible to save on hourly bills and has implemented a job-flow pooling system to keep Hadoop jobs running continuously as resources become available. This helps avoid the situation where a job completes in 61 minutes, for example, thus triggering the charge for a full hour of resources even though it only used a minute worth of the second hour.</p>
<p>In order to best gauge when it should use what type instance, Yelp <a href="http://engineeringblog.yelp.com/2012/07/introducing-emrio-optimize-your-aws-bills.html">created a tool called EMRio</a> that analyzes past usage to determine what resources are the most-efficient choice for any given job.</p>
<div id="attachment_590216" class="wp-caption aligncenter" style="width: 614px"><a href="http://gigaom2.files.wordpress.com/2012/12/emrio.jpg"><img  alt="emrio" src="http://gigaom2.files.wordpress.com/2012/12/emrio.jpg?w=604&#038;h=455" height="455" width="604" class="size-large wp-image-590216" /></a><p class="wp-caption-text">The results of EMRio</p></div>
<p>When it comes to optimizing costs on AWS, though, Pinterest appears to have it all figured out &#8212; even how to make use of the somewhat tricky <a href="http://aws.amazon.com/ec2/spot-instances/">Spot Instances</a> that are priced based on demand and can be terminated without notice if the market price outgrows a user&#8217;s bid. Park explained how Pinterest uses the heck out of Reserved Instances and created its own auto-scaling &#8220;watchdog&#8221; service that decides whether to use Spot Instances or on-demand instances when more resources are required.</p>
<div id="attachment_590236" class="wp-caption alignleft" style="width: 310px"><a href="http://gigaom2.files.wordpress.com/2012/12/20121128_105509.jpg"><img  alt="Ryan Park dropping knowledge -- and graphs" src="http://gigaom2.files.wordpress.com/2012/12/20121128_105509.jpg?w=300&#038;h=225" height="225" width="300" class="size-medium wp-image-590236" /></a><p class="wp-caption-text">Ryan Park dropping knowledge &#8212; and graphs</p></div>
<p>Although Spot Instance prices <a href="http://gigaom.com/2011/12/27/how-to-deal-with-amazons-spot-server-price-spikes/">occasionally spike through the roof</a>, Park&#8217;s experience is that they typically remain stable and can result in &#8220;massive&#8221; savings if you know how to use them effectively. Using Spot Instances to power Pinterest&#8217;s approximately 80 front-end servers costs only about $20 per hour, he said. All told, Pinterest has reduced its daily computing bill to about $440 from about $1,200.</p>
<p>All this being said, though, Park, Blomo and Scallan all acknowledged that the flexibility of being able to mix on-demand, reserved and spot servers might not be all it&#8217;s cracked up to be if you don&#8217;t understand how they all work. Reserved Instances are inflexible in terms of size and region once you reserve them, and Spot Instances must be used wisely for jobs or applications that can handle their easy come, easy go nature. And now there&#8217;s even more to consider <a href="http://gigaom.com/cloud/want-to-buy-or-sell-amazon-instances-now-you-can/">because Reserved Instances can be resold</a> via AWS&#8217;s spot marketplace.</p>
<p>&#8220;It gets a little tricky,&#8221; Blomo said.</p>
<h2>Pick your challenges</h2>
<p>Although decisions such database type and structure are largely architectural, there might be elements of cost efficiency at play, as well. Maybe Kleiner Perkins&#8217;s Bradford put it best while leading off the session with Scallan and Blomo. Bradford presented a slide containing a simple quote from Instagram Founder Mike Krieger: &#8220;Your users around the world don&#8217;t care that you wrote your own database.&#8221; Sometimes, Bradford added, it might be best to use what works &#8212; maybe even a managed service &#8212; rather than whatever&#8217;s trending highest on Hacker News.</p>
<p>Pinterest&#8217;s Park expressed a similar sentiment during his session, citing a lesson his team learned about trying out too many new databases. The site used to use MongoDB, Cassandra, Redis and other databases simultaneously, but learning all the new technologies and managing them became burdensome. Now, he said, Pinterest uses good, old-fashioned MySQL (granted, <a href="http://www.slideshare.net/eonarts/mysql-meetup-july2012scalingpinterest">it sharded MySQL 4,000 times</a>) and memcached &#8212; as well as Redis &#8212; because they have strong communities and new engineers are more likely to know how to work with them.</p>
<p>After explaining EMRio and some other custom-built Hadoop tools to the crowd, Yelp&#8217;s Blomo noted that companies should carefully consider whether the time and money it takes to build stuff will actually result in commensurate savings once those tools or systems are in production. That can require some tough balancing of criteria such as cost, performance, flexibility and user experience.</p>
<p>But it&#8217;s important to use human resources wisely. As Bradford said during his presentation, &#8220;There&#8217;s no free lunch when it comes to developer time.&#8221;</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=590008&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=920689"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=920689" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=590008+pinterest-flipboard-and-yelp-tell-how-to-save-big-bucks-in-the-cloud&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/07/cloud-and-data-second-quarter-2012-analysis-and-outlook-2/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=590008+pinterest-flipboard-and-yelp-tell-how-to-save-big-bucks-in-the-cloud&utm_content=dharrisstructure">Takeaways from the second quarter in cloud and data</a></li><li><a href="http://pro.gigaom.com/2012/12/how-direct-access-solutions-can-speed-up-cloud-adoption/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=590008+pinterest-flipboard-and-yelp-tell-how-to-save-big-bucks-in-the-cloud&utm_content=dharrisstructure">How direct-access solutions can speed up cloud adoption</a></li><li><a href="http://pro.gigaom.com/2012/08/understanding-and-managing-the-cost-of-the-cloud/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=590008+pinterest-flipboard-and-yelp-tell-how-to-save-big-bucks-in-the-cloud&utm_content=dharrisstructure">Understanding and managing the cost of the cloud</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/12/02/pinterest-flipboard-and-yelp-tell-how-to-save-big-bucks-in-the-cloud/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/12/20121129_154230-e1354479446379.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/12/20121129_154230-e1354479446379.jpg?w=150" medium="image">
			<media:title type="html">Yelp chart</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/12/20121129_1528212.jpg?w=604" medium="image">
			<media:title type="html">Some stats on Flipboard&#039;s AWS usage</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/12/20121129_154538.jpg?w=300" medium="image">
			<media:title type="html">20121129_154538</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/12/emrio.jpg?w=604" medium="image">
			<media:title type="html">emrio</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/12/20121128_105509.jpg?w=300" medium="image">
			<media:title type="html">Ryan Park dropping knowledge -- and graphs</media:title>
		</media:content>
	</item>
		<item>
		<title>Facebook open sources Corona &#8212; a better way to do webscale Hadoop</title>
		<link>http://gigaom.com/2012/11/08/facebook-open-sources-corona-a-better-way-to-do-webscale-hadoop/</link>
		<comments>http://gigaom.com/2012/11/08/facebook-open-sources-corona-a-better-way-to-do-webscale-hadoop/#comments</comments>
		<pubDate>Thu, 08 Nov 2012 20:01:15 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[Corona]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[mapreduce]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[scalability]]></category>
		<category><![CDATA[Web Infrastructure]]></category>
		<category><![CDATA[webscale]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=582252</guid>
		<description><![CDATA[Facebook has open sourced a new system called Corona for scheduling and managing Hadoop jobs. Corona attempts to do away with many of the problems that come along with massive-scale Hadoop operations, and soon looks to take Facebook's Hadoop deployment beyond just MapReduce.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=582252&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Facebook is at it again, building more software to make Hadoop a better way to do big data at web scale. Its latest creation, which the company <a href="https://github.com/facebook/hadoop-20/tree/master/src/contrib/corona">has also open sourced</a>, is called Corona and aims to make Hadoop more efficient, more scalable and more available by re-inventing how jobs are scheduled.</p>
<p>As with <a href="http://www.facebook.com/note.php?note_id=468211193919">most of its changes to Hadoop over the years</a> &#8212; including the <a href="http://gigaom.com/cloud/how-facebook-keeps-100-petabytes-of-hadoop-data-online/">recently unveiled AvatarNode</a> &#8212; Corona came to be because Hadoop simply wasn&#8217;t designed to handle Facebook&#8217;s scale or its broad usage of the platform. What kind of scale are we talking about? According to Facebook engineers Avery Ching, Ravi Murthy, Dmytro Molkov,‎ Ramkumar Vadali, and Paul Yang <a href="https://www.facebook.com/notes/facebook-engineering/under-the-hood-scheduling-mapreduce-jobs-more-efficiently-with-corona/10151142560538920">in a blog post detailing Corona on Thursday</a>, the company&#8217;s largest cluster is more than 100 petabytes; it runs more than 60,000 Hive queries a day; and its data warehouse has grown 2,500x in four years.</p>
<p>Further, Ching and company note &#8212; echoing something Facebook VP of Infrastructure Engineering Jay Parikh told me in September when <a href="http://gigaom.com/data/for-the-future-of-big-data-startups-look-to-facebook/">discussing the future of big data startups</a> &#8212; Hadoop is responsible for a lot of how Facebook runs both its platform and its business:</p>
<blockquote><p>Almost every team at Facebook depends on our custom-built data infrastructure for warehousing and analytics, with roughly 1,000 people across the company &#8212; including both technical and non-technical personnel &#8212; using these technologies every day. Over half a petabyte of new data arrives in the warehouse every 24 hours, and ad-hoc queries, data pipelines, and custom MapReduce jobs process this raw data around the clock to generate more meaningful features and aggregations.</p></blockquote>
<h2>So, what is Corona?</h2>
<p>In a nutshell, Corona represents a new system for scheduling Hadoop jobs that makes better use of a cluster&#8217;s resources and also makes it more amenable to multitenant environments like the one Facebook operates. Ching et al explain the problems and the solution in some detail, but the short explanation is that Hadoop&#8217;s JobTracker node is responsible for both cluster management and job-scheduling, but has a hard time keeping up with both tasks as clusters grow and the number of jobs sent to them increase.</p>
<p>Further, job-scheduling in Hadoop involves an inherent delay, which is problematic for small jobs that need fast results. And a fixed configuration of &#8220;map&#8221; and &#8220;reduce&#8221; slots means Hadoop clusters run inefficiently when jobs don&#8217;t fit into the remaining slots or when they&#8217;re not MapReduce jobs at all.</p>
<p>Corona resolves some of these problems by creating individual job trackers for each job and a cluster manager focused solely on tracking nodes and the amount of available resources. Thanks to this simplified architecture and a few other changes, the latency to get a job started is reduced and the cluster manager can make fast scheduling decisions because it&#8217;s not also responsible for tracking the progress of running jobs. Corona also incorporates a feature that divvies a cluster into resource pools to ensure every group within the company gets its fair share of resources.</p>
<p><a href="http://gigaom2.files.wordpress.com/2012/11/corona.jpg"><img  title="corona" alt="" src="http://gigaom2.files.wordpress.com/2012/11/corona.jpg?w=708"   class="aligncenter size-full wp-image-582351" /></a></p>
<p>The results have lived up to expectations since Corona went into full production in mid-2012: the average time to refill idle resources improved by 17 percent; resource utilization over regular MapReduce improved to 95 percent from 70 percent (in a simulation cluster); resource unfairness dropped to 3.6 percent with Corona versus 14.3 percent with traditional MapReduce; and latency on a test job Facebook runs every four minutes has been</p>
<p>Despite the hard work put into building and deploying Corona, though, the project still was a way to go. One of the biggest improvements currently being developed is to enable resource management based on CPU, memory and other job requirements rather than just the number of &#8220;map&#8221; and &#8220;reduce&#8221; slots needed. This will open Corona up to running non-MapReduce jobs, therefore making a Hadoop cluster more of a general-purpose parallel computing cluster.</p>
<p>Facebook is also trying to incorporate online upgrades, which would mean a cluster doesn&#8217;t have to come down every time part of the management layer undergoes an update.</p>
<h2>Why Facebook sometimes must re-invent the wheel</h2>
<p>Anyone deeply familiar with the Hadoop space might be thinking that a lot of what Facebook has done with Corona sounds familiar &#8212; and that&#8217;s because it kind of is. The <a href="http://hortonworks.com/blog/apache-hadoop-yarn-concepts-and-applications/">Apache YARN project</a> that has been integrated into the latest version of Apache Hadoop similarly splits the JobTracker into separate cluster-management and job-tracking components, and already allows for non-MapReduce workloads. Further, there <a href="http://gigaom.com/cloud/the-unsexy-side-of-big-data-6-tools-to-manage-your-hadoop-cluster/">is a whole class of commercial and open source cluster-management tools</a> that have their own solutions to the problems Corona tries to solve, including <a href="http://incubator.apache.org/mesos/index.html">Apache Mesos</a>, which is <a href="http://gigaom.com/cloud/twitter-backs-fave-big-data-projects-with-apache-sponsorship/">Twitter&#8217;s tool of choice</a>.<br />
However, anyone who&#8217;s familiar with Facebook knows the company isn&#8217;t likely to buy software from anyone. It also has reached a point of customization with its Hadoop environment where even open-source projects from Apache won&#8217;t be easy to adapt to Facebook&#8217;s unique architecture. From the blog post:</p>
<blockquote><p>It’s worth noting that we considered Apache YARN as a possible alternative to Corona. However, after investigating the use of YARN on top of our version of HDFS (a strong requirement due to our many petabytes of archived data) we found numerous incompatibilities that would be time-prohibitive and risky to fix. Also, it is unknown when YARN would be ready to work at Facebook-scale workloads.</p></blockquote>
<p>So, Facebook plods forward, a Hadoop user without equal (save for maybe Yahoo) left building its own tools in isolation. What will be interesting to watch as Hadoop adoption picks up and more companies beging building applications atop it is how many actually utilize the types of tools that companies like Facebook, <a href="http://gigaom.com/cloud/how-twitter-is-doing-its-part-to-democratize-big-data/">Twitter</a> and <a href="http://gigaom.com/data/quantcast-releases-bigger-faster-stronger-hadoop-file-system/">Quantcast</a> have created and open sourced. They might not have commercial backers behind them, but they&#8217;re certainly built to work well at scale.</p>
<p><em>Feature image courtesy of Shutterstock user <a href="http://www.shutterstock.com/gallery-10991p1.html">Johan Swanepoel</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=582252&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=421074"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=421074" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=582252+facebook-open-sources-corona-a-better-way-to-do-webscale-hadoop&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=582252+facebook-open-sources-corona-a-better-way-to-do-webscale-hadoop&utm_content=dharrisstructure">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2011/11/dissecting-the-data-5-issues-for-our-digital-future/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=582252+facebook-open-sources-corona-a-better-way-to-do-webscale-hadoop&utm_content=dharrisstructure">Dissecting the data: 5 issues for our digital future</a></li><li><a href="http://pro.gigaom.com/2011/03/defining-hadoop-the-players-technologies-and-challenges-of-2011/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=582252+facebook-open-sources-corona-a-better-way-to-do-webscale-hadoop&utm_content=dharrisstructure">Defining Hadoop: the Players, Technologies and Challenges of 2011</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/11/08/facebook-open-sources-corona-a-better-way-to-do-webscale-hadoop/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/11/shutterstock_42996799.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/11/shutterstock_42996799.jpg?w=150" medium="image">
			<media:title type="html">herd of elephants</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/11/corona.jpg" medium="image">
			<media:title type="html">corona</media:title>
		</media:content>
	</item>
		<item>
		<title>Dell has shipped 1M webscale servers in five years to arm the cloud</title>
		<link>http://gigaom.com/2012/10/30/dell-has-sold-1m-webscale-servers-in-five-years/</link>
		<comments>http://gigaom.com/2012/10/30/dell-has-sold-1m-webscale-servers-in-five-years/#comments</comments>
		<pubDate>Tue, 30 Oct 2012 21:27:57 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[data center]]></category>
		<category><![CDATA[Dell]]></category>
		<category><![CDATA[ebay]]></category>
		<category><![CDATA[energy efficiency]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[servers]]></category>
		<category><![CDATA[webscale]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=578807</guid>
		<description><![CDATA[Dell's Data Center Solutions group recently shipped its 1 millionth server just five years after coming into existence. It's proof of how important webscale buyers have become to the server market, as well as how different their demand are than those of traditional IT buyers.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=578807&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Dell is reporting today (on no less than three corporate blog channels) that the company&#8217;s <a href="http://www.dell.com/content/topics/global.aspx/sitelets/solutions/cluster_grid/dcs_landingpage?c=us&amp;l=en">Data Center Solutions</a> unit shipped its 1 millionth server earlier this week. The DCS division, which sells stripped-down, energy-efficient servers by the thousands to hyperscale customers, has been a shining star for Dell over the past few years.</p>
<p>DCS doesn&#8217;t sell to just anyone, though. The unit&#8217;s banner customer used to be Facebook (although it has <a href="http://gigaom.com/cloud/facebook-open-sources-its-servers-and-data-centers/">since begun building much of its own gear</a>, and <a href="http://gigaom.com/cloud/facebook-has-220-billion-of-your-photos-to-put-on-ice/">claims to be off vendor gear entirely</a> for its newest data center), and others include Microsoft, Salesforce.com and eBay. In fact, Dell provided the bulk of the web servers for <a href="http://gigaom.com/cloud/making-the-web-more-efficient-a-thousand-servers-at-a-time/">eBay&#8217;s Project Mercury data center that I profiled in April</a>. Outside the web space, Dell&#8217;s DCS customers include large oil &amp; gas companies and research centers.</p>
<p>Those types of customers are important. As cloud computing and large web sites have shifted server-sales dynamics over the years, <a href="http://gigaom.com/cloud/open-compute-builds-a-business-model-for-the-next-era-of-the-web/">fewer customers account for an ever-increasing percentage of server sales.</a> Thanks to Facebook&#8217;s Open Compute Project and other efforts, they&#8217;re increasingly demanding high-density gear that packs as much power as possible into the most-efficient footprint. In fact, <a href="http://gigaom.com/cloud/with-sales-booming-dell-sees-a-micro-server-future/">the company claimed in 2011</a> that its DCS division, combined with its off-the-shelf PowerEdge C line of energy-efficient servers, would be the fourth-largest x86 server vendor in the world if it were spun off from Dell.</p>
<p>To give you a sense just how many servers 1 million is, consider that Google <a href="https://plus.google.com/114250946512808775436/posts/VaQu9sNxJuY">runs about a million servers</a> (give or take, if estimates are accurate), while Facebook is <a href="http://www.tomshardware.com/news/facebook-servers-power-wattage-network,16961.html">estimated to be running about 181,000 servers</a>. Dell has produced an infographic putting the number into context in terms of energy savings, as well.</p>
<p>And you can check <a href="http://bartongeorge.net/2012/10/30/one-miiiiiiiiiiiiillion-cloud-servers/">this post from Dell&#8217;s Barton George</a> for a celebratory video, as well as a photo of the napkin on which the whole DCS idea was hatched.</p>
<p><a href="http://gigaom2.files.wordpress.com/2012/10/2500-dell-dcs-infographic_v9.jpeg"><img  title="2500.Dell DCS infographic_v9" alt="" src="http://gigaom2.files.wordpress.com/2012/10/2500-dell-dcs-infographic_v9.jpeg?w=708"   class="aligncenter size-full wp-image-578821" /></a></p>
<p><em>Feature image courtesy of <a href="http://www.shutterstock.com/gallery-348181p1.html">Shutterstock user Oleksiy Mark</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=578807&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=745333"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=745333" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=578807+dell-has-sold-1m-webscale-servers-in-five-years&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2011/12/migrating-media-applications-to-the-private-cloud-best-practices-for-businesses/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=578807+dell-has-sold-1m-webscale-servers-in-five-years&utm_content=dharrisstructure">Migrating media applications to the private cloud: best practices for businesses</a></li><li><a href="http://pro.gigaom.com/2010/01/facebook-apple-building-new-data-centers-but-why/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=578807+dell-has-sold-1m-webscale-servers-in-five-years&utm_content=dharrisstructure">Facebook, Apple Building New Data Centers, But Why?</a></li><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=578807+dell-has-sold-1m-webscale-servers-in-five-years&utm_content=dharrisstructure">A near-term outlook for big data</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/10/30/dell-has-sold-1m-webscale-servers-in-five-years/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/10/shutterstock_108857858.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/10/shutterstock_108857858.jpg?w=150" medium="image">
			<media:title type="html">Servers in the cloud</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/10/2500-dell-dcs-infographic_v9.jpeg" medium="image">
			<media:title type="html">2500.Dell DCS infographic_v9</media:title>
		</media:content>
	</item>
		<item>
		<title>Quantcast releases bigger, faster, stronger Hadoop file system</title>
		<link>http://gigaom.com/2012/09/27/quantcast-releases-bigger-faster-stronger-hadoop-file-system/</link>
		<comments>http://gigaom.com/2012/09/27/quantcast-releases-bigger-faster-stronger-hadoop-file-system/#comments</comments>
		<pubDate>Thu, 27 Sep 2012 13:38:34 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[HDFS]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[Quantcast]]></category>
		<category><![CDATA[Web Infrastructure]]></category>
		<category><![CDATA[webscale]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=567220</guid>
		<description><![CDATA[It's not for everyone, but if you're storing petabytes of data Hadoop, Quantcast thinks it has the cure to your woes. Its newly open sourced Quantcast File System promises smaller clusters and better performance, and it has proven itself over exabytes of data inside Quantcast.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=567220&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>The Quantcast File System is like the Six-Million Dollar Man of distributed data stores for Hadoop. An implementation of the <a href="http://en.wikipedia.org/wiki/CloudStore">Kosmix Distributed File System (aka CloudStore)</a> that had largely been written off and forgotten, <a href="http://quantcast.com">Quantcast</a> has built QFS to be bigger, faster and stronger than the Hadoop Distributed File System most commonly associated with the popular big data platform. Now, QFS <a href="http://quantcast.github.com/qfs">is open source and ready for use</a> in the webscale world.</p>
<p>According to Quantcast VP of Research and Development Jim Kelly, the web-audience measurement specialist began working with Hadoop in 2006 and experienced problems almost from the start. However, while the early problems with HDFS might have been symptoms of its immaturity, the problems soon began centering around the two things Hadoop is supposed to be best at &#8212; size and speed. So, in 2008, Quantcast began experimenting with, and actually sponsoring, the Kosmix project.</p>
<p>It turns out that wasn&#8217;t a moment too soon. By 2010, after Quantcast began integrating with ad networks, its data flow really began picking up into the tens of terabytes a day range. It turned on QFS as its production Hadoop file system in 2011 and now receives about 40TB a day and processes a whopping 20 petabytes. Kelly said Quantcast has pushed 4 exabytes &#8212; or 4 billion gigabytes &#8212; through QFS since turning it on.</p>
<p><a href="http://gigaom2.files.wordpress.com/2012/09/qfs.jpg"><img  title="qfs" src="http://gigaom2.files.wordpress.com/2012/09/qfs.jpg?w=604&#038;h=296" alt="" width="604" height="296" class="aligncenter size-large wp-image-567289" /></a></p>
<h2>Faster, yes. Bigger, not so much.</h2>
<p>At Quantcast&#8217;s scale, the problem with HDFS wasn&#8217;t so much its scalability, but the sheer size of the cluster required to handle petabyte-scale data stores. HDFS stores three copies of each piece of data to ensure they&#8217;re always available, although it tries to make up for the size issue with data locality (i.e., putting data directly on the computing nodes so it doesn&#8217;t have to traverse the network in order to be processed). Kelly thinks those techniques are relics of a bygone era.</p>
<p>&#8220;When HDFS [was created], disk drives and networks were tied for being the slowest things in the cluster,&#8221; he said.</p>
<p><a href="http://gigaom2.files.wordpress.com/2012/09/qfs-2.jpg"><img  title="QFS 2" src="http://gigaom2.files.wordpress.com/2012/09/qfs-2.jpg?w=300&#038;h=206" alt="" width="300" height="206" class="alignright size-medium wp-image-567292" /></a>Enter <a href="http://en.wikipedia.org/wiki/Reed%E2%80%93Solomon_error_correction">Reed-Solomon error correction</a>, QFS&#8217;s chosen method for assuring reliable access to data that Kelly says actually ends up shrinking the size of Hadoop clusters while improving their performance. (It&#8217;s actually the same method used on CDs and DVDs.) Rather than storing three full versions of each file like HDFS, resulting in the need for three times more storage, QFS only needs 1.5x the raw capacity because it stripes data across nine different disk drives. Quantcast believes smaller cluster size, combined with today&#8217;s 10-gigabit networks and the ability to read and write data in parallel make QFS significantly faster than HDFS at large scale.</p>
<p>QFS also comes equipped with other features that Quantcast had to implement to make it production-ready. Among them: it is written in C++ and has fixed-footprint memory management; it has access control based on users and groups; and it intelligently detects node failures, as opposed to planned maintenance, and invokes data recovery accordingly.</p>
<h2>It&#8217;s not for everyone, though</h2>
<p>Despite its claimed improvements over HDFS though, Kelly is quick to point out that QFS is probably not the best choice for everyone. It&#8217;s really designed for Hadoop users operating at petabyte scale, who have the technical prowess to handle a migration away from HDFS, and for whom data-processing costs are hitting the six-to-seven-figure range monthly once things such as energy bills accounted for.</p>
<p>&#8220;If you&#8217;re cluster only has 10 disk drives,&#8221; Kelly said, &#8220;[QFS] will save you $500, which is nice but &#8230;&#8221;</p>
<p>Likewise, if high availability is very important, the <a href="http://hadoop.apache.org/releases.html#17+September%2C+2012%3A+Release+0.23.3+available">latest version of HDFS</a> might be preferable. &#8220;There&#8217;s a standby [in QFS]; it&#8217;s not quite as hot as theirs,&#8221; Kelly said. But availability isn&#8217;t super important to Quantcast, he said, it hasn&#8217;t had any real problems with QFS going down anyhow. When it does, it actually recovers pretty fast.</p>
<p>As for the <a href="http://gigaom.com/cloud/because-hadoop-isnt-perfect-8-ways-to-replace-hdfs/">rest of the file systems touting themselves as better alternatives for HDFS</a>, Kelly didn&#8217;t have much to say. Quantcast&#8217;s efforts are focused on mega-scale Hadoop deployments, and he doesn&#8217;t see anything better for that use case. Although, he noted, Hadoop vendors probably shouldn&#8217;t get too upset over all the competition.</p>
<p>&#8220;I think some diversity in the ecosystem is probably not a bad thing,&#8221; he said, &#8220;and is probably a sign of healthy evolution.&#8221;</p>
<p><em>Feature image courtesy of <a href="http://www.shutterstock.com/gallery-244369p1.html">Shutterstock user Lobke Peers</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=567220&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=292371"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=292371" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=567220+quantcast-releases-bigger-faster-stronger-hadoop-file-system&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=567220+quantcast-releases-bigger-faster-stronger-hadoop-file-system&utm_content=dharrisstructure">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2011/03/defining-hadoop-the-players-technologies-and-challenges-of-2011/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=567220+quantcast-releases-bigger-faster-stronger-hadoop-file-system&utm_content=dharrisstructure">Defining Hadoop: the Players, Technologies and Challenges of 2011</a></li><li><a href="http://pro.gigaom.com/2012/11/unlocking-big-datas-potential-with-search/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=567220+quantcast-releases-bigger-faster-stronger-hadoop-file-system&utm_content=dharrisstructure">How search can unlock the power of big data</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/09/27/quantcast-releases-bigger-faster-stronger-hadoop-file-system/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/09/shutterstock_55825522.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/09/shutterstock_55825522.jpg?w=150" medium="image">
			<media:title type="html">cyborg</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/09/qfs.jpg?w=604" medium="image">
			<media:title type="html">qfs</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/09/qfs-2.jpg?w=300" medium="image">
			<media:title type="html">QFS 2</media:title>
		</media:content>
	</item>
		<item>
		<title>Why Twitter should open up about its infrastructure</title>
		<link>http://gigaom.com/2012/07/26/why-twitter-should-open-up-about-its-infrastructure/</link>
		<comments>http://gigaom.com/2012/07/26/why-twitter-should-open-up-about-its-infrastructure/#comments</comments>
		<pubDate>Thu, 26 Jul 2012 22:58:22 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[Data Centers]]></category>
		<category><![CDATA[infrastructure]]></category>
		<category><![CDATA[Outage]]></category>
		<category><![CDATA[Twitter]]></category>
		<category><![CDATA[Web Infrastructure]]></category>
		<category><![CDATA[webscale]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=547209</guid>
		<description><![CDATA[If Twitter wants to remain opaque about its practices, that's fine -- but it shouldn't expect any slack from upset users or investors. Blaming a two-hour outage on an "infrastructural double-whammy" after remaining mum on even where its data centers are located doesn't exactly inspire confidence.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=547209&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Whatever investments Twitter is making to improve the reliability of its system aren&#8217;t working, or at least not as well as they should be. The world&#8217;s favorite micro-blogging site blamed Thursday morning&#8217;s approximately two-hour outage on problems within its data centers &#8212; specifically the parallel failure of its running system and its backup system &#8212; and it&#8217;s the second time in less than two month&#8217;s Twitter&#8217;s infrastructure has brought the site down. Maybe it&#8217;s time for Twitter to talk openly about what its doing in there.</p>
<p>Don&#8217;t get me wrong, Twitter has been nothing if not generous in talking about the software it builds. The company has <a href="http://gigaom.com/cloud/twitter-open-sources-its-mysql-secret-sauce/">open sourced numerous data-management tools</a> and other pieces of code. It has occasionally (at least in 2009) been willing to share <a href="http://highscalability.com/scaling-twitter-making-twitter-10000-percent-faster/">how it handles, stores, searches, and analyzes billions of data points</a> relating to users and their tweets. But when it comes to the actual infrastructure on which this software runs?</p>
<p>Let&#8217;s just say Twitter is less than forthcoming. The <a href="http://blog.twitter.com/2012/07/our-apologies-for-todays-outage.html">explanation of today&#8217;s outage</a>:</p>
<blockquote><p>The cause of today’s outage came from within our data centers. Data centers are designed to be redundant: when one system fails (as everything does at one time or another), a parallel system takes over. What was noteworthy about today’s outage was the coincidental failure of two parallel systems at nearly the same time.</p></blockquote>
<p>It was minimally more descriptive in <a href="http://blog.twitter.com/2012/06/todays-turbulence-explained.html">explaining the cascading bug</a> that took the site down temporarily last month.</p>
<p>And after <a href="http://gigaom.com/cloud/twitter-moves-into-data-center-goodbye-fail-whale/">publicly detailing the migration into its new data center</a> last March, Twitter has been <a href="http://gigaom.com/cloud/twitters-data-center-mystery-deepens/">mysteriously mum about where it&#8217;s actually running</a>. In Salt Lake City? In Sacramento? The response from a Twitter spokesperson when asked where it&#8217;s data centers are located last June: &#8220;I can also confirm that we have multiple sites, but I won’t go into further detail.” Apparently, it <a href="http://gigaom.com/cloud/twitters-ever-changing-infrastructure-story/">now has space in Atlanta</a>, too.</p>
<p>If Twitter wants to remain opaque about its practices, that&#8217;s fine &#8212; but it shouldn&#8217;t expect any slack from upset users or investors. By contrast, we have a pretty good idea where Google&#8217;s data centers are and what&#8217;s going on inside them, and we know nearly everything about Facebook&#8217;s operations. When Amazon Web Services has an outage, it might take days, but <a href="http://gigaom.com/cloud/how-to-deal-with-cloud-failure-live-learn-fix-repeat/">provides a detailed post-mortem report</a> explaining what went wrong.</p>
<p>Even if Twitter&#8217;s infrastructure team is filled with very smart engineers, there&#8217;s certainly benefit to be derived from public discussion about what it might be doing right and wrong. Clearly, something isn&#8217;t right; the site is down too often considering how much smaller it is than the aforementioned services. While it&#8217;s not disruptive enough to anyone&#8217;s business to warrant an AWS-style explanation, we gotta get something better than blaming two hours of downtime on an &#8220;infrastructural double-whammy.&#8221;</p>
<p><em>Image courtesy of <a href="http://www.shutterstock.com/gallery-67766p1.html">Shutterstock user Elnur</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=547209&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=628922"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=628922" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=547209+why-twitter-should-open-up-about-its-infrastructure&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/07/cloud-and-data-second-quarter-2012-analysis-and-outlook-2/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=547209+why-twitter-should-open-up-about-its-infrastructure&utm_content=dharrisstructure">Takeaways from the second quarter in cloud and data</a></li><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=547209+why-twitter-should-open-up-about-its-infrastructure&utm_content=dharrisstructure">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2012/02/what-converged-infrastructure-means-for-the-future-of-the-data-center-staff/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=547209+why-twitter-should-open-up-about-its-infrastructure&utm_content=dharrisstructure">What converged infrastructure means for the future of the data center staff</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/07/26/why-twitter-should-open-up-about-its-infrastructure/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/07/hardhat.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/07/hardhat.jpg?w=150" medium="image">
			<media:title type="html">hardhat</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>
	</item>
		<item>
		<title>How one startup wants to inject Hadoop into your SQL</title>
		<link>http://gigaom.com/2012/07/24/how-one-startup-wants-to-inject-hadoop-into-your-sql/</link>
		<comments>http://gigaom.com/2012/07/24/how-one-startup-wants-to-inject-hadoop-into-your-sql/#comments</comments>
		<pubDate>Tue, 24 Jul 2012 18:40:52 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[database]]></category>
		<category><![CDATA[Drawn to Scale]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Hbase]]></category>
		<category><![CDATA[Mapr]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[scalability]]></category>
		<category><![CDATA[webscale]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=545849</guid>
		<description><![CDATA[Drawn to Scale's Spire database is meant to be all things to all people -- it combines Hadoop, HBase and SQL to provide a fast, scalable, robust experience -- and now it has integrated with MapR's Hadoop distribution. It's no surprise the young company already claims big customers.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=545849&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://gigaom2.files.wordpress.com/2012/07/hard_drives.jpg"><img  title="hard drives" src="http://gigaom2.files.wordpress.com/2012/07/shutterstock_108064520.jpg?w=300&#038;h=200" alt="" width="300" height="200" class="alignleft size-medium wp-image-545921" /></a>Forget about standalone Hadoop clusters running batch workloads to churn through massive datasets a few times per day or per week. San Francisco-based startup <a href="http://drawntoscale.com">Drawn to Scale</a> thinks Hadoop&#8217;s real home within the enterprise will be in your database. No, it&#8217;s not not trying to sell you some limited-use NoSQL data store &#8212; it wants to turn your good, old-fashioned SQL database into a real-time analytic engine that can grow like a weed.</p>
<p>And it wants to make this transition as easy as possible for customers that don&#8217;t want to get their hands dirty working with Hadoop or trying to turn a relational database into something it isn&#8217;t.</p>
<p>Drawn to Scale&#8217;s product, called Spire, is built atop HBase (a distributed NoSQL database that uses the Hadoop Distributed File System), so it scales with ease. Because it uses SQL, users can write robust queries in a familiar manner. Because it uses a distributed index and knows exactly where to go to find the right data, it&#8217;s fast as heck. Because it leverages Chef to configure the database, there&#8217;s no need to learn the finer points deploying and managing a Hadoop cluster.</p>
<p>&#8220;We&#8217;re sort of the only game in town when it comes to cloud-scale databases that run on top of Hadoop,&#8221; says Drawn to Scale Founder and CEO Bradford Stephens. There are plenty of companies that have lots of data &#8212; petabyes of it, in some cases &#8212; and want to find new ways to analyze it, but they don&#8217;t necessarily want to learn how to deploy Hadoop or write MapReduce workloads.</p>
<p>As of Tuesday, Drawn to Scale has another carrot to offer prospective customers because Spire now <a href="http://www.marketwire.com/press-release/drawn-scale-delivers-real-time-sql-applications-on-hadoop-with-mapr-partnership-1683188.htm">ships with MapR&#8217;s M3 distribution</a> pre-integrated as the product&#8217;s Hadoop platform. Although it&#8217;s the bane of open source devotees such as fellow Hadoop vendors Cloudera and Hortonworks because it uses a proprietary file system in place of HDFS, MapR is gaining quite a following because of the performance benefits its technology brings. In fact, Drawn to Scale is just the latest MapR partner after Amazon Web Services recently <a href="http://gigaom.com/cloud/amazon-taps-mapr-for-high-powered-elastic-mapreduce/">tagged it for inclusion in its Elastic MapReduce offering</a> and Google <a href="http://www.mapr.com/blog/google-mapr">did the same for its Compute Engine cloud</a>.</p>
<div id="attachment_545918" class="wp-caption alignright" style="width: 130px"><a href="http://gigaom2.files.wordpress.com/2012/07/bradford-hathead-copy.jpg"><img  title="bradford-hathead copy" src="http://gigaom2.files.wordpress.com/2012/07/bradford-hathead-copy.jpg?w=708" alt=""   class="size-full wp-image-545918" /></a><p class="wp-caption-text">Bradford Stephens</p></div>
<p>When you&#8217;re building a distributed database, you need a fast distributed file system, he added, and MapR is the fastest Hadoop file system around. Because it&#8217;s fully API-compliant with HDFS, though, M3 still works just fine with Spire&#8217;s HBase underpinnings. &#8220;Despite our small size,&#8221; Stephens said, &#8220;we are more than willing to pick a horse and really advocate loudly for it.&#8221;</p>
<p>Although Drawn to Scale is relatively young &#8212; it&#8217;s still in private beta and <a href="http://gigaom.com/cloud/drawn-to-scale-raises-money-to-make-sql-big-data-ready/">just raised a $925,000 first round in March</a> &#8212; large companies are already buying into its message. Already, Stephens told me, it&#8217;s engaged in seven <em>paid</em> pilot programs with large credit card companies, telcos and other &#8220;traditional&#8221; types of enterprises. One of them is already trying to negotiate a deal for a 1,000-node production deployment, he added. The company&#8217;s skeleton crew of mostly engineers couldn&#8217;t keep up the support workload if it opened the floodgates on all the inbound interest, Stephens said.</p>
<p>That kind of demand isn&#8217;t surprising when you consider Hadoop&#8217;s <a href="http://gigaom.com/cloud/the-state-of-hadoop-strong-and-poised-to-explode/">promise as the platform for a new generation</a> of data-driven platforms. Whether or not <a href="http://gigaom.com/cloud/why-the-days-are-numbered-for-hadoop-as-we-know-it/">anyone actually wants to use MapReduce</a> &#8212; the programming framework that made Hadoop famous &#8212; they do want to take advantage of its nearly limitless scale on cheap, commodity hardware. When companies such as Drawn to Scale bake Hadoop into an otherwise useful product as a technological component rather than as the product itself, customers should experience the best of both worlds.</p>
<p><em>Feature image courtesy of <a href="http://www.shutterstock.com/gallery-188515p1.html">Shutterstock user Jakub Pavlinec</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=545849&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=437674"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=437674" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=545849+how-one-startup-wants-to-inject-hadoop-into-your-sql&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=545849+how-one-startup-wants-to-inject-hadoop-into-your-sql&utm_content=dharrisstructure">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2010/10/with-scalable-data-stores-around-is-nosql-a-non-starter/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=545849+how-one-startup-wants-to-inject-hadoop-into-your-sql&utm_content=dharrisstructure">With Scalable Data Stores Around, Is NoSQL a Non-Starter?</a></li><li><a href="http://pro.gigaom.com/2012/12/big-data-2013-key-trends-and-companies-to-watch/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=545849+how-one-startup-wants-to-inject-hadoop-into-your-sql&utm_content=dharrisstructure">Big data 2013: key trends and companies to watch</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/07/24/how-one-startup-wants-to-inject-hadoop-into-your-sql/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/07/shutterstock_108064520.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/07/shutterstock_108064520.jpg?w=150" medium="image">
			<media:title type="html">hard drives</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/07/shutterstock_108064520.jpg?w=300" medium="image">
			<media:title type="html">hard drives</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/07/bradford-hathead-copy.jpg" medium="image">
			<media:title type="html">bradford-hathead copy</media:title>
		</media:content>
	</item>
		<item>
		<title>Meet the company building AOL&#8217;s micro data centers</title>
		<link>http://gigaom.com/2012/07/12/meet-the-company-building-aols-micro-data-centers/</link>
		<comments>http://gigaom.com/2012/07/12/meet-the-company-building-aols-micro-data-centers/#comments</comments>
		<pubDate>Thu, 12 Jul 2012 19:25:35 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[AOL]]></category>
		<category><![CDATA[Data Centers]]></category>
		<category><![CDATA[Elliptical Mobile Solutions]]></category>
		<category><![CDATA[energy efficiency]]></category>
		<category><![CDATA[Green IT]]></category>
		<category><![CDATA[modular data centers]]></category>
		<category><![CDATA[webscale]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=542099</guid>
		<description><![CDATA[Elliptical Mobile Solutions is hardly a household name in the data center world, but don't bet against it. While bigger data centers seem to be better for webscale companies such as Google and Facebook, many are happy to grow about 105 cubic feet at a time.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=542099&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.ellipticalmedia.com/">Elliptical Mobile Solutions</a> is hardly a household name in the data center world, but don&#8217;t bet against it becoming one. The Chandler, Ariz.-based company that started inside a founder&#8217;s garage builds one of the world&#8217;s smallest data centers and has already secured some big-name customers including, <a href="http://gigaom.com/cloud/aol-building-refrigerator-sized-data-centers/">most famously, AOL</a>. While bigger data centers seem to be better for webscale companies such as Google and Facebook, many are happy to grow on a lot smaller scale &#8212; about 105 cubic feet at a time.</p>
<p><a href="http://gigaom2.files.wordpress.com/2012/04/dsc_4650-hr-small.jpg"><img  title="DSC_4650 - HR-small" src="http://gigaom2.files.wordpress.com/2012/04/dsc_4650-hr-small.jpg?w=300&#038;h=187" alt="" width="300" height="187" class="alignleft size-medium wp-image-508507" /></a>Granted, EMS&#8217;s boxes are nowhere near as powerful as a massive data center chock full of computing gear, but that&#8217;s kind of the point. Modular data centers are all the rage right now because they let companies grow capacity as its needed, whether that&#8217;s <a href="http://gigaom.com/cloud/how-io-is-trying-to-build-modular-data-centers-for-the-rest-of-us/">a rack at a time inside an IO Data Centers unit</a> or <a href="http://gigaom.com/cloud/making-the-web-more-efficient-a-thousand-servers-at-a-time/">1,920 servers at a time</a> inside one of eBay&#8217;s specially designed modular units.</p>
<p>But modular doesn&#8217;t necessarily mean easy or flexible. At the smallest, a standard shipping container unit needs 120 square feet of floor space and can weigh 100,000 pounds, while one of IO&#8217;s units needs 500 square feet.</p>
<p>If you just need a rack full of dense computing power that can go quite literally anywhere you have room, EMS might be your provider. The company&#8217;s biggest unit, the 42U and roughly 165-cubic-foot R.A.S.E.R. HD, can handle up to 80 kilowatts at a PUE of 1.1. Its mid-range unit &#8212; the R.A.S.E.R. DX, which is the 105-cubic-foot unit AOL uses &#8212; can support 12 kilowatts and can fit through a doorway. Its smallest product, the C3-S.P.E.A.R. comes on wheels.</p>
<p><a href="http://gigaom2.files.wordpress.com/2012/07/datacenter2copy.jpg"><img  title="datacenter2copy" src="http://gigaom2.files.wordpress.com/2012/07/datacenter2copy.jpg?w=604&#038;h=289" alt="" width="604" height="289" class="aligncenter size-large wp-image-542168" /></a></p>
<p>Aside from their size, the micro-modular data centers (as the company calls them) are so versatile because they&#8217;re fully contained units complete with state-of-the-art cooling and fire-supression technologies. EMS CEO Bill Stockwell explained their appeal to me like this: &#8220;Can you imagine taking a gallon of milk out of your refrigerator, putting it on the counter, and cooling your whole house [from the coolness it puts off]?&#8221; That&#8217;s how many traditional data centers function, he said, with central cooling that has to handle all the racks within the facility. &#8220;We make a refrigerator unit for IT deployment.&#8221;</p>
<p><a href="http://gigaom2.files.wordpress.com/2012/07/nibiru-copy.jpg"><img  title="nibiru copy" src="http://gigaom2.files.wordpress.com/2012/07/nibiru-copy.jpg?w=300&#038;h=224" alt="" width="300" height="224" class="alignright size-medium wp-image-542165" /></a>Tony Cole, EMS&#8217;s vice president of North American sales, told me many customers put them inside office building in lieu of building out new rooms capable of handling computing gear. Royal Caribbean puts EMS units on its cruise ships. AOL appears content putting them on concrete slabs outside its offices.</p>
<p>Others that have their own data centers will deploy a micro data center when they want to increase density for a certain application but their current infrastructure can&#8217;t handle tens of kilowatts per rack. That&#8217;s what happened with a missile range in New Mexico that bought a load of high-density gear but then found out it would cost $900,000 to retrofit their data center to support it. Instead, they consolidated 10 racks into two R.A.S.E.R. HD units and actually acquired more computing capacity as a result.</p>
<p><a href="http://gigaom2.files.wordpress.com/2012/07/raserhd-1.jpg"><img  title="RASERHD-1" src="http://gigaom2.files.wordpress.com/2012/07/raserhd-1.jpg?w=245&#038;h=300" alt="" width="245" height="300" class="alignleft size-medium wp-image-542169" /></a>Because it&#8217;s liquid-cooled, &#8220;there isn&#8217;t an IT package on the market today that the R.A.S.E.R. HD couldn&#8217;t support from a cooling perspective,&#8221; Cole said.</p>
<p>Currently, Cole said, EMS&#8217;s micro data centers often act as disaster-recovery sites for mission-critical applications or sometimes get placed at remote offices. But EMS co-founder and Chief Technology Evangelist Simon Rohrich said he sees a shift happening as more companies get interested in owning their own cloud computing infrastructures, but don&#8217;t have the budgets or expertise to build and run their own data centers.</p>
<p>In this regard, AOL could turn out to be one heck of an important customer as both a use case and a cheerleader. AOL&#8217;s use case of deploying EMS units globally as part of a single cloud platform shows customers how easy it can be to make 10 racks across 5 continents look like a single location with the right software, Rohrich said. And having the company out in public <a href="http://loosebolts.wordpress.com/2012/07/05/aols-data-center-independence-day/">touting 5 times the computing capacity for 10 percent of the cost</a> is sure to turn some heads, too.</p>
<p>Already, he added, customers are starting to realize they can deploy multiple EMS units at a single location to achieve some powerful systems. Cole noted that EMS has deals in place for 13- and 16-unit arrays already, but suspects bigger arrays will come. The company is still in the early stages, he said, &#8220;so people are just kind of kicking the tires.&#8221;</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=542099&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=567566"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=567566" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=542099+meet-the-company-building-aols-micro-data-centers&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2011/06/3-baby-steps-toward-greener-data-centers/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=542099+meet-the-company-building-aols-micro-data-centers&utm_content=dharrisstructure">3 baby steps toward greener data centers</a></li><li><a href="http://pro.gigaom.com/2011/05/how-a-snapshot-of-a-green-data-center-can-be-misleading/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=542099+meet-the-company-building-aols-micro-data-centers&utm_content=dharrisstructure">How a Snapshot of a Green Data Center Can Be Misleading</a></li><li><a href="http://pro.gigaom.com/2010/11/how-to-make-cloud-computing-greener/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=542099+meet-the-company-building-aols-micro-data-centers&utm_content=dharrisstructure">How to Make Cloud Computing Greener</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/07/12/meet-the-company-building-aols-micro-data-centers/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/07/datacenter2copy1-e1342119591822.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/07/datacenter2copy1-e1342119591822.jpg?w=150" medium="image">
			<media:title type="html">datacenter2copy</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/04/dsc_4650-hr-small.jpg?w=300" medium="image">
			<media:title type="html">DSC_4650 - HR-small</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/07/datacenter2copy.jpg?w=604" medium="image">
			<media:title type="html">datacenter2copy</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/07/nibiru-copy.jpg?w=300" medium="image">
			<media:title type="html">nibiru copy</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/07/raserhd-1.jpg?w=245" medium="image">
			<media:title type="html">RASERHD-1</media:title>
		</media:content>
	</item>
		<item>
		<title>Fusion-io turns NAND into DRAM for developers</title>
		<link>http://gigaom.com/2012/07/10/fusion-io-turns-nand-into-dram-for-developers/</link>
		<comments>http://gigaom.com/2012/07/10/fusion-io-turns-nand-into-dram-for-developers/#comments</comments>
		<pubDate>Tue, 10 Jul 2012 16:28:40 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[analytics]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[DRAM]]></category>
		<category><![CDATA[Flash storage]]></category>
		<category><![CDATA[Fusion-io]]></category>
		<category><![CDATA[in-memory database]]></category>
		<category><![CDATA[memory]]></category>
		<category><![CDATA[nand flash]]></category>
		<category><![CDATA[SAP]]></category>
		<category><![CDATA[ssds]]></category>
		<category><![CDATA[webscale]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=541069</guid>
		<description><![CDATA[Flash-based storage pioneer Fusion-io says it has developed a method for extending a system's memory from DRAM into Fusion-io's NAND-based storage tier, enabling the possibility of bigger, cheaper in-memory applications than are currently possible. <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=541069&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://gigaom2.files.wordpress.com/2012/07/dram.jpg"><img  title="dram" src="http://gigaom2.files.wordpress.com/2012/07/dram.jpg?w=300&#038;h=225" alt="" width="300" height="225" class="alignleft size-medium wp-image-541119" /></a>Flash-based storage pioneer Fusion-io says it has developed a method for extending a system&#8217;s memory from DRAM into Fusion-io&#8217;s NAND-based storage tier, enabling the possibility of bigger, cheaper in-memory applications than are currently possible. This type of capability could become increasingly important as companies expect ever-faster performance of their data-analysis systems but don&#8217;t want to pay to store massive data sets in pricey DRAM.</p>
<p>Here&#8217;s how the company explains the technology in the press release announcing it:</p>
<blockquote><p>The Extended Memory subsystem dynamically moves frequently accessed data pages into memory on-demand while transparently migrating rarely accessed data pages from DRAM into ioMemory. This allows developers to simplify application design by assuming that entire datasets are in-memory, without the costs associated with DRAM purchase and operation. Application developers are able to further tune performance through software development kit tools that lock selected pages into DRAM, giving access to NAND flash as memory, instead of treating it as an extension of disk storage.</p></blockquote>
<p>Fusion-io was able to pull this off in part because of an industry trend toward higher-capacity, less-expensive flash storage and in part because Fusion-io doesn&#8217;t claim to offer solid-state drives. When I covered the release of the company&#8217;s ioDrive2 and ioDrive Duo2 (which plug into a server&#8217;s PCI slot and operate as repositories for data needing high performance) in October, the products represented a more than 2x capacity gain over the previous generation and continued to boost IOPS performance, <a href="http://gigaom.com/cloud/how-consumer-demands-drive-enterprise-flash-storage/">despite the smaller and cheaper NAND technologies on which they&#8217;re built</a>.</p>
<p>Fusion-io Founder and CEO David Flynn attributed his company&#8217;s ability to continually raise performance on new, consumer-driven NAND designs with its decision to to provide memory controller software rather than trying to act like a hard-disk drive (as other solid-state drive providers try to do) and use a microcontroller. That same decision, the company says, also helped make Extended Memory possible because it didn&#8217;t have to try and make disk-storage protocols function at the memory tier.</p>
<p>In-memory databases and analytic systems (such as SAP&#8217;s high-performance HANA appliance) are nothing new, but they&#8217;re limited in size to the system&#8217;s memory footprint and can be rather expensive. All-flash storage arrays are also <a href="http://gigaom.com/cloud/emc-goes-all-flash-buys-xtremio-for-430m/">becoming much more popular and getting less expensive</a>, but they still exist as a separate storage tier and are often aimed at legacy enterprise applications. If what Fusion-io and Princeton have created works as promised, it could create a middle ground approach that resonates with customers &#8212; such as Fusion-io mega-user Facebook &#8212; that need a workable combination of massive scalability and high performance.</p>
<p><em>Image courtesy of <a href="http://www.shutterstock.com/gallery-497053p1.html">Shutterstock user Mark Schwettmann</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=541069&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=597999"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=597999" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=541069+fusion-io-turns-nand-into-dram-for-developers&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/04/infrastructure-q1-cloud-and-big-data-woo-the-enterprise/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=541069+fusion-io-turns-nand-into-dram-for-developers&utm_content=dharrisstructure">Infrastructure Q1: Cloud and big data woo enterprises</a></li><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=541069+fusion-io-turns-nand-into-dram-for-developers&utm_content=dharrisstructure">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2012/01/infrastructure-q4-big-data-gets-bigger-and-saas-startups-shine/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=541069+fusion-io-turns-nand-into-dram-for-developers&utm_content=dharrisstructure">Infrastructure Q4: Big data gets bigger and SaaS startups shine</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/07/10/fusion-io-turns-nand-into-dram-for-developers/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/07/dram.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/07/dram.jpg?w=150" medium="image">
			<media:title type="html">dram</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/07/dram.jpg?w=300" medium="image">
			<media:title type="html">dram</media:title>
		</media:content>
	</item>
		<item>
		<title>AOL building refrigerator-sized data centers</title>
		<link>http://gigaom.com/2012/07/05/aol-building-refrigerator-sized-data-centers/</link>
		<comments>http://gigaom.com/2012/07/05/aol-building-refrigerator-sized-data-centers/#comments</comments>
		<pubDate>Fri, 06 Jul 2012 01:10:25 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[AOL]]></category>
		<category><![CDATA[containers]]></category>
		<category><![CDATA[Data Center Efficiency]]></category>
		<category><![CDATA[Data Centers]]></category>
		<category><![CDATA[ebay]]></category>
		<category><![CDATA[Green IT]]></category>
		<category><![CDATA[modular data centers]]></category>
		<category><![CDATA[Web Infrastructure]]></category>
		<category><![CDATA[webscale]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=539908</guid>
		<description><![CDATA[AOL is taking its flexible infrastructure strategy to a whole new level of flexibility by building data centers that are about the size of French door refrigerators. Now, AOL will be able to deploy infrastructure where needed with little more than an electrical outlet required.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=539908&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://gigaom2.files.wordpress.com/2012/07/nibiru.jpg"><img  title="nibiru" src="http://gigaom2.files.wordpress.com/2012/07/nibiru.jpg?w=300&#038;h=224" alt="" width="300" height="224" class="alignright size-medium wp-image-539989" /></a><strong>Updated: </strong>AOL is taking its flexible infrastructure strategy to a whole new level of flexibility by building data centers about the size of French door refrigerators. AOL Services CTO Mike Manos wrote about the units &#8212; part of a project code-named &#8220;Nibiru&#8221; internally &#8212; in his blog on Thursday, <a href="http://loosebolts.wordpress.com/2012/07/05/aols-data-center-independence-day/">proclaiming July 4 (the day the first one arrived) AOL&#8217;s Data Center Independence Day</a>. If they work as planned, AOL will be able to deploy new services and infrastructure when and where needed with little more than an electrical outlet required.</p>
<p>The Nibiru project, he explains, is a set of &#8220;incredibly game-changing&#8221; goals for transforming the way AOL&#8217;s services division carries out the work of managing the company&#8217;s infrastructure, and the newly materialized mini data centers we&#8217;re high on the list:</p>
<blockquote><p>Our primary “Nibiru” goal was to develop and deliver a data center environment without the need of a physical building. The environment needed to require as minimal amount of physical “touch” as possible and allow us the ultimate flexibility in terms of how we delivered capacity for our products and services. We called this effort the Micro Data Center. If you think about the amount of things that need to change to evolve to this type of strategy it’s a bit mind-boggling.</p></blockquote>
<p>Among those changes, Manos writes, are capabilities such as being able to deploy infrastructure wherever it&#8217;s needed regardless of temperature and humidity, and the &#8220;ability to fit into the power envelope of a normal office building.&#8221; The units will also be part of AOL&#8217;s automated cloud computing infrastructure, which means they&#8217;re managed as part of a greater pool of resources without the need for dedicated staff.</p>
<p>But these mini data centers aren&#8217;t just about a cool operations project; they could end up paying big dividends for AOL&#8217;s business lines, as well. Not only can AOL save money by taking up less space and power within a traditional colocation facility &#8212; if it decides to place a Nibiru box in such a facility at all &#8212; but the box gives AOL the flexibility to move into new geographies with ease. Among the benefits Manos points to are:</p>
<blockquote>
<ul>
<li>It allows us an incredibly flexible platform for driving and addressing privacy laws, regulatory oversight, and other such concerns allowing us to respond rapidly.</li>
<li>Gives us the ability to drive Edge Computing delivery to potentially bypass CDNs for certain content.</li>
<li>Gives us the capability to drive ‘Community-in-a-box’ whereby we can quickly launch new products in markets, quickly expand existing footprints like Patch in a low cost, but still hyper-local platform, allow the Huffington Post a platform to rapidly partner and enter new markets with minimal cost turn ups.</li>
</ul>
</blockquote>
<div id="attachment_539994" class="wp-caption alignleft" style="width: 310px"><a href="http://gigaom2.files.wordpress.com/2012/07/dsc_6365.jpeg"><img  title="dsc_6365" src="http://gigaom2.files.wordpress.com/2012/07/dsc_6365.jpeg?w=708" alt=""   class="size-full wp-image-539994" /></a><p class="wp-caption-text">Containers atop eBay&#8217;s Project Mercury data center in Phoenix.</p></div>
<p>AOL&#8217;s new boxes might be about the smallest data-center-in-a-box units around, but they&#8217;re actually part of a great move toward modular data centers across companies of all types. As I reported in April, eBay is <a href="http://gigaom.com/cloud/making-the-web-more-efficient-a-thousand-servers-at-a-time/">buying custom-built containers full of thousands of servers</a> that it can drop into (or on top of) its data centers as capacity dictates. IO Data Centers <a href="http://gigaom.com/cloud/how-io-is-trying-to-build-modular-data-centers-for-the-rest-of-us/">has built an entire business around modular data centers</a> that can sit just about anywhere.</p>
<p><strong>Update: </strong>AOL&#8217;s micro data centers were built by <a href="http://www.ellipticalmedia.com/index.html">Elliptical Mobile Solutions</a>, according to a spokesperson for that company. It has also built units, which it calls micr0-modular data centers, for NATO, the U.S. government and the Canadian Department of Defense.</p>
<p>All these efforts share the same goal of letting companies grow their server count when and where needed rather than trying to predict necessary capacity and relevant geographical locations years in advance.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=539908&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=321517"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=321517" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=539908+aol-building-refrigerator-sized-data-centers&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2011/01/big-data-arm-and-legal-troubles-transformed-infrastructure-in-q4/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=539908+aol-building-refrigerator-sized-data-centers&utm_content=dharrisstructure">Big Data, ARM and Legal Troubles Transformed Infrastructure in Q4</a></li><li><a href="http://pro.gigaom.com/2012/10/social-third-quarter-2012-analysis-and-outlook/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=539908+aol-building-refrigerator-sized-data-centers&utm_content=dharrisstructure">Social third-quarter 2012: analysis and outlook</a></li><li><a href="http://pro.gigaom.com/2012/08/the-economics-of-clean-data-center-innovation/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=539908+aol-building-refrigerator-sized-data-centers&utm_content=dharrisstructure">The economics of clean-data-center innovation</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/07/05/aol-building-refrigerator-sized-data-centers/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/07/nibiru.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/07/nibiru.jpg?w=150" medium="image">
			<media:title type="html">nibiru</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/07/nibiru.jpg?w=300" medium="image">
			<media:title type="html">nibiru</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/07/dsc_6365.jpeg" medium="image">
			<media:title type="html">dsc_6365</media:title>
		</media:content>
	</item>
	</channel>
</rss>
