<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>GigaOM &#187; surge conference</title>
	<atom:link href="http://gigaom.com/tag/surge-conference/feed/" rel="self" type="application/rss+xml" />
	<link>http://gigaom.com</link>
	<description></description>
	<lastBuildDate>Fri, 24 May 2013 20:29:10 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='gigaom.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://0.gravatar.com/blavatar/0db8f6557d022075dbbf010c54d46d93?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>GigaOM &#187; surge conference</title>
		<link>http://gigaom.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://gigaom.com/osd.xml" title="GigaOM" />
	<atom:link rel='hub' href='http://gigaom.com/?pushpress=hub'/>
		<item>
		<title>To scale web services, devops devotees should consider economics</title>
		<link>http://gigaom.com/2012/10/01/to-scale-web-services-devops-devotees-should-consider-economics/</link>
		<comments>http://gigaom.com/2012/10/01/to-scale-web-services-devops-devotees-should-consider-economics/#comments</comments>
		<pubDate>Mon, 01 Oct 2012 22:59:09 +0000</pubDate>
		<dc:creator>Stacey Higginbotham</dc:creator>
				<category><![CDATA[devops]]></category>
		<category><![CDATA[infrastructure]]></category>
		<category><![CDATA[operations]]></category>
		<category><![CDATA[real-time monitoring services]]></category>
		<category><![CDATA[surge conference]]></category>
		<category><![CDATA[web services]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=567787</guid>
		<description><![CDATA[For most people scaling out a web service is a matter of thinking about hardware and software. But the recent Surge conference taught me that most devops folk need to look down to the physical infrastructure as well as the economic tradeoffs of building a service.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=567787&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Truly scaling a web service doesn&#8217;t just require computer science expertise &#8212; the best operations and devops employees will also have an understanding of economics. The big picture at this year&#8217;s <a href="http://omniti.com/surge/2012/sessions">Surge Conference</a> in Baltimore, Md. seemed to be about optimizing lower-level services such as DNS or even networking protocols, and optimizing at the higher level via an understanding of economics.</p>
<p>The talks at Surge are designed as how-to discussions for engineers and can get pretty deep in the weeds. But after spending two days at the show last week, I&#8217;ve realized that many of the tweaks engineers make to their cloud-based services occur in a relatively narrow band. They&#8217;re focused mainly on code tweaks or rethinking the architecture to optimize for a specific cloud (almost always Amazon Web Services).</p>
<p>But lower-level tweaks required buying services or maybe even certain impossibilities such as owning the client software. One of the most-effective elements people discussed went mostly unsaid, but seems obvious to those in the business world &#8212; optimizing your architecture for the clouds you&#8217;re on. Several of the presenters, such as Joe Kottke of BrightTag, took attendees through their particular usage scenario and explained how they chose a different instance or changed their app architecture to cut costs.</p>
<p>One of the most effective of these was a chat by <a href="http://omniti.com/surge/2012/sessions/cluster-computing-a-case-study-in-scaling-a-fuzzy-search">Riley Berton of Viggle</a> who shared his social TV startup changed the way the service matched the sound prints of TV shows submitted by users to its database. Changing the way his application was built enabled him to go from spending $180,000 a month on Amazon to spending $25,000. One might argue that his application was poorly architected in the first place if he was spending that much, but it&#8217;s not like most developers have a sense of how their app should perform and what it should cost when they get started. That&#8217;s part of the problem.</p>
<p>And that was probably the most important takeaway from the show. As common as it is to find startups building out services in the cloud, there is till a lot to learn about understanding how certain architecture decisions translate into costs as your app scales. Services like Cloudability are hoping to help companies track this, as are <a href="http://gigaom.com/cloud/how-big-data-will-change-networking/">real-time monitoring services</a> such as Boundary, but there&#8217;s still plenty of low-hanging fruit in just thinking about the best instance types for a memory-intensive application as opposed to what one should think about using if you <a href="http://gigaom.com/cloud/pulse-on-kindle-fire-powered-by-google/">need rapid IO</a>.</p>
<p>In addition to that, most programmers should also think about whether or not they should even optimize their app for a certain criteria if the cost will turn out to be astronomical, or if perhaps they should trade down to a different optimization that comes at a lower price point. Most engineers are accustomed to thinking about these tradeoffs when it comes to hardware or a certain database, but translating it beyond just the application&#8217;s performance and into its cost is a new way of thinking. So maybe those CS majors might want to consider a business or economics class in addition to their normal coursework.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=567787&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=715392"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=715392" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=567787+to-scale-web-services-devops-devotees-should-consider-economics&utm_content=shigginbotham">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/10/continuous-delivery-and-the-world-of-devops/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=567787+to-scale-web-services-devops-devotees-should-consider-economics&utm_content=shigginbotham">Continuous delivery and the world of devops</a></li><li><a href="http://pro.gigaom.com/2011/12/migrating-media-applications-to-the-private-cloud-best-practices-for-businesses/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=567787+to-scale-web-services-devops-devotees-should-consider-economics&utm_content=shigginbotham">Migrating media applications to the private cloud: best practices for businesses</a></li><li><a href="http://pro.gigaom.com/2009/04/as-devices-converge-chip-vendors-girding-for-a-fight/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=567787+to-scale-web-services-devops-devotees-should-consider-economics&utm_content=shigginbotham">As Devices Converge, Chip Vendors Girding For a Fight</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/10/01/to-scale-web-services-devops-devotees-should-consider-economics/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/10/20120927_090643-e1349132318964.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/10/20120927_090643-e1349132318964.jpg?w=150" medium="image">
			<media:title type="html">20120927_090643</media:title>
		</media:content>

		<media:content url="http://1.gravatar.com/avatar/aee37121e18bf76bb9fee4494bab237a?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">shigginbotham</media:title>
		</media:content>
	</item>
		<item>
		<title>How Facebook solves the IT culture wars and scales its site</title>
		<link>http://gigaom.com/2012/09/28/how-facebook-solves-the-it-culture-wars-and-scales-its-site/</link>
		<comments>http://gigaom.com/2012/09/28/how-facebook-solves-the-it-culture-wars-and-scales-its-site/#comments</comments>
		<pubDate>Fri, 28 Sep 2012 21:59:31 +0000</pubDate>
		<dc:creator>Stacey Higginbotham</dc:creator>
				<category><![CDATA[employees]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[scale]]></category>
		<category><![CDATA[surge conference]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=567912</guid>
		<description><![CDATA[People and processes are just as important as servers when it comes to scale. A Facebook engineer explains how the social network built a culture and tools to help it keep up with its ever-expanding hardware infrastructure and software toolsets.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=567912&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Scaling isn&#8217;t just a matter of software and code, there&#8217;s also a huge cultural issue at play. At Facebook, solving problems between the engineering and operations teams, quelling fears about job loss related to automation at the employee level, and delivering tools to monitor the company&#8217;s IT operations all play big roles in helping the site scale to 950 million users.</p>
<p>In a talk at the <a href="http://omniti.com/surge/2012">Surge Conference</a> in Baltimore, Md., Pedro Canahuati, director of production engineering and site reliability at Facebook, explained how the social network keeps the site available, reliable and efficient.</p>
<h2>Scale smart not not just fast. </h2>
<p>Adding servers is essential to keeping Facebook available to the rapidly growing user base, but to remain reliable for the long term, Facebook needed a system that could scale up in a hurry if it needed to add tens of thousands of servers at a time. In August 2009 when Canahuati arrived at Facebook, he said the site spent seven weeks to get 10,000 servers into production once they were plugged in. Thanks to the <a href="http://gigaom.com/cloud/how-facebook-brings-a-new-data-center-online/">development of Triforce</a>, it now takes seven days to get the site running on 10,000 servers. But in Canahuati&#8217;s opinion, the rate of getting software on servers is still too slow, so there are two Facebook employees still working on making that process faster.</p>
<p>Building a tool itself saves Facebook the headache of dealing with open source code that might not be able to hack the strain of its infrastructure demands, while also keeping Facebook from paying a vendor a license fee that could become astronomical as it scaled. Understanding the process of building the tools to keep the site&#8217;s infrastructure up and running, and the impact that process plays on the ultimate goal of reliability, is where Facebook has carved out another advantage that other large web services could learn from.</p>
<p>Like his <a href="http://gigaom.com/2012/09/27/scaling-pinterest-and-adventures-in-database-sharding/">peers at Pinterest</a>, Canahuati stressed that when you are at massive scale, you need to keep it simple.</p>
<h2>Make it easy to see what&#8217;s wrong. </h2>
<p>He also added a few more tenets, including &#8220;instrument your world.&#8221; He explained how Facbeook collects a lot of data across many of its systems in order to understand how the different services that comprise the site are performing and interacting. Tools such as <a href="http://gigaom.com/cloud/exclusive-facebooks-scuba-project-dives-into-performance-data/">Scuba</a> and <a href="http://gigaom.com/cloud/at-facebook-cache-is-king-heres-how-it-keeps-it-up/">Claspin</a> are examples of this effort to take complex operational data and make it easy to understand.</p>
<p>&#8220;Smart visualizations are often overlooked,&#8221; said Canahuati, but scanning data at a glance and then acting quickly on it is an essential ingredient for keeping systems up and running.</p>
<h2>Automation can be a dream &#8212; or a nightmare. </h2>
<p>One of Facebook&#8217;s secrets to scale is automation, but automation can create its own problems. At its worst, one could create a system that automatically brings down the whole site. Elsewhere, automation can mask a more systemic problem.</p>
<p>However, a more persistent worry is that engineers and operations people working to automate certain actions might think they&#8217;re coding themselves out of a job. To alleviate those fears, Canahuati says Facebook has implemented several strategies that boil down into keeping a lean team works on multiple jobs and problems. This ensures that when automation solves one problem, there&#8217;s another one waiting in the wings to be solved, and also sets the expectation that employees are responsible for the whole site broadly and not just one tool set.</p>
<p>This approach is carried all the way to the way Facebook manages and hires its employees. It expects the engineering team that builds Facebook products to be aware of the operations side of things and build tools that help the operations team out. Operations employees are expected to be able to code and work with the engineering teams. &#8220;The software guys who build the code must take ownership for it working on a big system at scale,&#8221; Canahuati said.</p>
<p>But these operationally aware engineering teams and engineering-aware operational teams must have buy in at the top because people who code generally cost more, so hiring operational team members who code requires a bigger budget.</p>
<p>The tactic seems to work for Facebook, which is clearly trying to build a culture of responsibility and effectiveness that can scale the same way its servers do. Hence its continued release of tools that others can use to monitor their own giant deployments, as well as its operational slogan: &#8220;Fix more, whine less.&#8221;</p>
<p>In many ways, Canahuati&#8217;s points are good policies for any corporate culture today. Traits such as communication between teams, blame avoidance and employees that think strategically instead of just about their skillsets are helpful no matter how many servers you have.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=567912&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=97672"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=97672" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=567912+how-facebook-solves-the-it-culture-wars-and-scales-its-site&utm_content=shigginbotham">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=567912+how-facebook-solves-the-it-culture-wars-and-scales-its-site&utm_content=shigginbotham">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2011/12/migrating-media-applications-to-the-private-cloud-best-practices-for-businesses/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=567912+how-facebook-solves-the-it-culture-wars-and-scales-its-site&utm_content=shigginbotham">Migrating media applications to the private cloud: best practices for businesses</a></li><li><a href="http://pro.gigaom.com/2011/11/dissecting-the-data-5-issues-for-our-digital-future/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=567912+how-facebook-solves-the-it-culture-wars-and-scales-its-site&utm_content=shigginbotham">Dissecting the data: 5 issues for our digital future</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/09/28/how-facebook-solves-the-it-culture-wars-and-scales-its-site/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/09/20120928_145723-e1348859016128.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/09/20120928_145723-e1348859016128.jpg?w=150" medium="image">
			<media:title type="html">Pedro Canahuati at Facebook</media:title>
		</media:content>

		<media:content url="http://1.gravatar.com/avatar/aee37121e18bf76bb9fee4494bab237a?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">shigginbotham</media:title>
		</media:content>
	</item>
		<item>
		<title>Scaling Pinterest and adventures in database sharding</title>
		<link>http://gigaom.com/2012/09/27/scaling-pinterest-and-adventures-in-database-sharding/</link>
		<comments>http://gigaom.com/2012/09/27/scaling-pinterest-and-adventures-in-database-sharding/#comments</comments>
		<pubDate>Fri, 28 Sep 2012 04:56:38 +0000</pubDate>
		<dc:creator>Stacey Higginbotham</dc:creator>
				<category><![CDATA[database]]></category>
		<category><![CDATA[Marty Weiner]]></category>
		<category><![CDATA[Pinterest]]></category>
		<category><![CDATA[surge conference]]></category>
		<category><![CDATA[web servers]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=567359</guid>
		<description><![CDATA[Pinterest has learned about scaling the way most popular sites do -- the architecture works until one day it doesn't. But in a talk at the Surge Conference two Pinterest engineers shared their wars stories. Here's what they learned about keeping it simple and database sharding.
<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=567359&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>What happens when a site goes viral and more than doubles it user base every month? It breaks of course. Here’s how popular photo-based social network <a href="http://pinterest.com/">Pinterest</a>, handled the problem, and a few tips from Pinterest engineers on how others might avoid the same sort of trouble.</p>
<p>Marty Weiner and Yashh Nelapati of Pinterest shared the lessons from that experience on Thursday at the Surge Conference in Baltimore, Md., with most of the tips being about how the site has scaled its MySQL database. It’s a presentation the guys have given before, and slides can be found <a href="http://www.percona.com/live/mysql-conference-2012/sessions/scaling-pinterest">here</a>. But for those who want the big picture in a few bullet points, here ya go. (And if you’re interested in the design of Pinterest, CEO Ben Silbermann will be speaking at our <a href="http://event.gigaom.com/gigaomroadmap/?utm_source=data&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=567359+scaling-pinterest-and-adventures-in-database-sharding&amp;utm_content=shigginbotham">RoadMap conference</a>).</p>
<p>Simply put, learned quickly that too much complexity was its enemy if it wanted its infrastructure to scale as fast as the site was growing. Pinterest began in March 2010 hosted on Rackspace using one MySQL database and one small web engine. Once it launched in January 2011, it had migrated to Amazon’s Ec2, a few more MySQL databases, a few Nginx web servers, MongoDB and TaskQueue. As it transitioned to its big-growth stage, it began running more and more tools, including Memcached, Redis and at <a href="http://highscalability.com/blog/2012/2/16/a-short-on-the-pinterest-stack-for-handling-3-million-users.html">least three other tools</a>.</p>
<p>So the first lesson Weiner shared was not to do that: Instead of running a bunch of tools, simplify. The tools he decided to focus on shared the following characteristics: they were free; they had a large and happy user base; and they all had good or really good performance. Those tools were Amazon, Memcached, Redis and MySQL. Granted, getting them to scale properly wasn’t an engineering-free task, but at least everything was manageable when the work was done.</p>
<p>One of the tougher choices Pinterest had to make was a decision between clustering and sharding. Weiner described a continuum between the two, portraying clustering as automatic distribution of data through tools like Cassandra, HBase and Membase, and sharding as a manual act of deciding where to put data on a machine-by-machine basis. Given his choice — <a href="http://en.wikipedia.org/wiki/Shard_(database_architecture)">sharding</a> — he’s clearly a fan of control for his database technologies.</p>
<p>He complained that while the automatic distribution of data across servers was cool and easy, it also came with a big point of failure. Because the cluster management software that ran on his databases and handled how the database scaled across multiple servers, bugs and errors in that code would be automatically replicated across the entire cluster.</p>
<p>The rest of his talk focused on how to shard a lot of data and keep growing. For those who are interested in a deep dive on that technology, check out a <a href="http://www.youtube.com/watch?v=GW2cnp2tdh8">video of the same talk he gave in May</a>. For most of those who are thinking about building new scalable apps, the key lessons are probably that when building a web site that you hope to scale, you should keep it simple, go for popular and well-liked tools that are free, and seriously consider the tradeoffs between control and ease of use.</p>
<p>And for those of you who just like pinning photos to Pinterest, you can sleep well with the knowledge that thanks to the way the site sharded its database, all of your pins likely reside on the same server right next to your user ID. And that’s a good thing, because it makes it much easier to then scale the service to more users, without everything breaking down.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=567359&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=298893"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=298893" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=567359+scaling-pinterest-and-adventures-in-database-sharding&utm_content=shigginbotham">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/12/how-direct-access-solutions-can-speed-up-cloud-adoption/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=567359+scaling-pinterest-and-adventures-in-database-sharding&utm_content=shigginbotham">How direct-access solutions can speed up cloud adoption</a></li><li><a href="http://pro.gigaom.com/2012/10/the-state-of-cross-platform-measurement-across-tv-online-and-social/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=567359+scaling-pinterest-and-adventures-in-database-sharding&utm_content=shigginbotham">The state of cross-platform media measurement</a></li><li><a href="http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=567359+scaling-pinterest-and-adventures-in-database-sharding&utm_content=shigginbotham">The importance of putting the U and I in visualization</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/09/27/scaling-pinterest-and-adventures-in-database-sharding/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/05/screen-shot-2012-05-08-at-2-21-25-pm-e1336501389631.png?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/05/screen-shot-2012-05-08-at-2-21-25-pm-e1336501389631.png?w=150" medium="image">
			<media:title type="html">Pinterest Bottica</media:title>
		</media:content>

		<media:content url="http://1.gravatar.com/avatar/aee37121e18bf76bb9fee4494bab237a?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">shigginbotham</media:title>
		</media:content>
	</item>
		<item>
		<title>Seen in the wild: An Apple trade show booth</title>
		<link>http://gigaom.com/2012/09/27/seen-in-the-wild-an-apple-trade-show-booth/</link>
		<comments>http://gigaom.com/2012/09/27/seen-in-the-wild-an-apple-trade-show-booth/#comments</comments>
		<pubDate>Thu, 27 Sep 2012 15:34:26 +0000</pubDate>
		<dc:creator>Stacey Higginbotham</dc:creator>
				<category><![CDATA[Apple]]></category>
		<category><![CDATA[Cloud]]></category>
		<category><![CDATA[icloud]]></category>
		<category><![CDATA[scale]]></category>
		<category><![CDATA[surge conference]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=567346</guid>
		<description><![CDATA[Apple doesn't often make a formal appearance at trade shows or conferences outside of its own. But it looks like getting engineers for its iCloud may have preempted that policy since the device maker has a sleek presence at the Surge Conference in Baltimore.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=567346&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Here at the <a href="http://omniti.com/surge/2012">Surge Conference</a> in Baltimore, Md. I stumbled upon this Apple booth on the trade show floor. Sure, many of the developers and operations engineers attending the conference toted MacBooks of some sort, but Apple doesn&#8217;t show up at trade shows pitching its wares anymore. No, the guy was here recruiting for engineers to help with Apple&#8217;s iCloud product, which <a href="http://gigaom.com/apple/with-imessage-outage-another-hiccup-for-icloud/">has had struggles</a> that seem odd for a company that spends as much effort on the user experience as Apple.</p>
<p>The Surge Conference is billed as a conference about scalability, and is one of the cooler and nerdier events on the East Coast for those trying to build big websites and services. The premise of the conference is to get engineers onstage talking about times they have failed and what they learned from it. Engineers from Facebook, Pinterest and Yammer are giving talks. So it attracts a cluster of engineers that Apple would love to have working on its own cloud services, which have so far <a href="http://www.theregister.co.uk/2012/07/20/apple_fails_at_cloud_leaving_market_open_to_android/">struggled to scale</a> to the hundred of millions of dedicated users the company has.</p>
<p><a href="http://gigaom2.files.wordpress.com/2012/09/20120927_090512.jpg"><img  title="20120927_090512" src="http://gigaom2.files.wordpress.com/2012/09/20120927_090512.jpg?w=604&#038;h=453" alt="" width="604" height="453" class="alignright size-large wp-image-567348" /></a></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=567346&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=521585"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=521585" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=apple&utm_medium=editorial&utm_campaign=auto3&utm_term=567346+seen-in-the-wild-an-apple-trade-show-booth&utm_content=shigginbotham">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/07/new-strategies-in-consumer-media-cloud-storage/?utm_source=apple&utm_medium=editorial&utm_campaign=auto3&utm_term=567346+seen-in-the-wild-an-apple-trade-show-booth&utm_content=shigginbotham">The evolution of consumer-media cloud storage</a></li><li><a href="http://pro.gigaom.com/2012/04/connected-consumer-q1-controversy-courtrooms-and-the-cloud/?utm_source=apple&utm_medium=editorial&utm_campaign=auto3&utm_term=567346+seen-in-the-wild-an-apple-trade-show-booth&utm_content=shigginbotham">Controversy, courtrooms and the cloud in Q1</a></li><li><a href="http://pro.gigaom.com/2012/02/ces-2012-a-recap-and-analysis/?utm_source=apple&utm_medium=editorial&utm_campaign=auto3&utm_term=567346+seen-in-the-wild-an-apple-trade-show-booth&utm_content=shigginbotham">CES 2012: a recap and analysis</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/09/27/seen-in-the-wild-an-apple-trade-show-booth/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/09/20120927_0905121-e1348757654654.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/09/20120927_0905121-e1348757654654.jpg?w=150" medium="image">
			<media:title type="html">appletradebooth</media:title>
		</media:content>

		<media:content url="http://1.gravatar.com/avatar/aee37121e18bf76bb9fee4494bab237a?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">shigginbotham</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/09/20120927_090512.jpg?w=604" medium="image">
			<media:title type="html">20120927_090512</media:title>
		</media:content>
	</item>
		<item>
		<title>Here&#8217;s what happens when Heroku goes down</title>
		<link>http://gigaom.com/2011/09/30/heroku-exec-takes-us-behind-the-scenes-when-clouds-fail/</link>
		<comments>http://gigaom.com/2011/09/30/heroku-exec-takes-us-behind-the-scenes-when-clouds-fail/#comments</comments>
		<pubDate>Fri, 30 Sep 2011 22:00:55 +0000</pubDate>
		<dc:creator>Stacey Higginbotham</dc:creator>
				<category><![CDATA[Amazon]]></category>
		<category><![CDATA[andy parsons]]></category>
		<category><![CDATA[devops]]></category>
		<category><![CDATA[Heroku]]></category>
		<category><![CDATA[salesforce]]></category>
		<category><![CDATA[surge conference]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=413465</guid>
		<description><![CDATA[Delivering a cloud service isn't easy and figuring out how to handle things when they go wrong marks a huge leap in maturity for a company as guys from Heroku and Opscode explain. So what do webscale companies do when things go wrong? <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=413465&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<div id="attachment_413603" class="wp-caption alignleft" style="width: 310px"><a href="http://gigaom2.files.wordpress.com/2011/09/imag0206-e1317347131245.jpg"><img  title="imbriaco" src="http://gigaom2.files.wordpress.com/2011/09/imag0206-e1317347131245.jpg?w=300&#038;h=199" alt="" width="300" height="199" class="size-medium wp-image-413603" /></a><p class="wp-caption-text">Mark Imbriaco of Heroku</p></div>
<p>Delivering a Platform as a Service isn&#8217;t easy and figuring out how to handle things when they go wrong marks a huge leap in maturity for a company. When you&#8217;re small, panicking and then having one or two of the few people who built the software fix the problem may work, but as the company grows you&#8217;re going to have to figure out how to react when things fail. Luckily, Mark Imbriaco, the <a href="http://omniti.com/surge/2011/speakers/mark-imbriaco">director of operations at Heroku</a>, shared some of the steps it takes when a PaaS fumbles.</p>
<p>When disaster strikes, such as the <a href="http://gigaom.com/cloud/more-than-100-sites-went-down-with-ec2-including-your-paas-provider/">67-hour Amazon outage during April</a>, Heroku follows a set of policies designed to get the service back up quickly and to reduce stress on the engineers dealing with the problem. Because the company requires all its engineers, even software engineer to take time on call, Heroku believes that a notification peg must arrive with a link telling the on-call engineer what to do. At that point, the solution should take less than five minutes to solve. Otherwise, the primary on-call person must call in a person who is acting as the &#8220;incident commander.&#8221;</p>
<p>In most cases the pages, which arrive about two or three times during a 24-hour on-call period, require the engineer to take down the problematic instance and restart it. To keep engineers happy (in addition to sharing the on-call burden among the 20 people on the engineering staff), Imbriaco said Heroku relentlessly focuses on eliminating false positive notifications so on-call engineers won&#8217;t suffer from &#8220;pager fatigue.&#8221;</p>
<p>What surprised me was how actively the audience participated in the discussion about how Heroku does things, as opposed to how their company handles staffing, processes and whatnot around outages. Someone mentioned his company has people on call for a week as opposed to 24 hours. In another talk at the Surge conference, Adam Jacob, president and co-founder of Opscode, pushed for greater involvement from software developers whose code might be the problem-causing agent. Audience members at both talks were amazed that these men would give <del datetime="2011-10-01T02:31:23+00:00">systems administrators</del> software developers the ability to mess around on production servers.</p>
<p>The discussions made clear that if people are going to follow<a href="http://gigaom.com/cloud/scaling-lessons-from-googles-cio/"> Google CIO Ben Fried&#8217;s advice</a> and become generalists, then the culture inside engineering organizations will have to change. That may mean that like they do at Heroku, all engineers have to take on-call duties and almost-automated rules in place for someone to handle a failure in a few minutes or less. After that point, it gets escalated to someone who knows more if needed.</p>
<p>This gives everyone a stake in creating the best code running on the best systems, and it also allows ideas and understanding to flow across an organization. Folks may not want to call it devops, but whatever they call it, building applications that will scale requires new ways of thinking and responding.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=413465&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=822629"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=822629" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=413465+heroku-exec-takes-us-behind-the-scenes-when-clouds-fail&utm_content=shigginbotham">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2011/07/infrastructure-q2-big-data-and-paas-gain-more-momentum/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=413465+heroku-exec-takes-us-behind-the-scenes-when-clouds-fail&utm_content=shigginbotham">Infrastructure Q2: Big data and PaaS gain more momentum</a></li><li><a href="http://pro.gigaom.com/2011/06/a-field-guide-to-cloud-computing-current-trends-future-opportunities/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=413465+heroku-exec-takes-us-behind-the-scenes-when-clouds-fail&utm_content=shigginbotham">A field guide to cloud computing: current trends, future opportunities</a></li><li><a href="http://pro.gigaom.com/2011/04/infrastructure-q1-iaas-comes-down-to-earth-big-data-takes-flight/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=413465+heroku-exec-takes-us-behind-the-scenes-when-clouds-fail&utm_content=shigginbotham">Infrastructure Q1: IaaS Comes Down to Earth; Big Data Takes Flight</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2011/09/30/heroku-exec-takes-us-behind-the-scenes-when-clouds-fail/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2011/09/imag0206-e1317347131245.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2011/09/imag0206-e1317347131245.jpg?w=150" medium="image">
			<media:title type="html">imbriaco</media:title>
		</media:content>

		<media:content url="http://1.gravatar.com/avatar/aee37121e18bf76bb9fee4494bab237a?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">shigginbotham</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2011/09/imag0206-e1317347131245.jpg?w=300" medium="image">
			<media:title type="html">imbriaco</media:title>
		</media:content>
	</item>
	</channel>
</rss>
