Google: The Data Center Is the Computer

By Stacey Higginbotham | Monday, June 15, 2009 | 11:47 AM PT | 1 comment |

Server room and devices

As folks increasingly store and access information online, the data centers powering cloud services need to be managed more like a single computing entity rather than a bunch of servers, according to a Google white paper (Google calls it a mini-book) released today.

The paper lays out the concept of warehouse-scale computers (which we have previously referred to as both web-scale computing and mega data centers), specifically how to build out the infrastructure to support Internet services managed across thousands of servers. Google’s Luiz Barroso, a distinguished engineer, and Urs Hölzle, SVP of operations, both of whom help manage and build out Google’s data center, lay out their definition of WSCs:

The name is meant to call attention to the most distinguishing feature of these machines:
the massive scale of their software infrastructure, data repositories, and hardware platform. This perspective is a departure from a view of the computing problem that implicitly assumes a model where one program runs in a single machine. In warehouse-scale computing, the program is an Internet service, which may consist of tens or more individual programs that interact to implement complex end-user services such as email, search, or maps. These programs might be implemented and maintained by different teams of engineers, perhaps even across organizational, geographic, and company boundaries (as is the case with mashups, for example).

The 100-page paper also notes that there are a few other characteristics of these warehouse-scale computers, which are owned by companies such as Microsoft, Google, Amazon and Yahoo. These computers are mostly built in-house, are fairly flexible and are focused on cost efficiency. The Google engineers’ overview includes chapters analyzing the power consumption and costs associated with running such a warehouse-scale computer, and how to architect the hardware and software in these machines.

Commodity hardware gets a boost here, which doesn’t bode well for Cisco’s (c csco) new blade servers or IBM’s efforts to create specialized hardware for different workloads in the cloud. The paper also discusses the challenges of building software that can run across a heterogeneous hardware layer, as well as programming for redundancy and resiliency. Given how much publicity Google’s various failures and traffic slowdowns have received, the chart below illustrating the causes behind such glitches is notable.

Many in the industry are positioning warehouse-scale computing as the future of IT, but this paper underscores how it may be an evolving style of computing that will likely coexist with more traditional models for some time to come. In the same way mammals coexisted with dinosaurs for millions of years, perhaps.

googfail

Digg

Comments (1)

Link to this article using http://om.bit.ly/1vbel

Linkbacks (5)

  • [...] GigaOM: So in IBM’s view the workloads dictate the hardware, rather than the idea of commodity servers being used to build out a general purpose cloud? [...]

     
  • [...] easier for IT departments to consume. At higher-level, Cloudera is figuring out how to deliver "warehouse scale computing" beyond the walls of Google, who recently quipped the [...]

     
  • [...] Clementi: Google is not virtualized and virtualization is not sufficient to qualify as a cloud — it’s a better use of the physical infrastructure, but the real bang comes from modeling out the whole data center and taking energy, labor, software, and hardware and acting on all those levels. It’s like Google’s idea that the data center has become the computer. [...]

     
  • [...] Stacey Higginbotham | Friday, September 4, 2009 | 10:23 AM PT | 0 comments If the data center is the new computer, then the job of providing the de facto operating system of that new computer is up for grabs, as [...]

     
  • [...] So virtualization and cloud-based  infrastructure as a service are not the ubiquitous answers. At times, more efficient, streamlined access to raw hardware simply provides the best bang for the buck. For those interested in another perspective, Stacey recently spoke with Erich Clementi, the head of IBM’s cloud computing efforts, about Big Blue’s cloud strategy. Clementi pointed out, “Many people equate cloud computing to virtualization. It is not virtualization.” He further said: Google is not virtualized, and virtualization is not sufficient to qualify as a cloud — it’s a better use of the physical infrastructure, but the real bang comes from modeling out the whole data center and taking energy, labor, software and hardware and acting on all those levels. It’s like Google’s idea that the data center has become the computer. [...]

     

Subscribe to comments feed

Leave a Reply


Post to GigaOM with your Facebook account

Editorial Masthead

Sebastian Rupley
Editor in Chief
Carolyn Pritchard
Managing Editor
Celeste LeCompte
Special Projects Editor
Desiree DeNunzio
Copyeditor
Om Malik
Senior Writer
Stacey Higginbotham
Staff Writer
Ryan Lawler
Staff Writer
Wagner James Au
Contributing Editor
Liz Gannes
Staff Writer
Chris Albrecht
Staff Writer
Katie Fehrenbacher
Staff Writer
Josie Garthwaite
Staff Writer
Close
E-mail It