More hadoop Stories

Hadoop, thanks to the growing importance of Big Data Analytics is gaining traction inside the enterprise. What’s been missing for Big Data Analytics has been a LAMP-like stack. Fortunately, a stack for Big Data aggregation, processing and analytics is on its way. Read more »

Subscriber Content

infrastructure

The second quarter of 2010 belonged to the little guys and the new guys. Almost across the board, from processors to virtualization to cloud services, relatively small vendors and startups had the market cornered on innovation and mindshare. And where there’s tinder in the forms of customer demand, products, funding and a greater societal movement toward environmentalism, something is bound to catch fire. Read more at GigaOM Pro »

Commercial Hadoop champion Cloudera is building a connector to enable movement of data between Netezza’s data warehousing appliance and Cloudera’s Hadoop clusters. It’s just the latest instance of an analytics vendor integrating Hadoop support, and further evidence that Hadoop has legs as a commercial technology. Read more »

loading external resource
Subscriber Content

A few months ago, I posited that additional funding for Cloudera and Karmasphere signifies a large market opportunity for solutions that utilize the open-source analytics tool Hadoop. This week, Yahoo hosted its third annual Hadoop Summit, and the sheer amount of news that generated only affirmed ... Read more at GigaOM Pro »

A few months ago, I posited that additional funding for Cloudera and Karmasphere signifies a large market opportunity for solutions that utilize the open-source analytics tool Hadoop. From the news generated this week by Yahoo’s third annual Hadoop Summit, my beliefs of this have only been affirmed. Read more »

The race is on to find relevance in the reams of social data produced every day, but one problem is the sheer quantity of information involved. Researchers now say they have developed software that can analyze that data in a matter of seconds using cloud computing. Read more »

Hadoop creator and champion Yahoo is taking advantage of its annual Hadoop Summit today by rolling out some new features for its open-source Hadoop distribution. The new features tackle security and workflow management, which Yahoo hopes will help Hadoop continue its proliferation among mainstream users. Read more »

Google revamped its search indexing methodology this week, which was quickly eclipsed by the chatter about background images on its home page. But those images were a red herring distracting us from technology changes that could influence those delivering the real-time web for years to come. Read more »

Subscriber Content

In a world of billion-dollar web companies and VC-backed startups trying to forever change human interaction via software, IBM tends to look a little staid. But don’t let its deliberate pace, legacy-software-mongering ways and suited executives fool you. If you pull back the covers, you’ll find ... Read more at GigaOM Pro »

loading external resource

Twitter today open-sourced the code that it used to build its database of users and manage their relationships to one another, called FlockDB. The move comes shortly after Twitter released its Gizzard framework, which it uses to send thousands of queries a second to FlockDB. Read more »

Appistry today added another element to its cloud-computing application platform, announcing the April availability of CloudIQ Storage. With it, St. Louis-based Appistry joins the growing ranks of companies seizing on demand cloud storage solutions that maintain performance in the face of rapidly growing data volumes. Read more »

There are a few widespread misconceptions about Cloudera, the promising, well-funded Burlingame, Calif.-based startup that offers services, training and support for the open-source software framework Hadoop. At least that’s what I found out during a talk earlier today with the company’s CEO, Mike Olson. Read more »

Google, nearly six years since it first applied for it, has finally received a patent for its MapReduce parallel programming model. The question now is how this will affect the various products and projects that utilize MapReduce, such as Apache’s MapReduce-inspired Hadoop project. Read more »

Berkeley Labs has been working on an open source version of a system for demand response services for the power grid (called openADR) for more than five years. But only one company in that time has commercialized a version of the open source platform — a […] Read more »

Can an open source data management system do for the smart grid what Google’s open mobile operating system Android is doing for cell phones — spawn innovation and low cost development? Execs at the Tennessee Valley Authority (TVA), the largest public power provider in the U.S., […] Read more »

[qi:gigaom_icon_cloud-computing] Love it or fear it, there is no denying the impact cloud computing is having on IT practices. Despite a summer full of high-profile outages, cloud computing spent the season continuing its march toward ubiquity, as our third-quarter wrap-up at GigaOM Pro showed (subscription required). Read more »

Cloudera, a startup based in Burlingame, Calif., today announced the release of its first commercial product, Cloudera Desktop. It’s a graphical interface for managing Hadoop, the open-source framework that is catalyzing the data mining renaissance. Cloudera’s Hadoop now works on almost all major cloud platforms: Amazon […] Read more »

Hadoop, as a pivotal piece of the data mining renaissance, offers the ability to tackle large data sets in ways that weren’t previously feasible due to time and dollar constraints. But Hadoop can’t do everything quite yet, especially when it comes to real-time work flow. Fortunately, […] Read more »

With two major acquisitions announced today — the $420 million acquisition of SpringSource by VMware and Facebook buying Friendfeed for $50 million, I almost forgot to note that two good friends of this blog have switched jobs. Doug Cutting, creator of open-source software framework Hadoop,has left […] Read more »

At the Hadoop Summit in Silicon Valley today, Yahoo announced the availability of the Yahoo Distribution of Hadoop, a source-only version of Apache Hadoop that Yahoo uses within its own search engine. That’s more good news for Cloudera, a Burlingame, Calif-based startup that builds commercial services […] Read more »

Updated: Hadoop, the open-source software framework, is one of the technologies we have been following closely. If you are equally interested in Hadoop, then we have 10 free tickets for The Hadoop Summit that is going to be held this Wednesday, June 10, at the Marriott […] Read more »

Hadoop, an open-source software program that helps process incredibly large data sets, has been generating plenty of buzz. The upcoming Hadoop Summit on June 10 marks a midway point in an eventful year for the technology. Cloudera, a high-profile startup that’s building commercial services around Hadoop, just […] Read more »

At first glance it’s hard to see how the open-source software framework Hadoop, which was developed for analyzing large data sets generated by web sites, would be useful for the power grid — open-source tools and utilities don’t often mix. But that was before the smart […] Read more »

“Hadoop is going to find potential markets in any industry where there are large data sets that need complex analysis,” Mike Olson, chief executive officer and one of the four co-founders of Cloudera, the startup that’s commercializing the open-source software framework Hadoop, told me earlier today. […] Read more »

Cloudera, a Burlingame, Calif.-based startup that is building commercial services around open-source software framework Hadoop, has closed $6 million in Series B funding, bringing the total raised by the company to $11 million. The latest round of funding was led by Greylock Partners. Current investor Accel […] Read more »

[qi:gigaom_icon_cloud-computing] Earlier today, I stopped by at the Social Graph Symposium at Sun Microsystems’ Menlo Park campus. The event, which attracted some of the most well-known experts on social networks and social graphs, was organized to look at the various challenges and opportunities being presented by […] Read more »

We are in the midst of a data mining renaissance. Traditionally, data warehousing implementations were large, complex and expensive, meaning only the top-ranking companies could afford them. Teradata pioneered the initial market for corporate data warehousing solutions and still maintains a segment lead, something HP’s CEO […] Read more »

You know the saying, “If you build it, they will come”? Well that certainly holds true for GPS functionality and mobile phones. Nearly 48 percent of the mobile app developers surveyed by Boston-based Skyhook Wireless said that location is what “sets their app apart, or is […] Read more »

Hadoop, Cassandra, HBase, Hypertable, Open Neptune… these are some open source projects that are being pursued by web technologists in order to deal with explosion of digital data in a post-terabyte world. The traditional way to deal with unstructured data isn’t working. What we need is a structured means of finding, accessing, and retrieving files and objects. Read more »

Cloudera, a Burlingame, Calif-based company offering services around the open source software framework Hadoop, has raised $5 million in Series A funding led by Accel Partners. It has also attracted funding from seasoned infrastructure executive and Web veterans such as Caterina Fake (co-founder, Flickr), Dr. Qi […] Read more »

Today IBM  announced that six universities are using its cloud computing expertise to set up and manage clouds located in Qatar, Africa and in Japan. It is using Hadoop for allocating resources in the cloud — something it first began doing in 2007 when it teamed […] Read more »

Last week, OStatic noted the rumor, first reported by VentureBeat, that Microsoft intended to buy Silicon Valley semantic search engine Powerset for $100 million. Lo and behold, Microsoft and Powerset are confirming today that an acquisition agreement has been signed. The terms of the deal have […] Read more »

Parascale, a Cupertino, Calif-based start-up that has developed a storage file system for a cloud of computers announced that it had attracted $11.37 million in Series A funding from Charles River Ventures and Menlo Ventures. The company recently changed its chief executive and brought in Sajai […] Read more »

We are only ten days away from Structure’08, our web infrastructure conference. As part of our preparation for this event, our team of reporters & bloggers is finding new & interesting open source projects that are tackling various aspects of Cloud Computing, a concept popularized by […] Read more »

As part of our renewed focus on technologies that matter, we are launching a series of events called GigaOM PM, occasional meetups at which we will gather to discuss topical and important technology breakthroughs. I will host these gatherings, and we will keep them small and […] Read more »

191011page 11 of 11