Hadoop - Tech News Articles: GigaOM GigaOM

Hadoop

Why You Should Care

Hadoop is an open source software framework that was initially used for data-intensive queries at sites such as Yahoo, but has spread out to data crunching tasks of many types at all kinds of organizations. Hadoop can bring great efficiencies to dealing with large data sets, and is available in several distributions. Keep up with your Hadoop news here.

It’s no secret that Yahoo analyzes a lot user data, but today it’s giving the world a striking peek into how all that data is used. A new tool lets visitors work their way through demographic data to see which news stories are the most popular. Read More »

Cray's XK6 supercomputer

Looks like Oracle has some competition when it comes to selling big iron for big data. On Wednesday, Cray, the Seattle-based company best known for building some of the world’s fastest supercomputers, announced it’s getting into the big data game. Read More »

WibiData, a Hadoop-based startup focused on making it easier to analyze user behavior, has raised $5 million from New Enterprise Associates. The company, formerly known as Odiago, launched in late 2011 already claiming Wikipedia and Atlassian among its early customers. Read More »

Hadoop features front and center in the discussion of how to implement a big data strategy, one of the biggest trends in IT. There’s just one problem that keeps cropping up: many people don’t seem to know exactly what it means when somebody says “Hadoop.” Read More »

Facebook’s S-1 filing shows the company is all about infrastructure. The ad revenue and user experience it relies on to exist mean Facebook can’t afford to take it easy on IT, which means shareholders and users will both find plenty of reasons to get upset. Read More »

If you like the idea of your analytics system’s getting more accurate with each piece of data it ingests, it looks like you are in for an exciting run, because machine learning appears to be catching fire across the ecosystem of big data vendors. Read More »

For eBay, big data is serious business. Every day, the site stores and analyzes data from millions of users buying, selling and searching for hundreds of millions of products. It handles all this data with lots of Hadoop, although a good data warehouse doesn’t hurt either. Read More »

More Must Reads

As promised, storage kingpin EMC has integrated its Isilon NAS product with Hadoop in a way that will bring Isilon’s OneFS file system to bear on data. EMC isn’t alone. Vendors from Amazon to Oracle are trying to tame this big data beast. Read More »

Was Bill Gates, chairman and co-founder of Microsoft, the power behind the proprietary Windows-and-Office juggernaut, really an open source champion? A new Wired article lays Microsoft’s wider embrace of open source technologies — including Node.js and Hadoop — squarely at Gates’ feet. Read More »

Pentaho is moving its business intelligence tools to the Apache license to make them more compatible with big data technologies that already operate under that license. Pentaho’s Kettle extract, transform, load (ETL) technology was previously available under the LGPL or lesser Gnu General Public License. Read More »

Analytics and big data tools will remake energy in 2012, from helping curb energy consumption, to reducing energy loss, to adding in more clean power to the grid. Here’s 10 ways this will happen: Read More »

The great thing about big data is that there’s still plenty of room for new blood, especially for companies that want to leave infrastructure in the rearview mirror. At this point, the data-infrastructure space, including Hadoop, is well-funded and nearly saturated, but it also needs help. Read More »

With all the talk of big data, cynics think the whole notion has jumped the shark. Get ready, they say, for the next tech bubble to burst. EMC CMO Jeremy Burton is not among them. Granted, he’s a marketer, but what he says makes sense. Read More »

Big data has gotten very, very big if the elite talking heads at the World Economic Forum in Davos, Switzerland, are talking about it. And they are talking about it. Sessions include “Decoding the data deluge” and “Personal data: the ‘new oil’ of the 21st century.” Read More »

Ad-targeting company 33Across is acquiring link-tracking specialist Tynt Multimedia, resulting in a combined user graph spanning 1.25 billion users. Both are storing and analyzing billions of transactions daily, and they will use that data to help publishers compete on ad sales against mega sites like Google. Read More »

Joe Coyle, CTO of global integrator Capgemini, sees a lot of cloud pitches from all the major technology vendors — and God knows they all have a cloud strategy. Here’s what he thinks of the current state of the market. Read More »

IBM is working the reins of its Smarter Commerce initiative by rolling out a new Netezza analytics appliance designed to help retailers churn through potentially petabytes of consumer sales data in real time. It’s trying to capitalize on the increased importance of e-commerce to revenues. Read More »

loading external resource
Click to log in with: Not you?
Comment as guest:
By continuing you are agreeing to our Terms of Service and Privacy Policy.
Submitting comment...