Hadoop - Tech News Articles: GigaOM GigaOM

Hadoop

Why You Should Care

Hadoop is an open source software framework that was initially used for data-intensive queries at sites such as Yahoo, but has spread out to data crunching tasks of many types at all kinds of organizations. Hadoop can bring great efficiencies to dealing with large data sets, and is available in several distributions. Keep up with your Hadoop news here.

If you’ve ever wondered what big data means at an individual level, this realization about sums it up: “I could either keep dying my hair or retire a year earlier.” It’s those types of realizations Intuit hopes its heavy big data use will help uncover. Read More »

It’s neither easy nor glamorous — data scientists get all the love — but making sure your Hadoop cluster is properly configured and applications are running optimally is necessary, especially as applications move into production. Here are five tools to help you do it. Read More »

Facebook’s hyperinflated valuation heading into its IPO has everything to do with its promise, and very little to do with its actual profits. Here are some numbers we know about Facebook’s infrastructure that speak to its promise perhaps as much as its 900 million users. Read More »

There’s nothing quite like a hypothetical about someone setting a whole block on fire after cutting off the fire department’s electric supply in order to slow its response. Is it comforting to know that smart people and smart analytics could help stop it from happening? Read More »

Yahoo is looking to leverage its big data prowess with a new tool for marketers called Genome. It looks like an acknowledgement that while Yahoo might not rule the the web anymore, it knows a heck of a lot about analytics. Read More »

As the world once again starts analyzing Yahoo’s myriad woes after Sunday morning’s ouster of embattled CEO Scott Thompson, I’m left wondering if its investment in Hadoop didn’t aid in the company’s demise, even if it’s a way down the long list of Yahoo’s mistakes. Read More »

Big data: The quick and the dead

The IT hype machine has everyone jumping on the big data bandwagon. But before we start saving every scrap of data in the enterprise for fear that we will miss a nugget of insight, shouldn’t we focus on what we already have? Read More »

More Must Reads

Paul Doscher, CEO of Lucid Imagination wants you to know that when it come to enterprise-class search, open-source Lucene is a contender. And a strong contender that can face off against Google, Amazon and Microsoft in the big data search arena. Read More »

Finally the worlds of big data geeks and clean energy nerds have collided. Researchers have proposed building a “GreenHadoop,” that is a version of the MapReduce programming framework that could manage a data center’s computing workload to optimize clean energy from a solar system. Read More »

A cadre of DevOps experts will gather later this week at an undisclosed location in Northern California. The goal: To hash out issues they see in their own shops, to compare notes on problems and talk in a way that they cannot in vendor-driven conferences. Read More »

Market research firm IDC released the first legitimate market forecast for Hadoop on Monday, claiming the ecosystem around the de facto big data platform will sell almost $813 million worth of software by 2016. But Hadoop’s actual economic impact is likely much, much larger. Read More »

Ask a VC about big data and she will probably tell you about visualization of the user interface. We’re talking about intuitive UIs that let users visually work with data using charts and tools, not algorithms. It’s hard to do right, but the payoff could be …

Known for integration and embeddable databases, Pervasive has all kinds of exciting technology plans for cloud and big data on its roadmap. ... Read More »

When your business is to insure farmers against the effects of bad weather, you’d better have some seriously accurate data on your side. Mother Nature, after all, can be somewhat unpredictable. The Climate Corporation thinks the answer is lots of data and lots computing power. Read More »

Cloud computing and big data are in the enterprise to stay, but making the most of them presents challenges for IT decision makers. The future belongs to those companies who can work through legacy tools, ongoing security issues and the data scientist shortage.

There are now more than half a dozen commercial Hadoop distributions in the market, and almost every enterprise with big data challenges is tinkering with the Apache Foundation-licensed software. A new report examines the key disruptive trends shaping the Hadoop platform market.

IBM’s big data platform will support the Cloudera Hadoop distribution, a surprising decision given the reservations the two companies had expressed about each other before. That gives IBM and rival Oracle at least one thing in common: Oracle’s Big Data Appliance runs Cloudera too. Read More »

It’s beginning to look like there will be no free-standing analytics companies left. IBM is buying Vivisimo for the “discovery and navigation” expertise that companies use to access and analyze (what else?) big data. The news come a week after IBM bought Varicent, another analytics … Read More »

VMware has acquired Cetas, a startup that provides analytics atop the Hadoop platform. Terms of the deal haven’t been disclosed, but Cetas is an 18-month-old company with tens of paying customers that didn’t need to rush into an acquisition. So, why did VMware buy it? Read More »

Big data and the marketing world go together like peanut butter and jelly. Marketers want to present their brands in the most-effective manner possible and always put the right ad in front of the right person. Big data makes that possible at a whole new level. Read More »

This quarter saw Amazon Web Services finally relaxing its public-cloud-only stance and launching services to support hybrid-cloud deployments. Meanwhile, Hadoop players moved to make their platforms more accessible to mainstream BI analysts and database administrators. A new quarterly report analyzes these trends and provides a near-term …

Skybox Imaging, a startup that wants to capture and analyze high-resolution photos and videos of the Earth, has raised $70 million in Series C funding. The money will help Skybox its lineup of software engineers and data scientists that might be its secret sauce. Read More »

This quarter the EV market struggled to find its footing. Meanwhile, the smart-grid sector solidified and low-power technology proved itself important in the data center. Read more to learn what these news pieces and others mean for the larger space over the next few months.

If you’re an amateur poet and love big data, high-performance system vendor AMAX has a deal for you. The company is conducting a contest to find the best haiku on big data. But I’m sharing my poems right here. Read More »

TempoDB, a startup out of Chicago, has build a database as a service offering specifically for time-series data thrown off by thermostats, servers, automotive telematics. Does the world (or the Internet of Things) need a specialty time series database hosted in the cloud? Read More »

The headline might sound like buzzword stew, but it couldn’t be any truer. For companies willing to make the leap to cloud services, there will be a lot of companies willing to make big data as easy as paying your bill every month. Read More »

For years, Oracle has wowed Wall Street with fat software margins: Large companies depending on Oracle relational databases pay what it takes to keep them up and running. It’s unclear whether Oracle can carry that dominance over into the Big Data era, however. Read More »

If your organization doesn’t have a strategy for big data now, you will need one in the future. Here we discuss the difference between big data and traditional business intelligence, as well as the considerations executives should take into account as they plan their big data …

In a webscale data center, peak efficiency feels like a blast furnace. I stepped into the hot aisle of Dell Modular Data Center and 1,920 servers blasted 115-degree air right in my face. If eBay’s Dean Nelson has his way, that was just the beginning. Read More »

Social and mobile analytics startup Kontagent has expanded its business to include a data-mining service powered by Hive, the SQL-like interface for querying data stored within Hadoop. It’s a smart move by the company, and one that other cloud-based analytics providers would be wise to replicate. Read More »

The federal government talked a lot about grand scientific visions when it unveiled its big data agenda last week, but the government has consumers on its mind, too. Specifically, it doesn’t want to unduly hinder innovation, and it might even be willing to provide data. Read More »

If you have a lot of unstructured data, don’t have (or want) a Hadoop cluster and can write Python jobs, Mortar Data has got the service for you. The New York-based startup is jumping into the fray with possibly the most lightweight Hadoop service yet. Read More »

Managed hosting provider Sungard is getting into the big data space with a new Hadoop service that gives users on-demand access to the popular data-storage and processing platform. Called Unified Analytics Service, Sungard’s new offering joins the growing ranks of cloud-based Hadoop offerings. Read More »

By pumping hundreds of millions of dollars into big data research and development, the Obama administration thinks it can push the current state of the art well beyond what’s possible today, and into entirely new research areas. It’s a noble goal, but also a necessary one. … Read More »

Key to understanding big data is to move beyond simply examining the technology for data storage and analytics engines. Organizations preparing for a data-centric economy should also examine the roles of data quality, data obesity and data markets in the future of modern enterprises.

The problem for many companies is that user information is spread across hundreds or even thousands of different fields in various databases, and it’s difficult to compile it in real time. But doing that successfully is becoming increasingly important, says WiBiData at Structure:Data. Read More »

Two star hires and a well-reviewed phone-and-tablet operating system do not necessarily remake a company, but they do ease the perception — prevalent in recent years — that Microsoft is on its last legs. Could the once-dominant software giant be on the comeback trail? Read More »

One solution to the big data skills shortage has been consulting firms that specialize in deploying big data systems companies need to make sense of their information. These companies will continue to play a vital role in helping us make sense of the the data deluge.

Pivotal Labs will keep on doing what it does best after EMC’s acquisition, according to Pivotal CEO Rob Mee. There has been concern over Pivotal’s future as part of the EMC behemoth, which Rob Mee did his best to alleviate at Structure: Data 2012. Read More »

Big data now touches everything from enterprises to smart-meter startups, while Hadoop is fast becoming the leading tool to analyze that data, and debates around privacy abound. GigaOM Pro analysts offer insights on what to consider when it comes to big data decisions for your business.

EMC’s acquisition of Pivotal Labs proves the company really understands the big data market. Namely, that big data won’t go anywhere without great applications, and EMC isn’t the company to help customers figure out how to build theirs. Read More »

With many utilities facing the task of storing petabytes of smart meter data for as long as seven years in order to satisfy regulatory requirements, the ability to house and leverage the massive load of data accumulating from the smart grid is a significant IT challenge. … Read More »

Raghu Ramakrishnan, who was the top scientist for several of Yahoo’s key technology efforts, is now a technical fellow with Microsoft’s server and tools unit. This is the latest sign that Yahoo is struggling to retain key technologists. Read More »

Web companies like Google and Facebook invest incredible resources in making sure they know everything about their infrastructures and how server-level issues are affecting the applications that comprise their lifeblood. The rest of the business world is now catching on. Read More »

In September, I profiled six companies doing big data in the cloud, and here are nine more. One of the themes of Structure:Data is “putting big data to work,” and there’s no easier way to get started doing so than with a cloud service. Read More »

loading external resource
Click to log in with: Not you?
Comment as guest:
By continuing you are agreeing to our Terms of Service and Privacy Policy.
Submitting comment...
results