There are a lot of Hadoop deployments out there. A lot of MongoDB ones, too. And on Tuesday, Hadoop specialist Cloudera and the eponymous MongoDB announced a multi-faceted partnership they claim will significantly improve the experience for customers using both companies’ technologies.
From a user perspective, Hadoop and MongoDB are very different technologies but very complementary. Hadoop provides a cheap, large-scale data store, as well as tools for processing and analyzing the unstructured data often stored in it. MongoDB is an operational database, largely for web applications, designed to handle JSON documents. They’re both open source, and many companies use both.
“We have an enormous overlap in our customer bases,” MongoDB Director of Product Marketing Kelly Stirman said.
Hence the partnership, the first fruit of which is the certification of the MongoDB Connector for Hadoop to run on the latest version of Cloudera’s Hadoop distribution. However, he added, there will be a lot more to come, including some meaningful co-developed technology improvements that will be announced at the MongoDB World conference in June.
Beyond that, Stirman said users can expect a lot more work between the companies, some stuff “obvious” — think customer support deals — and other stuff more “clever and nuanced.” He didn’t rule out the possibility of jointly developed products rather than just a slew of integrations, and suggested operations, security and data management as potential areas of focus.
“[There are] potentially many years of engineering opportunity in front of this partnership,” Stirman said. He added, “None of those decision are set in stone at this point. Everything is sort of an option for us right now.”
From a business perspective, though, the partnership between Cloudera and MongoDB is yet another dividing line in an already fractured Hadoop marketplace. Even if some of the work is done as open source, the messages are pretty clear as Cloudera tightens its relationships with companies such as MongoDB and Intel, and Hortonworks with Microsoft, Teradata, Red Hat and others. It’s not just the technological difference between distributions that matter, but also the ecosystems shaping up around each.
Feature image courtesy of Flickr user Cesar Rodas.