1 Comment

Summary:

Microsoft, which looked as if it might be the odd man out on Hadoop, might actually play a big role in taking the platform into the mainstream. On Tuesday, it exposed its plans to make Hadoop data analyzable via both a JavaScript framework and Microsoft Excel.

bull elephant

Microsoft, which looked as if it might be the odd man out on Hadoop, might actually play a big role in taking the red-hot storage-and-processing platform for unstructured data into the mainstream. On Tuesday, it exposed its plans (with help from Hortonworks) to make Hadoop data analyzable via both a JavaScript framework and Microsoft Excel, meaning many millions of developers and business users will be able to work with Hadoop data using their favorite tools.

The Excel connector is especially significant because there are millions of business users skilled in Excel that just don’t have the chops to analyze data stored in Hadoop. There are plenty of startups trying make Hadoop accessible to business users — including Datameer, which provides a spreadsheet interface (and is even a Microsoft partner) — but it’s unlikely that any combination of efforts will match the impact of accessing and analyzing Hadoop data right from the familiar Excel (and PowerPivot) application.

Since October, Microsoft has been publicly working with Hadoop startup Hortonworks to make Hadoop work better in Windows environments. In the process it all but killed the Dryad big data framework it spent years developing. This new strategy includes Hadoop distributions for Windows Servers and the Windows Azure cloud computing platform, as well as a connection to SQL Server. Hadoop on Windows Azure, in fact, is already available in Developer Preview mode and should be generally available by the end of this quarter, according to ZDNet Microsoft insider Mary Jo Foley.

The SQL Server distribution, she reported, should hit preview mode by the end of the quarter and be generally available by the end of June.

On Monday, I quoted Hortonworks CEO Rob Bearden claiming Hadoop could be a billion-dollar market in a couple of years, ultimately ending up as the de facto data platform for next-generation applications. Not everyone buys into this rosy prediction, but Microsoft’s commitment to Hadoop makes it seem a lot more plausible. Sure, EMC, Oracle and IBM already push Hadoop-based products, but when it comes to sheer number of employees sitting at desktops and hacking away at spreadsheets, it’s hard to discount what Microsoft could mean for Hadoop as a whole.

No doubt, Microsoft’s increased involvement will come up at our Structure:Data conference next month in New York, where execs from several Hadoop vendors, including Hortonworks, will take the stage on March 22 to talk about the future of Hadoop.

Image courtesy of Brian Robert Marshall.

You’re subscribed! If you like, you can update your settings

  1. Great to see Microsoft working on an enhanced Hive ODBC driver as part of their growing commitment to Hadoop. This will provide a useful complement to Cloudscale’s service (cloudscale.com) which offers realtime and batch big data analytics in the cloud from right inside Excel. Cloudcel, our patented Excel-driven cloud architecture can be used with S3, Hadoop HDFS, MapR and Lustre backends. The Lustre version is the world’s fastest big data solution. Unlike the other spreadsheet big data solutions, Cloudcel is the only one that offers drag-and-drop big data app development for business users. No coding.

Comments have been disabled for this post