37 Comments

Summary:

The great thing about big data is that there’s still plenty of room for new blood, especially for companies that want to leave infrastructure in the rearview mirror. At this point, the data-infrastructure space, including Hadoop, is well-funded and nearly saturated, but it also needs help.

visual

Big data is hot, but infrastructure-level platforms such as Hadoop, which focus on storage and processing, still need help to take them into the mainstream. They need a killer app or two that will let companies analyze, visualize and act on all that data without hiring a team of Stanford Ph.Ds, or that will let developers write big-data apps without having to reinvent the wheel.

Here are five startups (in alphabetical order) either in stealth mode or just out of it that could help take Hadoop and its ilk to the promised land.

1. BloomReach

The stealth-mode BloomReach is taking a very targeted, very hands-free approach to big data for its customers. It’s offering a SaaS-based product that job listings say is for “helping leading online businesses uncover the highest quality, most relevant content sought by their consumers, when and where they want it.” Founded by a team with roots at Google, Cisco, Facebook and Yahoo, among other companies, BloomReach has, according to one estimate, about 160 customers — all of them among the top 10,000 websites, and most of them in the retail space. Among its core technologies and methods are Hadoop, Lucene, Monte Carlo simulations and large-scale image processing.

2. Continuuity

Continuuity, the just-launched stealth-mode startup by former Yahoo VP and chief cloud architect Todd Papaioannou, wants to make it easier to build applications that can leverage both cloud computing and big data technologies. As Papaioannou told me recently, most developers shouldn’t have to go through what Yahoo, Facebook and others did in order to write large-scale, data-driven applications. He also said “the data fabric is the next middleware” and noted that the company name is a play on “continuum.” You figure out what it’s up to.

3. Odiago

Odiago is the brainchild of Hadoop and analytics experts Christophe Bisciglia and Aaron Kimball, and aims to improve the state of web analytics. Its first product, Wibidata, which is in private beta, lets websites better analyze their user data to build more-targeted features. It’s built atop Hadoop and HBase, but also plugs into companies’ existing data-management and BI tools. Current customers include Wikipedia, RichRelevance, FoneDoktor and Atlassian (with whom it shares office space).

4. Platfora

Platfora, which launched in September with $5.7 million in funding, wants to make big data analytics accessible to the masses. Founder and CEO Ben Werther, formerly of Greenplum and NoSQL startup DataStax, told me when Platfora launched that its intuitive, visually stunning interface will make Hadoop-based analytics so easy even a history major could use it. Platfora’s product isn’t available yet, but the company is currently hiring, with an emphasis on frontend and user-experience skills.

5. SkyTree

Skytree is probably the stealthiest of the group, but it’s also is one of the more ambitious — because it’s trying to bring high-performance machine learning to mainstream companies. Machine learning is an impressive technique in which the system itself gets smarter as it digests more data, but it usually doesn’t find its way out of research environments or cutting-edge analytics teams. Skytree is putting together an impressive team, including co-founder Alexander Gray, who also teaches machine learning at Georgia Tech and spent six years at NASA’s Jet Propulsion Laboratory. The company will officially launch later this quarter.

We’ll be addressing many of the issues these companies are trying to resolve at our Structure: Data event that takes place March 21-22 in New York City. Founders from Continuuity, Odiago and Skytree will be speaking at the event, as will dozens of other data visionaries from companies such as IBM, Google, @WalmartLabs and Hortonworks.

Feature image courtesy of Flickr user jurvetson.

  1. Good Derrick but I can list 3 or 4 more low-profile startups that could change the face of big data right off top of my head and there’s probably another 25 since this space is sizzling red hot !

    Share
    1. Steve, what are your 3 fav of that list?

      Share
    2. Steve what are you two or three fav from that group?

      Share
    3. Steve, what are your three fav from that group?

      Share
      1. I’d be interested to hear, too. My list isn’t meant to be exhaustive, just five stealthy companies that I’ve spoken with or come across lately that have really impressed me. You’re right, though, it’s a white-hot space.

        Share
      2. Gotta give BloomReach credit with 160 customers but there’s many good players that know how to serve up relevant content so ergo their niche in retail space. Odiago looks most impressive with Wibidata and SkyTree most ambitious if they can bring high-performance machine learning to the mainstream. I see Platfora as more a optimized cloud computing play. Too early to tell with Continuuity and “data fabric is the next middleware” is way too me too and included the info mgmt 800 lb gorillas Google, IBM, Oracle, SAP, MSFT so LOL ;)

        Share
      3. Hey why all are asking favorite of Steve only ask me too…

        Share
    4. If you can, then do. Don’t talk about it, be about it.

      Share
  2. Reblogged this on quickgamer88.

    Share
  3. I keep wondering about the future…

    Share
  4. Reblogged this on indhulsheena.

    Share
  5. You should check out Aspera. Big digital asset file transfer protocol build from ground up.

    Share
    1. Derrick Harris Sunday, January 29, 2012

      I know Aspera well, actually, have written about its relevancy to big data. But it’s not exactly in stealth mode ;-)

      Share
  6. WoW! Very good info Derrick, Platfora has my attention now thanks to you.

    Share
  7. there would be more trust me, that may not even have started yet. There is a clear test, whether they would succeed in long run-1) They are solving a hot problem, which definitely same as Hot idea looking for a problem 2) it has to be the simplest way of doing it, and once it is not simplest would be obsoleted.
    it does not mean hell of a bean difference, whether it is IT or a nail cutter, or a blue jacket. both need technology to make. And as One of my professors said in late 70′s-today computers(Hardware and software) are far ahead of applications. Whatever you want done can be done today”. This statement is also true today. IMHO, the biggest gain would be in combine IT, other technology. E.G medical instrumentation and delivery devices. Problem Medical costs are very high and cannot be s of afforded. So combine modern method of rapid prototyping and manufacturing, software and problem and you got a recipe for success. IT is akin to electricity now.

    Share
  8. today its five but there is another 3-4 preparred to start..

    Share
  9. Dr. Ambrish Joshi Sunday, January 29, 2012

    Wonder why only 5; have to be more…

    Share
  10. I’d add ReportGrid to this list. http://www.reportgrid.com/

    Share
  11. Red Lambda’s neural foam technology does amazing things with big data (any and all data). Still stealthy but one to watch for sure

    Share
  12. I would add Fuzzy Logix, with many Fortune 100 customers, and reseller/OEM agreements with several $1-100B companies
    http://www.fuzzyl.com/products/
    Disclosure, I am partner at Fuzzy Logix.

    Share
  13. Jeremy Levine Monday, January 30, 2012

    I’m admittedly not unbiased, but Convertro definitely belongs on this list. They’re not in stealth, but they keep a very low profile despite many high profile customers.

    Share
  14. This ones under the radar also – HyperCloud memory trumps LRDIMMs memory for Romley rollout.

    How it relates to cloud computing – you need lots of RAM per server (as you increase cores) but memory slows down as you increase load. LRDIMM is the solution touted by Intel, but it has high latency issues (plus it requires a BIOS upgrade and non-interoperable with RDIMMs) – at 3 DIMMs per memory channel (3 DPC) it cannot even run at 1333MHz. Meanwhile Netlist’s HyperCloud requires no BIOS update, and is interoperable and can do 768GB at 1333MHz on a 2-socket server (i.e. can do 1333MHz at 3 DPC).

    Which means the LRDIMM space (20% of Romley server) could wind up being owned by HyperCloud.

    Share
  15. I would recommend ResearchShare, which has been recently released by JTE Multimedia: http://tinyurl.com/7r5poh3

    This app allows researchers to read, and collaborate on peer-reviewed medical articles in real time. It is a very interesting approach to helping medical researchers work together on big projects.

    Share
  16. What about Domo Technologies?

    Share
  17. Chris Zaharias Monday, January 30, 2012

    I run SearchQuant, the blog whose write-up of Bloomreach and enterprise SEO platforms you linked for this article. Since that Nov 15 2011 measurement, Bloomreach is up from 160 customers (as measured by BuiltWith Trends) to 228. Impressive [42.5%] growth in such a short period of time, but one of their primary competitors – BrightEdge has gone from 148 to 312. Decidedly, Bloomreach isn’t the only one blooming in the enterprise SEO space.

    Share
  18. Another great company in this space is StatHat (http://www.stathat.com). OKCupid uses them to build out their awesome statistics from their community.

    Share
  19. According to the facts page on BloomReach, it seems that they generate traffic to their customers. I wonder how do they generate that traffic, through ads on the content network?

    Share
  20. Nice one Derrick. Another company you wrote about recently, Space-Time Insight, is also one to watch in this space http://gigaom.com/cloud/how-california-uses-souped-up-google-maps-to-manage-its-power/

    Share
  21. Hey Derrick, I know you have written about Zettaset in the past, but they definitely should be on a list like this. They came out of stealth mode in the past year, and in the fall they announced a partnership with Fusion-io and Hyve Solutions to create what could be one of the world’s fastest Hadoop solutions. Definitely worth another look.

    Share
  22. Christian Sarkar Tuesday, January 31, 2012

    See also Neural ID in Redwood, CA >> http://www.neuralid.com

    Share
  23. Exciting list to be following. Launching a start up in stealth mode is crucial for initial success. Would love to use Odiago for our start up networks.

    Share
  24. I understand “stealth mode startup” to mean one in which nobody outside the company knows what they are doing unless bound by an NDA. Wikipedia agrees. How can these startups be “stealth mode” when you are proclaiming their purposes?

    Share
  25. William Le Ferrand Thursday, February 2, 2012

    I agree. I saw another startup bring a new business model to machine
    learning: No up-front licenses, no installation, pay-as-you-go. Check
    out http://eigendog.com.

    Share
  26. Richard Slade Friday, February 3, 2012

    You missed one !

    http://www.sinusiridum.com

    Share
  27. Domenic Denicola Friday, February 3, 2012
    Share
  28. Anything building energy management focused?

    Share

Comments have been disabled for this post