Hadoop-based analytics startup Tresata last week open sourced a set of machine learning libraries built on Scalding and designed to run in Hadoop and make use of the Apache Mahout project. Tresata is calling the project Ganita, and has also written a couple of explanatory blog posts about it, including how to do k-means clustering. The barriers to doing good work on big data just keep getting lower.
Subscriber content
?
Subscriber content comes from Gigaom Research, bridging the gap between breaking news and long-tail research. Visit any of our reports to learn more and subscribe.
Advertisement
Advertisement
Advertisement
Comments have been disabled for this post