DataRPM is one of a handful of companies trying to move business intelligence into its next generation by incorporating natural language processing and a search-like experience. InterWest Partners led its series A round. Read more »
Basho, a NoSQL startup whose Riak database competes against the likes of Cassandra in scale-out environments, has lost its CEO Greg Collins, CTO Justin Sheehy and Chief Architect Andy Gross. In an interview with the Register, Sheehy said the departures aren’t as bad as they look and that the company is in good hands. Perhaps, although whoever replaces Collins will be the company’s fourth CEO since it was founded in 2007, and neither of the company’s co-founders remain. Basho has raised more than $31 million in venture capital, with its last funding round of $11.1 million coming in July 2012.
A Major League Baseball team is reportedly the proud owner of a Cray Urika graph-processing appliance that helps the team make in-game decisions by analyzing lots and lots of data. It might be a first, but it’s where sports are headed. Read more »
Eucalyptus CEO Marten Mickos has been around the private cloud space since its inception. Nearly five years after the company’s launch, Mickos shares his thoughts on a market that rose fast, fell hard and appears to be on the rebound. Read more »
As it has been doing a lot of lately, the Facebook data science team released another study on Friday highlighting a particular facet of the social science treasure trove that is its collection of wall posts. It might be cool to see this kind of data in the hands of non-corporate researchers, but it’s still interesting to see things like how polarized political parties are or how much more positive women seem toward each other than men. Also, “Damn Canadians!”
Even Booz Allen Hamilton has dollar signs in its eyes when it thinks about sports data. The company is getting started on a new venture to apply its data science mastery to the piles of sensor and statistical data teams are generating. Read more »
Facebook is building its second data center in Luleå, Sweden, using “rapid deployment data center” techniques that will speed construction and simplify design by prebuilding certain parts and creating standardized kits for others. Read more »
As a new study about sex trafficking during the Super Bowl highlights, advances in data analysis are underpinning some powerful new ways of tackling very tough problems. Among all the stones hurled at the tech sector lately, this is an area in which it can take pride. Read more »
One blog post says, “Not only is Data Science not a science, it’s not even a good job prospect.” Another says, “[T]here will always be a place for those who excel at solving ambiguous technological & business problems. And they’ll cost more than $30/hr.” Who’s right? Read more »
This year’s Structure Data conference has a few new wrinkles, including a trivia night at a nearby pub and a series of Data Lab talks about using new types of data. Here are the details. Read more »
Streaming music service Spotify has acquired The Echo Nest and its graph of musical data spanning more than 35 million songs and 2 million artists. It’s an easy way for Spotify to match companies like Google and Pandora on the data science front. Read more »
A new study might help confirm that D-Wave Systems’ quantum computer chip might actually be what it claims to be. Conducted at the University of Southern California, where the D-Wave system owned by aerospace contractor Lockheed Martin is based, a team of scientists has concluded that the 128-qubit processor “behaved in a way that agrees with a model called ‘quantum Monte Carlo,’ yet disagreed with two candidate classical models.” In two weeks at our Structure Data conference, D-Wave CEO Vern Brownell will talk about what quantum computers can do and how they’ll be available as cloud services.
Tableau and Splunk, two of the more successful (and ubiquitous) data startups turned public companies over the past several years, have partnered on a new connector that lets Tableau users access Splunk as a data source within the analytics software. However, it’s not just the existence of a connector that’s valuable for users, but what it means — that they can now combine Splunk data with other data within Tableau to visually analyze all of it together. As Tableau grows more popular, partnering with it is becoming a popular move for everyone from large software vendors to small startups such as BigML.
A Huntsville, Ala., company is moving from the machine-to-machine world into cloud platforms and big data. Here’s how it did it and how it thinks its work could actually end up saving lives. Read more »
An e-commerce startup called Reflektion has raised an $8 million series B round of venture capital for its technology that helps retailers personalize the online shopping experience for consumers. Intel Capital led the round, and Nike and several private investors also pitched in. This seems like the latest thing in marketing — not just targeted advertising but entire tailored experiences for individual shoppers. It does add an Amazon-like recommendation experience, although one might fairly question whether most product catalogs are large enough to warrant it.
An Irish startup called Aylien is getting into the natural-language processing space with a set of APIs for text analysis. It’s not the first company to do this, but it might be the most unique. Read more »
Uber has published a blog post explaining the difference that median income makes on the company’s service in Chicago. Beauty might be in the eye of the beholder here, but the study itself reinforces how much today’s data-driven companies know about their businesses. Read more »
Sqrrl co-founder and VP of business development Ely Kahn came on the Structure Show this week to break down the state of cybersecurity and the cutting edge of data analysis within the Department of Defense. Read more »
Cloudera is working on an open source project called Oryx that aims bring machine learning to Hadoop in a way that previous attempts such as Apache Mahout could not. Read more »
Apache Spark, an in-memory data-processing framework, is now a top-level Apache project. That’s an important step for Spark’s stability as it increasingly replaces MapReduce in next-generation big data applications. Read more »
NoSQL startup DataStax announced on Wednesday that it has added an in-memory option to its commercial version of the Cassandra key-value database. Cassandra is seeing an uptick in adoption right now because of its scalability and ability to span data centers, and the ability to serve data from memory instead of disk will make it a lot faster, too. If the approaches of startups like DataStax, MemSQL and others are any indication, it looks like databases of the future will feature broad ranges of capabilities, data formats and storage options.
Machine learning algorithms can do a lot of things if they have enough data — recommend products, identify fraud and even help women get pregnant. Fertility startup Ovuline says its 60 million data points from users have helped 50,000 of them get pregnant. Read more »
If the thought of tea gets you thinking of British women, fine china and doilies, Tealet might take some getting used to. The Las Vegas startup has built its business so far on the backs of Reddit and Bitcoin, and it hopes to take on Starbucks. Read more »
IBM announced its Watson Mobile Developers Challenge on Wednesday. The company is pushing Watson as a cloud service hard because it knows it has its work cut out to win developers away from startups and large companies like Google also pushing AI via API. Read more »
Although techniques such as machine learning are taking off in the e-commerce and retail spaces as a way to display better recommendations or optimize product presentation, the smart money is still on humans getting the final say in what customers see. Read more »
RunKeeper tracked what its users were up to in Sochi during the Olympics and found they ran the equivalent of about 78 marathons. It’s an interesting nugget, but part of a much larger picture about learning how, when and where people exercise. Read more »
Website performance and security startup CloudFlare has acquired an anti-malware startup called StopTheHacker. The deal makes the popular CloudFlare that much more useful and also gives the company a new business to take advantage of the global infrastructure it’s building out. CEO Matthew Prince recently suggested it would get into the anti-malware space because it often has spare computing capacity that could be put to work scanning networks rather than sitting idle. Although it plans to integrate the two services more tightly, CloudFlare says it will continue operating and investing in the StopTheHacker service.
A company called Carrier IQ is trying to help mobile carriers serve their customers better by using machine learning algorithms to diagnose problems with their smartphone, such as poor battery performance or call quality. A smart use of the technology would be for carriers to get proactive in helping customers resolve their problems before they get annoyed enough to call customer service or, in an increasingly non-contractual industry, just go elsewhere without letting a carrier know they’re leaving. The holy grail of big data, after all, is to actually be able to be proactive.
IBM has acquired cloud-based database startup Cloudant. It’s a smart move in terms of getting a foothold in the cloud database space, but it also seemingly forces IBM to embrace cloud providers and technologies outside its current umbrella. Read more »
Structure Data has a great lineup of speakers, including a handful that will be talking about how to take advantage of new types of data. Here is a list of sessions anyone interested in sensors, location or artificial intelligence won’t want to miss. Read more »
A startup called BlueTalon officially launched on Friday with a platform for helping people easily share and collaborate on data stored in commercial databases. Read more »
On Thursday, Facebook announced via a post on its engineering page that it has revamped the Thrift framework it built in 2006 (which has since become an Apache project) and is re-releasing it as open source code via GitHub under the fbthrift moniker. Thrift was created as a tool for helping build distributed applications that need to call different services written in different languages. Although it has been very useful, the post’s author explains, Facebook and other Thrift users ran into performance issues and feature deficiencies that have been resolved with fbthrift.
Apache Mesos is the open source cluster-management software that automates operations at companies such as Twitter and Airbnb. Now, a startup called Mesosphere is building a business around taking it mainstream. Read more »
Expect Labs has unveiled the MindMeld API, a set of artificial intelligence capabilities delivered as a service. Developers can use it to create smart applications that know what types of content and search results to recommend, and when. Read more »
A security startup called Elastica came out of stealth mode on Tuesday, and brought with it $6.3 million in venture capital from the Mayfield Fund. Elastica tries to protect corporate data scattered across the dozens of cloud services companies might be using and, like so many other security startups, is touting its use of data science techniques to accomplish its goal. Elastica does have an impressive pedigree, though, both with the Ph.Ds. on its founding team and with advisers including Rayid Ghani (Obama for America, Edgeflip), Tom Reilly (ArcSight, Cloudera), M.C. Srivas (MapR) and Ion Stoica (UC Berkeley, Conviva, Databricks).
After weeks of voting, we’re pleased to announce the winners of the inaugural Structure Data Readers’ Choice awards. The eight winners represent some of the most innovative and promising startups that launched in 2013 and have made better data analysis their mission. Read more »
Analytics startup BeyondCore has raised $9 million for its technology that can analyze complex data sets and automatically highlight the strongest correlations. It’s a promising capability assuming companies are willing to open up analytics across the organization. Read more »
It didn’t take long for the Hadoop market to become a juggernaut, and it won’t take long for it to undergo some significant technological changes. Cloudera co-founder and chief strategy officer Mike Olson came on the Structure Show podcast to break it down. Read more »
Collecting student data digitally isn’t solely something for massive open online courses. Even university professors and their students can benefit from transforming the lecture experience into one designed to go anywhere and collect data all along the way. Read more »
Our Structure Data conference this year is about many things, but one big theme is the emergence of everything as data. Thanks to advances in sensors and machine learning, everything from soil to sounds can provide valuable data. Read more »