Science has a data problem, There’s been a rash of experiments that no one can reproduce and studies that have to be retracted, But there are some nascent efforts to address this credibility crisis by changing the way the data is handled. Read more »
University of Illinois researchers have created an app and a sensor-filled cradle that turn an iPhone into a mobile spectrophotometer. The combination of that mobile lab data and metadata such as location might prove very valuable. Read more »
photo: Christophe Bisciglia (left) and Aaron Kimball (right)
Startup WibiData has raised another $15 million and wants to turn the lessons it has learned in the field into generic software that can let anyone build predictive applications on Hadoop. Read more »
Cascading creator Concurrent has developed a new open source tool called Pattern for running machine learning models on Hadoop clusters. When combined with its SQL tool called Lingual, users can move data from one stage to another easily. Read more »
A group of researchers from Columbia and Stanford have created a method for turning complex cellular datasets into visualizations that map the similarities between tens of thousands of cells within a tissue sample. Read more »
Few companies today have the time or the analytics expertise to apply statistics and complex data modeling to regular or even daily business decisions and operations. Now, however, an ecosystem of companies is emerging to fill this need. Read more at GigaOM Pro »
Tableau had a successful IPO, closing the trading day up 64 percent and raking in $254 million. CEO Christian Chabot says the company is now set to make itself known around the world. Read more »
Database startup Drawn to Scale, creator of the SQL-on-Hadoop technology called Spire, is closing down. The company’s product, Spire, was one of the first SQL-on-Hadoop technologies. Read more »
Tableau’s initial public offering is on Friday, and expectations are high. The company has inspired much of the next-generation analytics space, and how it fares could be telling about just how powerful the data movement is. Read more »
When it comes to using big data technology effectively, there’s a lot to like about SaaS. When companies like BloomReach create and analyze massive web-wide data sets, they automate insights that almost no individual company could discover on its own. Read more »
Graph databases and graph-processing applications have been popping up all over the place lately, and now they’re starting to go commercial. On Tuesday, popular open source project GraphLab joined the ranks of graph startups. Read more »
With its acquisition of Lucky Sort, Twitter seems to be acknowledging that it’s a data company after all. The plan appears to be building a services that would do for Twitter equivalent to services such as Google Trends and Google Analytics. Read more »
If the big data era is really going to revolutionize our world, visualizations that let more people make sense of data will be critical. Here are six startups trying to change how we interact with and look at our data. Read more »
British privacy advocates have reacted with horror to the idea of EE and market research firm Ipsos Mori selling anonymized customer data. On balance, they shouldn’t worry so much. Read more »
Data-warehouse providers are quickly adding Hadoop distributions, or even their own versions of Hadoop, into their architecture, adding further cost advantages to collections of extremely large data sets. Finding the talent to manage this newly converged environment will not be easy, but it presents tremendous opportunity for companies willing to take some risk. Read more at GigaOM Pro »
Netflix CEO thought he could do a better job at developing a recommendation algorithm than his engineers. He failed – and the episode shaped the way the company has looked at data ever since. Read more »
Hadoop startup Mortar Data is offering to build recommendation systems for 10 companies, with help from Hilary Mason, Drew Conway and Max Shron. It’s part of a bigger plan to democratize the science behind online recommendations. Read more »
Teradata is trying to steal some thunder in the in-memory analytics space with a new technology called Intelligent Memory that places hot data in RAM while dispersing the rest across solid-state drives and disk. Read more »
EMC CTO John Roese has a tough, but important job trying to keep EMC, VMware and Pivotal all moving in the same direction. While the three are separate companies, their fates are also very much aligned. Read more »
MetLife is building new products on new technologies thanks to a $300 million investment in new technology and new skills. One of the first products is a MongoDB-based app that puts all of customers’ information in one place. Read more »
IBM’s entrant in the SQL-on-Hadoop competition has been flying under the radar, but is available as a technology preview. Called Big SQL, it’s a big deal if IBM wants to be a major player in the Hadoop space. Read more »
MailChimp wasn’t always a big data company, but 12 years into its existence the company is using its mountains of email data to do everything from modeling spam to connecting subscribers. Read more »
By Haowen Chan and Robin Morris, Guest Contributors
photo: pzAxe/Shutterstock
True believers may be guilty of hype, but there’s no denying that big data presents opportunities for businesses of every stripe. That potential is vulnerable to pollution from data bias, and so calls for preventative processes. Read more »
The confluence of better location data and audio-recognition could mean big changes to seemingly static industries such as retail and radio as they learn more about what customers really want. Read more »
MapR on Wednesday released its commercial version of HBase called M7, the first such product on the market, that the company claims is bigger, faster and better than the open source version. Read more »
Analytics startup Precog is on a mission to make analytics on unstructured data as simple as possible with a new line of targeted appliances. Read more »
The growing pains of big data were apparent at the Data 2.0 Summit on Tuesday in San Francisco. Here is a selection of visualization tools that came up at the meeting. Read more »
Machine learning startup Skytree has raised $18 million for its software that makes short work of pattern recognition across massive datasets. Read more »
In the tsunami of experimentation, investment, and deployment of systems that analyze big data, vendors have seemingly been trying approaches at two extremes—either embracing the Hadoop ecosystem or building increasingly sophisticated query capabilities into database management system (DBMS) engines.For some use cases, there appears to be room for a third approach that lies between the extremes and borrows from the best of each. Read more at GigaOM Pro »
Cloudera’s Impala engine for interactive SQL queries on Hadoop data is now generally available, and CEO Mike Olson gives his lay of the competitive landscape. Read more »
Artificial intelligence expert and Google Director of Research was elected to the American Academy of Arts and Sciences last week. He’s well known for a 2009 paper titled “The Unreasonable Effectiveness of Data.” Read more »
The advent of big data is affecting Ford Motor Co. in some significant ways, from how it analyzes its supply chain to the features it puts into its cars. Read more »
Analytic database vendor ParAccel has been acquired by a relatively quiet database company called Actian. ParAccel targets big data with its scale-out architecture, and it counts Amazon as both an investor and user. Read more »
Hadoop experts Qubole have just closed a Series A funding round for their service, which lets users run Hive data warehouse jobs in Amazon’s cloud. Read more »
“Social customer service” refers to those services that provide customer support via social media channels. Providing such services is no longer merely a niche or specialty sideline. Challengers, or disruptors who were early with the new technology, are working to expand and integrate their offerings into enterprise systems and processes. Read more at GigaOM Pro »
Gravity CTO Jim Benedetto knows his way around MySQL after managing a 600-instance cluster at MySpace, but he has found HBase religion as his real-time content-recommendation platform grew. And he’s not alone. Read more »
Guavus makes its living by helping telcos and mobile carriers make sense of what’s happening across their networks. To date it has raised $87 million and is looking to expand far and wide. Read more »
Less than year after hitting the 1 trillion object mark, Amazon S3 is now storing more than 2 trillion objects. That’s a lot any way you slice it and highlights AWS’s role as an underpinning of today’s web. Read more »