An MIT professor has created an algorithm he says can work in conjunction with rangefinders and adaptive cruise control systems to keep cars moving at the ideal speeds to limit traffic jams. Read more »
Cloudera has partnered with a startup called Databricks to integrate and support the Apache Spark data-processing platform within Cloudera’s Hadoop software. Spark, which is designed for speed and usability, is one of several technologies pushing Hadoop beyond MapReduce. Read more »
Rackspace is now doing Hadoop, Cloudera just announced a handful of partners — Hadoop is everywhere in the cloud these days. Here’s a quick breakdown of what cloud providers are offering which distributions of Hadoop as managed services. Read more »
As the Federal TradeCommission looks into privacy and data sharing rules for the internet of things, Google’s chief internet evangelist Vint Cerf will keynote the workshop. Read more »
After nearly 18 months in relative stealth mode, ClearStory Data is finally available. It’s a pretty novel way of doing business analytics that tries to let lay users do more by automating much of the hard work. Read more »
A new startup called Paxata wants to make business analysts’ lives easier by automating the process of going from raw data to something that an analytics product like Tableau can actually understand. Read more »
The weekend brought a spate of updates in the ongoing NSA saga. German media reported that Barack Obama had known about the tapping of Angela Merkel’s phone for years despite claiming he hadn’t, prompting fresh denials from Washington. Der Spiegel also published a detailed look at the American agency’s Berlin spying tactics.
Meanwhile El Mundo reported that the NSA had recorded phone call details of millions of Spaniards, and the Kyodo news agency said Japan had rebuffed U.S. requests in 2011 to tap fiberoptic cables going through Japan to China.
Rackspace has opened its Hortonworks-powered Hadoop service for early access customers, about a year after announcing it would be building the offering. It’s neither the first nor the last managed Hadoop service we’ll see this week. Read more »
Software isn’t just eating the world, it’s highly profitable compared with most other industries. Software and tobacco are bringing in high margins, and other businesses want in. Read more »
A study by IBM information scientists that looked at the aftermath of the Boston bombings found that Twitter was unreliable — but during such events almost all information sources are inherently unreliable Read more »
Backblaze CEO Gleb Budman came on the Structure this week to talk about everything from building open source storage pods to dealing with the CIA to how hard it easy it can be to waste $1 million marketing to the wrong people. Read more »
Cloudera CEO Tom Reilly says his company doesn’t really think of its peers Hortonworks and MapR as competitors, deciding instead to focus its efforts on winning bigger and broader deals. Read more »
The debate over whether journalists need to code is a lot like the argument over whether bloggers are journalists — a sign of how quickly the field is being disrupted, and of how much we still have left to learn Read more »
Technology buyers in some sectors drool over the promise of things like cloud computing and big data, but those words don’t mean a whole lot in places like warehouses or manufacturing plants, where how something works is far less important than that it works. Read more »
There’s a new line of lingerie out based on the fit data of 200,000 women. Read more »
Germany and Brazil are pushing forward with proposals for a global right to online privacy. It would have been nice if this action had begun in earnest when it was citizens being spied upon, and not only after Angela Merkel and Dilma Rousseff were revealed as targets. Read more »
Facebook’s decision to include status updates and wall posts in Graph Search could be called great or creepy depending on the user, but it wasn’t an inconsequential decision technologically. In fact, it put a significant strain on the feature’s database infrastructure. Read more »
The FTC is contemplating how it should regulate the internet of things. A recent legal opinion as well as a look at some of the comments filed ahead of the meeting offer a glimpse into the regulatory future. Read more »
I apologize if I’m late to the game on this, but someone just tweeted me about Apache Tajo, a potentially interesting new SQL query engine for Hadoop. I’m not sure how much traction it can possibly gain given the glut of other options out there (take a look at this now extremely outdated roundup from February), but I guess more options are better for users, to a point. SK Telecom, a Korean carrier, is already a big fan. Also, some of Tajo’s contributors’ employers are kind of interesting.
It’s product-announcement season in the big data and Hadoop world, and this week was full of them. Here are some of the more interesting items you might have missed if you blinked. Read more »
A new study by the Pew Center and the Knight Foundation shows that while many users come across news while they are on Facebook, most don’t go there specifically looking for it. Read more »
A new product code-named Rubicon aims to answer developers’ rapid-fire queries about all aspects of their applications’ performance in real time, said New Relic CEO Lew Cirne. Read more »
The Irish data protection chief turned down a request to investigate Facebook’s alleged complicity in the NSA’s PRISM scheme, but campaigners Europe v Facebook have won the right to a judicial review of the decision. Read more »
After growing concerns over companies that manage and store student data, a U.S. Senator is asking the Department of Education about how it scrutinizes data-centric third-party vendors. Read more »
Germany’s leader, Angela Merkel, has confronted U.S. president Barack Obama over the likely tapping of her communications. The White House has said the U.S. “is not monitoring and will not monitor” her communications, but has not denied doing so in the past. Read more »
Facebook has one of the largest, if not the largest, MySQL installations in the world, and has created a tool to keep that system online with as little human intervention as possible. It’s called MySQL Pool Scanner and, Facebook’s Shlomo Priymak wrote in a post on Monday describing it, it’s designed to automate ”nearly everything a conventional MySQL Database Administrator (DBA) might do so that the cluster can almost run itself.” Not only does MPS handle availability but, Priymak noted, it also lets administrators do things such as copy the entire Facebook dataset with a single command.
The European Parliament has asked the European Commission to suspend a deal with the U.S. that allowed the sharing of financial data, in order to track terrorist funding. However, that call appears to have been rebuffed. Read more »
A Boston-area startup called Nutonian is taking its machine learning software originally developed for scientific research into the mainstream. To the degree software can ever tell users how and why data are connected, Nutonian says its product is the best thing around. Read more »
The EMC-VMware spinoff integrates the GemFire in-memory database with its Hadoop offering. The goal? Faster feedback cycles. Read more »
Before today Automatic was selling its vehicle-data-gathering gadget solely on its website. Now it has the opportunity to put its quantified driving technology in front of millions of Apple customers. Read more »
Adoption by Google, Wikipedia and some of the top Linux distros is all good and fine, but SkySQL really wants to see the MariaDB database get picked up in the enterprise. $20 million should help that happen. Read more »
Streamy launched in 2007 and officially closed in 2010, but the social news reader application is back in a stripped-down version — not to make its founders boatloads of money on its own, but to prove the capabilities of their new Hadoop application platform. Read more »
Sales intelligence startup Infer is racking up customers and transactions for its software that analyzes thousands of data points in order to score potential leads on the likelihood they’ll convert. For some customers, the scores have become instrumental metrics. Read more »
As our devices multiply and our home broadband (and mobile) connections get faster the middle mile and backhaul networks have to keep up. That’s why Comcast’s test of a 1-terabit-per-second network matters. Read more »
IaaS? Check. PaaS? Check. Now big data? Check. Savvis fills in its cloud services portfolio with new big data services. Read more »
Cooladata has raised a $7.4 million series A round from Greylock IL and Carmel Ventures. The company has developed an interesting behavioral analytics service that’s powered by Google’s various cloud services. Read more »
Google’s new real-time map showing DDoS attacks across the world is both awesome and scary — especially if you’re running a website in the United States. Read more »
The European Parliament’s civil liberties committee has endorsed all of Green MEP Jan Phillip Albrecht’s suggestions for tougher privacy legislation, reversing much of the lobbying work done by technology firms and the U.S. government before the NSA scandal broke. Read more »
Teradata is expanding out of the appliance world by offering a fully managed version of its data warehouse software as a cloud service. The company is also dipping a toe into the NoSQL world — the internet of things — with support for JSON files. Read more »
Most hard drives store data using magnetic properties, but unfortunately those are only good for about 10 years. This might be great for your status updates, but if you’re storing photos or government documents, your best bet for a long-term future might be archival paper (or stone and a chisel). But MIT Tech Review reports that scientists have figured a way to etch data onto sheets of tungsten and silicon nitride in the form of QR codes to store data for the theoretical long term — like a million years. Whether or not we’ll have software to decode it then is another problem altogether.