A recent New York Times article casts some doubt on the economic impact of big data. Here’s why I think we haven’t seen anything yet when it comes to big data and the global economy. Read more »
Genomic-analysis startup Bina Technologies is trying to grow its footprint by giving away its appliances on a pay-per-use basis. It’s also expanding its capabilities to include analysis of exomes, a much smaller but very valuable component of human genes. Read more »
Is it illegal to visit Craigslist when the site tells you not to? In a new ruling on a closely-watch case about data scraping, a federal judge suggested that start-up 3Taps violated an anti-hacking law by disguising its IP address. Read more »
The tool, which forms part of Recommind’s cloud-based Axcelerate On-Demand package, aims to give non-technical users a faster and more informative e-discovery process. Read more »
NewSQL player will use funding to pursue opportunities in e-commerce, gaming and advertising, says CEO Robin Purohit. Read more »
The four GigaOM podcasts covered a range of topics this week: From SDNs and VMWare to BlackBerry’s past and present. We also discuss why the future of mobile banking will change due to connected devices, so tune in! Read more »
The White House’s Office of Science and Technology Policy published new guidelines and FAQs to ease adoption of President Obama’s open data policy. In May, the president signed an order mandating that agencies use machine-readable and open data formats when they collect or create information so it can be re-used efficiently.
Facebook has reportedly done away with its once-important EdgeRank system in lieu of a system that considers about 100,000 factors in determing what content to show on users’ feeds. Read more »
Google researchers have developed new methods for analyzing language using deep learning techniques. They’ve also open sourced an implementation of their work so any researchers can experiment with it. It could be the first of many deep learning tools designed for mass consumption. Read more »
When it comes to data, soccer is the new baseball. The latest issue of the Economist has an article breaking down English Premiere League soccer players using data, and a subsequent blog post includes an interactive tool from machine learning startup Ayasdi that lets readers explore the data. Earlier this week, Disney researchers presented their analysis of an entire year’s worth of ball-position data for a professional soccer league and how that can affect the outcome of games.
Todd Papaioannou is joining big data-focused venture capital firm Data Collective as a entrepreneur in residence. Papaioannou was most recently co-founder and CEO of Continuuity, and as has held executives roles at companies including Yahoo and Teradata. Read more »
Documents leaked by Edward Snowden hint at the scale of human and system errors in the NSA’s surveillance apparatus, that have lead to many Americans’ phone calls and emails being intercepted. Read more »
A monthly look at where health tech investors are putting their money. Read more »
This week’s topics: Can VMware make a dent in cloud? What’s all this we keep hearing about Yarn? Plus, a chat with Paul Santinelli. Read more »
Facebook has detailed its extensive improvements to the open source Apache Giraph graph-processing platform. The project, which is built on top of Hadoop, can now process trillions of connections between people, places and things in minutes. Read more »
Data scientists are in high demand, which is bound to lure some of them out of academia and into industry. Their biggest challenge won’t be finding a job, but finding the right one — and maybe opting for entrepreneurship over employment. Read more »
It’s pretty worrying when a hacker can take over a baby monitor and start yelling obscenities at an infant girl. But the incident provides a taste of the future if makers of connected devices don’t think security-first. Read more »
A Chicago-based startup called AvantCredit has raised a $20 million series B round for its personal loan service that uses machine learning algorithms to assess credit-worthiness. AvantCredit closed a $34 million Series A round earlier this year. It’s taking a page out of the ZestFinance playbook — lending to underserved markets at rates less usurious than traditional payday-loan providers — although that company now acts only as an underwriter rather than an actual lender.
The company has claimed in a legal document that Gmail users and their correspondents have “no legitimate expectation of privacy”. Only that’s not quite how Google sells its services. Read more »
A senior German politician says a U.S.-Germany “no-spying” deal will protect citizens, and the country’s top email providers say they’re instituting a meaningful security boost for customers. Both claims should be taken with a pinch of salt. Read more »
Looker, a Santa Cruz, Calif.-based business intelligence startup, has raised a $16 million series A round from Redpoint Ventures and First Round Capital. In an age of data tools targeting lay users, Looker is taking a different approach by trying to empower smart data analysts with its custom modeling language. The company closed a $1.7 million seed round in March.
A glowing, plastic polar bear out to save his real-world counterparts uses real-time data and electricity consumption sensors to change people’s habits. It’s a good example of how real-time data and ambient information could influence behavior. Read more »
Microsoft has developed a big data technology that sits on top of Hadoop’s new YARN resource manager. Called REEF, it’s designed to let users build jobs that can maintain state even after they’re done, and that can grab data from wherever they need it. Read more »
The local authority in London’s financial district has put the brakes on a startup’s pilot program, which used smart recycling bins to log the identifiers of passing mobile devices. Read more »
Hortonworks lost both a co-founder and a CTO this week: Who’s going to right the ship? GE supports industrial data in the cloud, but only to a point: Hear why the company thinks AWS won’t ever be place for nuclear power plant data. Read more »
Machine-learning technologies can be met with resistance from humans skeptical of their effectiveness or fearful of their impact. But it doesn’t have to be that way at your company. Read more »
Zynga has open sourced a tool called zPerfmon that collects and serves all the performance data its engineers could ever need, and all from a single server. Read more »
Spend a few days hanging around Black Hat and DEF CON, and you’ll see some creepy hacks. If you wanna lose a little sleep, dwell on the fact it’s much easier to replicate the work or even to buy data-capturing devices. Read more »
A recent Slate article criticizes Max Levchin’s new fertility app as a too-intrusive tool from a data-loving technophile, but, in many ways, it’s an extension of what many women have been doing for decades. Read more »
It appears investors are buying into the adage that CMOs are the new CIOs. On Friday, a Portland-based startup called Lytics announced it has raised $2.2 million in seed funding from Rembrandt Venture Partners and Voyager Capital. The company compares its big-data-meets-marketing approach with that of Causata, which is good company to be in.
Ads in the Google Maps app are relevant to what you searched for, but can send you farther out of the way than the actual nearest result. Read more »
A London startup called Renew is using Wi-Fi-equipped recycling bins to track the smartphones that pass by on the street. The legality of this is questionable, particularly as there’s no opt-in. Read more »
Silent Circle, co-founded by email security guru Phil Zimmermann, has pulled out of the secure email business. It was a pre-emptive measure inspired by Lavabit’s self-shuttering, and a worrying sign for the U.S.-hosted secure communications industry. Read more »
Hadoop-in-the-cloud startup Qubole says its customers used more than 100,000 nodes to run more than 350,000 jobs and process more than a petabyte of data in July. Those aren’t Facebook numbers, but they seem to signal an appetite among smaller users. Read more »
Google has taken an important new step in its effort to carve out a space where cloud computing innovators can work without fear of being sued. Read more »
The average doctor has likely never heard of Fred Trotter, but he has some provocative ideas about using physician data to change how healthcare gets delivered. Read more »
In our second cloud-and-data podcast, we hear from Facebook’s top analytics guy about how the company deals with all that data; we discuss the drama at Hortonworks and IBM oh, and why Infochimps and CSC may be a match made in heaven. Read more »
Glow, a data-driven fertility app launched by PayPal co-founder Max Levchin, is now live in Apple’s App Store. Read more »
There’s no guarantee of online privacy and security, but here’s a list of tools you can use to at least cover the basics. Ultimately, though, they will require a serious revamp if they’re going to see widespread use. Read more »
While some big data startups are thriving, others are shutting down or searching for buyers because it doesn’t look like a second round of venture capital is coming. Here are a few lessons I think I’ve gleaned from watching the space over the past few years. Read more »