This week’s topics: Can VMware make a dent in cloud? What’s all this we keep hearing about Yarn? Plus, a chat with Paul Santinelli. Read more »
Facebook has detailed its extensive improvements to the open source Apache Giraph graph-processing platform. The project, which is built on top of Hadoop, can now process trillions of connections between people, places and things in minutes. Read more »
Data scientists are in high demand, which is bound to lure some of them out of academia and into industry. Their biggest challenge won’t be finding a job, but finding the right one — and maybe opting for entrepreneurship over employment. Read more »
It’s pretty worrying when a hacker can take over a baby monitor and start yelling obscenities at an infant girl. But the incident provides a taste of the future if makers of connected devices don’t think security-first. Read more »
A Chicago-based startup called AvantCredit has raised a $20 million series B round for its personal loan service that uses machine learning algorithms to assess credit-worthiness. AvantCredit closed a $34 million Series A round earlier this year. It’s taking a page out of the ZestFinance playbook — lending to underserved markets at rates less usurious than traditional payday-loan providers — although that company now acts only as an underwriter rather than an actual lender.
The company has claimed in a legal document that Gmail users and their correspondents have “no legitimate expectation of privacy”. Only that’s not quite how Google sells its services. Read more »
A senior German politician says a U.S.-Germany “no-spying” deal will protect citizens, and the country’s top email providers say they’re instituting a meaningful security boost for customers. Both claims should be taken with a pinch of salt. Read more »
Looker, a Santa Cruz, Calif.-based business intelligence startup, has raised a $16 million series A round from Redpoint Ventures and First Round Capital. In an age of data tools targeting lay users, Looker is taking a different approach by trying to empower smart data analysts with its custom modeling language. The company closed a $1.7 million seed round in March.
A glowing, plastic polar bear out to save his real-world counterparts uses real-time data and electricity consumption sensors to change people’s habits. It’s a good example of how real-time data and ambient information could influence behavior. Read more »
Microsoft has developed a big data technology that sits on top of Hadoop’s new YARN resource manager. Called REEF, it’s designed to let users build jobs that can maintain state even after they’re done, and that can grab data from wherever they need it. Read more »
The local authority in London’s financial district has put the brakes on a startup’s pilot program, which used smart recycling bins to log the identifiers of passing mobile devices. Read more »
Hortonworks lost both a co-founder and a CTO this week: Who’s going to right the ship? GE supports industrial data in the cloud, but only to a point: Hear why the company thinks AWS won’t ever be place for nuclear power plant data. Read more »
Machine-learning technologies can be met with resistance from humans skeptical of their effectiveness or fearful of their impact. But it doesn’t have to be that way at your company. Read more »
Zynga has open sourced a tool called zPerfmon that collects and serves all the performance data its engineers could ever need, and all from a single server. Read more »
Spend a few days hanging around Black Hat and DEF CON, and you’ll see some creepy hacks. If you wanna lose a little sleep, dwell on the fact it’s much easier to replicate the work or even to buy data-capturing devices. Read more »
A recent Slate article criticizes Max Levchin’s new fertility app as a too-intrusive tool from a data-loving technophile, but, in many ways, it’s an extension of what many women have been doing for decades. Read more »
It appears investors are buying into the adage that CMOs are the new CIOs. On Friday, a Portland-based startup called Lytics announced it has raised $2.2 million in seed funding from Rembrandt Venture Partners and Voyager Capital. The company compares its big-data-meets-marketing approach with that of Causata, which is good company to be in.
Ads in the Google Maps app are relevant to what you searched for, but can send you farther out of the way than the actual nearest result. Read more »
A London startup called Renew is using Wi-Fi-equipped recycling bins to track the smartphones that pass by on the street. The legality of this is questionable, particularly as there’s no opt-in. Read more »
Silent Circle, co-founded by email security guru Phil Zimmermann, has pulled out of the secure email business. It was a pre-emptive measure inspired by Lavabit’s self-shuttering, and a worrying sign for the U.S.-hosted secure communications industry. Read more »
Hadoop-in-the-cloud startup Qubole says its customers used more than 100,000 nodes to run more than 350,000 jobs and process more than a petabyte of data in July. Those aren’t Facebook numbers, but they seem to signal an appetite among smaller users. Read more »
Google has taken an important new step in its effort to carve out a space where cloud computing innovators can work without fear of being sued. Read more »
The average doctor has likely never heard of Fred Trotter, but he has some provocative ideas about using physician data to change how healthcare gets delivered. Read more »
In our second cloud-and-data podcast, we hear from Facebook’s top analytics guy about how the company deals with all that data; we discuss the drama at Hortonworks and IBM oh, and why Infochimps and CSC may be a match made in heaven. Read more »
Glow, a data-driven fertility app launched by PayPal co-founder Max Levchin, is now live in Apple’s App Store. Read more »
There’s no guarantee of online privacy and security, but here’s a list of tools you can use to at least cover the basics. Ultimately, though, they will require a serious revamp if they’re going to see widespread use. Read more »
While some big data startups are thriving, others are shutting down or searching for buyers because it doesn’t look like a second round of venture capital is coming. Here are a few lessons I think I’ve gleaned from watching the space over the past few years. Read more »
Will Jeff Bezos leave the Washington Post alone once he takes control, or will he use it as a research lab for reinventing the newspaper? Here are a few things he should do if it’s the latter. Read more »
Predictive analytics specialist NICE has acquired Causata, a marketing analytics startup built around a core of big data and machine learning technologies. Causata should bolster NICE’s customer-engagement platform that helps companies better understand their customers. The four-year-old Causata has raised $23 million in venture capital, all from Accel Partners.
Another startup wants to save patients from a deluge of potentially irrelevant health information online with a service that customizes a regular feed of reliable research. Read more »
IT services and consulting specialist CSC has acquired Infochimps, a startup that sells a big data query and processing platform. Infochimps had raised about $5 million in equity and debt financing since launching in 2009. Read more »
Less than a year after launching it, a team including former Zynga employees and a longtime geneticist have sold their startup Arpeggi to genetic testing company Gene by Gene. Read more »
Curt Monash has some interesting data points on Hortonworks and the Hadoop market from its point of view — competitive landscape, cluster size, hardware setups, etc. Also word that Eric Baldeschwieler is doing “his own thing.”
Hortonworks CEO Rob Bearden has confirmed that co-founder and CTO Eric Baldeschwieler has left the company. No word as to why, but his departure is the latest event in a busy few months at Hortonworks. Read more »
Vodafone has turned on 4G services, starting in Amsterdam, The Hague, Rotterdam and Utrecht. It’s not the first carrier to do so in The Netherlands, but it is the first to start running 4G over 1800MHz spectrum. Read more »
10gen is announcing that energy demand specialist EnerNOC has rolled out MongoDB to help it analyze its power grid data in new ways. EnerNOC collects 1.5 billion data points every month, although it’s possible they won’t all find their way into the company’s MongoDB environment.
The industrial internet isn’t just some marketing speak thought up by GE. There are different considerations when handling jet engine data compared to a connected door lock. We discuss those on this week’s podcast. Read more »
Misfit’s Shine — the thumb-sized minimalistic activity tracker — is launching in Apple stores internationally. Read more »
If the corporate website is any indication, Hortonworks co-founder Eric Baldeschwieler is no longer with the company. The former Hadoop boss at Yahoo was Hortonworks’ first CEO and was most recently CTO. Read more »
It’s not so much a new brand as a new offering from Cox Communications, called Contour. People often find Netflix’s recommendations less than ideal, but that’s only $8 a month. I hope it’s the massive DVR and second-screen experience that are supposed to hook users.