10gen has added some new features to its MongoDB connector for Hadoop, including support for Hive and the ability to backup MongoDB files in HDFS. Read more »
Business intelligence and analytics startup Birst has raised a $38 million Series E round led by Sequoia Capital. Birst has been very busy in the past couple years, moving from SaaS to on-prem software, rethinking the data warehouse and even launching a Hadoop-based service. It looks like Birst is positioned to test the IPO waters like Qliktech and Tableau before it.
Cleversafe, a Chicago-based provider of object-storage systems for housing massive amounts of data, has raised a $55 million series D round led by New Enterprise Associates. Apart from traditional storage workloads, Cleversafe has also made a name for itself as a replacement for HDFS in Hadoop environments. According to Crunchbase, the company has now raised $91.4 million since 2007.
While companies are starting to realize that cloud architects are vital to a well-designed cloud strategy and architecture, the problem is finding education programs that develop the design expertise that cloud architects require to ensure a smooth transition from physical to cloud to ITaaS environments. Read more »
Pamela “PJ” Jones, the proprietor of Groklaw, is shutting down operations in the wake of the Lavabit secure email service closure. Groklaw, which was originally set up to cover the long-running SCO v. Novell trial but went on to facilitate discussions around all sorts of open-source and patent issues, relied partly on anonymous user tips. Jones said email could no longer be trusted, and said she was personally trying to get off the internet as much as possible.
The details, which appear to be genuine, do not include passwords. They do include OAuth tokens, though, so Twitter users should probably revoke and re-establish access to connected third-party apps. Read more »
A recent New York Times article casts some doubt on the economic impact of big data. Here’s why I think we haven’t seen anything yet when it comes to big data and the global economy. Read more »
Genomic-analysis startup Bina Technologies is trying to grow its footprint by giving away its appliances on a pay-per-use basis. It’s also expanding its capabilities to include analysis of exomes, a much smaller but very valuable component of human genes. Read more »
Is it illegal to visit Craigslist when the site tells you not to? In a new ruling on a closely-watch case about data scraping, a federal judge suggested that start-up 3Taps violated an anti-hacking law by disguising its IP address. Read more »
The tool, which forms part of Recommind’s cloud-based Axcelerate On-Demand package, aims to give non-technical users a faster and more informative e-discovery process. Read more »
NewSQL player will use funding to pursue opportunities in e-commerce, gaming and advertising, says CEO Robin Purohit. Read more »
The four GigaOM podcasts covered a range of topics this week: From SDNs and VMWare to BlackBerry’s past and present. We also discuss why the future of mobile banking will change due to connected devices, so tune in! Read more »
The White House’s Office of Science and Technology Policy published new guidelines and FAQs to ease adoption of President Obama’s open data policy. In May, the president signed an order mandating that agencies use machine-readable and open data formats when they collect or create information so it can be re-used efficiently.
Facebook has reportedly done away with its once-important EdgeRank system in lieu of a system that considers about 100,000 factors in determing what content to show on users’ feeds. Read more »
Google researchers have developed new methods for analyzing language using deep learning techniques. They’ve also open sourced an implementation of their work so any researchers can experiment with it. It could be the first of many deep learning tools designed for mass consumption. Read more »
When it comes to data, soccer is the new baseball. The latest issue of the Economist has an article breaking down English Premiere League soccer players using data, and a subsequent blog post includes an interactive tool from machine learning startup Ayasdi that lets readers explore the data. Earlier this week, Disney researchers presented their analysis of an entire year’s worth of ball-position data for a professional soccer league and how that can affect the outcome of games.
Todd Papaioannou is joining big data-focused venture capital firm Data Collective as a entrepreneur in residence. Papaioannou was most recently co-founder and CEO of Continuuity, and as has held executives roles at companies including Yahoo and Teradata. Read more »
Documents leaked by Edward Snowden hint at the scale of human and system errors in the NSA’s surveillance apparatus, that have lead to many Americans’ phone calls and emails being intercepted. Read more »
A monthly look at where health tech investors are putting their money. Read more »
This week’s topics: Can VMware make a dent in cloud? What’s all this we keep hearing about Yarn? Plus, a chat with Paul Santinelli. Read more »
Facebook has detailed its extensive improvements to the open source Apache Giraph graph-processing platform. The project, which is built on top of Hadoop, can now process trillions of connections between people, places and things in minutes. Read more »
Data scientists are in high demand, which is bound to lure some of them out of academia and into industry. Their biggest challenge won’t be finding a job, but finding the right one — and maybe opting for entrepreneurship over employment. Read more »
It’s pretty worrying when a hacker can take over a baby monitor and start yelling obscenities at an infant girl. But the incident provides a taste of the future if makers of connected devices don’t think security-first. Read more »
A Chicago-based startup called AvantCredit has raised a $20 million series B round for its personal loan service that uses machine learning algorithms to assess credit-worthiness. AvantCredit closed a $34 million Series A round earlier this year. It’s taking a page out of the ZestFinance playbook — lending to underserved markets at rates less usurious than traditional payday-loan providers — although that company now acts only as an underwriter rather than an actual lender.
The company has claimed in a legal document that Gmail users and their correspondents have “no legitimate expectation of privacy”. Only that’s not quite how Google sells its services. Read more »
A senior German politician says a U.S.-Germany “no-spying” deal will protect citizens, and the country’s top email providers say they’re instituting a meaningful security boost for customers. Both claims should be taken with a pinch of salt. Read more »
Looker, a Santa Cruz, Calif.-based business intelligence startup, has raised a $16 million series A round from Redpoint Ventures and First Round Capital. In an age of data tools targeting lay users, Looker is taking a different approach by trying to empower smart data analysts with its custom modeling language. The company closed a $1.7 million seed round in March.
A glowing, plastic polar bear out to save his real-world counterparts uses real-time data and electricity consumption sensors to change people’s habits. It’s a good example of how real-time data and ambient information could influence behavior. Read more »
Microsoft has developed a big data technology that sits on top of Hadoop’s new YARN resource manager. Called REEF, it’s designed to let users build jobs that can maintain state even after they’re done, and that can grab data from wherever they need it. Read more »
The local authority in London’s financial district has put the brakes on a startup’s pilot program, which used smart recycling bins to log the identifiers of passing mobile devices. Read more »
Hortonworks lost both a co-founder and a CTO this week: Who’s going to right the ship? GE supports industrial data in the cloud, but only to a point: Hear why the company thinks AWS won’t ever be place for nuclear power plant data. Read more »
Machine-learning technologies can be met with resistance from humans skeptical of their effectiveness or fearful of their impact. But it doesn’t have to be that way at your company. Read more »
Zynga has open sourced a tool called zPerfmon that collects and serves all the performance data its engineers could ever need, and all from a single server. Read more »
Spend a few days hanging around Black Hat and DEF CON, and you’ll see some creepy hacks. If you wanna lose a little sleep, dwell on the fact it’s much easier to replicate the work or even to buy data-capturing devices. Read more »
A recent Slate article criticizes Max Levchin’s new fertility app as a too-intrusive tool from a data-loving technophile, but, in many ways, it’s an extension of what many women have been doing for decades. Read more »
It appears investors are buying into the adage that CMOs are the new CIOs. On Friday, a Portland-based startup called Lytics announced it has raised $2.2 million in seed funding from Rembrandt Venture Partners and Voyager Capital. The company compares its big-data-meets-marketing approach with that of Causata, which is good company to be in.
Ads in the Google Maps app are relevant to what you searched for, but can send you farther out of the way than the actual nearest result. Read more »
A London startup called Renew is using Wi-Fi-equipped recycling bins to track the smartphones that pass by on the street. The legality of this is questionable, particularly as there’s no opt-in. Read more »
Silent Circle, co-founded by email security guru Phil Zimmermann, has pulled out of the secure email business. It was a pre-emptive measure inspired by Lavabit’s self-shuttering, and a worrying sign for the U.S.-hosted secure communications industry. Read more »
Hadoop-in-the-cloud startup Qubole says its customers used more than 100,000 nodes to run more than 350,000 jobs and process more than a petabyte of data in July. Those aren’t Facebook numbers, but they seem to signal an appetite among smaller users. Read more »