Dropbox is acquiring a data visualization startup called Parastructure, according to TechCrunch. It’s one of those deals where nobody is talking yet, but what little info is publicly available about Parastructure helps shed some light on Dropbox’s motivation. It’s hard to imagine Dropbox getting into the analytics software business that Parastructure was targeting, but it’s not hard to imagine Dropbox acquiring some talent that can help it scale onto, and query, new data technologies. And, like Box did with its dLoop acquisition, Dropbox might also be looking to improve search for its business users.
Hadoop is a complex technology, so it helps to have friends in high places when you’re trying to develop it and integrate webscale tooling into enterprise environments. For Hortonworks, that friend is Yahoo, with which it continues a deep engineering partnership. Read more »
Microsoft, which recently showed off its machine learning research with Skype Translate, is opening up those capabilities with a new cloud service called Azure Machine Learning. Read more »
Apple, Cisco and AT&T have joined Verizon and the Electronic Frontier Foundation in supporting Microsoft’s attempt to quash a U.S. search warrant seeking email data about an Irish customer stored on Irish servers. Read more »
On this weeks Structure Show podcast, Barb Darrow, Stacey Higginbotham and I talk about which speaker and topics have us mosted excited about this weeks Structure conference. Needless to say, theres a lot to like about it. Read more »
Netflix explained how it’s using data analysis to do more than recommend movies in a blog post this week. From optimizing bitrate to churning through user feedback, advanced algorithms are helping ensure that minimal issues affect the streaming experience. Read more »
Business intelligence startup SiSense has raised a $30 million third round of venture capital from DFJ Growth, as well as existing investors Battery Ventures, Genesis Partners and Opus Capital. The company has now raised $44 million since it launched in 2010. Like most analytics startups, SiSense promises nice visualizations and a user-friendly experience, but its major bragging point is fast data processing thanks to an architecture that takes full advantage of the processor’s cache rather than just DRAM or disk. The company appears to being growing impressively, too, claiming triple-digit customer growth and some big-name accounts.
Google open sourced a Docker-centric tool called Kubernetes that lets its cloud computing customers automate their resource management similar to how Google does it internally. It’s part of a sustained approach to prove Google’s chops as a cloud provider by pushing its vision of computing. Read more »
Professional social network Declara has expanded its series A round to include two new investors and an additional $9 million in funding. The company’s technology platform uses machine learning to connect people with co-workers and content that will help them learn. Read more »
A startup called Ersatz Labs is promising deep learning, delivered via appliance or the cloud, usable by pretty much anybody already familiar with machine learning. It’s the latest attempt to take the data-processing approach out of the lab and into the mainstream. Read more »
With a new tool for assigning personality profiles based on Facebook posts, a startup called Five is trying to demonstrate the types of inferences companies can make about consumers. The company hopes it will spur desire for a more-private alternative to today’s very public platforms. Read more »
Location-tracking startup Placed has raised a $10 million series B round for its smartphone app that knows which businesses consumers are visiting. It’s valuable information on its own, and even more powerful when combined with other data sources. Read more »
GoDaddy has filed for an IPO, and its S-1 report sheds some light on the company’s technological underpinnings. Here are the highlights. Read more »
DNA-SEQ and 1QBit, two startups working on cancer research and financial modeling, respectively, have partnered with quantum computer manufacturer D-Wave Systems to try and develop applications that can run on the latter’s system. Last month, a company called Aerospace Concepts announced its plans to develop quantum computing software targeting advanced systems design. Debate over D-Wave’s technology aside, attempts to build quantum software targeting specific industries are meaningful. If they’re eventually proven effective, such applications will presumably lower barriers to adoption (e.g., how to use them and when) that often accompany new technologies.
Inspired by Google’s famous approach to resource management, Apache Mesos is the open source software that manages the large pools of servers and cloud instances at companies such as Twitter and Airbnb. Mesosphere, a company trying to commercialize it, has raised $12.75 million since launching. Read more »
Deep learning is all the rage among the tech scene right now, and that’s more a result of its utility than because it sounds cool. Some questioned the feasibility of the Secret Service’s requested “sarcasm detector,” but deep learning could help there, too. Read more »
Amazon Web Services, Gnip and two Australian research institutions have teamed up to track the emotions of tweets in near real-time and offer the data to the public via visualizations, downloadable tables and an API. Read more »
Elasticsearch, the company behind a very popular open source suite for indexing, searching and visualizing JSON documents, has raised a $70 million series C round of venture capital. Just more than two years since being founded, the company has raised $104 million. Read more »
Big data startup Continuuity has teamed with AT&T Labs on an open source project called jetStream that pairs a high-throughput SQL database with a real-time data-processing engine. The goal is to underpin applications that can handle multiple levels of latency, consistency and analysis on streaming data. Read more »
IBM announced the three winners of its Watson Mobile Developer Challenge on Tuesday, but the company will likely many, many more developers on on the platform if Watson is going to become a focal point of the company’s future. Read more »
DataTorrent, a startup building a stream-processing engine for Hadoop that it claims can analyze more than 1 billion data events per second, announced on Tuesday that its flagship product generally available. Stream processing is becoming more important as we move into an era of connected devices, ubiquitous sensors and fast-paced web platforms such as Twitter. Data is flowing into systems faster than ever, and many companies would like to get some use out of it in real time; in some cases, even hours-old data could be considered stale. Other products and projects addressing stream processing on Hadoop include Apache Storm, Spark Streaming and Samza, and Amazon Kinesis.
Hadoop vendor Cloudera has acquired Gazzang, a startup specializing in encryption for Hadoop environments, as well as others next-generation data stores such as Cassandra and MongoDB. Read more »
Hadoop software company Cloudera has acquired Gazzang, a startup specializing in encryption software for big data environments. It’s Cloudera’s first significant acquisition (it bought machine learning startup Myrrix in 2012 in more of an “acquihire” situation) and it speaks to the importance of security as customers’ […] Read more »
Tableau Software has been on fire over the past couple years, but a big knock against the company has been that its popular analytics software isn’t available for the Mac operating system. The company has been promising a native Mac edition for a while, and it appears it will finally deliver in a few weeks, if a June 19 event called “Tableau 8.2 Roadshow + Mac Launch Party” is any indication. I suspect a lot of Windows virtual machines will be firing up a lot less frequently and a goodly amount of new users signing up.
Skymind is providing commercial support and services for an open source project called deeplearning4j. It’s a collection of of approaches to deep learning that mimic those developed by leading researchers, but tuned for enterprise adoption. Read more »
Big data startup Concurrent has raised a $10 million series B round of venture capital from Bain Capital Ventures, Rembrandt Ventures and True Ventures (Disclosure: True Ventures is also an investor in Gigaom). The company actually made its name building and supporting Cascading, a framework that many folks use to build Hadoop applications and data workflows at a higher level than writing MapReduce jobs, but is now also pushing application performance management with a product called Driven. Concurrent launched in 2007 and has raised nearly $15 million since first taking venture financing in 2011.
Gigaom has written a lot about artificial intelligence over the years. Here are three timelines tracking the rise of deep learning and other learning systems, IBM Watson and AI discussions at Gigaom conferences. Read more »
On this week’s Structure Show, we discussed how VMware handles automation both for its customers and in its own cloud, and how a startup called Mesosphere wants to turn parts of that approach on their head by taking inspiration from Google and Twitter. Read more »
The Global Database of Events, Languages, and Tones is a growing trove of information about meaningful events that have happened across the world in the past three decades. Now, it’s available to the public to access and analyze using Google’s cloud computing services. Read more »
A company called Fyusion is trying to redo the photo app with a 3-D imaging technology that captures the spatial features of objects in order to generate multi-dimensional images. Photos, however, are likely just a first step. Read more »
A new framework by Microsoft Research lets lawyers and privacy managers encode their policies using a language called Legalease, and check code for compliance across systems that store, process and analyze data. The goal is to speed development and put parties on the same page. Read more »
Apixio has raised a $13.5 million series C round of venture capital from Bain Capital Ventures and several angel investors. The company’s technology extracts data from numerous types of medical records, including patient charts and doctors’ notes, in order to help hospitals and insurance companies get a better sense of patients’ health for risk-assessment purposes. Apixio has raised more than $22 million since launching in 2009. It first launched publicly in 2011, then focusing on its semantic-analysis engine for identifying latent connections between diagnoses and symptoms.
Raymie Stata, current co-founder and CEO of Hadoop startup Altiscale and ex-Yahoo CTO, came on the Structure Show this week to talk about why Hadoop matters, why who’s building it matters and where it’s headed. Here are the highlights. Read more »
Researchers from Allen Institute for AI have built a computer system capable of teaching itself many facets of broad concepts by scouring and analyzing search engines using natural language processing and computer vision techniques. Read more »
Large web companies such as Facebook, Google and Amazon have been pushing the boundaries of IT for years, and now their problems — and solutions — are making their ways into the mainstream. Read more »
Expect Labs has revamped its MindMeld artificial intelligence API to support new languages, more devices and better speech recognition. The company is trying to power a new generation of apps that build on what Siri started. Read more »
A Palo Alto startup called Wit.AI wants to enable a world of intelligent dialog with the devices around us, but conquering voice commands is the first step on its mission. Read more »
Context Relevant, a Seattle-based startup that promises to create accurate predictive models in a hurry, even across large datasets, has raised a $21 million series B round of venture capital. Formation 8 led the round, which also included existing investors Madrona Venture Group, Bloomberg Beta and Vulcan Capital, and several angels and strategic investors. Context Relevant’s technology includes specialized algorithms for functions such as fraud detection and behavioral analysis, then automatically analyzes data against them and generates models that can added to applications without recoding.
Analytics startup DataPad emerged from stealth mode on Tuesday, along with $1.7 million in funding from a list of prominent investors. Its trying to provide simple, attractive analytics for the little guy, which is a promising but difficult space. Read more »
Facebook artificial intelligence director and NYU researcher Yann LeCun delved into the future of AI last week in an Ask Me Anything session on Reddit. Here are his thoughts on why Her is a long way off, but is on the right path. Read more »