It’s pretty clear mobile payments and the idea of delivering payment capabilities as an app are finally hitting their stride, but amid the details of how we’ll pay for things online, what it will mean for our relationship to money and our relationships with retailers? Read more »
It turns out that “big data” isn’t just a buzzword, but a legitimate concern for companies across the board. Their interest in the tools to take advantage of the opportunity for data analysis has sparked a land grab among software vendors centered around Hadoop. Read more »
Mapr, a stealth-mode start-up with about 30 employees is developing a version of Hadoop and plans to compete with the likes of Cloudera. The company is likely to launch later this year and has been funded by Lightspeed Venture Partners and NEA. Read more »
With so much data flowing over networks and with so much computing power needed to crunch that data, computing infrastructure needs to be as low power as possible to both make the era of “big data” economical and also more eco-friendly. Read more »
Underlying all the useful applications, like Hadoop, that have emerged out of the big data ecosystem, there’s a fundamental assumption: The data that companies want will be able to be accessed when companies want and need it, explained Michelle Munson, CEO and co-founder of Aspera. Read more »
At our Structure: Big Data conference, CA Technologies CTO Donald Ferguson suggested that big data might actually be a driving force behind the adoption of cloud computing because variable workloads are ideal for utility billing models. More of his thoughts here. Read more »
As organizations strive to analyze more data than ever and to do it faster than ever, the results they’re getting might actually be worse than those in the pre-big-data and real-time world — at least temporarily. Read more »
When it comes to social data, one of the biggest firehoses around is the one that comes from Twitter. Trying to make sense of 140 million tweets a day in something close to real-time is a significant challenge, says Tap11 chief technology officer Braxton Woodham. Read more »
Google may have more distributed data than any other company but it still takes user input to create smarter machines. Google’s Voice Search speech recognition, for example, began to improve when the service started to train itself and improve accuracy through the use of end-user data Read more »
During an afternoon panel entitled “The Many Faces of MapReduce — Hadoop and Beyond,” moderator Gary Orenstein compared the two primary Hadoop components — MapReduce and the Hadoop Distributed File System — to the meat and bread of a sandwich. Read more »
Mining terabytes of data isn’t just for service providers — media companies are also trying to make use of the oceans of information they have about their users to come up with better ways of recommending news to them, says Bloomberg Digital head Kevin Krim. Read more »
Dave Hitz — data storage pioneer and co-founder of NetApp — doesn’t like the term big data. But, if he must use the moniker, he acknowledges that big data and analytics are what are keeping the storage industry interesting and driving business right now. Read more »
NoSQL startup DataStax officially entered the pantheon of Hadoop providers today, introducing its own distribution called “Brisk.” Brisk utilizes the open source NoSQL database Cassandra as a replacement for Apache’s Hadoop Distributed File System, as well as Cassandra’s built-in MapReduce engine and Hive. Read more »
Data isn’t the solution to business problems. Pulling data into applications and using it to make decisions and improve the user experience is the way to solve business problems said Jim Baum, the CEO of Netezza, at Structure Big Data. Read more »
As the amount of captured data grows, how can businesses make more sense of it, use it for accurate predictions and better understand their customers? The answer may lie in the world of physics: the concept of space-time paired with data improves predictions through context. Read more »
Joyent Founder and Chief Scientist Jason Hoffman redefined the concept of big data in a panel on data science with bit.ly Chief Scientist Hilary Mason, Cloudscale Founder and CEO Bill McColl, Fluidinfo Founder and CEO Terry Jones, and nPario President and CEO Bassel Ojjeh. Read more »
Are you excited about NoSQL, Hadoop and big data analytics? Then you don’t want to miss GigaOM’s Structure Big Data conference, which is happening today in New York. Don’t have a ticket for the sold-out event? No worries, every session is streamed live online. Read more »
A Yale computer science project has turned into a company giving Hadoop the ability to perform analytics on both structured and unstructured data. Hadapt launched today with an undisclosed amount of funding and the goal of making Hadoop more broadly applicable for analytics. Read more »
Netflix is taking a bold step, licensing the first exclusive show to stream through its service before appearing on broadcast or cable TV. But is the move as risky as some might think? Thanks to a large amount of viewing data, Netflix doesn’t think so. Read more »
I met with a cool startup called DueDil, which is trying to provide a Lexis-Nexis-meets-Google service that aggregates public data on public and private companies from a variety of databases and uses that to create new financial metrics to determine success. Read more »
Using Hadoop to process data for targeted web advertising efforts is nothing new, but this week, two companies in the video advertising space also stepped forward to highlight how Hadoop is helping them deliver the right ads to the right viewers for their clients. Read more »
Consumer electronics recommendation engine Retrevo launched a new feature this morning that challenges the Amazon Marketplace. However, for Retrevo to meet its lofty goals of dethroning Amazon even in this single category, it will have to rely on the accuracy of its machine-learning algorithms. Read more »
Just over than a month after discontinuing its Hadoop distribution to focus on the flagship Apache Hadoop project, Yahoo is proposing some changes to the Hadoop MapReduce component that could significantly improve processing performance. The proposal illustrates just how beneficial Yahoo’s renewed focus could be. Read more »
Yesterday, HP CEO Leo Apotheker laid out his vision for the company’s cloud computing future, but given HP’s all-but-non-existent cloud strategy until this point, it’s difficult to believe the company can be a real competitor until it actually starts to deliver what Apotheker is promising. Read more »
IBM and Revolution Analytics have brought together SQL queries and predictive analytics by integrating R Enterprise statistical analysis software with IBM’s Netezza TwinFin data warehouse appliance. It’s part of a significant evolution in analytics strategies as big data becomes a big issue for all types organizations. Read more »
Couchbase, the company formed by the merger of Membase and CouchOne last month, has released the first version of its Couchbase Server database, as well as a board of advisors that reads like a who’s who of web infrastructure and big data. Read more »
Microsoft is developing a new big data tool called Dryad. Dryad and the associated programming model, DryadLINQ, simplify the process of running data-intensive applications across hundreds, or even thousands, of machines running Windows HPC Server. Dryad builds upon lessons learned from Hadoop, but differs in some significant ways. Read more »
ParAccel’s competition all got bought, leaving the company standing all but alone as an independent company dedicated to the cause of big data. But with a solid product and a steady business channel to boost a large vendor’s bottom line, it shouldn’t be alone for long. Read more »
We’re in the midst of a computing implosion: a re-centralization of resources driven by virtualization, many-core CPUs, GPU computing, flash memory, and high-speed networking. We have a lot to watch over the next few years: what I like to call the coming of the Super Server. Read more »
Facebook is working on a real-time analytics dashboard to let users determine which content is getting the most attention from visitors. As described in an educational session on Wednesday night in Facebook’s Seattle office, the service is built atop HBase and tracks about 100 metrics. Read more »
Once you finally get that new Mac in your hands, you’ll want to get up and running fast. Migrating all of your applications, preferences and data can be a daunting task, but there are options available to help make your transition as painless as possible. Read more »
Data warehousing giant Teradata today agreed to acquire Aster Data, a data analytics provider, proving that it’s no longer enough to be able to store and access a lot of data quickly, one must also be able to analyze it quickly. But now, who’s left. Read more »
Infochimps is attempting to build a data market, and in doing so, the company is wading into some of the messiest and most unstructured data around, attempting to clean it up and put it up for sale. I talk to co-founder Flip Kromer about the challenges. Read more »
After all the talk over the past few weeks about IBM’s Watson, it’s becoming clear that Watson is not HAL of big-screen notoriety. But when used in concert with predictive analytics software, technologies like Watson can become part of a very complete big-data architecture. Read more »
Big Data software company Acunu said it has closed £2.2 million ($3.6 million) in Series A financing. The startups software helps bridge the gap between expensive in-memory storage and cheap-but-slow hard drives, by offering a rewrite of the storage stack optimized for solid state drives. Read more »
Every attendee of SXSW Interactive is used to the yoga, the HTML5, the gaming, and the death of journalism panels, but for 2011, the conference has fastened onto two new trends: data as a double-edged sword and a lack of women in technology and startups. Read more »
Aaccording to one machine-learning expert, one key takeaway from Watson’s “Jeopardy!” victory is simple: humans are very smart. That a system such as Watson can understand natural language is a huge step forward, but it’s still only as good as its data and algorithms. Read more »
Two popular big data startups, Karmasphere and 10gen, made management changes this week, which might signal that the companies’ boards feel they’re poised to make runs at the big time and need seasoned leadership to take them to the next level. Read more »
Terracotta is trying to bring real-time analytics to the masses (of Java users, at least) by letting Ehcache users query data stored in the product’s in-memory cache. With Ehcache Search, customers can perform real-time queries against terabytes of data stored in their transactional caches. Read more »
In a new Forrester report, authors James Staten and Lauren E. Nelson advise infrastructure and operations (I&O) leaders to encourage their data analysts to get hip to cloud-based analytics tools and to consider making their organizational data available to the public as a cloud resource. Read more »