More hbase Stories
loading external resource
Subscriber Content

big data on computer image

Companies are rushing to embrace the promise of big data to understand both their businesses and the ways in which customers interact with them. But effective data-based decisions are not made in response to simplistic data reporting; they are made in response to considered and ongoing data analysis. Read more at GigaOM Pro »

Kiji resides in the lower left section

HBase is a great option for developing big data applications, but it’s not necessarily easy to use. WibiData is addressing this by open sourcing a portion of its predictive analytics infrastructure that adds structure to data, followed eventually by a whole HBase development framework called Kiji. Read more »

loading external resource

Team Continuuity
photo: Continuuity

Hadoop is nothing without applications, and Continuuity aims to deliver those apps by making Hadoop something developers can work and innovate with. Its efforts haven’t gone unnoticed — the company just closed a $10 million Series A round from a who’s who of big data VCs. Read more »

hard drives

Drawn to Scale’s Spire database is meant to be all things to all people — it combines Hadoop, HBase and SQL to provide a fast, scalable, robust experience — and now it has integrated with MapR’s Hadoop distribution. It’s no surprise the young company already claims big customers. Read more »

Subscriber Content

rockclimbing1

Organizations are coping with the challenge of processing unprecedented volumes of data. However, the processes involved with using a large cluster to run applications like Hadoop are error-prone. So IT managers are turning to cluster-management solutions to automate tasks associated with cluster creation, management and maintenance. Read more at GigaOM Pro »

elephant walking away

For better or worse, Hadoop has become synonymous with big data. In just a few years it has gone from a fringe technology to the de facto standard. But is the enterprise buying into a technology whose best day has already passed? Read more »

Subscriber Content

elephant

There are now more than half a dozen commercial Hadoop distributions in the market, and almost every enterprise with big data challenges is tinkering with the Apache Foundation-licensed software. A new report examines the key disruptive trends shaping the Hadoop platform market. Read more at GigaOM Pro »

Aaron Kimbell of WibiData at Structure:Data 2012

The problem for many companies is that user information is spread across hundreds or even thousands of different fields in various databases, and it’s difficult to compile it in real time. But doing that successfully is becoming increasingly important, says WiBiData at Structure:Data. Read more »

Subscriber Content

datacenter

Big data now touches everything from enterprises to smart-meter startups, while Hadoop is fast becoming the leading tool to analyze that data, and debates around privacy abound. GigaOM Pro analysts offer insights on what to consider when it comes to big data decisions for your business. Read more at GigaOM Pro »

database book

Drawn to Scale, a two-year-old startup focused on making SQL ready for the world of big data by combining it with Hadoop, has raised an initial funding round of $925,000. Its product, Spire, utilizes Hadoop to increase scalability and reduce latency across large data sets. Read more »

tumblr dashboard

Tumblr hits 500 million page views a day, deals with 40,000 requests per second and sends more than a terabyte of data into its Hadoop cluster. Here’s how it went from nothing to a startup that needed to serve 15 billion page views a month. Read more »

hadoop

Hadoop features front and center in the discussion of how to implement a big data strategy, one of the biggest trends in IT. There’s just one problem that keeps cropping up: many people don’t seem to know exactly what it means when somebody says “Hadoop.” Read more »

ebay screen

For eBay, big data is serious business. Every day, the site stores and analyzes data from millions of users buying, selling and searching for hundreds of millions of products. It handles all this data with lots of Hadoop, although a good data warehouse doesn’t hurt either. Read more »

big elephant

Although the first couple years of commercial Hadoop attention have been characterized by an attitude of “Hadoop is great, but …”, the tone is changing as Hadoop vendors increase the platform’s palatableness with each new iteration. No longer is Hadoop necessarily an epic undertaking rife with pitfalls. Read more »

monalisa-egg

Facebook held a Tech Talk on Monday night explaining how it built a MySQL environment capable of handling everything the company needs in terms of scale, performance and availability. Based on what I heard, it looks like critics of Facebook’s MySQL environment might be wrong. Read more »

Subscriber Content

bronze elephant

Hadoop has been used by large web companies for applications such as search engines, but the reality is that the project is so much more. This report takes a closer look, examining what Hadoop is (and isn’t), who’s doing what to productize it and why we can expect to see the market pick up serious steam in 2011. We profile the growing number of companies — from startups like MapR to Cloudera, the arguable leader in the space — using Hadoop, the challenges still hindering widespread adoption and where potential users can expect the market to go as we move through 2011 and beyond. Companies mentioned in this report include Yahoo, Facebook, EMC, Teradata and Appistry. For a full list of companies, and to read the full report, sign up for a free trial. Read more at GigaOM Pro »

hbase

Trend Micro maintains web reputation databases that allow intelligent detection of spam, phishing, or suspicious web sites. It processes data accumulating at the rate of about four petabytes per year. Here’s why Trend Micro settled on Apache Hbase as the core database of new elastic infrastructure. Read more »

Hadoop, as a pivotal piece of the data mining renaissance, offers the ability to tackle large data sets in ways that weren’t previously feasible due to time and dollar constraints. But Hadoop can’t do everything quite yet, especially when it comes to real-time work flow. Fortunately, […] Read more »

Hadoop, Cassandra, HBase, Hypertable, Open Neptune… these are some open source projects that are being pursued by web technologists in order to deal with explosion of digital data in a post-terabyte world. The traditional way to deal with unstructured data isn’t working. What we need is a structured means of finding, accessing, and retrieving files and objects. Read more »