More data Stories

The Hadoop hoopla is generating increasing numbers of announcements from more and more vendors. From startups to large established players, new products and partnerships are emerging which confirm the emergence of a vibrant Apache Hadoop. Hall explains the three emerging layers in the “Hadoop stack.” Read more »

Commercial Hadoop startup Karmasphere today released the results of a survey of 102 Hadoop developers regarding adoption, use and future plans. The results provide some interesting insights into how Hadoop grows within organizations and underscore its status as an extremely valuable, but none-too-simple analytics tool. Read more »

Upcoming Events

Aster Data, a big data analytics software company is saying that it has received $30 million in new funds from existing investors and a new undisclosed strategic investor. David Cheriton who backed Google and VMWare as an angel investor is also investing in the company. Read more »

IBM says it will acquire Marlborough, Mass.,-based Netezza Corporation, a maker of data warehousing analytics for a whopping $1.7 billion in cash. IBM is offering $27 a share for the Neteeza and hopes that the smaller company would help IBM with its growing business analytics practice. Read more »

As Big Data gathers steam within the consumer web, Cloudera is making it possible for mainstream IT to tap into this trend through its distribution of Hadoop, suggested by the company’s customer growth. Lower costs and improved ease-of-use are making Hadoop a reality for enterprise. Read more »

Aster Data, a San Carlos, Calif.-based start-up that develops software for big data applications, says it’s replacing its founder-CEO, Mayank Bawa, with software industry veteran Quentin Gallivan, who previously worked for Pivotlink and Postini. Bawa will switch roles and will be company’s chief customer officer. Read more »

loading external resource

While settling on a standard big data stack is deeply important to the big data industry as a whole, I’m nonetheless questioning the operational and competitive consequences for companies who choose to buy into this standard without first considering the value of building a proprietary solution. Read more »

The online personal finance assistant Mint often mines user data for trends and interesting charts to feature on its popular corporate blog. Now the Intuit-owned company is preparing to release the data it’s collected on behalf of its 3 million users. Read more »

Hadoop, the big data analytics software is so hot right now. Heck anything big data is so hot right now. Today’s links offer insights to Hadoop alternatives, how to use Hadoop and an endorsement of Microsoft’s platform as a service strategy. Read more »

Image (3) file-backup-software-for-windows-and-mac_-remote-backup-and-file-sync-with-sugarsync.jpg for post 28872

Most website users prefer logging in with a Google sign-in, but Facebook is a close second, according to new data from Janrain. Close to 40 percent of users preferred to sign in with a Google ID, while 24 percent chose to login with their Facebook profile. Read more »

Hadoop, thanks to the growing importance of Big Data Analytics is gaining traction inside the enterprise. What’s been missing for Big Data Analytics has been a LAMP-like stack. Fortunately, a stack for Big Data aggregation, processing and analytics is on its way. Read more »

Commercial Hadoop champion Cloudera is building a connector to enable movement of data between Netezza’s data warehousing appliance and Cloudera’s Hadoop clusters. It’s just the latest instance of an analytics vendor integrating Hadoop support, and further evidence that Hadoop has legs as a commercial technology. Read more »

Twitter has scaled back its plans to store billions of tweets using Cassandra, but the interest in this news and NoSQL data stores in general goes beyond one company’s decision. It touches on the changing nature of the web and the software that underlies it. Read more »

Apple’s iconic iPod is credited for reviving the company and helping it dominate consumer mind share over the past decade, but of my greatest fears has started to come true where analysts are now throwing the notion out there that the iPod is dead or dying. Read more »

Arthur van Hoff helped architect Java at Sun, co-founded Marimba, and engineered the application platform at TiVo. Now he’s identifying trends in Twitter messages at The Ellerdale Project. He explains to us why real-time search is one of the most “intellectually challenging” things he’s ever done. Read more »

EMC realizes two simple facts: pure hardware is a commodity and the next industrial revolution is all about data. And that is why it is accelerating its investments in software. Today it acquired Greenplum, a 10-year-old data warehouse software company for north of $300 million. Read more »

A few months ago, I posited that additional funding for Cloudera and Karmasphere signifies a large market opportunity for solutions that utilize the open-source analytics tool Hadoop. From the news generated this week by Yahoo’s third annual Hadoop Summit, my beliefs of this have only been affirmed. Read more »

Hadoop creator and champion Yahoo is taking advantage of its annual Hadoop Summit today by rolling out some new features for its open-source Hadoop distribution. The new features tackle security and workflow management, which Yahoo hopes will help Hadoop continue its proliferation among mainstream users. Read more »


Now that AT&T, along with all the providers internationally, have scrapped unlimited data plans and introduced caps, you’ll need to keep an eye on how much data you’re using. Here are a few ways to make sure you don’t end up going over your monthly allowance. Read more »

DemandTec, a retail forecasting software provider, has convinced Target Corp. to hand over even more of its shopping data in order to better set prices and forecast demand. But DemandTec has needs of its own — partners that can help it filter unstructured social data. Read more »

Google revamped its search indexing methodology this week, which was quickly eclipsed by the chatter about background images on its home page. But those images were a red herring distracting us from technology changes that could influence those delivering the real-time web for years to come. Read more »

Want to know how Apple’s Genius song recommendation system for iTunes works? A post telling folks was deleted without explanation, but it’s worth reading since recommendation engines are the key to shoving the web onto devices like mobile phones and for creating a hyperpersonalized surfing experience. Read more »


Given the recent news about AT&T’s decision to shift from unlimited 3G data plans, we were curious how much data you actually use on your device? Taking a peak at my stats revealed that I’ve downloaded 4.1GB of data and uploaded nearly a gig. Read more »

Is there a business in providing intelligible data sets to information workers, application developers and analysts in a world where once expensive data such as turn-by-turn directions or real-time financial quotes are now free? Microsoft, with its Project Dallas, joins other firms hoping that there is. Read more »

We managed to create 800,000 petabytes of digital information last year, according to a study released today by IDC and EMC. The creation of digital data will increase to 1.2 million petabytes by the end of this year, which means we need fatter pipes. Read more »

The World Bank, which tracks everything from mortality rates to livestock production in hundreds of countries around the globe, said today it is opening up its data, including removing all of the pay walls around data that used to require a subscription fee to download. Read more »

Twitter today open-sourced the code that it used to build its database of users and manage their relationships to one another, called FlockDB. The move comes shortly after Twitter released its Gizzard framework, which it uses to send thousands of queries a second to FlockDB. Read more »

In some ways, the fact that Hadoop is mature enough to inspire commercial products — Cloudera and Karmasphere, e.g. — means it’s yesterday’s news. Which open-source, big-data-inspired product will be the next to launch a wave of startups and drive tens of millions in VC spending? Read more »

From a comparison of auto and PC industries to problems associated with the location-based advertising to tips & tricks of reading startup term sheets — here is a selection of five articles to read. And after you are done, check out Hitchhiker’s guide to financial regulation. Read more »

Appistry today added another element to its cloud-computing application platform, announcing the April availability of CloudIQ Storage. With it, St. Louis-based Appistry joins the growing ranks of companies seizing on demand cloud storage solutions that maintain performance in the face of rapidly growing data volumes. Read more »

Big data is on the tip of everyone’s tongues these days as more information is contributed to electronic records and more sources provide that information. We now have a river of data that we’re going to harness and use to make money and better decisions. Read more »

17374757677page 75 of 77

You're subscribed! If you like, you can update your settings