More data Stories

security checkpoint

The open-source, data-processing tool Hadoop is already popular for a variety of use cases that can benefit from clusters of machines churning through unstructured data — such as search engines and social-media analysis — and now it’s turning its attention to security data. Read more »

American_Cash

Appistry, a St. Louis–based software company, has closed a $12 million Series D round for its family of distributed computing products. The company also appears to have changed its corporate messaging — from that of a cloud-computing vendor to that of a big-data vendor. Read more »

s-class

Nimbus Data Systems today rolled out its second-generation platform and announced online auction giant eBay as a major customer. In fact, eBay has deployed a non-trivial 100TB of Nimbus gear. Have we finally reached the inflection point for primary flash storage? Read more »

loading external resource

iCloud

All iOS 5 and Mac OS X Lion users can get 5 GB of storage in iCloud for free, but those who need more storage can pay $20 per year for 15 GB, $40 per year for 25 GB, or $100 per year for 55 GB. Read more »

situational_intelligence_2

Already incredibly useful for helping us get directions, find the nearest grocery store and find out our state capitol, Google Maps is now becoming the hot way to display enterprise or organizational data that’s tagged with location data. The timing of this trend isn’t surprising. Read more »

Subscriber Content

Web companies like Google and Facebook gain business advantage by analyzing large volumes of rapidly changing data about their users, but they are far from alone. A recent infographic from Get Satisfaction charts the volume of data stored in 17 key industry sectors, illustrating that most ... Read more at GigaOM Pro »

Big data has the potential to cut operating costs by nearly 50% across all sectors of manufacturing. Get Satisfaction makes several interesting claims about opportunities for big data in an infographic released this month. Market segments such as manufacturing are generating far more data (966 petabytes […] Read more »

wildebeest migration

For anyone who didn’t know, Facebook is a huge Hadoop user, and it does some very cool things to stretch the open source big data platform to meet Facebook’s unique needs. Today, it detailed how it migrated its 30-petabyte cluster from one data center to another. Read more »

handing over money

Concurrent, the company providing the Cascading data workflow API, has raised a $900,000 seed round to capitalize on the newfound excitement around Hadoop. Cascading is an open-source API for creating and running data workflows atop Hadoop clusters. Read more »

3206570218_8a77512dba

OpenFlow may be one of the hotter buzzwords these days, but getting past the exuberance and down to brass tacks can be difficult because the technology can be applied many places. It also sprouts up in new contexts as the ecosystem around the technology expands. Read more »

loading external resource

origami elephant

All the speculation about how Yahoo’s Hadoop spinoff company, Hortonworks, will affect Cloudera and other companies providing Hadoop-based products might have been overblown. The company is still figuring out its strategy around offering a Hadoop distribution, which could be good news for competitors such as Cloudera. Read more »

photo (17)

The FinTech Innovation Lab, an accelerator program for financial tech startups, graduated its first class on Friday. This first batch of companies is bringing some impressive ideas to bear on data, analytics and payments and showing there’s room for new approaches in the financial sector. Read more »

[OpenStack] looks not only like an open-source alternative to Amazon Web Services and VMware vCloud in the public Infrastructure as a Service space, but also a democratizing force in the private-cloud software space. As my colleague Derrick Harris suggests, the open-source cloud-computing project OpenStack has come a […] Read more »

magnify dollar

Monster.com is getting into the cloud-computing mix with a new “semantic search and analytics platform” service called SeeMore. Merging two hot capabilities — cloud-based delivery and analytics — makes a lot of sense for Monster, which no doubt supplies many companies with a lot of data. Read more »

Screen shot 2011-07-20 at 8.02.59 PM

ZestCash, a next generation loan-service for the underbanked led by former Google CIO Douglas Merrill, has raised $19 million to expand its data-driven approach to offering short-term loans. The company uses online data to help determine the credit worthiness of customers. Read more »

tvads

What’s Watched uses data from social media and mobile applications to provide media companies with a view into what shows are being watched and who’s watching them. That data is then used to target specific groups of users to increase ratings and grow TV audiences. Read more »

Security_Gate_-_geograph.org.uk_-_84112

Yesterday, Google announced a new feature that alerts web surfers when their PCs might be infected with malware, but it’s hardly the only company using big data to fight cybercrime. We’ve covered a handful of them of over the past couple years. Read more »

kmeans_scatter_plot

Alpine Data Labs, a predictive analytics startup that incubated within Greenplum (now part of EMC), is expanding its support beyond the Greenplum Database and into Oracle’s Exadata appliance and the open-source Postgres database. Alpine tries to distinguish itself by running entirely within companies’ analytic databases. Read more »

printing press

Something I thought about a lot while writing about OpenStack yesterday is how much it democratized access to cloud computing in just a year. But OpenStack is just one example of how information technology, overall, is undergoing a period of arguably unprecedented democratization. Read more »

situational_intelligence_2

Talk about big data: The California Independent System Operator Corporation has installed an 80-foot by 6.5-foot screen in its control room to display real-time power-grid data from thousands of endpoints. Its system is powered by Space-Time Insight, whose software melds real-time geospatial data with unique visualizations. Read more »

screen-shot-2011-07-13-at-5-10-51-pm-e1310602449126

Thanks to the rating systems in place on such popular websites as Yelp, Amazon and eBay, many people are comfortable evaluating things in absolute terms: a two-star restaurant, a B movie and so on. But new MIT research says this approach is fundamentally flawed. Read more »

2951460503_8d3a7d21e1_z

Facebook has shut down a service from Open-Xchange that allowed users to export the email addresses of their contacts, which makes the Germany company the latest to run afoul of the social network’s ongoing attempts to maintain control over the information of its users. Read more »

Firehose

Underneath Twitter’s fun and trendy public image, the data streamed through the microblogging service is apparently worth some big bucks. DataSift, one of only two companies authorized to re-syndicate Twitter’s content using its “firehose,” on Monday announced a $6 million venture capital funding round. Read more »

server farm

Big processors or little processors, scale-up or scale-out, on-premise or in the cloud: the answers might not be as easy as one would think. Web-style, scale-out architectures, low-power server processors and cloud computing are getting more attention by the day, but they have their limits. Read more »

Mysql

According to database pioneer Michael Stonebraker, Facebook is operating a huge, complex MySQL implementation equivalent to “a fate worse than death.” It’s actually a predicament all too common among web startups, for which the solution might be a class of databases referred to as NewSQL. Read more »

322057841_dcd8b6fa28_z

The Internet and social networks such as Twitter are where many people go to research — or just talk about — medical issues. Can researchers discover any useful public-health information by looking at all this crowdsourced data? A new study from Johns Hopkins University suggests that they can. Read more »

investment paper

Amazon.com is making what appears to be a big investment in analytic database startup ParAccel. ParAccel today announced the close of a Series E round led by Amazon, bringing the company’s total funding to $73 million. The amount of this round is undisclosed. Read more »

Facebookdatacenter

If you think people are over-sharing on the Internet today, brace yourself, says Facebook CEO Mark Zuckerberg. As the sharing booms, so will the online data. And Facebook plans to build more of its own data centers to deal with the coming data boom. Read more »

origami elephant

The size of Hadoop deployments appears to have tripled since October, according to statistics that Cloudera is sharing. If accurate, they help prove assumptions that Hadoop usage grows quickly once organizations wrap their heads around how it is used. Read more »

spiderweb

Israeli startup Personyze is linking with one of the web’s most controversial data collection companies, Rapleaf, to provide new tools for website owners. Can its attempt to help ordinary website owners turn information into actions really solve the big data puzzle? Read more »

twitter_newbird_boxed_whiteonblue

Twitter announced Tuesday it has acquired BackType, an analytics platform aimed at helping companies and brands gauge their social media impact. The possible rationale for the deal is BackType’s Storm real-time big data processing platform that could help Twitter offer well-defined analytics. Read more »

fantasy

Big data — as in managing and analyzing large volumes of information — has come a long way in the past couple of years. Among the greatest innovations might be the advent of real-time analytics, which allow the processing of information in real time to enable instantaneous decision-making. Read more »

race to the finish

When done right, cloud computing actually can be a source of significant competitive advantage. So says Zynga, at least, which highlighted its unique cloud infrastructure, as well as its advanced analytics efforts, as part of its core strengths in the S-1 statement it filed this morning. Read more »

1303132333440page 32 of 40