More big-data Stories
Subscriber Content

story1

Big data is often talked about as a phenomenon that lets organizations create narratives from their volumes of data. That is an apt characterization when we are talking about connecting the dots among disparate and possibly disconnected data sets. However, when we are talking about anything ... Read more at GigaOM Pro »

hard_disk_head_on_platter

1010data says it now hosts more than 5 trillion records for its customers. If 1010data’s growth is a microcosm of the greater market, it’s no wonder there’s so much excitement around scalable data stores such as Hadoop, NoSQL databases and massively parallel analytic databases. Read more »

crowdsource

In the world of science, cloud computing provides an ideal platform for crowdsourcing scientific problems across the whole world of researchers, giving them access to data sets and the computing resources to analyze them. If big data is any indicator, scientific crowdsourcing should catch on. Read more »

loading external resource

inklingfeaturechef

Digital recipes and cookbooks need to emulate the world of digital music. By creating a standard recipe format similar to the MP3, we could overcome the artificial barriers between cooking Websites, apps and our bookshelves. Only then could we be build truly comprehensive digital cooking libraries. Read more »

delivery-framework

Mu Sigma has closed a $108 million investment round to expand its analytics-outsourcing business. Mu Sigma takes customers’ data and it turns it into business insights, meaning customers don’t have to built their own in-house big data expertise. It’s an already-profitable business that’s only getting bigger. Read more »

Subscriber Content

clouds

How does the cloud market for the next year look? There will be no silver linings or lemonade. This research note, part of GigaOM Pro’s outlook for 2012, examines startups’ chances to challenge traditional enterprise IT practices, cloud pricing models, the true state of federated clouds, and how IT jobs will change in the coming months. Expect plenty of defensive M&A, cloud lock-in, phantom data discovery, outages, availability headaches, pricing confusion and job displacement over the next 12 months. Read more at GigaOM Pro »

server farm

Google announced that it’s ending its Academic Cloud Computing Initiative, a joint program with IBM and the National Science Foundation that gave researchers access to a massive Hadoop cluster on which to run their data-intensive projects. The company says access to such resources is now common. Read more »

img-myhadoop-bigger4

Money has turned the Hadoop community, once united under the Apache banner and the cuddly stuffed-toy-elephant logo, into something resembling a frat house: Everyone’s under the same roof, but there’s plenty of machismo to go around. If it’s not good business; it is good theater. Read more »

Image

For those in a perpetual snit about personal data privacy, here’s bold proposal from Michael Driscoll: Donate your own medical data. And do it now — don’t wait till you’re dead. What better way to make big data truly relevant — and helpful — to real live people? Read more »

loading external resource

cake pops

Beyond Hadoop, there’s a lot more to think about when it comes to big data, ranging from where companies will actually find workers to how they’ll deal with an impending privacy-policy onslaught. The answers won’t be easy to come by, but they could be critical. Read more »

Subscriber Content

It is becoming abundantly apparent that, while there is still focus on install rates of physical smart meters, what will matter in the future is how well the data from these smart meters can be secured by utilities and how well utilities can use that data ... Read more at GigaOM Pro »

2978844608_a443a5e60e_z

Microsoft’s Windows Azure platform as a service (PaaS) now supports Node.js, the popular server-side JavaScript development framework. That could give Azure more traction beyond the Microsoft .NET faithful. Also new: a limited trial of Hadoop-based distribution for Azure. Read more »

shopping mall

IBM plans to acquire retail data analytics company DemandTec in an all-cash deal valued at $440 million, continuing Big Blue’s buys in big data businesses. DemandTec has two especially compelling elements for IBM: It adds to its big data expertise, and it’s vertically focused. Read more »

497364007_b28f03366a_z

LexisNexis is pressing MarkLogic’s technology into service for its just-launched Lexis Advance legal service. MarkLogic’s document storage, search and analytics technology replaces legacy home-built code as part of a platform modernization and big data push. Read more »

chorus line

Greenplum has announced its Unified Analytics Platform, a packaging of the Greenplum Database and Hadoop distribution along with its long-awaited Chorus software. Chorus is really what ties everything together, providing a platform to explore both types of data and to share interesting data sets and findings. Read more »

3745750194_a7e32a8505_z

Digital Reasoning, a somewhat shadowy specialist in big data analytics for the U.S. intelligence community, announced B Series funding and named industry vet John Brennan to its board. Funding came from CIA-backed In-Q-Tel, as well as some Silver Lake Sumeru partners and other unnamed investors. Read more »

rent

However it’s defined, the concept of big data ultimately boils down to money. But I think there’s a line we shouldn’t cross when it comes to using analytics to squeeze every last dollar out of an operation. It’s between things we want and things we need. Read more »

privacy

What exactly is Carrier IQ doing with your smartphone data? It claims that only its selling network performance metrics to operators. But relationships it has with media analytics firms and handset makers imply otherwise. With 150 million smartphones tracked Carrier IQ has big data goldmine. Read more »

appengine-2yrs

Silk, the browser for Amazon’s new Kindle Fire, utilizes Amazon’s cloud. But don’t think AWS is the Kindle Fire’s only cloud connection. In a post on Tuesday, Pulse’s Greg Bayer explained how his company’s news-reading app actually runs atop Google’s App Engine Platform-as-a-Service offering. Read more »

Subscriber Content

hospitalroom

By 2020 it is estimated that 20–50 billion devices will be connected to the Internet. Many of these devices will be collecting health data or will be connected to health and medical devices in the home, the hospital or the wider environment. The Internet of things (IoT), meanwhile, refers to the growth of sensors and things that connect to the Internet via RFID, Bluetooth, ZigBee and satellite. In health care, its growth is likely to open new disruptive business opportunities for services that add value to the data collected. This paper provides a preliminary overview of the landscape of opportunities and drivers in the current health and health care environments and highlights some of the challenges that remain. Companies mentioned in this report include IBM, Arrayent Health, Kaiser Permanente and Ford. For a full list of companies, and to read the full report, sign up for a free trial. Read more at GigaOM Pro »

Clouds-A3

HPCC Systems, the division of LexisNexis that’s pushing a big-data processing-and-delivery platform to compete with Hadoop, has tuned its software to run on Amazon’s cloud computing platform. Interested developers can now experiment with the open source software without having to wrangle physical servers. Read more »

289861766_8d34a5d1d9_z

So the big data backlash begins. The Hadoop framework does a lot, but some experts — including those who push non-Hadoop options — say it’s not enough for many specialized apps where a build-your-own Hadoop implementation costs too much to be a real contender. Read more »

6259499293_b577b94cfd_z

With its new Idol 10 software, Hewlett-Packard brings together key Vertica and Autonomy technologies acquired over the last year for an assault on knotty big data problems. HP says the software can collect and analyze unstructured data including video, audio and social net feeds. Read more »

No access

Today, I read a press release from Sourcefire touting its “big data” approach to security and the fact that its Immunet anti-malware-for-PCs product is now monitoring 2 million endpoints. I reached out to find out what’s under Immunet’s covers and, no surprise, found Hadoop. Read more »

mariah mac

It appears as if Apple users’ willingness to shell out a little more cash for a premium experience doesn’t stop at computers. Orbitz’s data-crunching has found that Mac users also spend about $20 more a night on hotels than do Windows users. Read more »

5151041232_676410f62d_z

DataSift, the British company that built its business filtering and sorting through reams of Twitter data in real time, has brought its act to the U.S., opening a San Francisco office. Businesses use DataSift to glean information about user impressions of their products and services. Read more »

Subscriber Content

motherboard

When it comes to the promise of data as the currency of the web, the current state of affairs has privacy advocates and many consumers up in arms. But it doesn’t have to be the one-sided affair it is today, in which companies have all the data and all the rights, and we shouldn’t have to be afraid of who’s doing what with our information. With laws, products, practices and education, data can become a far more valuable currency than cash ever was. Keeping that in mind, this research note examines five issues that must be addressed by policy makers and entrepreneurs so that they can deliver on our data-driven digital future. Companies mentioned in this report include Twitter, Facebook and Foursquare. For a full list of companies, and to read the full report, sign up for a free trial. Read more at GigaOM Pro »

Japan's K supercomputer is the fastest in the world.

In the past decade supercomputers were dressed-up versions of Intel’s x86 machines, but increasingly supercomputers are borrowing innovations (and silicon in the form of ARM-based chips or DSPs) from the mobile and big data realms to add speed without guzzling too much power. Read more »

braintorm

What happens when you place the equivalent of 1024 neurons in parallel on a chip? Well, you get a new form of computing for cloud computing and sensor networks as well as toys that can recognize cue cards, better artificial intelligence and pattern recognition. Read more »

brain

A new startup called CrowdControl is launching today, and it aims to bring order to the world of crowdsourcing by using artificial intelligence to judge workers’ accuracy. Think Amazon Mechanical Turk, only with a quality control mechanism in place to help ensure jobs get done right. Read more »

DiscGPS1

The Supreme Court of the United States heard oral arguments in a case that could decide how connected the concept of big data is to constitutional expectations of privacy. How much data is too much before allowable surveillance crosses the line into an invasion of privacy? Read more »

1121314151622page 14 of 22