More big-data Stories

shutterstock_14136973

According to physicist Geoffrey West, the world’s cities have what one might call a growing problem. As they grow bigger, their problems grow worse, which means it takes an ever-faster pace of innovation to keep things in check. Big data techniques might provide the answer. Read more »

loading external resource

Lew Tucker Cisco, Serban Simu Aspera, Haseeb Budhani Infineta Systems
photo: Pinar Ozger

As datasets get fatter and cumbersome, it’s becoming harder to move them around. Even the fattest pipes look like cocktail straws when you’re talking about petabyte databases. It’s getting more and more difficult to move these massive troves of data to the applications that use them. Read more »

Subscriber Content

gigaompromasterimagecloud

Discussions about the cloud now involve more than just the IT department. New developments in hardware architectures, more-energy-efficient data centers, regulatory concerns and simplifying analytics are all discussions currently circling through the industry. Here’s what to consider when thinking about your business in the cloud. Read more at GigaOM Pro »

dates_lg

When you mix a researcher, a massive online encyclopedia and a supercomputer, the result is a collection of insights and visualizations into what Wikipedia looks like mapped across time and space. It looks a lot like how our history books might look merged and graphed. Read more »

loading external resource

robocop_KellBailey

Now that businesses have collected and stored all of this data, how are they going to protect it? And most importantly, how are they going to use if safely and legitimately? ISF’s Steve Durbin outlines the five key issues surrounding big data and information security. Read more »

shutterstock_60414424

Now six years old, the Apache Hadoop platform for storing and processing huge amounts of data, perhaps the catalyst of the current big data movement, appears ready for its closeup. According to the companies leading the Hadoop charge, they’re already beating away customers with a stick. Read more »

avatarnode

It’s no secret that Facebook stores a lot of data in Hadoop, but how it keeps that data available whenever it needs it isn’t necessarily common knowledge. Today at the Hadoop Summit Facebook Engineer Andrew Ryan highlighted that solution, which Facebook calls AvatarNode. Read more »

shutterstock_102937379

Amazon Web Services already has a winner with its Elastic MapReduce Hadoop service, and now it’s turning up the heat by adding MapR’s Hadoop distribution as an option. Users can take advantage of MapR’s performance features while also having integration with AWS’s suite of cloud services. Read more »

serengeti

VMware is launching a new open source project, called “Serengeti,” that aims to let the Hadoop data-processing platform run on the virtualization leader’s vSphere hypervisor. VMware apparently smells a lucrative opportunity in Hadoop and isn’t about to miss out on getting a piece of the pie. Read more »

family tree

Online genealogy service Ancestry.com is trying to become like the Amazon or Netflix of family trees. Much like those companies use customer data to recommend products or movies customers might like, Ancestry.com is using machine learning to make learning about ancestors a lot less work. Read more »

infographics_V7

According to a new survey by the Economist Intelligence Unit (commissioned by IT consulting giant Capgemini), corporate executives are starting to figure out that big data matters and how to leverage it, even if they haven’t fully come around on the concept. Read more »

shutterstock_60822643

Amazon S3 now hosts more than a trillion objects for its cloud computing customers, proving once again that AWS is the king of the cloud. That’s 142 objects for every person on Planet Earth or 3.3 objects for every star in the galaxy. Read more »

shutterstock_81391894

One year after launching into the Hadoop market with much anticipation, Yahoo spinoff Hortonworks finally has a product available. The company announced version 1.0 of its flagship Hortonworks Data Platform on Tuesday, as well as a High Availability version designed with new partner VMware. Read more »

risk

Rice University researchers have built a web-based calculator that predicts the risks associated with hurricanes for an address. The tool uses historial and meteorological data to generate a risk profile for residents of Houston. This is what big data tools should do — offer users actionable intelligence. Read more »

download

Online bra retailer, True&Co, which launched last week to help women find the perfect bra, is having trouble delivering its products. As the Internet crosses over into the real world, not only websites must prepare for a launch, but the entire supply chain. Read more »

data brain

In a move to expand its utility beyond simply finding better answers to known statistical problems, hot data-science startup Kaggle is now letting its stable of expert data scientists compete to tell companies how they can improve their businesses using machine learning. Read more »

shutterstock_72069451

If you just pay attention to largest Hadoop users, you might think the platform is just a way of powering search engines or analyzing customer behavior for ad-serving. Of course that’s not the case, but finding those broader use cases can still be kind of difficult. Read more »

6259499293_b577b94cfd_z (1)

All right all you big data nerds — it’s time to suit up for the NIST’s Big Data workshop slated for next week. The event will focus on what state-of-the-art core technologies will drive big data and how to ensure accuracy of big data processes. Read more »

lock and key_hellogreenstar

Privacy was often an afterthought for small companies, but that’s changing in the era of big data. As TRUSTe’s CEO, Chris Babel has seen the impact privacy can have on a startup — for better or worse. Here, he offers tips on how to avoid the pitfalls. Read more »

Datahero_Graphic

A startup called Datahero launched on Thursday with a new cloud service that makes visualizing data as simple as a few mouse clicks, and $1 million in seed funding. The company’s ultimate goal is to make big data something you or I could do. Read more »

45709987_0d45810d94_z

The greater Boston area’s bid to be the go-to big data hub will get a boost as Intel and MIT will announce the Intel Science and Technology Center for Big Data as well as bigdata@CSAIL, a research group for MIT academics and industry researchers. Read more »

Junar CEO Diego May

As more “open data” comes online, finding the right data, managing its access and workflow, and fostering collaboration, is the problem startup Junar wants to attack with its new Open Data Platform. Customers can try it out for free, starting now. Read more »

martin o'leary

When a data scientist crunched enough numbers to predict that Sweden would win this weekend’s Eurovision Song Contest, he felt fairly confident. But he didn’t expect that the biggest noise would be the inaccurate prediction that Malta would do well — something he’s now apologized for. Read more »

piggybank

If you’ve ever wondered what big data means at an individual level, this realization about sums it up: “I could either keep dying my hair or retire a year earlier.” It’s those types of realizations Intuit hopes its heavy big data use will help uncover. Read more »

Capitol Hill Question Mark (Washington, DC)

NoSQL databases like MongoDB, Cassandra or CouchDB are a key foundation for web startups. But those companies might be better served using an old-fashioned relational database when it comes to their bread-and-butter transactions, according to Thrillist CTO Mark O’Neill. Read more »

shutterstock_54633895

According to some researchers, web companies such as Google, Amazon and Facebook are doing the research world a disservice because they won’t make their datasets available for peer review. These researchers have a point, but privacy concerns might always trump openness where openness matters at all. Read more »

1789101122page 9 of 22