More cassandra Stories
loading external resource

achilles heel

Hadoop is on its way to becomig the de facto platform for the next-generation of data-based applications, but it’s not without some flaws. Ironically, one of Hadoop’s biggest shortcomings right now is also one of its biggest strengths going forward — the Hadoop Distributed File System. Read more »

clocks

It was a rough weekend for the internet. While Friday’s problems with Amazon Web Services and other sites could be chalked up to some wicked thunderstorms, several sites went down Saturday for periods of time thanks to problems with the “leap second.” Read more »

Subscriber Content

gigaompromasterimagecloud

Discussions about the cloud now involve more than just the IT department. New developments in hardware architectures, more-energy-efficient data centers, regulatory concerns and simplifying analytics are all discussions currently circling through the industry. Here’s what to consider when thinking about your business in the cloud. Read more at GigaOM Pro »

Capitol Hill Question Mark (Washington, DC)

NoSQL databases like MongoDB, Cassandra or CouchDB are a key foundation for web startups. But those companies might be better served using an old-fashioned relational database when it comes to their bread-and-butter transactions, according to Thrillist CTO Mark O’Neill. Read more »

Subscriber Content

Ask a VC about big data and she will probably tell you about visualization of the user interface. We’re talking about intuitive UIs that let users visually work with data using charts and tools, not algorithms. It’s hard to do right, but the payoff could be huge. Read more at GigaOM Pro »

Subscriber Content

datacenter

Big data now touches everything from enterprises to smart-meter startups, while Hadoop is fast becoming the leading tool to analyze that data, and debates around privacy abound. GigaOM Pro analysts offer insights on what to consider when it comes to big data decisions for your business. Read more at GigaOM Pro »

loading external resource
Subscriber Content

Everyone is eager to name the new big data stack — a LAMP stack equivalent for all your big data needs. But with literally dozens of different open-source projects all taking a shot at solving a piece of the stack, settling on a standard architecture is ... Read more at GigaOM Pro »

Subscriber Content

audience_blue_marfis75

The emergence of the big data phenomenon is fundamentally changing everything from the way companies operate to the way people interact to how the world deals with outbreaks of infectious diseases. Here we highlight 10 case studies illustrating how big data is changing the world. Read more at GigaOM Pro »

now_open_to_the_public2_original

Twitter has been on a tear lately when it comes to open sourcing big-data tools. The latest two are Cassie, a client for managing Cassandra clusters, and Scalding, a MapReduce framework for simplifying the creation of Hadoop jobs. Big data won’t be black magic forever. Read more »

couchScreen Shot 2012-02-07 at 10.37.31 PM

Database professionals planning to take the NoSQL leap this year said the restrictive schemas in the RDBMS world drove their move. High latency, high cost and inability to scale out were also cited as reasons to move beyond SQL databases. Read more »

dark clouds

Executives at NoSQL startups are keeping a brave face in response to Amazon Web Services’ new DynamoDB offering. They cite the new product as a validation, while generally dismissing the competitive ramifications of having Amazon now playing in the same pool. But is that confidence justified? Read more »

Subscriber Content

motherboard

When it comes to the promise of data as the currency of the web, the current state of affairs has privacy advocates and many consumers up in arms. But it doesn’t have to be the one-sided affair it is today, in which companies have all the data and all the rights, and we shouldn’t have to be afraid of who’s doing what with our information. With laws, products, practices and education, data can become a far more valuable currency than cash ever was. Keeping that in mind, this research note examines five issues that must be addressed by policy makers and entrepreneurs so that they can deliver on our data-driven digital future. Companies mentioned in this report include Twitter, Facebook and Foursquare. For a full list of companies, and to read the full report, sign up for a free trial. Read more at GigaOM Pro »

hadoop logo

The recent excitement around Hadoop has culminated in five new Hadoop products today from EMC, NetApp, Mellanox, SnapLogic and DataStax. What’s interesting now is that we’re seeing large technology vendors with hardware expertise pushing gear optimized for Hadoop. Read more »

Subscriber Content

datacenter

Business and IT leaders now face significant opportunities and challenges with big data — that is data sets that are so large they are difficult to store, manage and analyze. This report explores the rapidly evolving big data business and technology ecosystem. It examines big data in the context of several different industries: financial services, health care, sports, travel and media. We explore the different big data technologies — from Hadoop and NoSQL derivatives to cloud-based collaboration tools — and their various benefits for enterprises. And we examine some of the existing challenges big data poses, and what enterprise IT leaders can do to overcome them. Companies mentioned in this report include Amazon Web Services, Google, Teradata, IBM and Cloudera. For a full list of companies, and to read the full report, sign up for a free trial. Read more at GigaOM Pro »

cassandrathumb

Openwave’s next generation platform must support geographic redundancy, massive scalability and high availability. It has to distribute databases redundantly across multiple data centers and handle large customer datasets – varying from hundreds of terabytes to petabytes, and supporting thousands of transactions per second from each customer. Read more »

Subscriber Content

gigaompromasterimagecloud

Some might call this past quarter in the infrastructure space transformative. The rise of ARM-based processing suggests the days of x86 dominance might be coming to an end, while the Amazon Web Services-WikiLeaks controversy cast new light on the legal aspects of cloud computing. Big data got bigger, meanwhile, as the Hadoop ecosystem expanded, and amid all these cutting-edge technologies, two archaic topics — Novell and Java — proved they aren’t going anywhere soon. Companies mentioned in this report include Intel, AMD, Amazon Web Services, IBM, Yahoo, Appistry, VMware, Joyent and Microsoft. For a full list of companies, and to read the full report, sign up for a free trial. Read more at GigaOM Pro »

question mark

Today’s links pose some good questions about both cloud computing and NoSQL. For cloud computing, the question is about what’s the right blend of old-school and new-school, and for NoSQL it’s whether the next year will bring consolidation, proliferation or both. Read more »

The U.S. government will be using NoSQL database technology Cassandra for a 400-node cluster for intelligence mining. In addition, Riptano, the Austin, Texas-based Cassandra-focussed startup says now the open source NoSQL software is ready to run on Amazon’s EC2 service. Read more »

Cassandra, the NoSQL software is being blamed for scaling problems being faced by Digg, which led to the yet-unconfirmed departure of Digg VP of Engineering John Quinn, a champion of Cassandra. Still, we hear the social news site isn’t giving up on the software – yet! Read more »

diggbroken

Cassandra, the NoSQL software is being blamed for scaling problems being faced by Digg, which led to the yet-unconfirmed departure of Digg VP of Engineering John Quinn, a champion of Cassandra. Still, we hear the social news site isn’t giving up on the software – yet! Read more »

Subscriber Content

infrastructure

The second quarter of 2010 belonged to the little guys and the new guys. Almost across the board, from processors to virtualization to cloud services, relatively small vendors and startups had the market cornered on innovation and mindshare. And where there’s tinder in the forms of customer demand, products, funding and a greater societal movement toward environmentalism, something is bound to catch fire. Read more at GigaOM Pro »

Riptano, an almost four-month-old startup building a business around the open-source Cassandra key value store, is so far seeing a lot of demand from enterprises eager to adopt the code. I spoke with Matt Pfeil, co-founder and CEO of Riptano, to learn more. Read more »

Twitter has scaled back its plans to store billions of tweets using Cassandra, but the interest in this news and NoSQL data stores in general goes beyond one company’s decision. It touches on the changing nature of the web and the software that underlies it. Read more »

Twitter today open-sourced the code that it used to build its database of users and manage their relationships to one another, called FlockDB. The move comes shortly after Twitter released its Gizzard framework, which it uses to send thousands of queries a second to FlockDB. Read more »

In some ways, the fact that Hadoop is mature enough to inspire commercial products — Cloudera and Karmasphere, e.g. — means it’s yesterday’s news. Which open-source, big-data-inspired product will be the next to launch a wave of startups and drive tens of millions in VC spending? Read more »

Twitter last night offered up the code for Gizzard, an open-source framework for accessing distributed, scalable data stores quickly. It could become an important component of building out a web-based business, much like Facebook’s Cassandra project has for webscale startups and even big companies. Read more »

When it comes to deploying databases web scale, many large sites opt to “go cheap, go custom or go home.” But might the resources spent developing open-source projects or building tools from scratch not become extraneous if companies could buy solutions that would work just fine? Read more »

Hadoop, Cassandra, HBase, Hypertable, Open Neptune… these are some open source projects that are being pursued by web technologists in order to deal with explosion of digital data in a post-terabyte world. The traditional way to deal with unstructured data isn’t working. What we need is a structured means of finding, accessing, and retrieving files and objects. Read more »