Just the ramblings of a LAMP DevOps

blog.robert.mcfrazier.com

Cassandra and Hadoop and Hive, Oh my!

Posted on March 7, 2013 by Robert McFrazier

Posted in Hadoop, Linux, NoSQL 1 Comment

At work we are installing a Cassandra cluster for an additional tool in the database toolbox. Hadoop and Hive are bundled with it because we are using the DataStax distribution of Cassandra. This gives us a nice platform to store data and run Hadoop data mining jobs.

We will be using Cassandra several different ways:

A database for information that does not need to be stored relationally.
A caching server for data jobs that are run on the databases that we do not want direct web traffic to make calls against.
A data mining platform using Hadoop and Hive/Pig.

Being a .Net shop, we chose Fluent Cassandra as the Cassandra client library, this was after a healthy Fluent Cassandra vs Aquiles debate.

We are in the process of installing and configuring the cluster now, so I’ll post again after we have the cluster up in our development environment.

You may also like:

Cassandra and Hadoop and Hive, Oh my! (part 2)

Robert McFrazier

I have been in the software industry for over 14 years in many different roles including LAMP engineer, software developer, technical trainer, and manager. I enjoy learning new things and working on strong teams. I spend my free time with my wife and my son, trying to stay dry in the pacific northwest. I spend all day with my head in the clouds (AWS usually).

« Code for Seattle meetup

Definition of a manager »

One thought on “Cassandra and Hadoop and Hive, Oh my!”

Pingback: Cassandra and Hadoop and Hive, Oh my! (part 2) - blog.robert.mcfrazier.com

Leave a comment Cancel reply