Friday, February 26, 2010

The Apache Cassandra Project - say goodbyte to Relational Database

What is Cassandra?

http://incubator.apache.org/cassandra/

The Apache Cassandra Project develops a highly scalable second-generation distributed database, bringing together Dynamo's fully distributed design and Bigtable's ColumnFamily-based data model.

  • Proven Cassandra is in use at Rackspace, Digg, Facebook, Twitter, Cisco, Mahalo, Ooyala, and more companies that have large, active data sets. The largest production cluster has over 100 TB of data in over 150 machines.


  • Rich Data Model Allows efficient use for many applications beyond simple key/value.

  • Fault Tolerant Data is automatically replicated to multiple nodes for fault-tolerance. Replication across multiple data centers is supported. Failed nodes can be replaced with no downtime.


  • Highly Availabile Writes and reads offer a tunable ConsistencyLevel, all the way from "writes never fail" to "block for all replicas to be readable," with the quorum level in the middle.


  • Elastic Read and write throughput both increase linearly as new machines are added, with no downtime or interruption to applications.

  • Decentralized Every node in the cluster is identical. There are no network bottlenecks. There are no single points of failure.



No comments:

Post a Comment