Skip to main content

Difference Between NoSql Cassandra and Apache Hadoop

What is Cassandra?

Cassandra is the NoSql Database and it handles the more amount of data between multiple servers. It serves data from database to online transactional applications and business intelligence because cassandra is the open source database. Cassandra created by facebook and designed for peer to peer nodes. It partitions the data across hadoop cluster and counts the copy of data from the database.

What is Hadoop?

Hadoop is an open source framework which is used to store large amount of data sets.Hadoop is provided for data storage,data access,data processing and security operations. Many organizations are used hadoop for storage purpose because Hadoop storing large amount of data quickly.


Difference Between NoSql Cassandra and Apache Hadoop:




S.NoNoSql CassandraApache Hadoop
1Cassandra is the no NoSql database and mainly used for architecture and handle more amount of data between multiple servers.Hadoop is the open source framework and mainly used for stores the large amount of data. Works of the hadoop based on programming.
2It accepts the only structured data only.Hadoop accepts structured ,unstructured and semi structured data.
3Architecture of the cassandra contains peer to peer nodes and all nodes are same in cassandra.Architecture of hadoop contains master and slave nodes.In hadoop name node works master and data node works worker node.
4Cassandra works on backend of the online systems.Hadoop works on web,mobile and IOT applications.
5Cassandra used for online transactions.Hadoop used to analysis the data from the user input and database.
6In cassandra read and write data at many times.In hadoop write programming once but read programs at many time.
7It is the NoSql database means query language so command is used to accessing the data in cassandra.It is a programming language so mapreduce programming used to accessing the data in hadoop.
8Cassandra stores the data at array format and stores the data at indexed based.Storage format of hadoop is file systems means large data are spilted into small blocks.
9All nodes are same in cassandra so easily handles the data from database.When master node are down in hadoop cannot access any data.
10Uses Gossip protocol for communication between nodes.Uses TCP and UDP for communication between nodes.
11Indexing the data very easy in cassandra because data stores at index format.Indexing is very difficult in hadoop.
12In Cassandra data are not directly written in the disk.Data first stored in table format if table memory full data are stores in the disk.In Hadoop data are directly written in nodes.

Comments

Popular posts from this blog