Junaid Magdum / Pune Institute of Computer Technology
Rahul Barhate / Maharashtra Institute of Technology
The processing of streaming data has led to
major advancements in the fields of finance, security
analysis and social media analysis. For processing,
organizations must first store this data. Of the few data
storage options that satisfy the characteristics to reserve
streams of data, NoSQL databases provide a better
alternative because of their ability to stored schema-less
data which was not possible in RDBMS. Among these, two
databases in Elasticsearch and Cassandra have been
chosen to demonstrate their performance when tested
against a streaming application. The performance
comparison is carried out on the basis of Data
Manipulation Language operations on both NoSQL
datastores. Based on the experiment conducted, it is
concluded that higher scalability operations were
performed faster by Cassandra while Elasticsearch took
smaller execution time for applications that required
aggregations and modifications of data. The analysis done
thus helps organizations chose the appropriate NoSQL
Database for their own requirements.