Information Retrieval and Sentimental Analysis with Databricks

Authors

  • Saifuzzafar Jaweed Ahmed  Computer Engineering, Dhole Patil College of Engineering Pune, Pune, Maharashtra, India
  • Prof. Sayali Shivarkar  Computer Engineering, Dhole Patil College of Engineering Pune, Pune, Maharashtra, India

DOI:

https://doi.org/10.32628/CSEIT2172101

Keywords:

Big Data, Information Retrieval, Sentimental analysis, Databricks

Abstract

With the rapid development of cloud computing, the information increases rapidly. Cheap cloud storage and computing power accelerate the development of massive data, and make the large data information collection and knowledge retrieval become necessary. The percentage of unstructured big data is more than 50%, so it is stored in the form of a file for the most part. Big data is split into many blocks that are stored within the server with some corresponding metadata of storage on the master server. How to collect the big data and keywords and retrieve the information are discussed in this paper. The proposed methodology is the information retrieval and sentimental analysis mechanism to improve text information retrieval and opinion mining using databricks. Information retrieval and analysis become more popular research fields within the world. Big data is a collection of heterogeneous structured, unstructured and semi-unstructured data. The basic aim of this paper is to present a broad picture of massive Data and to point out how information are often retrieved using evolutionary computation techniques i.e. databricks that help in the information retrieval process in a better way compared to traditional retrieval techniques. This paper also covered the setup and configuration of databricks.

References

  1. Chen Jie, Chen Dongjie  “Research On Big Data Information Retrieval Based on Hadoop Architecture”, 2014 IEEE Workshop on Electronics, Computer and Applications .
  2. Doaa Marouk, Sherinr rady, Nagwa Badr,M.E Khalifa,” A survey on information retrieval system modeling using term dependencies and term weighting”, The 8th IEEE International Conference on Intelligent Computing and Information Systems (ICICIS2017).
  3. Wang Xiao-shu,Xie Yao,Luo Huan, “Cloud Computing Oriented Retrieval Technology based on Big Data” , 2015 Seventh International Conference on Measuring Technology and Mechatronics Automation
  4. Peiyun Zhang and Rongjian Xie , “Ontology-based Unstructured Information Organization and Retrieval”, World Congress on Software Engineering, DOI 10.1109/WCSE.2009.305978-0-7695-3570-8/09 $25.00 © 2009 IEEE
  5. Robert Ilijason, “Beginning  Apache Spark Using Azure Databricks”,  ISBN-13 : 978-1484257807
  6. Christopher D. Manning Stanford University, Prabhakar Raghavan Yahoo! Research, Hinrich Schütze University of Stuttgart, “Introduction To Information Retrieval”,  ISBN: 052186571
  7. Vipin Deep Kaur, “Sentimental Analysis of Book Reviews using Unsupervised Semantic Orientation and Supervised Machine Learning Approaches”, Second International Conference on Green Computing and Internet of Things (ICGCIoT)
  8. Israr Haneef, Ehsan Ullah Munir, Ghazia Qaiser, Hafiz Gulfam Ahmad Umar, “Big Data Retrieval: Taxonomy, Techniques and Feature Analysis”,IJCSNS International Journal of Computer Science and Network Security, VOL.18 No.11, November 2018.

Downloads

Published

2021-04-30

Issue

Section

Research Articles

How to Cite

[1]
Saifuzzafar Jaweed Ahmed, Prof. Sayali Shivarkar, " Information Retrieval and Sentimental Analysis with Databricks" International Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 7, Issue 2, pp.459-467, March-April-2021. Available at doi : https://doi.org/10.32628/CSEIT2172101