A Review on Hadoop Eco System for Big Data

Authors

  • Anushree Raj  M.Sc. Big Data Analytics Department, St Agnes College (Autonomous) Mangalore, Karnataka, India
  • Rio D'Souza  Computer Science and Engineering Department, St Joseph Engineering College Mangalore, Karnataka, India

DOI:

https://doi.org//10.32628/CSEIT195172

Keywords:

Big Data, Hadoop architecture, Hadoop eco system components, HDFS, MapReduce.

Abstract

In this era of information age, a huge amount of data generates every moment through various sources. This enormous data is beyond the processing capability of traditional data management system to manage and analyse the data in a specified time span. This huge amount of data refers to Big Data. Big Data faces numerous challenges in various operations on data such as capturing data, data analysis, data searching, data sharing, data filtering etc. HADOOP has showed a big way of various enterprises for big data management. Big data hadoop deals with the implementation of various industry use cases. To master the Apache Hadoop, we need to understand the hadoop eco system and hadoop architecture. In this paper we brief on the Hadoop architecture and hadoop eco system.

References

  1. Natalia Miloslavskaya ,Alexander Tolstoy, “Big Data, Fast Data and Data Lake Concepts” 7th Annual International Conference on Biologically Inspired Cognitive Architectures, Volume 88, 2016, Pages 300–305
  2. B. Saraladevi, N. Pazhaniraja, P. Victer Paul, M.S. Saleem Basha, P. Dhavachelvan, “Big Data and Hadoop-A Study in Security Perspective,” 2nd International Symposium on Big Data and Cloud Computing (ISBCC’15), Procedia Computer Science 50 ( 2015 ) 596 – 601
  3. DT Editorial Services, “Big Data(covers Hadoop2, Map Reduce, Hive, Yarn, Pig, R and Data Visualization)” by Dreamtech Press
  4.  “Hadoop, MapReduce and HDFS:A developer perspective,”(Procedia Computer Science, Volume 48, 2015,Pages 45-50)
  5. A Novel and efficient de-duplication system for HDFS (Procedia Computer Science,Volume 92, 2016, Pages (498-505)
  6. Tharso Ferreira, Antonio Espinosa, Juan Carlos Moure, Porfidio Hern´andez, “An Optimization for MapReduce Frameworks in Multi-core,” International Conference on Computational Science, ICCS 2013, Procedia Computer Science 18 ( 2013 ) 2587 – 2590
  7. Can Uzunkaya, Tolga Ensari, Yusuf Kavurucu, “Hadoop Ecosystem and Its Analysis on Tweets,” World Conference on Technology, Innovation and Entrepreneurship, Procedia - Social and Behavioral Sciences 195 (2015 ) 1890 – 1897
  8. Sachin Bende, Rajashree Shedge, “Dealing with Small Files Problem in Hadoop Distributed File System,” 7th International Conference on Communication, Computing and Virtualization 2016, Procedia Computer Science 79 ( 2016 ) 1001 – 1012
  9. PekkaPääkkönen, DanielPakkala1, “Reference Architecture and Classification of Technologies, Products and Services for Big Data Systems,”
  10. https://intellipaat.com/tutorial/hadooptutorial/introductio n- hadoop/
  11. Apache Hadoop. http://hadoop.apache.org/
  12. Kala Karun. A , Chitharanjan. K,” A Review on Hadoop – HDFS Infrastructure Extensions”, Conference on Information and Communication Technologies,2013,IEEE
  13. Naveen Garg , Dr. Sanjay Singla, Dr. Surender Jangra, “Challenges and Techniques for Testing of Big Data,” Procedia Computer Science 85 ( 2016 ) 940 – 948

Downloads

Published

2019-02-28

Issue

Section

Research Articles

How to Cite

[1]
Anushree Raj, Rio D'Souza, " A Review on Hadoop Eco System for Big Data, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 5, Issue 1, pp.343-348, January-February-2019. Available at doi : https://doi.org/10.32628/CSEIT195172