H2Hadoop : Metadata Centric BigData Analytics on Related Jobs Data Using Hadoop Pseudo Distributed Environment

Authors

  • K. Sridevi  PG Scholar (M.Tech), Department of Information Technology, Sagi Ramakrishnam Raju Engineering College, Bhimavaram, Andhra Pradesh, India
  • Dr. I Hema Latha  Associative Professor, Department of Information Technology, Sagi Ramakrishnam Raju Engineering College, Bhimavaram, Andhra Pradesh, India

Keywords:

Big Data, CJB Table, Hadoop, Hadoop Performance, Map Reduce,Sequential Data

Abstract

Hadoop contains a few impediments that could be created to have a higher execution in executing occupations. These restrictions are generally a result of information territory in the bunch, occupations and undertakings planning, CPU execution time, or asset designations in Hadoop. Information region and productive asset portion remains a test in cloud computing MapReduce platform. We propose an improved Hadoop design that lessens the calculation cost related with BigData investigation. In the meantime, the proposed engineering tends to the issue of asset distribution in local Hadoop. Improved Hadoop engineering influences on NameNode's capacity to relegate occupations to the TaskTrakers (DataNodes) inside the group. By adding controlling highlights to the NameNode, it can shrewdly immediate and dole out errands to the DataNodes that contain the required information. Proposed arrangement concentrate on removing highlights and building a metadata table that conveys data about the presence and the area of the information obstructs in the bunch. This empowers NameNode to guide the employments to particular DataNodes without experiencing the entire informational collections in the cluster.Comparing with local Hadoop, proposed Hadoop reduced CPU time, number of read operations, input data size, and another different factors.

References

  1. Jiong Xie,Shu Yin,Xiaojun Ruan,Zhiyang Ding, Yun Tian,James Majors, Adam Manzanares,and Xiao QinImproving MapReduce Performance through Data Placement in Heterogeneous Hadoop Clusters 2010 IEEE.
  2. Joe B.B uck Noah Watkins Jeff LeFevre Kleoni IoannidouSciHadoop:Array-based Query Processing in Hadoop SC11 November 1218,Seattle,WA,USA.
  3. Xiao Yu and Bo Hong Bi-Hadoop:Extending Hadoop To Improve Support For Binary-Input Applications 2013 IEEE.
  4. Balaji Palanisamy, Aameek Singh,Ling Liu Purlieus:Locality-aware Resource Allocation for MapReduce in a Cloud SC 11, November 12-18, 2011, Seattle, Washington,USA
  5. Rong Gua,Xiaoliang Yanga,Jinshuang Yana SHadoop:Improving MapReduce performance by optimizing job execution mechanism in Hadoop clusters.
  6. Nishanth S, Radhikaa B,Ragavendar T J,Chitra Babu, and Prabavathy BCoRadoop++:A Load Balanced Data Colocation in Radoop Distributed File System 2013 Fifth International Conference on Advanced Computing (ICoAC).
  7. Apache hadoop 2.0, http://hadoop.apache.org/docs/r2.0.0-alpha/.
  8. Hamoud Alshammari,Jeongkyu Lee and Hassan Bajwa H2Hadoop: Improving Hadoop Performance using the Metadata of Related Jobs IEEE TRANSACTIONS ON Cloud Computing, manuscript ID TCC-2015-11- 0399.
  9. Lohr,S.,The age of big data.New York Times,2012.
  10. Changqing,J., et al.Big Data Processing in Cloud Computing Environments.In Pervasive Systems,Algorithms and Networks(ISPAN), 2012 12th International Symposium on.2012.
  11. Chen,M., S.Mao,and Y.Liu,Big Data:A Survey.Mobile Networks and Applications, 2014.19(2):p.171-209.
  12. Mehul Nalin VoraHadoop-HBase for Large-Scale Data
  13. Xuhui Liu, Jizhong Han,Yunqin Zhong, Chengde Han Implementing WebGIS on Hadoop:A Case Study of Improving Small File I/O Performance on HDFS 2009 IEEE.
  14. Hamoud Alshammari,Hassan Bajwa and Jeongkyu Lee Hadoop Based Enhanced Cloud Architecture For Bioinformatic Algorithms.
  15. Marx,V.,Biology:The big challenges of big data.Nature,2013.498(7453):p.255-260.

Downloads

Published

2017-12-31

Issue

Section

Research Articles

How to Cite

[1]
K. Sridevi, Dr. I Hema Latha, " H2Hadoop : Metadata Centric BigData Analytics on Related Jobs Data Using Hadoop Pseudo Distributed Environment, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 2, Issue 6, pp.834-841, November-December-2017.