Data Encryption Strategy with Privacy-Preserving for Big Data in Mobile Cloud using H2Hadoop

Authors(1) :-

Cloud Computing leverages Hadoop framework for process Big Data in parallel. Hadoop has bound limitations that could be exploited to execute the duty efficiently. These limitations square measure principally thanks to data section within the cluster jobs and tasks scheduling, and resource allocations in Hadoop. Economical resource allocation remains a challenge in Cloud Computing MapReduce platforms. We propose H2Hadoop that is an enhanced Hadoop design that reduces the computation value related to Big Data analysis. The projected design also addresses the difficulty of resource allocation in native Hadoop. H2Hadoop provides a better resolution for “text data”, like finding DNA sequence and the motif of a dna sequence. Also, H2Hadoop provides an efficient Data Mining approach for Cloud Computing environments. H2Hadoop architecture leverages on Name Node’s ability to assign jobs to the TaskTrakers (Data Nodes) inside the cluster. By adding control options to the Name Node, H2Hadoop will intelligently direct and assign tasks to the Data Nodes that contain the desired knowledge while not causing the duty to the full cluster. Comparing with native Hadoop, H2Hadoop reduces cpu time, range of read operations, and another Hadoop factors.

Authors and Affiliations

Asst.Professor, Department of Computer Applications SVCET, Chittoor, Andhra Pradesh, India

Big Data, Cloud Computing, Hadoop, H2Hadoop, Hadoop Performance, MapReduce, Text Data.

  1. Ming, M., G. Jing, and C. Jun-jie. Blast-Parallel: The parallelizing implementation of sequence alignment algorithms based on Hadoop platform. in Biomedical Engineering and Informatics (BMEI), 2013 6th International Conference on. 2013.
  2. Schatz, M.C., B. Langmead, and S.L. Salzberg, Cloud computing and the DNA data race. Nature biotechnology, 2010. 28(7): p. 691.
  3. Schadt, E.E., et al., Computational solutions to large-scale data management and analysis. Nature Reviews Genetics, 2010. 11(9): p. 647-657.
  4. Farrahi, K. and D. Gatica-Perez, A probabilistic approach to mining mobile phone data sequences. Personal Ubiquitous Comput., 2014. 18(1): p. 223-238.
  5. Marx, V., Biology: The big challenges of big data. Nature, 2013. 498(7453): p. 255-260.
  6. Lohr, S., The age of big data. New York Times, 2012. 11.
  7. Changqing, J., et al. Big Data Processing in Cloud Computing Environments. in Pervasive Systems, Algorithms and Networks (ISPAN), 2012 12th International Symposium on. 2012.
  8. Chen, M., S. Mao, and Y. Liu, Big Data: A Survey. Mobile Networks and Applications, 2014. 19(2): p. 171-209.
  9. Jagadish, H., et al., Big data and its technical challenges. Communications of the ACM, 2014. 57(7): p. 86-94. 10. White, T., Hadoop: The definitive guide. 2012: "O’Reilly Media, Inc.".
  10. Patel, A.B., M. Birla, and U. Nair. Addressing big data problem using Hadoop and Map Reduce. in Engineering (NUiCONE), 2012 Nirma University International Conference on. 2012.
  11. Hammoud, M. and M.F. Sakr. Locality-Aware Reduce Task Scheduling for MapReduce. in Cloud Computing Technology and Science (CloudCom), 2011 IEEE Third International Conference on. 2011. 12Dean, J. and S. Ghemawat, MapReduce: simplified data processing on large clusters. Communications of the ACM, 2008. 51(1): p. 107-113.
  12. Xiaowen Feng, Hai Jin, Ran Zheng, Zhiyuan Shao, Lei Zhu," Implementing Smith-Waterman Algorithm with Two-dimensional Cache on GPUs " Second International Conference on Cloud and Green Computing, 2012.
  13. Novan Zulkarnain and Muhammad Anshari, "Big Data: Concept, Applications, & Challenges", International Conference on Information Management and Technology (ICIMTech),2016.
  14. J. Ramsingh and V.Bhuvaneswari, "Data Analytic on Diabetic awareness with Hadoop Streaming using Map Reduce in Python", IEEE International Conference on Advances in Computer Applications (ICACA), 2016.
  15. Ming Meng, Jing Gao*, Jun-jie Chen,"Blast-Parallel: The parallelization implementation of sequence alignment algorithm based on Hadoop platform ", 6th International Conference on Biomedical Engineering and Informatics (BMEI 2013) , 2013.
  16. Saad Khan Zahid, Laiq Hasan , Asif Ali Khan, Salim Ullah, "A Novel Structire of Smith-Waterman Algorithm for Efficient Sequence alignment", ISBN: 978-1-4799-6376-8/15/$31.00 ©2015 IEEE.
  17. Miss. Anju Ramesh Ekre," Genome Sequence Alignment tools: a Review", 978-1-4673-9745-2 ©2016 IEEE.
  18. Rohith K. Menon, Goutham P. Bhat and Michael C. Schatz," Rapid Parallel Genome Indexing with MapReduce", http://www.genome10k.
  19. Merina Maharjan, "Genome Analysis with MapReduce", ttp://

Publication Details

Published in : Volume 3 | Issue 4 | March-April 2018
Date of Publication : 2018-04-30
License:  This work is licensed under a Creative Commons Attribution 4.0 International License.
Page(s) : 1181-1186
Manuscript Number : CSEIT1833579
Publisher : Technoscience Academy

ISSN : 2456-3307

Cite This Article :

, "Data Encryption Strategy with Privacy-Preserving for Big Data in Mobile Cloud using H2Hadoop", International Journal of Scientific Research in Computer Science, Engineering and Information Technology (IJSRCSEIT), ISSN : 2456-3307, Volume 3, Issue 4, pp.1181-1186, March-April-2018.
Journal URL :

Article Preview