Data Encryption Strategy with Privacy-Preserving for Big Data in Mobile Cloud using H2Hadoop

Authors

  •   Asst.Professor, Department of Computer Applications SVCET, Chittoor, Andhra Pradesh, India

Keywords:

Big Data, Cloud Computing, Hadoop, H2Hadoop, Hadoop Performance, MapReduce, Text Data.

Abstract

Cloud Computing leverages Hadoop framework for process Big Data in parallel. Hadoop has bound limitations that could be exploited to execute the duty efficiently. These limitations square measure principally thanks to data section within the cluster jobs and tasks scheduling, and resource allocations in Hadoop. Economical resource allocation remains a challenge in Cloud Computing MapReduce platforms. We propose H2Hadoop that is an enhanced Hadoop design that reduces the computation value related to Big Data analysis. The projected design also addresses the difficulty of resource allocation in native Hadoop. H2Hadoop provides a better resolution for “text data”, like finding DNA sequence and the motif of a dna sequence. Also, H2Hadoop provides an efficient Data Mining approach for Cloud Computing environments. H2Hadoop architecture leverages on Name Node’s ability to assign jobs to the TaskTrakers (Data Nodes) inside the cluster. By adding control options to the Name Node, H2Hadoop will intelligently direct and assign tasks to the Data Nodes that contain the desired knowledge while not causing the duty to the full cluster. Comparing with native Hadoop, H2Hadoop reduces cpu time, range of read operations, and another Hadoop factors.

References

  1. Ming, M., G. Jing, and C. Jun-jie. Blast-Parallel: The parallelizing implementation of sequence alignment algorithms based on Hadoop platform. in Biomedical Engineering and Informatics (BMEI), 2013 6th International Conference on. 2013.
  2. Schatz, M.C., B. Langmead, and S.L. Salzberg, Cloud computing and the DNA data race. Nature biotechnology, 2010. 28(7): p. 691.
  3. Schadt, E.E., et al., Computational solutions to large-scale data management and analysis. Nature Reviews Genetics, 2010. 11(9): p. 647-657.
  4. Farrahi, K. and D. Gatica-Perez, A probabilistic approach to mining mobile phone data sequences. Personal Ubiquitous Comput., 2014. 18(1): p. 223-238.
  5. Marx, V., Biology: The big challenges of big data. Nature, 2013. 498(7453): p. 255-260.
  6. Lohr, S., The age of big data. New York Times, 2012. 11.
  7. Changqing, J., et al. Big Data Processing in Cloud Computing Environments. in Pervasive Systems, Algorithms and Networks (ISPAN), 2012 12th International Symposium on. 2012.
  8. Chen, M., S. Mao, and Y. Liu, Big Data: A Survey. Mobile Networks and Applications, 2014. 19(2): p. 171-209.
  9. Jagadish, H., et al., Big data and its technical challenges. Communications of the ACM, 2014. 57(7): p. 86-94. 10. White, T., Hadoop: The definitive guide. 2012: "O’Reilly Media, Inc.".
  10. Patel, A.B., M. Birla, and U. Nair. Addressing big data problem using Hadoop and Map Reduce. in Engineering (NUiCONE), 2012 Nirma University International Conference on. 2012.
  11. Hammoud, M. and M.F. Sakr. Locality-Aware Reduce Task Scheduling for MapReduce. in Cloud Computing Technology and Science (CloudCom), 2011 IEEE Third International Conference on. 2011. 12Dean, J. and S. Ghemawat, MapReduce: simplified data processing on large clusters. Communications of the ACM, 2008. 51(1): p. 107-113.
  12. Xiaowen Feng, Hai Jin, Ran Zheng, Zhiyuan Shao, Lei Zhu," Implementing Smith-Waterman Algorithm with Two-dimensional Cache on GPUs " Second International Conference on Cloud and Green Computing, 2012.
  13. Novan Zulkarnain and Muhammad Anshari, "Big Data: Concept, Applications, & Challenges", International Conference on Information Management and Technology (ICIMTech),2016.
  14. J. Ramsingh and V.Bhuvaneswari, "Data Analytic on Diabetic awareness with Hadoop Streaming using Map Reduce in Python", IEEE International Conference on Advances in Computer Applications (ICACA), 2016.
  15. Ming Meng, Jing Gao*, Jun-jie Chen,"Blast-Parallel: The parallelization implementation of sequence alignment algorithm based on Hadoop platform ", 6th International Conference on Biomedical Engineering and Informatics (BMEI 2013) , 2013.
  16. Saad Khan Zahid, Laiq Hasan , Asif Ali Khan, Salim Ullah, "A Novel Structire of Smith-Waterman Algorithm for Efficient Sequence alignment", ISBN: 978-1-4799-6376-8/15/$31.00 ©2015 IEEE.
  17. Miss. Anju Ramesh Ekre," Genome Sequence Alignment tools: a Review", 978-1-4673-9745-2 ©2016 IEEE.
  18. Rohith K. Menon, Goutham P. Bhat and Michael C. Schatz," Rapid Parallel Genome Indexing with MapReduce", http://www.genome10k.
  19. Merina Maharjan, "Genome Analysis with MapReduce", ttp://hadoop.apache.org/

Downloads

Published

2018-04-30

Issue

Section

Research Articles

How to Cite

[1]
, " Data Encryption Strategy with Privacy-Preserving for Big Data in Mobile Cloud using H2Hadoop, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 3, Issue 4, pp.1181-1186, March-April-2018.