An Enormous Inspection of MapReduce Technology

Authors(1) :-J. Rajesh Khanna

Since, the last three or four years, the field of “big data” has appeared as the new frontier in the wide spectrum of IT-enabled innovations and favorable time allowed by the information revolution. Today, there is a raise necessity to analyses very huge datasets, that have been coined big data, and in need of uniqueness storage and processing infrastructures. MapReduce is a programming model the goal of processing big data in a parallel and distributed manner. In MapReduce, the client describes a map function that processes a key/value pair to procreate a set of intermediate value pairs & key, and a reduce function that merges all intermediate values be associated with the same intermediate key. In this paper, we aimed to demonstrate a close-up view about MapReduce. The MapReduce is a famous framework for data-intensive distributed computing of batch jobs. This is over-simplify fault tolerance, many implementations of MapReduce materialize the overall output of every map and reduce task before it can be consumed. Finally, we also discuss the comparison between RDBMS and MapReduce, and famous scheduling algorithms in this field.

Authors and Affiliations

J. Rajesh Khanna
Assistant Professor, Department of CSE, BVRIT, Telangana, India

Big Data, MapReduce, Scheduling, Processing Layer, Indexing, Data Layout.

  1. Kim, G.-H., Trimi, S., & Chung, J.-H. (2014). Big-data applications in the government sector. Communicationsof the ACM, 57(3), pp 78–85.
  2. Dr. Yusuf Perwej, "An Experiential Study of the Big Data," for published in the International Transaction of Electrical and Computer Engineers System (ITECES), USA, ISSN (Print): 2373-1273 ISSN (Online): 2373-1281, Vol. 4, No. 1, page 14-25, March 2017, DOI:10.12691/iteces-4-1-3.
  3. R. Murugesh, I. Meenatchi, "A Study Using PI on: Sorting Structured Big Data In Distributed Environment Using Apache Hadoop MapReduce", International Journal of Computer Sciences and Engineering, Vol.2, Issue.8, pp.35-38, 2014.
  4. "Apache Hadoop," Apache. Online]. Available: http://hadoop.apache.org/. Accessed: 18-Feb-2015].
  5. M. Khan, P. M. Ashton, M. Li, G. A. Taylor, I. Pisica, and J. Liu, "Parallel Detrended Fluctuation Analysis for Fast Event Detection on Massive PMU Data," Smart Grid, IEEE Trans., vol. 6, no. 1, pp. 360–368, Jan. 2015.
  6. K. Parimala1 G. Rajkumar, A. Ruba, S. Vijayalakshmi, "Challenges and Opportunities with Big Data", International Journal of Scientific Research in Computer Science and Engineering, Vol.5, Issue.5, pp.16-20, 2017.
  7. Lee, D., Kim, J.-S., & Maeng, S. "Large-scale incremental processing with MapReduce", Future Generation Computer Systems, 36, pp 66–79, (2014), doi:10.1016/j.future.2013.09.010.
  8. M. Khan, M. Li, P. Ashton, G. Taylor, and J. Liu, "Big data analytics on PMU measurements," in Fuzzy Systems and Knowledge Discovery (FSKD), 2014 11th International Conference on, 2014, pp. pp 715–719.
  9. Qi, C., Cheng, L., & Zhen, X. (2014). Improving mapreduce performance using smart speculative execution strategy. IEEE Transactions on Computers, Vol. 63(4), pp 954–967. Doi:10.1109/TC.2013.15.
  10. J. Kwon, K. Park, D. Lee, S. Lee, PSR: Pre-computing Solutions in RDBMS for Fast Web services Composition Search, in: Proceedings of the 2nd International Conference on Web Services, Salt Lake City, Utah, USA, ICWS 2007, pp. 808-815.

Publication Details

Published in : Volume 2 | Issue 6 | November-December 2017
Date of Publication : 2017-12-31
License:  This work is licensed under a Creative Commons Attribution 4.0 International License.
Page(s) : 491-499
Manuscript Number : CSEIT1726154
Publisher : Technoscience Academy

ISSN : 2456-3307

Cite This Article :

J. Rajesh Khanna, "An Enormous Inspection of MapReduce Technology", International Journal of Scientific Research in Computer Science, Engineering and Information Technology (IJSRCSEIT), ISSN : 2456-3307, Volume 2, Issue 6, pp.491-499, November-December-2017. |          | BibTeX | RIS | CSV

Article Preview