Query Engine Design and Performance Analysis: A Review

Authors(1) :-Chemwotie Kipkurui Brian

Distributed real-time computing has been the domain of practical system engineering for many decades. The development of a discipline of real-time programming would allow the construction of programs with analysable and variable timing properties. Such a discipline will need to be built on a well-integrated framework in which different methods are used where appropriate to obtain timing properties to which a high-level assurance can be attached. Time is everything in this current system, everything fast will gain the most valuable achievements especially in the business field. Whoever tries to get the first opportunity to fulfil the market needs will gain most profit. This paper looks into the various query engine models and frameworks that tries to improve on both the design and performance.

Authors and Affiliations

Chemwotie Kipkurui Brian
Department of Computing, Jomo Kenyatta University of Agriculture and Technology, Nairobi, Kenya

Distributed, Real-Time Computation, Query Engine

  1. A. Toshniwal et al., "Storm @ Twitter," pp. 147–156, 2014.
  2. L. Neumeyer and B. Robbins, "S4 : Distributed Stream Computing Platform," IEEE Int. Conf. Database Syst.
  3. S. Chernyak et al., "MillWheel : Fault-Tolerant Stream Processing at Internet Scale," Proc. VLDB Endow., vol. 6, no. 11, 2013.
  4. J. Samosir, M. Indrawan-santiago, and P. D. Haghighi, "An Evaluation of Data Stream Processing Systems for Data Driven Applications 2 Real-time Data Processing of Big Data," Procedia - Procedia Comput. Sci., vol. 80, pp. 439–449, 2016.
  5. M. Zaharia, T. Das, H. Li, T. Hunter, S. Shenker, and I. Stoica, "Discretized Streams : Fault-Tolerant Streaming Computation at Scale," Proc. Twenty-Fourth ACM Symp. Oper. Syst. Princ., no. 1, pp. 423–438, 2013.
  6. F. Yang, E. Tschetter, G. Merlino, X. Léauté, N. Ray, and J. Bieber, "Druid A Real-time Analytical Data Store," Proc. 2014 ACM SIGMOD Int. Conf. Manag. Data, pp. 157–168, 2014.
  7. D. Kossmann and K. Stocker, "Iterative Dynamic Programming : A New Class of Query Optimization Algorithms," vol. 1, no. 212, 1999.
  8. Y. E. Ioannidis, "RANDOMIZED ALGORITHMS FOR OPTIMIZING LARGE JOIN QUERIES + ALGORITHMS," ACM, pp. 312–321, 1990.
  9. A. Hameurlain and F. Morvan, "Evolution of Query Optimization Methods," Trans. on Large-Scale Data- & Knowl.-Cent. Syst., vol. 33, no. 0, pp. 211–242, 2009.
  10. K. S. Beyer, R. Gemulla, A. Balmin, E. J. Shekita, C. Kanne, and F. Ozcan, "Jaql : A Scripting Language for Large Scale Semistructured Data Analysis," Proc. VLDB Endow., vol. 4, no. 12, pp. 1272–1283, 2011.
  11. C. Olston, B. Reed, R. Kumar, and A. Tomkins, "Pig Latin : A Not-So-Foreign Language for Data Processing," ACM SIGMOD, 2008.
  12. A. Thusoo et al., "Hive - A Warehousing Solution Over a Map-Reduce Framework," ACM VLDB Endow., 2009.
  13. Y. Yu et al., "DryadLINQ : A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language," USENIX Symp. Oper. Syst. Des. Implement., pp. 1–14, 2009.
  14. D. Willis, D. J. Pearce, and J. Noble, "Efficient Object Querying for Java," Eur. Conf. Object-Oriented Program., vol. 4067, pp. 28–49, 2006.
  15. M. Zaharia et al., "Resilient Distributed Datasets : A Fault-Tolerant Abstraction for In-Memory Cluster Computing," Proc. 9th USENIX Conf. Networked Syst. Des. Implement., 2012.
  16. K. Shvachko, H. Kuang, S. Radia, and R. Chansler, "The Hadoop Distributed File System," 26th Symp. Mass Storage Syst. Technol., pp. 1–10, 2010.
  17. M. Zaharia, M. Chowdhury, M. J. Franklin, S. Shenker, and I. Stoica, "Spark : Cluster Computing with Working Sets," Proc. 2nd USENIX Conf. Hot Top. cloud Comput., 2010.
  18. R. S. Xin et al., "Shark : SQL and Rich Analytics at Scale," Proc. 2013 ACM SIGMOD Int. Conf. Manag. Data, pp. 13–24, 2013.
  19. R. Ranjan, "Streaming Big Data Processing in Datacenter Clouds," IEEE Cloud Comput., 2015.
  20. E. Feller, L. Rilling, and C. Morin, "Snooze : A Scalable and Autonomic Virtual Machine Management Framework for Private Clouds," 12th IEEE/ACM Int. Symp. Clust. Cloud Grid Comput., 2012.
  21. P. Hunt, M. Konar, F. P. Junqueira, and B. Reed, "ZooKeeper : Wait-free coordination for Internet-scale systems," USENIX Annu. Tech. Conf., vol. 8, 2010.

Publication Details

Published in : Volume 2 | Issue 2 | March-April 2017
Date of Publication : 2017-04-30
License:  This work is licensed under a Creative Commons Attribution 4.0 International License.
Page(s) : 784-788
Manuscript Number : CSEIT1722239
Publisher : Technoscience Academy

ISSN : 2456-3307

Cite This Article :

Chemwotie Kipkurui Brian, "Query Engine Design and Performance Analysis: A Review", International Journal of Scientific Research in Computer Science, Engineering and Information Technology (IJSRCSEIT), ISSN : 2456-3307, Volume 2, Issue 2, pp.784-788, March-April-2017. |          | BibTeX | RIS | CSV

Article Preview