Architecture of Data Lake

Authors(2) :-Ajit Singh, Sultan Ahmad

Data can be traced from various consumer sources. Managing data is one of the most serious challenges faced by organizations today. Organizations are adopting the data lake models because lakes provide raw data that users can use for data experimentation and advanced analytics. A data lake could be a merging point of new and historic data, thereby drawing correlations across all data using advanced analytics. A data lake can support the self-service data practices. This can tap undiscovered business value from various new as well as existing data sources. My paper will present the overview of data lake, benefits and it’s architecture along with the opportunities laid down by data lake and advanced analytics, as well as, the challenges in integrating, mining and analyzing the data collected from these sources. It goes over the important characteristics of the data lake architecture and Data and Analytics as a Service (DAaaS) model.

Authors and Affiliations

Ajit Singh
Assistant Professor, Department of MCA, Patna Women's College, Patna, Bihar, India
Sultan Ahmad
Department of Computer Science, College of Computer Engineering and Sciences, Prince Sattam bin Abdulaziz University, P. O. Box. 151, Alkharj 11942, Saudi Arabia

Data Lake, Overview, Benefits, Architecture, Underlying Models, Layers of Architecture

  1. https://tdwi.org/articles/2017/03/29/executive-summary-data-lakes.aspx
  2. Data Lake Development with Big Data by Beulah Salome Purra, Pradeep Pasupuleti
  3. http://www.datasciencecentral.com/profiles/blogs/9-key-benefits-of-data-lake
  4. https://www.blue-granite.com/blog/bid/402596/top-five-differences-between-data-lakes-and-data-warehouses

Publication Details

Published in : Volume 5 | Issue 2 | March-April 2019
Date of Publication : 2019-04-30
License:  This work is licensed under a Creative Commons Attribution 4.0 International License.
Page(s) : 411-414
Manuscript Number : CSEIT1952121
Publisher : Technoscience Academy

ISSN : 2456-3307

Cite This Article :

Ajit Singh, Sultan Ahmad, "Architecture of Data Lake", International Journal of Scientific Research in Computer Science, Engineering and Information Technology (IJSRCSEIT), ISSN : 2456-3307, Volume 5, Issue 2, pp.411-414, March-April-2019. Available at doi : https://doi.org/10.32628/CSEIT1952121
Journal URL : http://ijsrcseit.com/CSEIT1952121

Article Preview