Study of Machine Learning Techniques using Apache Spark

Authors(4) :-Soumya Manjunath Hegde, Shilpa .M, Soujanya .C .S, Urvashi Grover

The challenges in the field of big data analysis is growing due to the huge volume of data collected on daily basis by social media, weather forecast, mobile data etc. In this survey paper, there is a look on different aspects of usage of Apache spark, be it, the framework, the libraries, the spark technologies etc. The spark platform provides various algorithms to analyse machine learning techniques and implement them on other virtualization platforms such as VMware vSphere. Further, Spark is used on different platforms to achieve high performance, overcome latency and achieve efficiency. The papers, studied here, have drawn parallelism between the Hadoop and the Spark and the latter has proved to be the best platform as it is hundred times faster and more efficient.

Authors and Affiliations

Soumya Manjunath Hegde
Eighth semester, Department of ISE, Vidyavardhaka College of Engineering, Mysuru, Karnataka, India
Shilpa .M

Soujanya .C .S
Eighth semester, Department of ISE, Vidyavardhaka College of Engineering, Mysuru, Karnataka, India
Urvashi Grover

weather forecast, virtualization, Hadoop, Spark, latency

Publication Details

Published in : Volume 4 | Issue 6 | May-June 2018
Date of Publication : 2018-05-08
This work is licensed under a Creative Commons Attribution 4.0 International License.
Page(s) : 23-29
Manuscript Number : CSEIT184606
Technoscience Academy

ISSN : 2456-3307

Cite This Article :

Soumya Manjunath Hegde, Shilpa .M, Soujanya .C .S, Urvashi Grover, "Study of Machine Learning Techniques using Apache Spark", International Journal of Scientific Research in Computer Science, Engineering and Information Technology (IJSRCSEIT), ISSN : 2456-3307, Volume 4, Issue 6, pp.23-29, May-June.2018

