Semantic similarity based clustering and modeling using Latent Dirichlet Allocation (LDA)

Authors(3) :-Anusha N, Soumya Bilagi, Raghava M S

Privacy has become a substantial issue once the applications of big data are dramatically growing in cloud computing. In recent years, we have a tendency to focus on privacy and propose a unique a novel approach that is termed Dynamic data encryption Strategy (D2ES). Our planned approach aims to by selection encipher knowledge and use privacy classification ways under temporal order constraints. This approach is intended to maximize the privacy protection scope by employing a selective coding strategy within the specified execution time necessities. During this paper, is intended victimization semantic similarity based mostly clustering and topic modeling victimization Latent Dirichlet Allocation (LDA) for summarizing the big text collection over Map reduce framework. The account task is performed in four stages and provides a standard implementation of multiple documents account. The conferred technique is evaluated in terms of quantifiability and varied text account parameters particularly, compression ratio, retention ratio, ROUGE and Pyramid score are measured. The benefits of Map scale back framework are clearly visible from the experiments and it's additionally incontestable that Map reduce provides a quicker implementation of summarizing giant text collections and may be a powerful tool in big Text data analysis.

Authors and Affiliations

Anusha N
Department of Computer Science and Engineering, Sambhram Institute Of Technology, M S Palya, Banglore, Karnataka, India
Soumya Bilagi
Department of Computer Science and Engineering, Sambhram Institute Of Technology, M S Palya, Banglore, Karnataka, India
Raghava M S

Map Reduce, Latent Dirichlet Allocation (LDA), encryption, clustering

Publication Details

Published in : Volume 3 | Issue 5 | May-June 2018
Date of Publication : 2018-06-30
License:  This work is licensed under a Creative Commons Attribution 4.0 International License.
Page(s) : 1044-1048
Manuscript Number : CSEIT1835244
Publisher : Technoscience Academy

ISSN : 2456-3307

Cite This Article :

Anusha N, Soumya Bilagi, Raghava M S, "Semantic similarity based clustering and modeling using Latent Dirichlet Allocation (LDA) ", International Journal of Scientific Research in Computer Science, Engineering and Information Technology (IJSRCSEIT), ISSN : 2456-3307, Volume 3, Issue 5, pp.1044-1048, May-June-2018.
Journal URL : http://ijsrcseit.com/CSEIT1835244

Follow Us

Contact Us