Sensing Identicals Files and Eliminating SHA Base Algorithm (SIFE-SBA) for WhatApps

Authors

  • Sona Sahu  Department of CSE, BITS, Bhopal, Madhya Pradesh, India
  • Prof. Adity Sinha  Department of CSE, BITS, Bhopal, Madhya Pradesh, India

Keywords:

Unique Documents, Detecting Duplicate, Copying, Database Storing, Data Consumptions, Whatsapp’s.

Abstract

An extreme transaction of the WhatsApp’s which is more duplicate or replicate files. where whole documents may be served in different formats like JPG,JPEG, PNG., GIF, PDF, Doc, DOCX and more. That Files may get copied to avoid delays/data consumption to provide fault acceptance. large databases of WhatsApp’s files store in the declassification effort with receiving folders. copies of the files are abundant in images, video, Audio and short text in the databases. In our examination on On-line Social Networks (WhatsApp’s) of images, more than 60% of total data are exact duplicate. due to forwarding by friends with linked each other. In this algorithms for detecting replicated files before store in local database by checking with updated hash index which are more critical in WhatsApp’s applications where data is received from many friends links sources or groups. The deduction of duplication files are very necessary, to reduce data consumption in runtime, to improve data storing capacity with higher accuracy.

References

  1. BRODER, A., GLASSMAN, S., MANASSE, S., AND ZWEIG, G. “Syntactic clustering of the web”, In Proceedings of the Sixth International World Wide Web Conference (WWW6’97) (Santa Clara, CA., April). 391–404.
  2. SHIVAKUMAR, N. AND GARICA-MOLINA, H. 1998. Finding near-replicas of documents on the web. In Proceedings of Workshop on Web Databases (WebDB’98) (Valencia, Spain, March). 204–212.
  3. McCain, M., and William C., 1998. Integrating Quality Assurance into the GIS Project Life Cycle, Proceedings of the 1998 ESRI Users Conference. http://www.dogcreek.com/html/documents.html
  4. Good child, M., and Gopal, S. (Eds.),, “Accuracy of Spatial Databases”, Taylor & Francis, London. 1989.
  5. Scott, L. “Identification of GIS Attribute Error Using Exploratory Data Analysis”, Professional Geographer 46(3), 378.386, 1994.
  6. FGDC Federal Geographic Data Committee, Content standard for digital geospatial metadata (revised June 1998), Federal Geographic Data Committee, Washington, D.C., http://www.fgdc.gov/metadata/contstan.html, 1998. FGDC-STD- 001-1998.
  7. Lavanya Pamulaparty Dr. C.V. Guru Rao Dr. M. Sreenivasa Rao, “LSBSM: A Novel Method for Identification of Near Duplicates in Web Documents”, International Journal of Computer Science and Information Security (IJCSIS), Vol. 15, No. 2, February 2017
  8. Mauro Conti, Radha Poovendran, Marco Secchiero, “FakeBook: Detecting Fake Profiles in On-line Social Networks”,ACM International Conference on Advances in Social Networks Analysis and Mining, IEEE 2012.
  9. Ramya R S, Venugopal K R, Iyengar S S & Patnaik L, “Feature Extraction and Duplicate Detection for Text Mining: A Survey”, Global Journal of Computer Science and Technology: C Software & Data Engineering, Volume 16 Issue 5 Version 1.0 Year 2016
  10. Georgios Kontaxis, Iasonas Polakis, Sotiris Ioannidis and Evangelos P. Markatos, “Detecting Social Network Profile Cloning”, 3rd International Workshop on Security and Social Networking, IEEE, 2011.
  11. The SHA-1 algorithm specified in this document is identical to the SHA-1 algorithm specified in FIPS 180-1.
  12. http://en.wikipedia.org/wiki/SHA-2.

Downloads

Published

2017-06-30

Issue

Section

Research Articles

How to Cite

[1]
Sona Sahu, Prof. Adity Sinha, " Sensing Identicals Files and Eliminating SHA Base Algorithm (SIFE-SBA) for WhatApps , IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 2, Issue 3, pp.390-395, May-June-2017.