Survey on Inclusive Analysis of Incomplete Datasheets

Authors

  • Baswaraju Swathi  Information Science, Visvesvaraya Technological University, Bangalore, Karnataka, India
  • Supriya P  Information Science, Visvesvaraya Technological University, Bangalore, Karnataka, India
  • Suma M  Information Science, Visvesvaraya Technological University, Bangalore, Karnataka, India

Keywords:

inclusive analysis, incomplete data, indexing, Query processing techniques

Abstract

Analyzing and processing any dataset is very important for any organization as it helps in making key business decisions of an organization and also increases the profit of any business organization. However, these data sets also include incomplete data sets, which are often eliminated in the pre-processing techniques. Incompleteness is the common problem that most of the datasets suffer from. The incompleteness refers to any missing or uncertain data in the datasets. The missing data exists due to failure of data transmission devices, accidental loss of data or improper storage. Given a dataset of multi-dimensional objects and a query object, finding k closest objects to the query from the dataset without eliminating the missing value data object is a fundamental problem in data mining. This concept has a significant role in real time applications like image recognition, location based services, etc. In this paper, we study how to retrieve k-closest object to a given query from datasets with incomplete data. Further, we explain and discuss the latest techniques used to improve the accuracy of such data retrieval. We then analyze and compare the results obtained, efficiency and performance of all the techniques discussed.

References

  1. Nick Roussopoulos Stephen Kelly Fredrick Vincent: “Nearest Neighbor Queries”.
  2. Parisa Haghani, Sebastian Michel and Karl Aberer: “Evaluating Top-k Queries over Incomplete Data Streams”.
  3. Wei Cheng, Xiaoming Jin, Jian-Tao Sun, Xuemin Lin, Xiang Zhang, and Wei Wang: “Searching Dimension Incomplete Databases”, IEEE Transactions on Knowledge and Data  Engineering, Vol. 26, No.3, March 2014
  4. Guadalupe Canahuate, Michael Gibas, and Hakan Ferhatosmanoglu: “Indexing Incomplete Databases”.
  5. Yogitha. M kapse and Anthara Bhattacharya: “Searching dimension in incomplete database by using hybrid Indexing Method”, International Journal of advanced research in Computer Science and Management Studies, Volume 3,Issue 6, June 2015.
  6. Cheng Lu, Shiji Song and Cheng Wu: “𝐾-Nearest Neighbor Intervals Based AP Clustering Algorithm for Large Incomplete Data”, Hindawi Publishing Corporation, Mathematical Problems in Engineering, Volume 2015, Article ID 535932,
  7. Beng Chin Ooi, Cheng Hian Goh and Kian-Lee Tan: “Fast High-Dimensional Data Search in Incomplete Databases”.
  8. Reynold Cheng, Lei Chen, Jinchuan Chen and Xike Xie: “Evaluating Probability Threshold k-Nearest-Neighbor Queries over Uncertain Data”.

Downloads

Published

2017-04-30

Issue

Section

Research Articles

How to Cite

[1]
Baswaraju Swathi, Supriya P, Suma M, " Survey on Inclusive Analysis of Incomplete Datasheets , IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 2, Issue 2, pp.966-972, March-April-2017.