Effective Clustering Approach to Discover Outliers in Voluminous Database using Clustering Approach

Authors

  • M. Veena  M.Phil (SSP) Research Scholar, Department of Computer Applications, Alagappa University, Karaikudi, Tamil Nadu, India
  • Dr. A. Nagarajan  Assistant Professor, Department of Computer Applications, Alagappa University, Karaikudi, Tamil Nadu, India

Keywords:

Web Mining, Sequential Pattern Algorithm, Browsing patterns, Server Log Files

Abstract

In recent advancements of the internet have changed the business scenario and plethora of data is available for the decision making and to improve business transactions. Mining interesting data related to server logs is an evolving area and web usage mining caters to the need of the website positioning and marketing strategies. The web server creates log files regarding details about the page, IP address of the user, browser, and operating system used and time/date stamp regarding browsing patterns and this data is mined to extract useful information using web usage mining. The primary objective of this research work is to discover the low hit pages of a website from the log files. The research work mainly focuses on a new technique to find the browsing patterns or the navigational behavior of the users after mining the content of the server log files using a hybrid data mining techniques. The proposed algorithm uses sequential frequent item set mining technique. The proposed methods are evaluated using voluminous benchmarked datasets and from the experimental results, it is founds that the proposed methods are superior with respect to the accuracy, time consumption, memory usage and precision than the existing state of the art algorithms.

References

  1. V. Kumar et al., "Clustering using modified harmony search algorithm", International journal of computational intelligence studies, 3(2): pp. 113-133, 2014.
  2. Du Y.J. and Li H.M., "Strategy for Mining Association Rules for Web Pages Based on Formal Concept Analysis", Applied Soft Computing, Vol. 10, No. 3, pp. 772-783, 2010.
  3. K. Kianmehr et al., "Association Rules Mining Based Approach for Web Usage Mining", In Proceedings of the IEEE International Conference on Information Reuse and Integration, IEEE Conference Publishing Services, pp. 166-171, August 03-05, 2011.
  4. Martinez De Pison et al., "Mining Association Rules From Time Seriesto Explain Failures In A Hot-Dip Galvaizing Steel Line", Computers and Industrial Engineering, Vol. 63, No. 1, pp. 22-36, 2012.
  5. P. Murugavel et al., "Improved Hybrid Clustering and Distance-based Technique for Outlier Removal", International Journal on Computer Science and Engineering (IJCSE).
  6. R. Pamula et al., "An Outlier Detection Method Based on Clustering". Emerging Applications of Information Technology (EAIT), pp. 253 – 256, 2011.
  7. Asieh Ghanbarpour and Behrooz Minaei, "EXDBSCAN: An extension of DBSCAN to detect clusters in multi-density datasets", IEEE Intelligent Systems (ICIS), 10.1109/IranianCIS.2014.6802561.
  8. Pranav Nerurka et al, "Empirical Analysis of Data Clustering Algorithms", 6th International Conference on Smart Computing and Communications, ICSCC 2017, 7-8 December 2017, Kurukshetra, India.
  9. Qinbao Song et al., "A Fast Clustering-Based Feature Subset Selection Algorithm for High Dimensional Data", IEEE Transactions On Knowledge And Data Engineering Vol:25 No:1, 2013.
  10. D. Sculley., "Web-scale k-means clustering", In Proceedings of the 19th international conference on World wide web, pp. 1177-1178, 2010.

Downloads

Published

2018-04-30

Issue

Section

Research Articles

How to Cite

[1]
M. Veena, Dr. A. Nagarajan, " Effective Clustering Approach to Discover Outliers in Voluminous Database using Clustering Approach , IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 3, Issue 3, pp.410-415, March-April-2018.