An Implementations of Clustering Technique in Data Mining to Analyse Big Data in Finance

Authors

  • Dr. Rajesh Gargi  Principal of JCDM College of Engineering, Sirsa, Haryana, India
  • Sheenu Sachdeva  Assistant Professor, Department of CSE, JCDM College of Engineering, Sirsa, Haryana, India
  • Pooja Majoka  M.Tech. Scholar, Department of CSE, JCDM College of Engineering, Sirsa, Haryana, India

Keywords:

Kmean, Big Data, PSO

Abstract

This research work is proposed to manage Big Data in Finance using Clustering and optimization mechanism. There are different knowledge discovery techniques, applications and process models that are applicable to deal with big data. There are several researches related to Big Data in Finance. A review of existing researches is stated here. Different researches are proposed research considering K-different clustering mechanism. These algorithms have been applied to various real-life applications running in serial, parallel, and high-performing computing environments. But these researches and traditional data mining techniques have their own limitations. The aim of researches is to judge the efficiency of different data mining algorithms on dataset and determine the optimum clustering algorithm. The performance analysis depends on many factors encompassing test mode, distance function and parameters. There are several researches which used K-MEAN clustering and optimization mechanism. The issues related to Kmean clustering would be resolved in this research. Research has introduced the more effective cluster mechanism to classify the data set. Therefore it is essential to propose a data mining technique to deal with big data in Finance. This research work would be helpful to known about data mining of big data. Data related to Equity and Mutual fund is considered in proposed model. After getting data, it is classified in to different clusters. Same data is stored in a cluster. Each cluster has different type of data set but the data within a cluster have similarly. There are different factors such as face value, market cap, promoters holding , fi holding, domestic holding, 52 week low, 52 week high , dividend, P/E are considered to analyze the better result in future. Using this proposed module, it will be possible to determine the shares or funds that will provide more profit.

References

  1. T. Soni Madhulatha. AN OVERVIEW ON CLUSTERING METHODS. IOSR Journal of Engineering Apr. 2012, Vol. 2(4) pp: 719-725.
  2. W.Sarada, Dr.P.V.Kumar, A REVIEW ON CLUSTERING TECHNIQUES AND THEIR COMPARISON , International Journal of Advanced Research in Computer Engineering &Technology (IJARCET) Volume 2 Issue 11, November 2013
  3. Bhoj Raj Sharmaa.Clustering Algorithms: Study and Performance Evaluation Using Weka Tool.International Journal of Current Engineering and Technology ISSN 2277 - 4106 © 2013.
  4. Shweta Srivastava. Clustering Techniques Analysis for Microarray Data. International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 5, May 2014
  5. Muhammad Husain Zafar.A Clustering Based Study of Classification Algorithms.International Journal of Database Theory and Application Vol.8, No.1 (2015), pp.11-22.
  6. Asir Antony Gnana Singh. Performance Analysis on Clustering Approaches for Gene ExpressionData.International Journal of Advanced Research in Computer and Communication Engineering Vol. 5, Issue 2, February 2016 .
  7. Kalyani M Raval. Data Mining Techniques
  8. V.Ramesh, P.Parkavi, P.Yasodha. Performance Analysis of DM Techniques for Placement Chance Prediction .
  9. Neelamadhab Padhy1, Dr.Pragnyaban Mishra .Survey of Data Mining Applications and Feature Scope .June 2012.
  10. David Jensen and Jennifer Neville. Data Mining in Social Networks.
  11. Aarti Sharma, Rahul Sharma,Vivek Kr. Sharma,VishalShrivatava. Application of Data Mining – A Survey Paper.2014.
  12. Kun-Ming Yu, Che-Yu Lin, Wen Ouyang Jiayi Zhou. An OpenCL Candidate Slicing Frequent Pattern Mining algorithm on graphic processing units. 2011 IEEE International Conference on Systems, Man, and Cybernetics.
  13. Che-Yu Lin, Kun-Ming Yu, Wen Ouyang Jiayi Zhou. An OpenCL Candidate Slicing Frequent Pattern Mining algorithm on graphic processing units. 2011 IEEE International Conference on Systems, Man, and Cybernetics.
  14. C. Vorapongkitipun and N. Nupairoj, “Improving performance of small-file accessing in Hadoop,” 2014 11th Int. Jt. Conf. Comput. Sci. Softw. Eng. "Human Factors Comput. Sci. Softw. Eng. - e-Science High Perform. Comput. eHPC, JCSSE 2014, pp. 200–205, 2014.
  15. C. Verma and R. Pandey, “Big Data representation for grade analysis through Hadoop framework,” Proc. 2016 6th Int. Conf. - Cloud Syst. Big Data Eng. Conflu. 2016, pp. 312–315, 2016.
  16. A. Siretskiy and O. Spjuth, “HTSeq-Hadoop: Extending HTSeq for massively parallel sequencing data analysis using Hadoop,” Proc. - 2014 IEEE 10th Int. Conf. eScience, eScience 2014, vol. 1, pp. 317–323, 2014
  17. K. Rattanaopas and S. Kaewkeeree, “Improving Hadoop MapReduce performance with data compression: A study using wordcount job,” ECTI-CON 2017 - 2017 14th Int. Conf. Electr. Eng. Comput. Telecommun. Inf. Technol., pp. 564–567, 2017.
  18. P. R. Merla and Y. Liang, “Data analysis using hadoop MapReduce environment,” Proc. - 2017 IEEE Int. Conf. Big Data, Big Data 2017, vol. 2018–Janua, pp. 4783–4785, 2018.
  19. J. Kaur, K. Sachdeva, and G. Singh, “Image processing on multinode hadoop cluster,” 2017 Int. Conf. Electr. Electron. Commun. Comput. Optim. Tech., pp. 21–26, 2017.
  20. A. Bhardwaj, V. K. Singh, Vanraj, and Y. Narayan, “Analyzing BigData with Hadoop cluster in HDInsight azure Cloud,” 12th IEEE Int. Conf. Electron. Energy, Environ. Commun. Comput. Control (E3-C3), INDICON 2015, 2016.
  21. M. Bhandarkar, “MapReduce programming with apache Hadoop,” 2010 IEEE Int. Symp. Parallel Distrib. Process., p. 1, 2010.
  22. A. Q. Mohammed and R. Bharati, “An efficient technique to improve resources utilization for hadoop MapReduce in heterogeneous system,” ICCT 2017 - Int. Conf. Intell. Commun. Comput. Tech., vol. 2018–January, pp. 12–16, 2018.
  23. X. Qiao, D. Shi & F. Xu, "Optimal pricing strategy & economic effect of product sharing based on analysis of B2C sharing platform," 2019 16th International Conference on Service Systems & Service Management (ICSSSM), Shenzhen, China, 2019, pp. 1-6, doi: 10.1109/ICSSSM.2019.8887720.
  24. E. Turkedjiev, M. Angelova & K. Busawon, "Validation of Artificial Neural Network Model for Share Price UK Banking Sector Short-Term Trading," 2013 UKSim 15th International Conference on Computer Modelling & Simulation, Cambridge, 2013, pp. 6-11, doi: 10.1109/UKSim.2013.31.
  25. M. Wu & Q. Yu, "Empirical Analysis of Impact Factors of A Shares & H Shares of Price Differences," 2010 International Conference on E-Product E-Service & E-Entertainment, Henan, 2010, pp. 1-4, doi: 10.1109/ICEEE.2010.5661626.
  26. L. Guoyi & L. Renzhong, "An Analysis on Correlation of Factors Affecting Bank Share Prices," 2009 International Conference on Information Management, Innovation Management & Industrial Engineering, Xi'an, 2009, pp. 394-397, doi: 10.1109/ICIII.2009.404.
  27. X. Fang & T. Bai, "Share Price Prediction Using Wavelet Transform & Ant Colony Algorithm for Parameters Optimization in SVM," 2009 WRI Global Congress on Intelligent Systems, Xiamen, 2009, pp. 288-292, doi: 10.1109/GCIS.2009.85.
  28. F. Wang, L. Duan & J. Niu, "Optimal Pricing of User-Initiated Data-Plan Sharing in a Roaming Market," in IEEE Transactions on Wireless Communications, vol. 17, no. 9, pp. 5929-5944, Sept. 2018, doi: 10.1109/TWC.2018.2851578.
  29. N. Trivedi, P. Jangir, N. Jangir, S. A. Parmar, M. Bhoye & A. Kumar, "Voltage stability enhancement & voltage deviation minimization using multi-verse optimizer algorithm," 2016 International Conference on Circuit, Power & Computing Technologies (ICCPCT), Nagercoil, 2016, pp. 1-5, doi: 10.1109/ICCPCT.2016.7530136.
  30. M. Anbarasi, K. S. Sendhil Kumar, R. Balamurugan & Thejasswini, "Disease Prediction using Hybrid Optimization Methods based on Tuning Parameters," 2020 10th International Conference on Cloud Computing, Data Science & Engineering (Confluence), Noida, India, 2020, pp. 643-648, doi: 10.1109/Confluence47617.2020.9058029.

Downloads

Published

2022-05-17

Issue

Section

Research Articles

How to Cite

[1]
Dr. Rajesh Gargi, Sheenu Sachdeva, Pooja Majoka, " An Implementations of Clustering Technique in Data Mining to Analyse Big Data in Finance, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 9, Issue 3, pp.448-454, May-June-2023.