Generating Fake News Detection Model Using a Two-Stage Evolutionary Approach

Authors

  • Cheni Sruneethi Assistant Professor, Department of MCA, Annamacharya Institute of Technology and Sciences (AITS), Karakambadi, Tirupati, Andhra Pradesh, India Author
  • Moduru Mounika Post Graduate, Department of MCA, Annamacharya Institute of Technology and Sciences (AITS), Karakambadi, Tirupati, Andhra Pradesh, India Author

DOI:

https://doi.org/10.32628/CSEIT251145

Keywords:

fake, news, dataset, model

Abstract

While fake news is morally reprehensible, irresponsible parties intentionally use it to achieve their goals by disseminating it to vulnerable and targeted groups. Machine learning techniques have been researched extensively to detect fake news. On the other hand, evolutionary-based algorithms are now gaining popularity in the research community. In this study, a two-stage evolutionary approach is proposed to generate and optimize a mathematical equation for fake news detection. In the first stage, tree-based Genetic Programming (GP) algorithm is used to generate mathematical expressions to detect correlations between the language-independent (Lang-IND) features, extracted from Fake.my-COVID19 dataset, the newly curated fake news dataset in a mixed Malay - English language. The uniqueness of the proposed approach is that the mathematical expressions are formed by basic arithmetic operators or to include complex arithmetic operators such as addition, multiplication, subtraction, division, square, abs, log1p, sign, square root, and exponential together with Lang-IND features as the variables. Prior to second stage of the evolutionary approach, a sensitivity analysis is applied to shorten the best equation while maintaining the F1-score performance. In the second stage, an Adaptive Differential Evolution (ADE), is used to fine-tune the mathematical model.

Downloads

Download data is not yet available.

References

Ruchansky, N., Seo, S., & Liu, Y. (2017). CSI: A hybrid deep model for fake news detection. Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 797–806. https://doi.org/10.1145/3132847.3132877

Shu, K., Sliva, A., Wang, S., Tang, J., & Liu, H. (2017). Fake news detection on social media: A data mining perspective. ACM SIGKDD Explorations Newsletter, 19(1), 22–36. https://doi.org/10.1145/3137597.3137600

Wang, W. Y. (2017). "Liar, Liar Pants on Fire": A new benchmark dataset for fake news detection. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 422–426. https://doi.org/10.18653/v1/P17-2067

Castillo, C., Mendoza, M., & Poblete, B. (2011). Information credibility on Twitter. Proceedings of the 20th International Conference on World Wide Web, 675–684. https://doi.org/10.1145/1963405.1963500

Zhou, X., &Zafarani, R. (2020). A survey of fake news: Fundamental theories, detection methods, and opportunities. ACM Computing Surveys (CSUR), 53(5), 1–40. https://doi.org/10.1145/3395046

Ahmed, H., Traore, I., & Saad, S. (2018). Detecting opinion spams and fake news using text classification. Security and Privacy, 1(1), e9. https://doi.org/10.1002/spy2.9

Kumar, S., & Shah, N. (2018). False information on web and social media: A survey. arXiv preprint arXiv:1804.08559. https://arxiv.org/abs/1804.08559

Yang, X., & Rondeau, S. (2019). Fake news detection on social media: A data mining perspective. IEEE Access, 7, 153000–153016. https://doi.org/10.1109/ACCESS.2019.2942085

Zhang, J., & Ghorbani, A. A. (2020). An improved fake news detection model based on deep learning. Proceedings of the 18th Annual International Conference on Privacy, Security and Trust (PST), 1–6. https://doi.org/10.1109/PST49696.2020.00013

Al-Dhief, F. T., Jalab, H. A., & Mohd, B. J. (2020). A review of evolutionary algorithms used in machine learning. Journal of Theoretical and Applied Information Technology, 98(17), 3416–3430.

Downloads

Published

19-05-2025

Issue

Section

Research Articles