Generating Fake News Detection Model Using a Two-Stage Evolutionary Approach
DOI:
https://doi.org/10.32628/CSEIT251145Keywords:
fake, news, dataset, modelAbstract
While fake news is morally reprehensible, irresponsible parties intentionally use it to achieve their goals by disseminating it to vulnerable and targeted groups. Machine learning techniques have been researched extensively to detect fake news. On the other hand, evolutionary-based algorithms are now gaining popularity in the research community. In this study, a two-stage evolutionary approach is proposed to generate and optimize a mathematical equation for fake news detection. In the first stage, tree-based Genetic Programming (GP) algorithm is used to generate mathematical expressions to detect correlations between the language-independent (Lang-IND) features, extracted from Fake.my-COVID19 dataset, the newly curated fake news dataset in a mixed Malay - English language. The uniqueness of the proposed approach is that the mathematical expressions are formed by basic arithmetic operators or to include complex arithmetic operators such as addition, multiplication, subtraction, division, square, abs, log1p, sign, square root, and exponential together with Lang-IND features as the variables. Prior to second stage of the evolutionary approach, a sensitivity analysis is applied to shorten the best equation while maintaining the F1-score performance. In the second stage, an Adaptive Differential Evolution (ADE), is used to fine-tune the mathematical model.
Downloads
References
Ruchansky, N., Seo, S., & Liu, Y. (2017). CSI: A hybrid deep model for fake news detection. Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 797–806. https://doi.org/10.1145/3132847.3132877
Shu, K., Sliva, A., Wang, S., Tang, J., & Liu, H. (2017). Fake news detection on social media: A data mining perspective. ACM SIGKDD Explorations Newsletter, 19(1), 22–36. https://doi.org/10.1145/3137597.3137600
Wang, W. Y. (2017). "Liar, Liar Pants on Fire": A new benchmark dataset for fake news detection. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 422–426. https://doi.org/10.18653/v1/P17-2067
Castillo, C., Mendoza, M., & Poblete, B. (2011). Information credibility on Twitter. Proceedings of the 20th International Conference on World Wide Web, 675–684. https://doi.org/10.1145/1963405.1963500
Zhou, X., &Zafarani, R. (2020). A survey of fake news: Fundamental theories, detection methods, and opportunities. ACM Computing Surveys (CSUR), 53(5), 1–40. https://doi.org/10.1145/3395046
Ahmed, H., Traore, I., & Saad, S. (2018). Detecting opinion spams and fake news using text classification. Security and Privacy, 1(1), e9. https://doi.org/10.1002/spy2.9
Kumar, S., & Shah, N. (2018). False information on web and social media: A survey. arXiv preprint arXiv:1804.08559. https://arxiv.org/abs/1804.08559
Yang, X., & Rondeau, S. (2019). Fake news detection on social media: A data mining perspective. IEEE Access, 7, 153000–153016. https://doi.org/10.1109/ACCESS.2019.2942085
Zhang, J., & Ghorbani, A. A. (2020). An improved fake news detection model based on deep learning. Proceedings of the 18th Annual International Conference on Privacy, Security and Trust (PST), 1–6. https://doi.org/10.1109/PST49696.2020.00013
Al-Dhief, F. T., Jalab, H. A., & Mohd, B. J. (2020). A review of evolutionary algorithms used in machine learning. Journal of Theoretical and Applied Information Technology, 98(17), 3416–3430.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 International Journal of Scientific Research in Computer Science, Engineering and Information Technology

This work is licensed under a Creative Commons Attribution 4.0 International License.