Comparative study of the Performance of Machine Learning Text Classifiers Applied to Afaan Oromo Text

Authors

  • Etana Fikadu  College of Engineering & Technology, Wollega University, Post Box No: 395, Ethiopia

DOI:

https://doi.org/10.32628/CSEIT20645

Keywords:

Afaan Oromo text categorization, classification algorithms, machine learning

Abstract

The aim of this study is to find the optimal method that can be used to classify Afaan Oromo text among different classifier by using the same number of text document. Automatic text classification has been needed in many fields for a long time. Many methods are used to classify text. The performance of this classifier we used in this study is measured in terms of recall, precision and F-measure. Finally we compare the efficiencies of the Bayesian Network, Naïve Bayesian, IBK and SMO to classify Afaan Oromo text. Experimental results on the same set of Afaan Oromo documents used before show that SMO slightly outperforms the other methods. Comparison reported in this paper shows that the SMO classifier exceeds the other four Machine learning classifier.

References

  1. A. Addis,"Study and Development of Novel Techniques for Hierarchical Text Categorization," PhD Thesis, University of Cagliari, 2010.
  2. F. SEBASTIANI, “Machine Learning in Automated Text Categorization”, vol. 34, 2002.
  3. R.S. Feldman, J., the Text Mining Handbook: Advanced Approaches in Analyzing Unstructured Data. Cambridge University Press,USA, New York, 2006.
  4. Etana Fikadu and Ramesh Babu P, “Application of Data Mining Classification Algorithms for Afaan Oromo Media Text News Categorization”, published 7 - july 2019
  5. Kamal, et al “Afaan Oromo News Text Categorization using Decision Tree Classifier and Support Vector Machine: A Machine Learning Approach”, published May, 2017
  6. Nigam K, American Association for Artificial Intelligence (AAAI),. Inc., A Nonprofit California Corporation, Al Topics / Machine Learning, 2008.
  7. Abera Diriba, Automatic classification of Afaan Oromo news text: The case of radio fana, master’s thesis, (2009)
  8. Y. Yang, and X. Liu, (1999). A re-examination of text categorization methods.
  9. I.and Frank, "Data Mining: Practical Machine Learning Tools and Techniques. San Francisco, USA: ," Morgan Kaufmann, 2005.
  10. Y. Zhao, "Comparison of Agglomerative and Partitioning Document Clustering Algorithms.," Washington DC: ACM Press., (2002).
  11. S. Hassan, H. Ney, and J. Zaplo, (France, Friday 6 July 2001), Statistical Classification Methods for Arabic News Articles, ACL/EACL 2001, Germany.
  12. A. Bensaid, et al (2004). Automatic Arabic Document Categorization Based on the Naive Bayes Algorithm.
  13. Sakhr, Categorization, (2007). http://www.sakhr.com/
  14. R. and Ribeiro-Neto, B. Baeza-Yates, "Modern Information Retrieval. New," 1999.
  15. http://www.cs.waikato.ac.nz/ml/weka/ (Visited January 8th, 20120).

Downloads

Published

2020-08-30

Issue

Section

Research Articles

How to Cite

[1]
Etana Fikadu, " Comparative study of the Performance of Machine Learning Text Classifiers Applied to Afaan Oromo Text" International Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 6, Issue 4, pp.77-83, July-August-2020. Available at doi : https://doi.org/10.32628/CSEIT20645