Segregation of Live News Articles Based on Location Using Machine Learning

Authors

  • Heneil Tayade  Department of Computer Engineering, Datta Meghe College of Engineering, Navi Mumbai, Maharashtra, India
  • Chaitanya Shetty  Department of Computer Engineering, Datta Meghe College of Engineering, Navi Mumbai, Maharashtra, India
  • Ratika Jankar  Department of Computer Engineering, Datta Meghe College of Engineering, Navi Mumbai, Maharashtra, India
  • Dr. Amol Pande   Department of Computer Engineering, Datta Meghe College of Engineering, Navi Mumbai, Maharashtra, India

DOI:

https://doi.org/10.32628/CSEIT206380

Keywords:

Machine Learning, Random Forest, SVM, Naive Bayes, News classification.

Abstract

As we all know web contains an enormous amount of data which is gigantic and it is changing continuously for each minute and we also know during this hectic lifestyle it's very difficult to stay track of each news and article that's occurring. So, people are mostly focused on the news which goes into their nearby environment. During this paper, we consider displaying the news directing on the nearby cities and also displaying the required news articles supported by a few important cities. Here, we've a web crawler which is used to withdraw the content from the HTML pages of the articles. Random forest, Naïve Bayes and SVM classifiers are used to compute the precision and their accuracy is being calculated. Machine Learning is the well- known technique used for this type of news classification and displaying of the news articles

References

  1. B. Pendharkar, P. Ambekar, P. Godbole, S. Joshi, and S. Abhyankar.” Topic categorization of rss news feeds,” Group vol. 4, p. 1, 2007.
  2. Shen, Z. Chen, Q. Yang, H. Zeng, B. Zhang, Y. Lu, and W. Ma, Web-page classification through summarization, in Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 2004, pp. 242249.
  3. Leo Breiman, Random forests, Machine Learning. vol. 45, no. 1, pp. 532, 2001
  4. Lewis,” Naive (bayes) at forty: The independence assumption in information retrieval,” Machine Learning: ECML-98, pp. 415, 1998.
  5. J. K. M. Han, Data Mining: Concepts and Techniques, 2nd ed. 2006.
  6. Yu,” SVM tutorial: Classification, regression, and ranking,” Handbook of Natural Computing 2009
  7. Rijsbergen, Information Retrieval,2nded London: Butterworths, 1979.
  8. Y. Baeza and B. R. Neto, Modern Information Retrieval. Boston, 1999.
  9. J. Davis and M. Goadrich, The relationship between precision- recall and ROC curves, in Proceedings of the 23rd International Conference on Machine Learning, ser. ICML06. New York, NY, USA: ACM, 2006, pp. 233240   [Online]Available: http://doi.acm.org/10.1145/1143844.1143 874.
  10. T. Landgrebe, P. Paclk, R. Duin, and A. Bradley, Precision- recall operating characteristic (P-ROC) curves in imprecise environments, in Proceedings of ICPR, 2006.
  11. Manning, C. and Schutze, H., Foundation of statistical natural language processing Cambridge, Mass: MIT press,1999.
  12. C. Ee and P. Lim, Automated online news classification with personalization.
  13. M. Kasthuri, Dr.S.Britto Ramesh Kumar, A Framework for Language In- dependent Stemmer Using Dynamic Programming, International Journal of Applied Engineering Research, ISSN 0973- 4562 Vol.10, pp 39000- 39004, Number.18,2015.

Downloads

Published

2020-06-30

Issue

Section

Research Articles

How to Cite

[1]
Heneil Tayade, Chaitanya Shetty, Ratika Jankar, Dr. Amol Pande , " Segregation of Live News Articles Based on Location Using Machine Learning" International Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 6, Issue 3, pp.505-512, May-June-2020. Available at doi : https://doi.org/10.32628/CSEIT206380