Comparative Analysis of Machine Learning Regression Algorithms on Air Pollution Dataset

Authors

  • Sumit Upadhyay  Department of Computer Science and Engineering, IMS Engineering College, Ghaziabad, Uttar Pradesh, India

DOI:

https://doi.org/10.32628/CSEIT206427

Keywords:

Machine Learning, Regression, Prediction, Air Quality Index, So2, PM2.5, Accuracy score, Support Vector Regression.

Abstract

Air pollution has both acute and chronic effects on human health, affecting a number of different systems and organs. Examining and protecting air quality has become one of the most essential activities for the government in many industrial and urban areas today. Air pollutants, such as carbon monoxide (CO), sulfur dioxide (SO(2)), nitrogen oxides (NOx), volatile organic compounds (VOCs), ozone (O(3)), heavy metals, and respirable particulate matter (PM2.5 and PM10), differ in their chemical composition, reaction properties, emission, time of disintegration and ability to diffuse in long or short distances. The main objective of this paper to build a model for predicting Air Quality Index(AQI) of the specific cities using various types of machine learning algorithms namely Multiple Linear Regression, K Nearest Neighbours(KNN), Support Vector Machine(SVM) and Decision Tree. And also evaluate and compare the performance of every algorithm based on their accuracy score and errors. Air Pollution dataset is publicly available on different government sites. The implementation phase dataset is divided as 80% for the training of different models and the rest of the dataset is used for testing the model.

References

  1. Dixian Zhu 1, Changjie Cai 2 , Tianbao Yang 1 and Xun Zhou 3 “A Machine Learning Approach for Air Quality Prediction: Model Regularization and Optimization”.
  2. https://www.nrdc.org/stories/air-pollution-everything-you-need-know “Air Pollution : Everything You Need to Know”.
  3. an Kleine Deters, Rasa Zalakeviciute, Mario Gonzalez and Yves Rybarczyk 18 Jun 2017 “Modeling PM2.5 Urban Pollution Using Machine Learning and Selected Meteorological Parameters”.
  4. Pooja Bhalgat , Sejal Pitale and Sachin Bhoite “Air Quality Prediction using Machine Learning Algorithms”.
  5. Yves Rybarczyk and Rasa Zalakeviciute December 20th 2017 “Regression Model to predict Air Pollution from Affordable Data Collection”.
  6. Aditya C, Chandana R Deshmukh, Nayana D, Praveen Gandhi Vidyavastu 4 may 2018 “Detection and Prediction of Air Pollution using Machine Learning Models”.
  7. Mushtak Sayyed, Akshay Sarode, Adesh Salunke and Swaraj Desai march 2020 “A thorough Survey on prediction of Airpollution”.
  8. Mrs. A. Gnana Soundari, Mrs. J. Gnana and Akshaya A.C 2019 “INDIAN AIR QUALITY PREDICTION AND ANALYSIS USING MACHINE LEARNING”.

Downloads

Published

2020-08-30

Issue

Section

Research Articles

How to Cite

[1]
Sumit Upadhyay, " Comparative Analysis of Machine Learning Regression Algorithms on Air Pollution Dataset" International Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 6, Issue 4, pp.125-136, July-August-2020. Available at doi : https://doi.org/10.32628/CSEIT206427