Ground-level ozone prediction by support vector machine approach with a cost-sensitive classification scheme

Wei-Zhen Lu, Dong Wang

    Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

    68 Citations (Scopus)

    Abstract

    For ground-level ozone (O3) prediction, a predictive model, with reliable performance not only on non-polluted days but, more importantly, on polluted days, is favored by public authorities to issue alerts, so that concerned citizens and industrial organizations could take precautions to avoid exposure and reduce harmful emissions. However, the class imbalance problem, i.e., in some collected field data, number of O3 polluted days are much smaller than that of non-polluted days, will deteriorate the model performance on minority class-O3 polluted days. Despite support vector machine (SVM) obtaining promising results in air quality prediction, in this study, a cost-sensitive classification scheme is proposed for the standard support vector classification model (S-SVC) in order to investigate whether the class imbalance plagues S-SVC. The S-SVC with such scheme is named as CS-SVC. Experiments on imbalanced data sets collected from two air quality monitoring sites in Hong Kong show that 1) S-SVC is still sensitive to class imbalance problem; 2) compared with S-SVC, CS-SVC effectively avoids class imbalance problem with lower percentage of false negative on O3 polluted days but with higher percentage of false positive on non-polluted days; 3) compared with both S-SVC and CS-SVC, support vector regression model (SVR), after converting its output to binary one, only has similar performance with S-SVC, which indicates class imbalance problem also impairs the regressor model. From point of protecting public health, CS-SVC, which less likely misses to forecast O3 polluted days, is recommended here. © 2008 Elsevier B.V. All rights reserved.
    Original languageEnglish
    Pages (from-to)109-116
    JournalScience of the Total Environment
    Volume395
    Issue number2-3
    DOIs
    Publication statusPublished - 1 Jun 2008

    Research Keywords

    • Class imbalance problem
    • Cost-sensitive classification
    • Ozone
    • Support vector classification
    • Support vector regression

    Fingerprint

    Dive into the research topics of 'Ground-level ozone prediction by support vector machine approach with a cost-sensitive classification scheme'. Together they form a unique fingerprint.

    Cite this