Analysis of Run-Off-Road Accidents by Association Rule Mining and Geographic Information System Techniques on Imbalanced Datasets
Research output: Journal Publications and Reviews (RGC: 21, 22, 62) › 21_Publication in refereed journal › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Article number | 4882 |
Journal / Publication | Sustainability |
Volume | 12 |
Issue number | 12 |
Online published | 15 Jun 2020 |
Publication status | Published - Jun 2020 |
Link(s)
DOI | DOI |
---|---|
Attachment(s) | Documents
Publisher's Copyright Statement
|
Link to Scopus | https://www.scopus.com/record/display.uri?eid=2-s2.0-85087497839&origin=recordpage |
Permanent Link | https://scholars.cityu.edu.hk/en/publications/publication(e1b1aff4-0c75-4196-be35-726d73395f59).html |
Abstract
Run-off-road (ROR) accidents cause a large proportion of fatalities on roads. Exploring key factors is an effective method to reduce fatalities and improve safety sustainability. However, some limitations exist in current studies: (1) Datasets of ROR accidents have imbalance problems, in which the samples of fatal accidents (FA) are much less than non-fatal accidents (NFA). Data mining methods on such imbalanced datasets make the results biased. (2) Few studies conducted spatial analysis of ROR accidents in visualization. Therefore, this study proposes an association rule mining (ARM)-based framework to analyze ROR accidents on imbalanced datasets. A novel method is proposed to address the imbalance problem and ARM is applied to analyze accident severity. Geographic information system (GIS) is adopted for spatial analysis of ROR accidents. The proposed framework is applied to ROR accidents in Victoria, Australia. Six FA factors and seven NFA factors are identified from two-item rules. The results of three-item rules indicate factors acting interactively increase the likelihood of FA or NFA. Hot spots of ROR accidents are presented by GIS maps. Effective measures are accordingly proposed to improve road safety. Compared with traditional data-balancing methods, the proposed framework has been validated to provide more robust and reliable results on imbalanced datasets.
Research Area(s)
- run-off-road accidents, imbalanced dataset, bootstrap-resampling-data-balancing method, association rule mining, ensemble method, geographic information system
Citation Format(s)
Analysis of Run-Off-Road Accidents by Association Rule Mining and Geographic Information System Techniques on Imbalanced Datasets. / Jiang, Feifeng; Yuen, Kwok Kit Richard; Lee, Eric Wai Ming; Ma, Jun.
In: Sustainability, Vol. 12, No. 12, 4882, 06.2020.Research output: Journal Publications and Reviews (RGC: 21, 22, 62) › 21_Publication in refereed journal › peer-review
Download Statistics
No data available