TY - GEN
T1 - Assessing Accident Risk using Ordinal Regression and Multinomial Logistic Regression Data Generation
AU - Alicioglu, Gulsum
AU - Sun, Bo
AU - Ho, Shen Shyang
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/7
Y1 - 2020/7
N2 - Robust and accurate modeling of motor vehicle accident and injury severities have significant impact on transportation safety and economy. The capability to assess accident risk based on external driving conditions (e.g., weather, road condition, etc.) and driver behavior and characteristics can reduce accident occurrences by alerting drivers to alleviated risk. In this paper, we propose a novel accident risk assessment framework driven by ordinal regression. One challenge of the risk assessment problem is that non-accident data are not collected by any agency in their study of transportation safety. Hence, we also propose a realistic negative data generation scheme based on feature weighs derived from multinomial logistic regression to overcome this challenge. Experimental results on two different real-world datasets from the US National Highway Traffic Safety Administration and UK Transport for Greater Manchester are used to demonstrate the feasibility and robustness of our proposed ordinal regression framework. Performance on four ordinal regression algorithms, namely: logistic all-threshold, logistic immediate-threshold, ordinal ridge, and least absolute deviations are compared. In addition, for US dataset, we investigate the effect of random oversampling and undersampling on the proposed risk assessment framework. We empirically show that bagging with random oversampling using logistic all-threshold ordinal regression method has the best prediction performance among ordinal regression models.
AB - Robust and accurate modeling of motor vehicle accident and injury severities have significant impact on transportation safety and economy. The capability to assess accident risk based on external driving conditions (e.g., weather, road condition, etc.) and driver behavior and characteristics can reduce accident occurrences by alerting drivers to alleviated risk. In this paper, we propose a novel accident risk assessment framework driven by ordinal regression. One challenge of the risk assessment problem is that non-accident data are not collected by any agency in their study of transportation safety. Hence, we also propose a realistic negative data generation scheme based on feature weighs derived from multinomial logistic regression to overcome this challenge. Experimental results on two different real-world datasets from the US National Highway Traffic Safety Administration and UK Transport for Greater Manchester are used to demonstrate the feasibility and robustness of our proposed ordinal regression framework. Performance on four ordinal regression algorithms, namely: logistic all-threshold, logistic immediate-threshold, ordinal ridge, and least absolute deviations are compared. In addition, for US dataset, we investigate the effect of random oversampling and undersampling on the proposed risk assessment framework. We empirically show that bagging with random oversampling using logistic all-threshold ordinal regression method has the best prediction performance among ordinal regression models.
UR - http://www.scopus.com/inward/record.url?scp=85093848436&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85093848436&partnerID=8YFLogxK
U2 - 10.1109/IJCNN48605.2020.9207105
DO - 10.1109/IJCNN48605.2020.9207105
M3 - Conference contribution
AN - SCOPUS:85093848436
T3 - Proceedings of the International Joint Conference on Neural Networks
BT - 2020 International Joint Conference on Neural Networks, IJCNN 2020 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2020 International Joint Conference on Neural Networks, IJCNN 2020
Y2 - 19 July 2020 through 24 July 2020
ER -