Real-time crash prediction in an urban expressway using disaggregated data

The authors develop accident prediction models for a stretch of the urban expressway Autopista Central in Santiago, Chile, using disaggregate data captured by free-flow toll gates with Automatic Vehicle Identification (AVI) which, besides their low failure rate, have the advantage of providing disaggregated data per type of vehicle. The process includes a random forest procedure to identify the strongest precursors of accidents, and the calibration/estimation of two classification models, namely, Support Vector Machine and Logistic regression. The authors find that, for this stretch of the highway, vehicle composition does not play a first-order role. The authors' best model accurately predicts 67.89% of the accidents with a low false positive rate of 20.94%. These results are among the best in the literature even though, and as opposed to previous efforts, (i) the authors do not use only one partition of the data set for calibration and validation but conduct 300 repetitions of randomly selected partitions; (ii) the authors' models are validated on the original unbalanced data set (where accidents are quite rare events), rather than on artificially balanced data.


  • English

Media Info

Subject/Index Terms

Filing Info

  • Accession Number: 01660997
  • Record Type: Publication
  • Files: TRIS
  • Created Date: Feb 5 2018 11:30AM