Research Article

Identifying and Labeling Potentially Risky Driving: A Multistage Process Using Real-World Driving Data

Table 6

Cross-classifying potentially risky driving behaviors.

Dataset labeled
April 1 (%)April 2 (%)April 4 (%)April 5 (%)April 6 (%)April 7 (%)

Random forest modelsApril 1, 201349.165.747.146.652.8
April 2, 201352.051.869.267.672.9
April 4, 201359.349.247.750.057.0
April 5, 201356.373.656.472.680.2
April 6, 201350.669.054.469.473.4
April 7, 201350.465.753.768.866.8

Percentages represent the proportion of the originally labeled observations (by the same-day model) that the cross-day model also identified. We note that all cross-classifications labeled a similar proportion of each dataset as potentially risky (∼5–10%).