Labels | Weighted Precision | Weighted Recall (Accuracy) | Weighted F1 Score |
---|---|---|---|

Criterion standard labels | 0.908 | 0.905 | 0.905 |

Original protocol assignments | 0.872 | 0.841 | 0.850 |

↵a Precision, recall, and F1 score were calculated for each class, and weighted averages of these metrics were computed using class frequencies as weights. The weighted F1 score represents the average of F1 scores weighted by class frequency.