Table 2:

Diagnostic performance of machine-learning classification in training and validation datasets

Tumor Type (Pathologic Diagnosis)Diagnostic Performance
SCCIP
Model prediction for training dataset
    SCC162Accuracy90.9%a
Sensitivity94.1%
    IP114Specificity87.5%
PPV88.9%
    Total1716NPV93.3%
Model prediction for validation dataset
    SCC61Accuracy84.6%a
Sensitivity85.7%
    IP15Specificity83.3%
PPV85.7%
    Total76NPV83.3%
Model prediction for entire cohort
    SCC223Accuracy89.1%
Sensitivity91.7%
    IP219Specificity86.4%
PPV88.0%
    Total2422NPV90.5%
  • Note:—NPV indicates negative predictive value; PPV, positive predictive value.

  • a With a 2-tailed test of population proportion, the accuracies for the training and validation datasets were not significantly different (P = .537).