Group | Thresholds | Dice | Precision^{b} | Sensitivity | Correlation |
---|---|---|---|---|---|

I-A | MLV < 1 cm^{3} (n = 22) | 31.0 (0–50.0) | 29.6 (3.4–54.9) | 57.5 (0–90.0) | ρ = 0.09, P = .68 |

I-B | MLV ≥ 1 cm^{3} (n = 129) | 83.5^{c} (71.2–89.3) | 84.9^{c} (70.3–92.9) | 87.6^{d} (75.8–92.9) | ρ = 0.90, P < .001 |

II-A | MLV < 21 cm^{3} (n = 100) | 71.2 (45.8–84.8) | 71.6 (38.7–84.9) | 81.3 (59.8–92.5) | ρ = 0.79, P < .001 |

II-B | MLV ≥ 21 cm^{3} (n = 51) | 89.4^{c} (85.4–92.5) | 92.3^{c} (85.6–96.1) | 89.3^{e} (83.0–92.2) | ρ = 0.97, P < .001 |

III-A | MLV < 31 cm^{3} (n = 113) | 73.6 (48.0–85.8) | 77.2 (46.3–85.7) | 82.5 (62.8–92.1) | ρ = 0.83, P < .001 |

III-B | MLV ≥ 31 cm^{3} (n = 38) | 90.6^{c} (87.3–93.2) | 94.7^{c} (88.4–96.8) | 89.4^{e} (82.8–93.6) | ρ = 0.96, P < .001 |

IV-A | MLV < 51 cm^{3} (n = 124) | 75.0 (48.9–86.8) | 78.1 (49.2–86.5) | 83.3 (65.2–92.5) | ρ = 0.87, P < .001 |

IV-B | MLV ≥ 51 cm^{3} (n = 27) | 91.5^{c} (89.1–93.6) | 95.9^{c} (92.2–97.5) | 89.2 (83.5–92.2) | ρ = 0.92, P < .001 |

V-A | MLV < 70 cm^{3} (n = 131) | 77.2 (51.5–87.0) | 79.9 (54.2–87.0) | 84.0 (67.8–92.6) | ρ = 0.88, P < .001 |

V-B | MLV ≥ 70 cm^{3} (n = 20) | 91.8^{c} (89.4–93.9) | 96.0 (93.0–96.9) | 89.6 (85.0–92.0) | ρ = 0.83, P < .001 |

↵a Performance metrics are in median (IQR) and percentages. Results of E3 applied to the Evaluation Cohort are shown as a function of different volume thresholds.

↵b Excludes 2 subjects in group A with automatically segmented lesion volumes of zero because precision is undefined in this circumstance.

↵c

*P*< .001.↵d

*P*< .01.↵e

*P*< .05 group A versus group B, where Group A is the group meeting the threshold criteria and Group B is the group not meeting the threshold criteria.