Original investigationRecent Developments in the Dorfman-Berbaum-Metz Procedure for Multireader ROC Study Analysis1
Section snippets
Original DBM Method
The DBM method is typically used with the test × reader × case factorial study design where each case (ie, patient) undergoes each of several diagnostic tests and the resulting images are interpreted once by each reader. Throughout this paper, we assume that the data have been collected using this factorial design. The competing modalities can be compared using the DBM method; in particular, the null hypothesis of no test effect can be tested and confidence intervals for test differences can be
Simulation Study
In a simulation study we examined the performance of the three DBM approaches—original DBM, new model simplification, and new model simplification plus ddfH—with respect to the empirical type I error rate for testing the null hypothesis of no test effect. The simulation model of Roe and Metz (2) provided continuous decision-variable outcomes generated from a conventional binormal model that treats both cases and readers as random. We used this simulation model to create discrete rating data by
Discussion
We have summarized recently proposed solutions for the various drawbacks of the original DBM method and examined the performance of these solutions in a simulation study. The solutions include using normalized pseudovalues that allow DBM results to be based on either the original or the jackknife accuracy estimates; using less data-based model reduction and ddfH to make DBM less conservative with a type I error rate much closer to the nominal level; and showing that the DBM model can be viewed
Acknowledgments
The authors thank Carolyn Van Dyke, MD, for sharing her data set.
References (24)
- et al.
Dorfman-Berbaum-Metz method for statistical analysis of multireader, multimodality receiver operating characteristic data: Validation with computer simulation
Acad Radiol
(1997) - et al.
Monte Carlo validation of a multireader method for receiver operating characteristic discrete rating data: Factorial experimental design
Acad Radiol
(1998) God, like the devil, is in the details
Acad Radiol
(2006)- et al.
Power estimation for the Dorfman-Berbaum-Metz method
Acad Radiol
(2004) - et al.
Maximum likelihood estimation of parameters of signal-detection theory and determination of confidence intervals: Rating method data
J Math Psychol
(1969) - et al.
Receiver operating characteristic rating analysis: Generalization to the population of readers and patients with the jackknife method
Investig Radiol
(1992) Approximate tests of correlation in time series
J R Stat Soc Ser B
(1949)Notes on bias in estimation
Biometrika
(1956)Bias and confidence in not quite large samples
Ann Math Stat
(1958)- et al.
A comparison of the Dorfman-Berbaum-Metz and Obuchowski-Rockette Methods for receiver operating characteristic (ROC) data
Stat Med
(2005)
Monte Carlo validation of the Dorfman-Berbaum-Metz method using normalized pseudovalues and less data-based model simplification
Acad Radiol
A comparison of denominator degrees of freedom methods for multiple observer ROC analysis
Stat Med
- 1
This research was supported by Grant R01EB000863 from the National Institutes of Health, Bethesda, MD. The views expressed in this article are those of the authors and do not necessarily represent the views of the Department of Veterans Affairs.