Post hoc choice of cut points introduced bias to diagnostic research

Ben Ewald

doi:10.1016/j.jclinepi.2005.11.025

Post hoc choice of cut points introduced bias to diagnostic research

J Clin Epidemiol. 2006 Aug;59(8):798-801. doi: 10.1016/j.jclinepi.2005.11.025. Epub 2006 May 26.

Author

Ben Ewald¹

Affiliation

¹ Centre for Clinical Epidemiology, University of Newcastle, Maddison Building , Level 3, NSW, Australia. Ben.Ewald@newcastle.edu.au [corrected]

PMID: 16828672
DOI: 10.1016/j.jclinepi.2005.11.025

Abstract

Background and objective: To examine the extent of bias introduced to diagnostic test validity research by the use of post hoc data driven analysis to generate an optimal diagnostic cut point for each data set.

Methods: Analysis of simulated data sets of test results for diseased and nondiseased subjects, comparing data driven to prespecified cut points for various sample sizes and disease prevalence levels.

Results: In studies of 100 subjects with 50% prevalence a positive bias of five percentage points of sensitivity or specificity was found in 6 of 20 simulations. For studies of 250 subjects with 10% prevalence a positive bias of 5% was observed in 4 of 20 simulations.

Conclusion: The use of data-driven cut points exaggerates test performance in many simulated data sets, and this bias probably affects many published diagnostic validity studies. Prespecified cut points, when available, would improve the validity of diagnostic test research in studies with less than 50 cases of disease.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Area Under Curve
Bias*
Diagnostic Tests, Routine / standards*
Epidemiologic Methods*
Humans
Reference Values
Reproducibility of Results
Sensitivity and Specificity*