Optimization of breast mass classification using sequential forward floating selection (SFFS) and a support vector machine (SVM) model

Int J Comput Assist Radiol Surg. 2014 Nov;9(6):1005-20. doi: 10.1007/s11548-014-0992-1. Epub 2014 Mar 25.

Abstract

Purpose: Improving radiologists' performance in classification between malignant and benign breast lesions is important to increase cancer detection sensitivity and reduce false-positive recalls. For this purpose, developing computer-aided diagnosis schemes has been attracting research interest in recent years. In this study, we investigated a new feature selection method for the task of breast mass classification.

Methods: We initially computed 181 image features based on mass shape, spiculation, contrast, presence of fat or calcifications, texture, isodensity, and other morphological features. From this large image feature pool, we used a sequential forward floating selection (SFFS)-based feature selection method to select relevant features and analyzed their performance using a support vector machine (SVM) model trained for the classification task. On a database of 600 benign and 600 malignant mass regions of interest, we performed the study using a tenfold cross-validation method. Feature selection and optimization of the SVM parameters were conducted on the training subsets only.

Results: The area under the receiver operating characteristic curve [Formula: see text] was obtained for the classification task. The results also showed that the most frequently selected features by the SFFS-based algorithm in tenfold iterations were those related to mass shape, isodensity, and presence of fat, which are consistent with the image features frequently used by radiologists in the clinical environment for mass classification. The study also indicated that accurately computing mass spiculation features from the projection mammograms was difficult, and failed to perform well for the mass classification task due to tissue overlap within the benign mass regions.

Conclusion: In conclusion, this comprehensive feature analysis study provided new and valuable information for optimizing computerized mass classification schemes that may have potential to be useful as a "second reader" in future clinical practice.

Keywords: Breast cancer; Computer-aided diagnosis of mammograms; Feature selection; Pattern classification.

Publication types

  • Evaluation Study
  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Breast Neoplasms / diagnostic imaging*
  • Diagnosis, Computer-Assisted / methods*
  • Female
  • Humans
  • Mammography / methods*
  • Neoplasm Staging
  • Pattern Recognition, Automated
  • ROC Curve
  • Radiographic Image Interpretation, Computer-Assisted
  • Support Vector Machine*