Survey Paper
Machine learning and radiology

https://doi.org/10.1016/j.media.2012.02.005Get rights and content

Abstract

In this paper, we give a short introduction to machine learning and survey its applications in radiology. We focused on six categories of applications in radiology: medical image segmentation, registration, computer aided detection and diagnosis, brain function or activity analysis and neurological disease diagnosis from fMR images, content-based image retrieval systems for CT or MRI images, and text analysis of radiology reports using natural language processing (NLP) and natural language understanding (NLU). This survey shows that machine learning plays a key role in many radiology applications. Machine learning identifies complex patterns automatically and helps radiologists make intelligent decisions on radiology data such as conventional radiographs, CT, MRI, and PET images and radiology reports. In many applications, the performance of machine learning-based automatic detection and diagnosis systems has shown to be comparable to that of a well-trained and experienced radiologist. Technology development in machine learning and radiology will benefit from each other in the long run. Key contributions and common characteristics of machine learning techniques in radiology are discussed. We also discuss the problem of translating machine learning applications to the radiology clinical setting, including advantages and potential barriers.

Graphical abstract

In this paper, we give a short introduction to machine learning and survey its applications in radiology. We focused on six categories of applications in radiology: medical image segmentation, registration, computer aided detection and diagnosis, brain function or activity analysis and neurological disease diagnosis from fMR images, content-based image retrieval systems for CT or MRI images and text analysis of radiology reports using natural language processing (NLP) and natural language understanding (NLU). This survey shows that machine learning plays a key role in many radiology applications. Machine learning identifies complex patterns automatically and helps radiologists make intelligent decisions on radiology data such as conventional radiographs, CT, MRI, and PET images and radiology reports. In many applications, the performances of the machine learning-based automatic detection and diagnosis systems have shown to be comparable to that of a well-trained and experienced radiologist. Technology development in machine learning and radiology will benefit from each other in the long run. Key contributions and common characteristics of machine learning techniques in radiology are discussed. We also discuss the problem of translating machine learning applications to the radiology clinical setting, including advantages and potential barriers.

  1. Download : Download high-res image (200KB)
  2. Download : Download full-size image

Highlights

► Mainstream machine learning techniques relevant for radiology are introduced. ► Six major applications of machine learning in radiology are surveyed. ► Central themes of machine learning research in radiology are described. ► Factors impacting translation of machine learning to radiology are discussed.

Introduction

Radiologic imaging is of increasing importance in patient care. Both diagnostic and therapeutic indications for radiologic imaging are expanding rapidly (Bhargavan et al., 2009). The rapid expansion is a consequence of the need for more rapid, accurate, cost-effective, and less invasive treatment. Technologic advancements in radiologic imaging equipment have also fueled the utilization of imaging. Such technologic advancements include the capability to acquire higher and higher resolution images, enabling visualization of smaller anatomic structures and abnormalities. The higher resolution comes at the cost of an ever increasing average number of images per patient. Radiologists need to interpret these images and as the number of images increases, radiologists’ workload increases as well. The increasing number and complexity of the images threatens to overwhelm radiologists’ capacities to interpret them. In many real radiologic practices, automated and intelligent image analysis and understanding are becoming an essential part or procedure, such as image segmentation, registration, and computer-aided diagnosis and detection. In addition, in the area of cancer prognosis and treatment, automated and intelligent algorithms have a large market and are welcomed broadly, in areas such as radiation therapy planning or automatic identification of imaging biomarkers from radiological images of certain diseases, etc. Machine learning algorithms underpin the algorithms and software that make computer-aided diagnosis/prognosis/treatment possible.

Radiology is a branch of medical science which uses imaging technology and radiation to make diagnoses and treat disease. It has benefited greatly from the advances of physics, electronic engineering, and computer science. Based on different detection and imaging rationale, various modalities were developed in the past decades in the field of diagnostic radiology. Today, the mainstream modalities which are widely used in hospitals and medical centers include radiography, fluoroscopy, computed tomography (CT), ultrasound, magnetic resonance imaging (MRI), and positron emission tomography (PET).

In the daily practice of radiology, medical images from different modalities are read and interpreted by radiologists. Usually radiologists must analyze and evaluate these images comprehensively in a short time. But with the advances in modern medical technologies, the amount of imaging data is rapidly increasing. For example, CT examinations are being performed with thinner slices than in the past. The reading and interpretation time of radiologists will mount as the number of CT slices grows.

Machine learning provides an effective way to automate the analysis and diagnosis for medical images. It can potentially reduce the burden on radiologists in the practice of radiology. The applications of machine learning in radiology include medical image segmentation (e.g., brain, spine, lung, liver, kidney, colon); medical image registration (e.g., organ image registration from different modalities or time series); computer-aided detection and diagnosis systems for CT or MRI images (e.g., mammography, CT colonography, and CT lung nodule CAD); brain function or activity analysis and neurological disease diagnosis from fMR images; content based image retrieval systems for CT or MRI images; and text analysis of radiology reports using natural language processing (NLP) and natural language understanding (NLU).

Machine learning is the study of computer algorithms which can learn complex relationships or patterns from empirical data and make accurate decisions (Bishop, 2006, Duda et al., 2000, Mitchell, 1997). It is an interdisciplinary field that has close relationships with artificial intelligence, pattern recognition, data mining, statistics, probability theory, optimization, statistical physics, and theoretical computer science. Applications of machine learning include natural language processing, medical diagnosis, bioinformatics, video surveillance, and financial data analysis.

Machine learning algorithms can be organized into different categories based on different principles. For example, depending on the utilization of labels of training samples, they can be categorized into supervised learning, semi-supervised learning, and unsupervised learning algorithms.

In supervised learning, each sample contains two parts: one is input observations or features and the other is output observations or labels (Alpaydin, 2004, Hastie et al., 2009). Usually the input observations are causes and the output observations are effects. The purpose of supervised learning is to deduce a functional relationship from training data that generalizes well to testing data. The form of the relationship is a set of equations and numerical coefficients or weights. Examples of supervised learning include classification, regression, and reinforcement learning.

In unsupervised learning, we only have one set of observations and there is no label information for each sample (Hastie et al., 2009). Usually these observations or features are caused by a set of unobserved or latent variables. The main purpose of unsupervised learning is to discover relationships between samples or reveal the latent variables behind the observations. Examples of unsupervised learning include clustering, density estimation, and blind source separation.

Semi-supervised learning falls between supervised and unsupervised learning (Chapelle et al., 2006, Zhu, 2007). It utilizes both labeled data (usually a few) and unlabeled data (usually many) during the training process. Semi-supervised learning algorithms were developed mainly because the labeling of data is very expensive or impossible in some applications. Examples of semi-supervised learning include semi-supervised classification and information recommendation systems (Christakou et al., 2005).

Machine learning has many applications in real life. It is routinely used in banking (for detecting fraudulent transactions (Dorronsoro et al., 1997)), in finance (to predict stock prices (Huang et al., 2005a)), in marketing (to reveal patterns of consumer spending (Bose and Mahapatra, 2001)), and on the Internet (as part of search engines (Basili, 2003)). In biomedicine, MYCIN was proposed in the early 1970s at Stanford University. It is an expert system with about 600 rules designed to identify bacteria and recommend antibiotics (Swartout, 1985). Machine learning also showed capability in the field of drug design (Burbidge et al., 2001).You may not be aware of the existence of machine learning, but its applications are pervasive in our daily lives.

This review is structured as follows. In Section 2 we give a short introduction to machine learning and related algorithms. In Section 3 we describe six representative applications of machine learning in radiology. In Section 4 we discuss key contributions and common characteristics of machine learning techniques in radiology. In Section 5 we cover issues on translating machine learning techniques to clinical radiology practice. In Section 6 we review current research status and discuss future directions.

Section snippets

Overview of machine learning

Because of the rapid development of machine learning, it is hard to introduce every aspect of machine learning in one article. So in this section we will give a concise introduction to the most important topics of machine learning (Bishop, 2006). These topics include linear models, learning with kernels, probabilistic models, clustering analysis and dimensionality reduction. Through this introduction, we hope readers may have a general idea about the content of machine learning research, what

Applications of machine learning in radiology

In this section, we will introduce some typical applications of machine learning in radiology.

Key contributions and common characteristics of machine learning techniques in radiology

In the previous section, we showed that machine learning has many applications in radiology. These applications vary from each other and may have very different forms regarding problems to be solved, input data, output data, anatomical constraints, prior knowledge, and hidden variables. Also machine learning methods utilized to solve these problems seem complicated and many of them have strict assumptions. At first glance, it seems difficult for a researcher in radiology to decide whether to

The translation from machine learning to clinical practice

In Section 3 we illustrated six representative domains in radiology where machine learning could contribute. In this section, we will discuss advantages of utilizing machine learning in radiology and potential barriers which could hinder the deployment of machine learning in clinical practice.

The major advantages of applying machine learning in radiology will be labor saving and accurate diagnostic results. Many radiology practices are very time consuming for human labor. For example, in

Discussion and conclusion

In this paper, we have presented a short introduction to machine learning and surveyed its applications in radiology. We focused on six applications in radiology: image segmentation, image registration, computer-aided detection and diagnosis, brain function or activity analysis and neurological disease diagnosis from fMR images, content-based image retrieval, and text analysis of radiology reports using NLP/NLU. This survey shows that machine learning plays a key role in many radiology

Acknowledgments

We thank Andrew Dwyer, MD, for critical review of the manuscript. This manuscript was support by the Intramural Research Program of the National Institutes of Health Clinical Center.

References (235)

  • H.-P. Chan et al.

    Computer-aided diagnosis of lung cancer and pulmonary embolism in computed tomography—a review

    Academic Radiology

    (2008)
  • W.W. Chapman et al.

    Creating a text classifier to detect radiology reports describing mediastinal findings associated with inhalational anthrax and other disorders

    Journal of the American Medical Informatics Association

    (2003)
  • H.D. Cheng et al.

    Computer-aided detection and classification of microcalcifications in mammograms: a survey

    Pattern Recognition

    (2003)
  • H.L. Chui et al.

    A new point matching algorithm for non-rigid registration

    Computer Vision and Image Understanding

    (2003)
  • P. Comon

    Independent component analysis, a new concept

    Signal Process

    (1994)
  • T.F. Cootes et al.

    Active shape models – their training and application

    Computer Vision and Image Understanding

    (1995)
  • C. Davatzikos et al.

    Classifying spatial patterns of brain activity with machine learning methods: application to lie detection

    Neuroimage

    (2005)
  • C. Davatzikos et al.

    Detection of prodromal Alzheimer’s disease via pattern classification of magnetic resonance imaging

    Neurobiology of Aging

    (2008)
  • O. Demirci et al.

    A projection pursuit algorithm to classify individuals using fMRI data: application to schizophrenia

    Neuroimage

    (2008)
  • T.G. Dietterich et al.

    Solving the multiple instance problem with axis-parallel rectangles

    Artificial Intelligence

    (1997)
  • K. Doi

    Computer-aided diagnosis in medical imaging: historical review, current status and future potential

    Computerized Medical Imaging and Graphics

    (2007)
  • K. Duan et al.

    Evaluation of simple performance measures for tuning SVM hyperparameters

    Neurocomputing

    (2003)
  • Y. Fan et al.

    Classification of structural images via high-dimensional image warping, robust feature extraction, and SVM

    Medical Image Computing and Computer-Assisted Intervention – MICCAI 2005

    (2005)
  • T. Heimann et al.

    Statistical shape models for 3D medical image segmentation: a review

    Medical Image Analysis

    (2009)
  • C. Hinrichs et al.

    Spatially augmented LPboosting for AD classification with evaluations on the ADNI dataset

    Neuroimage

    (2009)
  • Y.S. Abu-Mostafa

    The Vapnik–Chervonenkis dimension: information versus complexity in learning

    Neural Computation

    (1989)
  • A.M. Ali et al.

    Automatic lung segmentation of volumetric low-dose CT scans using graph cuts

    Advances in Visual Computing

    (2008)
  • E. Alpaydin

    Introduction to Machine Learning

    (2004)
  • J.A. Baker et al.

    Breast-cancer – prediction with artificial neural-network-based on bi-rads standardized Lexicon

    Radiology

    (1995)
  • S. Bakken et al.

    A comparison of semantic categories of the ISO reference terminology models for nursing and the MedLEE natural language processing system

    Medinfo

    (2004)
  • A. Barto et al.

    Reinforcement Learning: An Introduction

    (1999)
  • R. Basili

    Learning to classify text using support vector machines: methods, theory, and algorithms

    Computational Linguistics

    (2003)
  • M. Bates

    Models of natural-language understanding

    Proceedings of the National Academy of Sciences of the United States of America

    (1995)
  • M. Belkin et al.

    Laplacian eigenmaps for dimensionality reduction and data representation

    Neural Computation

    (2003)
  • Y. Bengio

    Gradient-based optimization of hyperparameters

    Neural Computation

    (2000)
  • M. Bhargavan et al.

    Workload of radiologists in United States in 2006–2007 and trends since 1991–1992

    Radiology

    (2009)
  • J. Bi et al.

    An Improved Multi-task Learning Approach with Applications in Medical Diagnosis, Machine Learning and Knowledge Discovery in Databases

    (2008)
  • W. Bialek et al.

    Predictability, complexity, and learning

    Neural Computation

    (2001)
  • J. Bilmes

    A Gentle Tutorial of the EM Algorithm and its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models

    (1998)
  • C.M. Bishop

    Pattern Recognition and Machine Learning

    (2006)
  • S.Y. Bookheimer et al.

    Patterns of brain activation in people at risk for Alzheimer’s disease

    New England Journal of Medicine

    (2000)
  • F.L. Bookstein

    Thin-plate splines and the atlas problem for biomedical images

    Lecture Notes in Computer Science

    (1991)
  • Y. Boykov et al.

    Interactive organ segmentation using graph cuts

    Medical Image Computing and Computer-Assisted Intervention – MICCAI 2000

    (2000)
  • L. Breiman

    Bagging predictors

    Machine Learning

    (1996)
  • Brodley, C., Kak, A., Shyu, C., Dy, J., Broderick, L., Aisen, A., 1999. Content-based retrieval from medical image...
  • A.E. Bryson et al.

    Applied Optimal Control: Optimization, Estimation, and Control

    (1969)
  • C.J.C. Burges

    A tutorial on support vector machines for pattern recognition

    Data Mining and Knowledge Discovery

    (1998)
  • R. Caruana

    Multitask learning

    Machine Learning

    (1997)
  • H.P. Chan et al.

    Computer-aided detection of mammographic microcalcifications – pattern-recognition with an artificial neural-network

    Medical Physics

    (1995)
  • H.P. Chan et al.

    Computer-aided classification of mammographic masses and normal tissue – linear discriminant-analysis in texture feature space

    Physics in Medicine and Biology

    (1995)
  • Cited by (500)

    View all citing articles on Scopus
    View full text