Survey PaperMachine learning and radiology
Graphical abstract
In this paper, we give a short introduction to machine learning and survey its applications in radiology. We focused on six categories of applications in radiology: medical image segmentation, registration, computer aided detection and diagnosis, brain function or activity analysis and neurological disease diagnosis from fMR images, content-based image retrieval systems for CT or MRI images and text analysis of radiology reports using natural language processing (NLP) and natural language understanding (NLU). This survey shows that machine learning plays a key role in many radiology applications. Machine learning identifies complex patterns automatically and helps radiologists make intelligent decisions on radiology data such as conventional radiographs, CT, MRI, and PET images and radiology reports. In many applications, the performances of the machine learning-based automatic detection and diagnosis systems have shown to be comparable to that of a well-trained and experienced radiologist. Technology development in machine learning and radiology will benefit from each other in the long run. Key contributions and common characteristics of machine learning techniques in radiology are discussed. We also discuss the problem of translating machine learning applications to the radiology clinical setting, including advantages and potential barriers.
Highlights
► Mainstream machine learning techniques relevant for radiology are introduced. ► Six major applications of machine learning in radiology are surveyed. ► Central themes of machine learning research in radiology are described. ► Factors impacting translation of machine learning to radiology are discussed.
Introduction
Radiologic imaging is of increasing importance in patient care. Both diagnostic and therapeutic indications for radiologic imaging are expanding rapidly (Bhargavan et al., 2009). The rapid expansion is a consequence of the need for more rapid, accurate, cost-effective, and less invasive treatment. Technologic advancements in radiologic imaging equipment have also fueled the utilization of imaging. Such technologic advancements include the capability to acquire higher and higher resolution images, enabling visualization of smaller anatomic structures and abnormalities. The higher resolution comes at the cost of an ever increasing average number of images per patient. Radiologists need to interpret these images and as the number of images increases, radiologists’ workload increases as well. The increasing number and complexity of the images threatens to overwhelm radiologists’ capacities to interpret them. In many real radiologic practices, automated and intelligent image analysis and understanding are becoming an essential part or procedure, such as image segmentation, registration, and computer-aided diagnosis and detection. In addition, in the area of cancer prognosis and treatment, automated and intelligent algorithms have a large market and are welcomed broadly, in areas such as radiation therapy planning or automatic identification of imaging biomarkers from radiological images of certain diseases, etc. Machine learning algorithms underpin the algorithms and software that make computer-aided diagnosis/prognosis/treatment possible.
Radiology is a branch of medical science which uses imaging technology and radiation to make diagnoses and treat disease. It has benefited greatly from the advances of physics, electronic engineering, and computer science. Based on different detection and imaging rationale, various modalities were developed in the past decades in the field of diagnostic radiology. Today, the mainstream modalities which are widely used in hospitals and medical centers include radiography, fluoroscopy, computed tomography (CT), ultrasound, magnetic resonance imaging (MRI), and positron emission tomography (PET).
In the daily practice of radiology, medical images from different modalities are read and interpreted by radiologists. Usually radiologists must analyze and evaluate these images comprehensively in a short time. But with the advances in modern medical technologies, the amount of imaging data is rapidly increasing. For example, CT examinations are being performed with thinner slices than in the past. The reading and interpretation time of radiologists will mount as the number of CT slices grows.
Machine learning provides an effective way to automate the analysis and diagnosis for medical images. It can potentially reduce the burden on radiologists in the practice of radiology. The applications of machine learning in radiology include medical image segmentation (e.g., brain, spine, lung, liver, kidney, colon); medical image registration (e.g., organ image registration from different modalities or time series); computer-aided detection and diagnosis systems for CT or MRI images (e.g., mammography, CT colonography, and CT lung nodule CAD); brain function or activity analysis and neurological disease diagnosis from fMR images; content based image retrieval systems for CT or MRI images; and text analysis of radiology reports using natural language processing (NLP) and natural language understanding (NLU).
Machine learning is the study of computer algorithms which can learn complex relationships or patterns from empirical data and make accurate decisions (Bishop, 2006, Duda et al., 2000, Mitchell, 1997). It is an interdisciplinary field that has close relationships with artificial intelligence, pattern recognition, data mining, statistics, probability theory, optimization, statistical physics, and theoretical computer science. Applications of machine learning include natural language processing, medical diagnosis, bioinformatics, video surveillance, and financial data analysis.
Machine learning algorithms can be organized into different categories based on different principles. For example, depending on the utilization of labels of training samples, they can be categorized into supervised learning, semi-supervised learning, and unsupervised learning algorithms.
In supervised learning, each sample contains two parts: one is input observations or features and the other is output observations or labels (Alpaydin, 2004, Hastie et al., 2009). Usually the input observations are causes and the output observations are effects. The purpose of supervised learning is to deduce a functional relationship from training data that generalizes well to testing data. The form of the relationship is a set of equations and numerical coefficients or weights. Examples of supervised learning include classification, regression, and reinforcement learning.
In unsupervised learning, we only have one set of observations and there is no label information for each sample (Hastie et al., 2009). Usually these observations or features are caused by a set of unobserved or latent variables. The main purpose of unsupervised learning is to discover relationships between samples or reveal the latent variables behind the observations. Examples of unsupervised learning include clustering, density estimation, and blind source separation.
Semi-supervised learning falls between supervised and unsupervised learning (Chapelle et al., 2006, Zhu, 2007). It utilizes both labeled data (usually a few) and unlabeled data (usually many) during the training process. Semi-supervised learning algorithms were developed mainly because the labeling of data is very expensive or impossible in some applications. Examples of semi-supervised learning include semi-supervised classification and information recommendation systems (Christakou et al., 2005).
Machine learning has many applications in real life. It is routinely used in banking (for detecting fraudulent transactions (Dorronsoro et al., 1997)), in finance (to predict stock prices (Huang et al., 2005a)), in marketing (to reveal patterns of consumer spending (Bose and Mahapatra, 2001)), and on the Internet (as part of search engines (Basili, 2003)). In biomedicine, MYCIN was proposed in the early 1970s at Stanford University. It is an expert system with about 600 rules designed to identify bacteria and recommend antibiotics (Swartout, 1985). Machine learning also showed capability in the field of drug design (Burbidge et al., 2001).You may not be aware of the existence of machine learning, but its applications are pervasive in our daily lives.
This review is structured as follows. In Section 2 we give a short introduction to machine learning and related algorithms. In Section 3 we describe six representative applications of machine learning in radiology. In Section 4 we discuss key contributions and common characteristics of machine learning techniques in radiology. In Section 5 we cover issues on translating machine learning techniques to clinical radiology practice. In Section 6 we review current research status and discuss future directions.
Section snippets
Overview of machine learning
Because of the rapid development of machine learning, it is hard to introduce every aspect of machine learning in one article. So in this section we will give a concise introduction to the most important topics of machine learning (Bishop, 2006). These topics include linear models, learning with kernels, probabilistic models, clustering analysis and dimensionality reduction. Through this introduction, we hope readers may have a general idea about the content of machine learning research, what
Applications of machine learning in radiology
In this section, we will introduce some typical applications of machine learning in radiology.
Key contributions and common characteristics of machine learning techniques in radiology
In the previous section, we showed that machine learning has many applications in radiology. These applications vary from each other and may have very different forms regarding problems to be solved, input data, output data, anatomical constraints, prior knowledge, and hidden variables. Also machine learning methods utilized to solve these problems seem complicated and many of them have strict assumptions. At first glance, it seems difficult for a researcher in radiology to decide whether to
The translation from machine learning to clinical practice
In Section 3 we illustrated six representative domains in radiology where machine learning could contribute. In this section, we will discuss advantages of utilizing machine learning in radiology and potential barriers which could hinder the deployment of machine learning in clinical practice.
The major advantages of applying machine learning in radiology will be labor saving and accurate diagnostic results. Many radiology practices are very time consuming for human labor. For example, in
Discussion and conclusion
In this paper, we have presented a short introduction to machine learning and surveyed its applications in radiology. We focused on six applications in radiology: image segmentation, image registration, computer-aided detection and diagnosis, brain function or activity analysis and neurological disease diagnosis from fMR images, content-based image retrieval, and text analysis of radiology reports using NLP/NLU. This survey shows that machine learning plays a key role in many radiology
Acknowledgments
We thank Andrew Dwyer, MD, for critical review of the manuscript. This manuscript was support by the Intramural Research Program of the National Institutes of Health Clinical Center.
References (235)
- et al.
A learning algorithm for Boltzmann machines
Cognitive Science
(1985) Information geometry of the EM and EM algorithms for neural networks
Neural Networks
(1995)A fast diffeomorphic image registration algorithm
Neuroimage
(2007)- et al.
Incorporating prior knowledge into image registration
Neuroimage
(1997) - et al.
Symmetric diffeomorphic image registration with cross-correlation: evaluating automated labeling of elderly and neurodegenerative brain
Medical Image Analysis
(2008) - et al.
Segmentation of brain 3D MR images using level sets and dense registration
Medical Image Analysis
(2001) - et al.
Business data mining – a machine learning perspective
Information & Management
(2001) - et al.
Bayesian and non-Bayesian probabilistic models for medical image analysis
Image and Vision Computing
(2003) - et al.
Drug design by machine learning: support vector machines for pharmaceutical data analysis
Computers & Chemistry
(2001) - et al.
Shaving diffusion tensor images in discriminant analysis: a study into schizophrenia
Medical Image Analysis
(2006)
Computer-aided diagnosis of lung cancer and pulmonary embolism in computed tomography—a review
Academic Radiology
Creating a text classifier to detect radiology reports describing mediastinal findings associated with inhalational anthrax and other disorders
Journal of the American Medical Informatics Association
Computer-aided detection and classification of microcalcifications in mammograms: a survey
Pattern Recognition
A new point matching algorithm for non-rigid registration
Computer Vision and Image Understanding
Independent component analysis, a new concept
Signal Process
Active shape models – their training and application
Computer Vision and Image Understanding
Classifying spatial patterns of brain activity with machine learning methods: application to lie detection
Neuroimage
Detection of prodromal Alzheimer’s disease via pattern classification of magnetic resonance imaging
Neurobiology of Aging
A projection pursuit algorithm to classify individuals using fMRI data: application to schizophrenia
Neuroimage
Solving the multiple instance problem with axis-parallel rectangles
Artificial Intelligence
Computer-aided diagnosis in medical imaging: historical review, current status and future potential
Computerized Medical Imaging and Graphics
Evaluation of simple performance measures for tuning SVM hyperparameters
Neurocomputing
Classification of structural images via high-dimensional image warping, robust feature extraction, and SVM
Medical Image Computing and Computer-Assisted Intervention – MICCAI 2005
Statistical shape models for 3D medical image segmentation: a review
Medical Image Analysis
Spatially augmented LPboosting for AD classification with evaluations on the ADNI dataset
Neuroimage
The Vapnik–Chervonenkis dimension: information versus complexity in learning
Neural Computation
Automatic lung segmentation of volumetric low-dose CT scans using graph cuts
Advances in Visual Computing
Introduction to Machine Learning
Breast-cancer – prediction with artificial neural-network-based on bi-rads standardized Lexicon
Radiology
A comparison of semantic categories of the ISO reference terminology models for nursing and the MedLEE natural language processing system
Medinfo
Reinforcement Learning: An Introduction
Learning to classify text using support vector machines: methods, theory, and algorithms
Computational Linguistics
Models of natural-language understanding
Proceedings of the National Academy of Sciences of the United States of America
Laplacian eigenmaps for dimensionality reduction and data representation
Neural Computation
Gradient-based optimization of hyperparameters
Neural Computation
Workload of radiologists in United States in 2006–2007 and trends since 1991–1992
Radiology
An Improved Multi-task Learning Approach with Applications in Medical Diagnosis, Machine Learning and Knowledge Discovery in Databases
Predictability, complexity, and learning
Neural Computation
A Gentle Tutorial of the EM Algorithm and its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models
Pattern Recognition and Machine Learning
Patterns of brain activation in people at risk for Alzheimer’s disease
New England Journal of Medicine
Thin-plate splines and the atlas problem for biomedical images
Lecture Notes in Computer Science
Interactive organ segmentation using graph cuts
Medical Image Computing and Computer-Assisted Intervention – MICCAI 2000
Bagging predictors
Machine Learning
Applied Optimal Control: Optimization, Estimation, and Control
A tutorial on support vector machines for pattern recognition
Data Mining and Knowledge Discovery
Multitask learning
Machine Learning
Computer-aided detection of mammographic microcalcifications – pattern-recognition with an artificial neural-network
Medical Physics
Computer-aided classification of mammographic masses and normal tissue – linear discriminant-analysis in texture feature space
Physics in Medicine and Biology
Cited by (500)
Application of artificial intelligence radiomics in the diagnosis, treatment, and prognosis of hepatocellular carcinoma
2024, Computers in Biology and MedicineLongitudinal MRI analysis using a hybrid DenseNet-BiLSTM method for Alzheimer's disease prediction
2024, Behavioural Brain ResearchAutomated image label extraction from radiology reports — A review
2024, Artificial Intelligence in MedicineMachine learning-based evaluation technology of 3D spatial distribution of residual radioactivity in large-scale radioactive structures
2024, Nuclear Engineering and TechnologyDo it the transformer way: A comprehensive review of brain and vision transformers for autism spectrum disorder diagnosis and classification
2023, Computers in Biology and Medicine