Elsevier

Neural Networks

Volume 19, Issue 4, May 2006, Pages 408-415
Neural Networks

The use of artificial neural networks in decision support in cancer: A systematic review

https://doi.org/10.1016/j.neunet.2005.10.007Get rights and content

Abstract

Artificial neural networks have featured in a wide range of medical journals, often with promising results. This paper reports on a systematic review that was conducted to assess the benefit of artificial neural networks (ANNs) as decision making tools in the field of cancer. The number of clinical trials (CTs) and randomised controlled trials (RCTs) involving the use of ANNs in diagnosis and prognosis increased from 1 to 38 in the last decade. However, out of 396 studies involving the use of ANNs in cancer, only 27 were either CTs or RCTs. Out of these trials, 21 showed an increase in benefit to healthcare provision and 6 did not. None of these studies however showed a decrease in benefit. This paper reviews the clinical fields where neural network methods figure most prominently, the main algorithms featured, methodologies for model selection and the need for rigorous evaluation of results.

Introduction

In the last decade, the use of artificial intelligence (AI) has become widely accepted in medical applications. This is manifested by an increasing number of medical devices currently available on the market with embedded AI algorithms, together with an accelerating pace of publication in medical journals, with over 500 academic publications each year featuring Artificial Neural Networks (ANNs) (Gant, Rodway, & Wyatt 2001). Claimed advantages of neural network methods include:

  • Ease of optimisation, resulting in cost-effective and flexible non-linear modelling of large data sets.

  • Accuracy for predictive inference, with potential to support clinical decision making.

  • These models can make knowledge dissemination easier by providing explanation, for instance, using rule extraction or sensitivity analysis (Lisboa, 2002).

The published literature suggests that ANN models have been shown to be valuable tools in reducing the workload on the clinicians by detecting artefact and providing decision support, potentially with the ability to automatically re-estimate the model on-line. However, there are relatively few published clinical trials, and even fewer testing the clinical value of ANNs against established linear-in-the-parameters statistical methods (Lisboa, 2002).

There are two recurring concerns on ANNs. The first is the use of first principle statistical methods to control model complexity, which has been addressed by regularisation methods and with the use of cross-validation (Biganzoli et al., 1998, Lisboa et al., 2003, Ripley, 1996, Ripley and Ripley, 2001). The second key issue is transparency, i.e. explaining what influences the network predictions and how to resolve outcome predictions in terms of readily understood clinical statements. This is partly addressed by rule-extraction algorithms.

Notwithstanding these concerns, an interesting feature of neural network decision support in medicine is the routine clinical use of a range of systems, from the commercial-C.Net (Nabney, Evans, Tenner, & Gamlyn, 2001) and BioSleep (Tarassenko, McGrogan, & Braithwaite, 2002)—to research prototypes (Lisboa et al., 2000, Taktak et al., 2004) without listing in PubMed of supportive clinical trials. The situation is not specific to neural networks, but extends particularly to web-based decision support tools such as www.adjuvantonline.com, marking a departure from algorithms for clinical routine assessments, e.g. the Glasgow Coma Score for severity of illness in critical care and Nottingham Prognostic Index for breast cancer, both of which have undergone rigorous multi-centre clinical trials evidenced in the literature, if not altogether without controversy.

The use of unstructured approaches to clinical evaluation of new medical research is a trend, which has proved hard to change. Already in 1994 a paper entitled ‘the scandal of poor medical research’ (Altman, 1994) highlighted the need to proper study design bordering on the unethical typically through the application of such bad scientific methodology as to be sometimes called ‘torturing the data’ until they confess to the desired result (Mills, 1993).

Therefore, it is important to define and keep to a staged framework to design a sequence of studies each with a clear-cut purpose, ranging from the exploratory to the definitive, where the chief aim of each step in this chain is to support the next developmental step until a power calculation is possible which will determine the sample size, along with clinical protocol and study design for a multi-centre randomised clinical trial. Such a framework has been published (Campbell et al., 2000) and adapted for the development of intelligent decision support in an earlier review (Lisboa, 2002). This review will note the current trends in the studies that reach journals in the medical or medically related science literature, highlight points of good and poor practice, and draw conclusions for study design to improve the likelihood of studies being appropriately followed-up in the future.

Section snippets

Literature search

A systematic literature search was conducted using Pubmed for entries during the period 1994–2003 with the keywords ‘neural networks’. The search was limited to clinical trials and randomised controlled trials (RCTs). Results of the search are summarised in Fig. 1. The search was repeated using the keywords (neural networks) and (cancer) from 1994 to the current date. There were 396 hits in total with only 27 either CTs or RCTs and the abstracts of the resulting hits were analysed. The

Review of papers related to cancer listed in Pubmed

The majority of clinical trial studies benchmarked the ANNs performance against traditional screening methods. In prostate cancer, this involves the use of prostate specific antigen (PSA) serum marker, digital rectal examination, Gleason sum, age and race (Gamito et al., 2000, Remzi et al., 2003, Stephan et al., 2003, Tewari et al., 2001). Some studies have compared ANNs with statistical methods (Chan et al., 2003, Finne et al., 2000, Matsui et al., 2002, Remzi et al., 2003). Remzi demonstrated

Implications for study design

It is well documented that hundreds of papers are published in the medical literature, at a vast mean cost per published paper, yet few results find their way into improving healthcare practices in routine clinical use. There are reasons for this, partly the unavoidable result that not all interesting new methods turn out to fulfil their early promise. However, more often than not it is methodological shortcomings that mortally damage the future worth of the paper. Some of the reasons for this

Ethical and legal issues

A final consideration with particular implications for the evaluation of biomedical decision support systems concerns the legal and ethical foundation to judge whether the ‘duty of care’ has been breached. The principles involved hark back to the ‘Bolam test’ which refers to the skill of an ordinary competent practitioner. This test offers considerable latitude in the exercise of clinical discretion, a leniency founded on confidence in the doctor's training (Gant et al., 2001). Similar

Conclusions

A review of PubMed listed publications involving clinical trials of neural network systems identified trends in areas of clinical promise, specifically in the diagnosis, prognosis and therapeutic guidance for cancer, but also the need for more extensive application of rigorous methodologies. This has implications for study design, to address some of the more common pitfalls of empirical models for medical diagnosis, particularly those relying on generic non-linear function approximations, which

Acknowledgements

This work is supported by the BIOPATTERN EU Network of Excellence. EU Contract 508803.

References (57)

  • D.G. Altman

    Editorial: The scandal of poor medical research

    British Medical Journal

    (1994)
  • D.G. Altman et al.

    What do we mean by validating a prognostic model?

    Statistics in Medicine

    (2000)
  • E. Biganzoli et al.

    Feed forward neural networks for the analysis of censored survival data: A partial logistic regression approach

    Statistics in Medicine

    (1998)
  • C.M. Bishop

    Neural network for pattern recognition

    (1995)
  • M.E. Boon et al.

    Using artificial neural networks to screen cervical smears: How new technology enhances health care

  • Bryce, T.J., Dewhirst, M. W., Floyd, C.E., Jr., Hars, V., & Brizel, D.M., 1998, Artificial neural network model of...
  • M. Campbell et al.

    Framework for design and evaluation of complex interventions to improve health

    British Medical Journal

    (2000)
  • I. Chan et al.

    Detection of prostate cancer by integration of line-scan diffusion, T2-mapping and T2-weighted magnetic resonance imaging; a multichannel statistical classifier

    Medical Physics

    (2003)
  • Coppini, G., Diciotti, S., Falchini, M., Villari, N., & Valli, G., 2003, Neural networks for computer-aided diagnosis:...
  • J.E. Dayhoff et al.

    Artificial neural networks: Opening the black box

    Cancer

    (2001)
  • H. Doornewaard et al.

    The diagnostic value of computer-assisted primary cervical smear screening: A longitudinal cohort study

    Modern Pathology

    (1999)
  • R. Dua et al.

    Detection of basal cell carcinoma using electrical impedance and neural networks

    IEEE Transactions on Biomedical Engineering

    (2004)
  • FDA. (1998). Guidance for FDA reviewers and industry: Guidance for the content of premarket submissions for software...
  • Fujikawa, K., Matsui, Y., Kobayashi, T., Miura, K., Oka, H., Fukuzawa, S., et al., 2003, Predicting disease outcome of...
  • E.J. Gamito et al.

    Use of artificial neural networks in the clinical staging of prostate cancer: Implications for prostate brachytherapy

    Techniques in Urology

    (2000)
  • V. Gant et al.

    Artificial neural networks: Practical considerations for clinical applications

  • A. Gerger et al.

    Automated epiluminescence microscopy—Tissue counter analysis using CART and 1-NN in the diagnosis of melanoma

    Skin Research and Technology

    (2003)
  • M. Gletsos et al.

    A computer-aided diagnostic system to characterize CT focal liver lesions: Design and optimization of a neural network classifier

    IEEE Transaction on Information Technology in Biomedicine

    (2003)
  • Cited by (384)

    • Recent advancements in deep learning based lung cancer detection: A systematic review

      2022, Engineering Applications of Artificial Intelligence
    View all citing articles on Scopus
    View full text