PT - JOURNAL ARTICLE AU - G.I. Cassinelli Petersen AU - J. Shatalov AU - T. Verma AU - W.R. Brim AU - H. Subramanian AU - A. Brackett AU - R.C. Bahar AU - S. Merkaj AU - T. Zeevi AU - L.H. Staib AU - J. Cui AU - A. Omuro AU - R.A. Bronen AU - A. Malhotra AU - M.S. Aboian TI - Machine Learning in Differentiating Gliomas from Primary CNS Lymphomas: A Systematic Review, Reporting Quality, and Risk of Bias Assessment AID - 10.3174/ajnr.A7473 DP - 2022 Apr 01 TA - American Journal of Neuroradiology PG - 526--533 VI - 43 IP - 4 4099 - http://www.ajnr.org/content/43/4/526.short 4100 - http://www.ajnr.org/content/43/4/526.full SO - Am. J. Neuroradiol.2022 Apr 01; 43 AB - BACKGROUND: Differentiating gliomas and primary CNS lymphoma represents a diagnostic challenge with important therapeutic ramifications. Biopsy is the preferred method of diagnosis, while MR imaging in conjunction with machine learning has shown promising results in differentiating these tumors.PURPOSE: Our aim was to evaluate the quality of reporting and risk of bias, assess data bases with which the machine learning classification algorithms were developed, the algorithms themselves, and their performance.DATA SOURCES: Ovid EMBASE, Ovid MEDLINE, Cochrane Central Register of Controlled Trials, and the Web of Science Core Collection were searched according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines.STUDY SELECTION: From 11,727 studies, 23 peer-reviewed studies used machine learning to differentiate primary CNS lymphoma from gliomas in 2276 patients.DATA ANALYSIS: Characteristics of data sets and machine learning algorithms were extracted. A meta-analysis on a subset of studies was performed. Reporting quality and risk of bias were assessed using the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) and Prediction Model Study Risk Of Bias Assessment Tool.DATA SYNTHESIS: The highest area under the receiver operating characteristic curve (0.961) and accuracy (91.2%) in external validation were achieved by logistic regression and support vector machines models using conventional radiomic features. Meta-analysis of machine learning classifiers using these features yielded a mean area under the receiver operating characteristic curve of 0.944 (95% CI, 0.898–0.99). The median TRIPOD score was 51.7%. The risk of bias was high for 16 studies.LIMITATIONS: Exclusion of abstracts decreased the sensitivity in evaluating all published studies. Meta-analysis had high heterogeneity.CONCLUSIONS: Machine learning–based methods of differentiating primary CNS lymphoma from gliomas have shown great potential, but most studies lack large, balanced data sets and external validation. Assessment of the studies identified multiple deficiencies in reporting quality and risk of bias. These factors reduce the generalizability and reproducibility of the findings.AIartificial intelligenceAUCarea under the receiver operating characteristic curveCNNconvolutional neural networkMLmachine learningPCNSLprimary CNS lymphomaPRISMAPreferred Reporting Items for Systematic Reviews and Meta-AnalysesPROBASTPrediction model study Risk Of Bias Assessment ToolTRIPODTransparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis