Introduction

The International Federation of Gynecology and Obstetrics (FIGO) system is the most widely used for staging uterine cervical and endometrial cancer [1]. However, the FIGO staging system does not take into account lymph node (LN) metastasis (so-called N staging), despite its known adverse impact on survival in gynecological cancer [2, 3]. Although LN resection before radiotherapy results in improved survival in patients with macroscopically enlarged pelvic and para-aortic LNs, routine pre-surgical staging is not generally recommended, even though a lack of assessment of LN involvement may lead to suboptimal treatment [49].

Indeed, the gold standard for diagnosing LN metastases is currently surgical assessment [10], but this is a highly specialized, time-consuming, costly and invasive procedure that increases the patient’s risk of immediate and delayed complications. Hence, a non-invasive but accurate pre-surgical method of diagnosing LN metastases would be desirable, helping to prevent unnecessary lymphadenectomy and optimizing surgical interventions. In this context, computed tomography (CT) and magnetic resonance imaging (MRI) are widely used to assess LNs in patients with malignant tumors, including uterine cancer. As the state of the art progresses, various functional imaging techniques, including diffusion-weighted MRI (MRI-DWI), and combined positron emission tomography and CT (PET/CT) using 18F-fluoro-deoxy-glucose (FDG), have been proposed as a method of improving diagnostic performance.

To date, there have been three reports on apparent diffusion coefficient (ADC) obtained with DWI for the detection of LN metastasis [1113], and several on PET/CT [1420] for uterine cancer. PET/MRI image fusion is increasingly being used in clinical settings, although both clinical evaluation and technical optimization are still developing processes. Nevertheless, initial experience with this new imaging device proves promising for oncological applications [21]. Generally, there are two major image fusion techniques available, namely hardware based and retrospective software based. Hardware-based image fusion is performed by means of hybrid scanners, which enable the real-time acquisition and fusion of two different imaging modalities within a single device. Retrospective software-based image fusion, on the other hand, relies on dedicated software to fuse two separate imaging datasets, most often from CT or MRI and single-photon emission tomography (SPECT) or PET. This technique, called “image registration”, is used to align both sets of data so that each voxel corresponds to the same anatomical landmarks in both images [22].

As yet, however, there have been few studies [23, 24] on the ability of fused PET/MRI imaging to identify metastatic LNs in patients with uterine endometrial and cervical cancer. The purpose of this study was, therefore, to assess the accuracy of retrospective image fusion PET/MRI-DWI to PET/CT in detecting metastatic LNs in patients with newly diagnosed cervical and endometrial carcinoma, comparing PET/MRI-DWI findings with those of both PET/CT and MRI-DWI alone.

Materials and methods

Patients

This retrospective study involved 27 untreated female patients with either endometrial (n = 14) or cervical cancer (n = 13) (demographic and clinical data are reported in Tables 1, 2, 3) from October 2011 to December 2013. Exclusion criteria were general contraindications for MRI (such as cardiac pacemaker and claustrophobia). Each patient underwent MRI-DWI and PET/CT before undergoing subtotal hysterectomy (n = 1), total abdominal hysterectomy with (n = 8) or without (n = 18) bilateral salpingo-oophorectomy, and pelvic lymphadenectomy, with (n = 15) or without (n = 12) para-aortic lymphadenectomy, for histopathologically proven uterine cancer. Each pelvic LN was analyzed by a histopathologist. None of the patients had either contraindications to the surgical procedure or clinical evidence of distant metastases.

Table 1 Demographic and clinical characteristics of the study population
Table 2 Frequency and percentage of tumor histotype of the study population
Table 3 Frequency and percentage of FIGO classification of the study population

Prior to the surgery, four of the patients with cervical cancer underwent an abdominal CT scan in the emergency room. Three had bilateral (one case with severe renal failure) and one unilateral ureter infiltration by the tumor. The time interval between MRI scan and surgical treatment was 5–29 days (mean: 17 days). The time interval between PET/CT scan and surgical treatment was 3–33 days (mean: 18 days). The time interval between MRI scan and PET/CT scan was 0–33 days (mean: 10 days).

Subsequently, 11 patients received chemotherapy alone and 2 chemo-radiotherapy. Women were followed up according to the protocol of our institution. Of the treated patients, 19 achieved complete remission and 8 had recurrent disease (Table 1). Informed consent was obtained from all participants. As the study was retrospective, approval of the local ethics committee was not sought.

PET/CT imaging

Whole body PET/CT images were obtained using a PET/CT scanner Biograph 16 HI-REZ (Siemens, Hoffman Estates, IL). After an 8-h fast, patients received an intravenous injection of 250 MBq FDG, and were encouraged to rest. PET/CT scanning from the middle of the skull to the upper thigh was performed 60 min after the injection. Specifically, a low-dose spiral CT scan was performed using the following parameters: 120 kV, 33 mA, 1.5 s rotation time, 24 mm collimation and 48 mm table feed per rotation with arms raised, followed by PET image acquisition (2–4 min per bed position three-dimensional acquisition mode, according to the body max index). CT images were reconstructed onto a 512 × 512 matrix and converted into 511 kiloelectron volt (keV)-equivalent attenuation factors for attenuation correction. PET images were reconstructed onto a 128 × 128 matrix using the attenuation-weighted Fourier rebinning (FORE-OSEM) iterative reconstruction, with two iterations and eight subsets, and a post-reconstruction Gaussian volume filter with FWHM = 4 mm. PET, PET/CT, and CT images were analyzed using a dedicated Leonardo workstation (Siemens Medical Solutions). The processed images were displayed in coronal, transverse, and sagittal planes.

MR imaging

All patients were scanned using a 1.5-Tesla MRI scanner (Achieva Intera 1.5 T, Philips Medical Solutions, The Netherlands). Pelvic MR images were acquired using a body-sense four-channel phased array pelvic coil for signal reception, with a parallel factor of 1.2, as follows: axial T2-weighted turbo-spin-echo SPAIR images (TR/TE: 4760/100 ms, slice thickness/intersection gap: 3/1 mm, echo train length: 11, matrix: 560 × 560, FOV: 432 mm × 432 mm). Axial DWI was performed during free breathing using a Stejskal–Tanner spin-echo echo-planar imaging sequence and a body synergy four-channel coil with parallel factor of zero at the following parameters: TR/TE: 8160/73 ms; flip angle: 90°; NEX: 6; readout bandwidth: 2369.3 Hz/pixel; matrix: 384 × 384; FOV: 370 mm × 370 mm; and slice thickness/gap: 5/0 mm, b values: 0 and 800 s/mm2.

Image analysis

All PET and MRI series were retrospectively and manually fused using dedicated image-fusion software, available on the Leonardo multimodality workstation (Siemens Medical Solutions). MR and PET/CT images were stored in a shared database, and the MR images (T2 and DWI) were co-registered to CT images of PET using a semi-automatic voxel-based algorithm. After image registration, the co-registered images were reconstructed and visualized in the axial plane. Alignment was assessed by checking the body outline and the position of motionless metabolically active organs (bone and spine) in all three planes (axial, coronal, and sagittal). When accurate image fusion was not feasible, PET-MRI fusion was evaluated by assessing PET/MRI-DWI-fused images side-by-side with PET and MRI images. PET/CT images were interpreted by a nuclear physician (MS, 17 years experience) and MR images by an abdominal radiologist (AS, 15 years experience). Both readers were informed of patient history, but blinded to the presence or absence of LN metastases. On DWI images, LNs were classified as cancer positive in the presence of focally abnormal signal intensity higher than the signal intensity of the spinal cord in a location corresponding to the LN chains on T2-weighted images. As ADC value differentiation of LNs in uterine cancer is reportedly controversial [11, 12], LNs in our series were classified on the basis of visual DWI criteria, irrespective of either ADC value or size. On PET/CT and fused PET/MR images, LNs were classed as cancer positive in the presence of either focally appreciable metabolic activity above that of normal muscle, or asymmetric metabolic activity greater than that of normal-appearing LNs at the same level in the contralateral pelvis in a location corresponding to the LN chains on the CT or MR images, respectively [18]. In contrast, LNs with a central unenhanced area suggestive of a fatty hilum were considered benign. As in a previous study [14], LNs were grouped according to anatomical landmarks into eight regions: right common iliac, left common iliac, right external iliac, left external iliac, right internal iliac, left internal iliac, right obturator fossa, and left obturator fossa.

PET/CT and fused PET/MRI-DWI images were interpreted visually using a five-point scoring system as follows: 0 = negative findings, 1 = insignificant yet visible lesion, 2 = equivocal, needs further follow-up, 3 = probably metastasis, 4 = significant metastasis [25, 26]. We created a score sheet (Excel, Microsoft, US) for each patient to enable evaluation of LN region involvement by PET/CT and MRI exams, grids of N parameters, imaging techniques (MRI-DWI, PET/CT, PET/MRI), and histopathological data. Scores were given per-patient and per-node basis. Images were examined directly on a computer workstation screen.

Statistical analysis

Sensitivity, specificity, accuracy, and positive and negative predictive values were calculated for each diagnostic method on a per-patient basis and on a per-node basis, adopting histopathological and follow-up imaging (CT, MRI, and PET/CT) results as the gold standard. A receiver operating characteristic (ROC) curve was used to evaluate the diagnostic value of PET/CT, MRI-DWI and PET/MRI-DWI. A P value <0.05 was regarded as statistically significant. MedCalc software (MedCalc Software, Belgium) was used to perform all statistical analyses.

Results

LN metastases were identified by histopathology in 8 of 27 patients (29 %), specifically in 37 of the 216 LN regions (17 %) examined.

Findings on a per-patient basis

DWI was true positive for nodal metastases in 7/8 (87 %) of patients with LN metastases, and true negative for 13/19 (68 %) of patients without node metastases. Similarly, PET/CT and PET/MRI-DWI were true positive for nodal metastases in 7/8 (87 %) of patients with node metastasis, but true negative in 16/19 (84 %) of patients without node metastasis (Table 4). DWI alone showed 87.5 % sensitivity, 68.4 % specificity, 74 % diagnostic accuracy, 53.8 % positive predictive value (PPV), and 92.8 % negative predictive value (NPV). PET-CT and fused PET/MRI-DWI showed 87.5 % sensitivity, 84.2 % specificity, 85.1 % diagnostic accuracy, 70 % PPV, and 94.1 % NPV (Table 4).

Table 4 Parameters of diagnostic performance including FP, FN, TP, TN, sensitivity, specificity. Accuracy, PPV, and NPV of DWI, PET/CT and PET/MRI-DWI on a per-patient basis

The area under the ROC curve (AUC) of PET/CT was 0.859 (confidence interval; 0.710–1.000), while MRI-DWI and PET-MRI AUCs were, respectively, 0.780 (0.617–0.942) and 0.859 (0.710–1.000) (Fig. 1).

Fig. 1
figure 1

Graphs of ROC analysis “on a per patient basis”. Green line, PET/CT images; blue line, DWI-MRI images; orange line, fused PET/MRI-DWI images. The area under the ROC curve (AUC) of PET/CT was 0.859 (0.710–1.000), while DWI-MRI and PET-MRI AUCs were 0.780 (0.617–0.942), and 0.859 (0.710–1.000)

Findings on a per-node basis

The LN regions bearing metastases were the common iliac (n = 8), external iliac (n = 8), internal iliac (n = 8), and the obturator fossa (n = 13). DWI was true positive for 32 of the 37 (86 %) metastatic node groups, and true negative for 119 of the 179 (66 %) non-metastatic node groups. PET/CT was true positive for 26 of the 37 (70 %) metastatic node groups, and true negative for 162 of the 179 (90 %) non-metastatic node groups. PET/MRI-DWI fared better, being true positive for 33 of the 37 (89 %) metastatic node groups and true negative for 164 of the 179 (91 %) non-metastatic node groups.

Across node groups, DWI alone displayed 86.4 % sensitivity, 66.4 % specificity, 69.9 % diagnostic accuracy, 34.7 % PPV, and 95.9 % NPV, respectively.

The sensitivity, specificity, accuracy, PPV and NPV of PET-CT vs PET/MRI-DWI for the detection of pelvic metastatic LNs were 70.2 vs 89.1 %, 87 vs 91.6 %, 87 vs 91.2 %, 60.4 vs 68.7 %, and 93.6 vs 97.6 %, respectively (Table 5).

Table 5 Parameters of diagnostic performance including FP, FN, TP, TN, sensitivity, specificity. Accuracy, PPV and NPV of DWI, PET/CT, and PET/MRI-DWI on a per-node basis

The AUC of PET/CT was 0.933 (0.891–0.962), while MRI-DWI and PET/MRI AUCs were 0.929 (0.886–0.960) and 0.963 (0.928–0.984), respectively (Fig. 2). Comparison of the areas under the ROC curves for the detection of metastatic LNs revealed, however, that the difference between PET/CT and fused PET/MRI-DWI was not significant (p = 0.055).

Fig. 2
figure 2

Graphs of ROC analysis “on a per node basis”. Green line, PET/CT images; blue line, DWI-MRI images; orange line fused PET/MRI-DWI images. The AUC of PET/CT was 0.933 (0.891–0.962), while DWI-MRI and PET-MRI AUCs were 0.929 (0.886–0.960) and 0.963 (0.928–0.984), respectively

Discussion

A non-invasive technique that can accurately identify LN metastasis in malignant tumors would be beneficial for optimizing treatment management. The identification of metastatic LNs by both CT and MRI is based on measurements of node size, and the most widely accepted criterion for diagnosis of nodal involvement, a short-axis diameter greater than 8–10 mm, has a sensitivity rate for the detection of LN metastasis in endometrial cancer between 27 and 66 %, and a corresponding specificity rate between 73 and 99 % [2732]. In uterine cervical cancer, LN metastasis detection by these methods is between 30 and 73 % sensitive and between 44 and 93 % specific [14, 31, 3337]. However, PET/CT shows low sensitivity and high specificity for detecting metastatic LNs in patients with uterine cervical and endometrial cancer, whereas DWI shows high sensitivity and low specificity [38].

DWI is an MRI technique that depicts molecular diffusion, which is the Brownian motion of water protons in biological tissues. It has been used in oncological imaging for the depiction and characterization of tumors, as well as for differentiating benign from malignant lesions in various kinds of tumors, including uterine cancer [13, 39]. The extent of water diffusion can be related to microstructure, microcirculation, cell organization and density, thereby enabling DWI to provide information about the biophysical properties of tissues in vivo. Assuming that malignant tumors generally have higher cellularity than benign lesions, DWI might theoretically help in differentiating malignant from benign lesions. However, DWI not only visualizes pathological areas in malignant lesions but also benign pathologies with restricted diffusion [39].

Kitajima et al. [38] reported that DWI shows higher sensitivity and lower specificity than PET/CT in detecting LN metastasis in patients with uterine cancer (endometrial cancer and cervical cancer). However, the quantitative ADC value used is controversial for differentiating malignant from benign LNs in uterine cancer [1113], suggesting that ADC analysis is not acceptable for the preoperative evaluation of LN metastasis in patients with uterine cancer. This explains why we performed no ADC analyses in our study.

Studies documenting the accuracy of FDG-PET/CT for detecting LN metastasis in uterine cancer [1420] have reported that PET/CT tends to show low sensitivity and high specificity; this observation is confirmed in our series, and may be explained by the tendency of this technique to underestimate standardized uptake values in tiny LNs due to the partial volume effect. This makes the usual cutoff (2.5–3.0) for differentiating malignant from benign LNs unreliable, and many studies have failed to perform semi-quantitative analysis to determine a standardized value for FDG uptake in nodal lesions [1517]. FDG-PET/CT is also unable to detect microscopic metastasis, which is unsurprising as PET has a mean spatial resolution value of 0.5 cm (range 0.4–0.6 cm), making small lymph nodal metastases almost undetectable [1417].

To our knowledge, this is the first reported study to have investigated the validity of retrospectively fused DWI/T2-MRI to PET/CT images from different scanners for nodal staging of endometrial and uterine cervical cancer. All authors have previously focused on the evaluation of conventional MRI (without or with contrast) sequences fused to PET-CT images. For example, Kim et al. [23] showed that the sensitivity and specificity of PET/CT and fused MRI were 44.1 and 93.9 %, and 54.2 and 92.7 %, respectively. The ROC analysis demonstrated a higher diagnostic performance of fused PET/MRI compared to PET/CT alone in the detection of LN metastases (p = 0.02). Similarly, Kitajima et al. [24] proved that on a patient basis, the sensitivity, specificity and accuracy for detecting pelvic nodal metastasis were 100, 96.3, and 96.7 % for both fused PET/MRI and PET/contrast-enhanced CT, and 66.7, 100, and 96.7 % for MRI, respectively. The differences between these three parameters were not statistically significant (p = 1).

Likewise, we found no statistically significant differences between either PET/CT and MRI-DWI or PET/CT and PET/MRI-DWI on a per-patient basis. We did, however, find that PET/MRI-DWI is as accurate as the most specific and sensitive of the underlying two modalities. Nevertheless, in our sample there were three false-positives cases with DWI, but true negatives with PET/CT and PET/MRI-DWI. Therefore, DWI shows a lower specificity than PET/CT (68.4 vs 84.2 %). Nonetheless, as shown in Table 4, when the two modalities are fused, there is no difference with PET/CT alone (specificity of PET/DWI-MRI, 84.2 %). However, on a per-node basis, our sensitivity, specificity, PPV and NPV values for fused PET/MRI-DWI were greater than those of PET/CT. This is due to the high sensitivity of DWI (86.4 %) (Fig. 3) adding to the high specificity of PET/CT (90.5 %) (Figs. 4, 5).

Fig. 3
figure 3

32-year-old patient with cervical carcinoma. In the T2-weighted MR sequence a, b there is a slightly hyperintense pathologic tissue (circled in red) in the uterine cervix and bilateral obturator enlarged lymph nodes (circled in green). DWI c and the co-registered PET/DWI images d show the same findings

Fig. 4
figure 4

78-year-old patient with endometrioid carcinoma of uterus. In the THRIVE sequence a there is a contrast-enhanced pathological tissue at the uterine corpus-fundus (circled in red). DWI b shows a focal area of restriction of diffusivity in the left external iliac location (white arrow), scored as possibly metastasis (score 3). The co-registered PET/DWI images c do not show any significant uptake in that location (score 0). Surgery confirmed that it was a false positive

Fig. 5
figure 5

46-year-old patient with cervical cancer. The T2-weighted MR image a shows no pelvic lymph node enlargement. Axial DWI b shows a focal area of restriction of diffusivity in the right obturator fossa (red arrow), scored as equivocal (score 2). PET c and the co-registered PET/MRI-DWI images d do not show any significant uptake in that location (white arrows). Surgery confirmed that it was a false positive of DWI (bowel loops)

Staging practices vary widely, depending on the individual physician and/or institution. Nevertheless, since the establishment of surgical staging as the standard initial step in the management of most patients with endometrial and cervical cancer, pelvic and para-aortic lymphadenectomy as part of surgical staging has become more common, due to reports showing diagnostic and therapeutic advantages [4042]. Despite these recommendations, lymph node dissection as a part of endometrial and cervical cancer staging, especially in “low-risk” patients is still controversial. Indeed, nodal resection often does not confer a survival benefit in these patients [4347], and routine pelvic lymph node dissection may itself expose the patient to risks such as intraoperative vascular injury, as well as, serious complications such as lymphedema and lymphocyst in the long term. Although these risks decrease in the hands of an experienced surgeon [48, 49], the risk of lymphedema and lymphocysts appears to be related to the number of LNs removed [49, 50]. It follows, therefore, that a group of low-risk patients exists who may not need routine lymph node dissection.

In light of our findings, PET/MRI-DWI may prove to be an accurate and non-invasive tool for the pre-planning of surgical procedure and guidance of minimally invasive node resection in low-risk patients. Moreover, PET/MRI-DWI evaluation could be considered in a preoperative prediction model (comprising other parameters such as serum cancers markers, MRI to assess invasion, histological grade, and clinical stage) to determine which patients can avoid lymph node dissection, thereby obviating the need for a pathologist to be present during intraoperative assessment.

However, this study has several limitations. Specifically, our case series was relatively small and a larger number of patients will need to be studied to more accurately evaluate the role of PET/MRI-DWI in LN metastasis detection. Furthermore, in our fused sequences, considering pelvic MR images rather than whole body MRI, we could only assess pelvic, but not para-aortic LNs, whose involvement has nevertheless been assigned an important prognostic value [14, 15]. Moreover, scores were subjective and given by only two readers on the basis of visual interpretation of the images.

Conclusions

Although we found no statistically significant differences between PET/MRI-DWI and PET/CT either on a per-patient or per-node basis, indicating that they have similar diagnostic accuracy in N staging in uterine cancer, PET/MRI-DWI did show higher sensitivity, specificity, PPV and NPV than FDG-PET/CT on a node-by-node basis. Judging by our results, in the absence of other non-invasive N-staging methods for these types of cancer, the potential of imaging fusion techniques deserve further investigation.