Regional distribution of measurement error in diffusion tensor imaging

https://doi.org/10.1016/j.pscychresns.2006.01.008Get rights and content

Abstract

The characterization of measurement error is critical in assessing the significance of diffusion tensor imaging (DTI) findings in longitudinal and cohort studies of psychiatric disorders. We studied 20 healthy volunteers, each one scanned twice (average interval between scans of 51 ± 46.8 days) with a single shot echo planar DTI technique. Intersession variability for fractional anisotropy (FA) and Trace (D) was represented as absolute variation (standard deviation within subjects: SDw), percent coefficient of variation (CV) and intra-class correlation coefficient (ICC). The values from the two sessions were compared for statistical significance with repeated measures analysis of variance or a non-parametric equivalent of a paired t-test. The results showed good reproducibility for both FA and Trace (CVs below 10% and ICCs at or above 0.70 in most regions of interest) and evidence of systematic global changes in Trace between scans. The regional distribution of reproducibility described here has implications for the interpretation of regional findings and for rigorous pre-processing. The regional distribution of reproducibility measures was different for SDw, CV and ICC. Each one of these measures reveals complementary information that needs to be taken into consideration when performing statistical operations on groups of DT images.

Introduction

Diffusion tensor imaging (DTI) has been evolving rapidly and gaining popularity in psychiatric research. There has been a rapid increase in publications, particularly in the field of schizophrenia where eight original papers were published between 1998 and 2002 and 23 between 2003 and May 2005.

Despite this, the measurement error of this technique has not been fully characterized. The information regarding measurement error is critical in assessing the significance of DTI findings in longitudinal studies and in comparing patient groups. This is particularly true in the case of psychiatric disorders where differences from controls may be quite subtle.

Pfefferbaum et al. (2003) were the first to address reproducibility of FA and Trace images in detail. They studied normal controls three times with a minimum time interval between scans of 1 day. They reported coefficients of variation (CVs) between 1.23% and 2.35% for FA and of 0.84–3.73% for Trace. Their analysis was based either on large collections of voxels (such as all the voxels in the white matter of the supratentorium) or on a single large region of interest (ROI) placed over the entire corpus callosum on a midline slab of tissue with thickness of 5 mm. Thus, the estimates of reproducibility and measurement error derived from this study are likely to be quite liberal as compared with common approaches to DTI data analysis which would include either smaller ROIs or voxel-by-voxel approaches with programs such as statistical parametric mapping (SPM). Moreover, this study did not provide any information on the regional distribution of measurement error, which is important since reproducibility may not be equal across the whole image due to the complex statistical properties of the calculated DTI measures and many other factors. For example, recently claims were made regarding differences in anisotropy between patients with schizophrenia and normal controls in small ROIs of gray matter in the entorhinal cortex (Kalus et al., 2005) and the hippocampus (Kalus et al., 2004). How are we to judge the strength of these findings without knowing if the reproducibility of gray matter ROIs is similar to that found in the corpus callosum?

Also Kubicki et al. (2004) studied the reproducibility of single shot planar imaging (EPI) with ROIs applied to four scans (acquired in separate sessions, with unspecified time interval between sessions) of the same subject. The CV varied between 1.4% and 12% for various regions of white matter and reached 25% for a gray matter ROI. However, the EPI acquisition employed was highly susceptible to artifacts due to the use of a fairly long echo time and the lack of corrections for eddy current distortions. Moreover, the scans were acquired with gaps in between slices, and no effort to register scans acquired on different sessions was described. Hence, the estimates of measurement error presented in this study are likely to be conservative when compared with what would be obtained with more state-of-the-art acquisition and processing schemes.

The gap in knowledge left by these two studies provided the rationale for the current study, where we tested two methods commonly employed in statistical analysis of DT images (ROIs and SPM-based techniques) and described the regional variation in FA and Trace (D) (henceforth referred to as Trace) associated with each strategy.

Section snippets

Methods

We studied 20 healthy volunteers (ages 21–36, mean 26 ± 4.4 SD, three females), each one scanned twice, with an average interval between scans of 51 ± 46.8 days. All scans were performed with the head immobilized by a vacuum cushion. All subjects gave written informed consent according to procedures approved by the NIMH institutional review board.

DTI sessions were conducted on a 1.5 T GE Signa magnet (Waukesha, WI) and consisted of an axial single shot echo planar imaging (EPI) sequence with six

Results

Typical images of FA and Trace obtained during this study are shown in Fig. 2. SNR, SDw, CV and ICC for mean ROI values for FA and Trace are shown in Table 1. A notable regional variation in reproducibility was seen. For FA, the SDw was highest in the Peduncles, PLIC and CSO, and lowest in Put, GCC, FrG, Thal and Ins with values in the Peduncles being twice those of the Put. The CV was highest in the CblCtx and the FrG and lowest in the GCC and the SCC. Nine out of 14 ROIs had CVs below 10%.

Discussion

Characterizing regional variation in measurement error for DTI is important to the interpretation of the results of group comparisons and longitudinal studies. In this article, we present three different ways of looking at measurement error as they apply to ROI measurements and to voxel-by-voxel analysis. We used different measures of reproducibility because they reveal different and complementary information about the regional distribution of measurement error.

Acknowledgments

Some of the material in this article was presented at the 42nd Annual Meeting of the American College of Neuropsychopharmacology (ACNP) in San Juan, Puerto Rico, in 2003 and at the International Society of Magnetic Resonance in Medicine (ISMRM) “Workshop on Methods for Quantitative Diffusion MRI of Human Brain” in Lake Louise, Alberta, Canada, in 2005.

Gustavo K. Rohde is currently a National Research Council Research Associate at the Naval Research Laboratory, Washington, DC.

We thank Andreas

References (18)

There are more references available in the full text version of this article.

Cited by (67)

  • Scan-rescan repeatability of axonal imaging metrics using high-gradient diffusion MRI and statistical implications for study design

    2021, NeuroImage
    Citation Excerpt :

    Our findings are in line with previous smaller-scale studies of scan-rescan repeatability of axon diameter estimation using ActiveAx in the human and Vervet monkey brains (Alexander et al., 2010; Dyrby et al., 2013), in which broad trends in axon diameter index across the corpus callosum were shown to be quite reproducible, although individual voxel-wise estimates were shown to have greater variability. Of note, many previous studies have assessed the reproducibility of FA measures in the healthy population (Besseling et al., 2012; Boekel et al., 2017; Grech-Sollars et al., 2015; Koller et al., 2021; Lin et al., 2015; Marenco et al., 2006), with reported ICC scores that fall within a range of ~0.7 to 0.98. Our data also yielded ICC scores in the same range for FA, allowing us to compare our results to the existing literature and supporting the consistency of our data and analyses with prior studies.

  • Reproducibility, reliability and variability of FA and MD in the older healthy population: A test-retest multiparametric analysis

    2020, NeuroImage: Clinical
    Citation Excerpt :

    Interestingly, only the CVws for MD in overall grey matter is slightly higher (CVws = 4.42%) than previously reported values for this index. When looking at selected regions of interest, the reproducibility values for both FA and MD are consistently better (between 1% and 2.8%) than those reported by a previous study (Marenco et al., 2006) for FA (2.5–10.16%) and MD (2.49–6.20%) with data collected with 8 averages of 6 diffusion directions at 1100 s/mm2 b-values on a 1.5 Tesla system. In our study, the GCC appeared the least reproducible of all the regions for both FA and MD (FA CVws = 1.8%, MD CVws = 2.8%), showing a much higher CVws than the SCC (FA CVws = 1.0%, MD CVws = 2.5%) – the same effect observed by Veenith et al., al. (2013).

View all citing articles on Scopus
1

Currently at the Naval Research Laboratory, Washington, DC 20375.

View full text