Abstract
BACKGROUND AND PURPOSE: Preterm infants are at risk for overt and silent CNS injury, with developmental consequences that are difficult to predict. The novel Specific Test of Early Infant Motor Performance, administered in preterm infants at term age, is indicative of later developmental gross motor and cognitive scores at 12 months. Here, we assessed whether functional performance on this early assessment correlates with CNS integrity via MR spectroscopy or diffusional kurtosis imaging and whether these quantitative neuroimaging methods improve predictions for future 12-month developmental scores.
MATERIALS AND METHODS: MR spectroscopy and quantitative diffusion MR imaging data were acquired in preterm infants (n = 16) at term. Testing was performed at term and 3 months using the Specific Test of Early Infant Motor Performance and the Bayley Scales of Infant and Toddler Development, Third Edition, at 12 months. We modeled the relationship of MR spectroscopy and diffusion MR imaging data with both test scores via multiple linear regression.
RESULTS: MR spectroscopy NAA ratios at a TE of 270 ms in the frontal WM and basal ganglia and kurtosis metrics in major WM tracts correlated strongly with total Specific Test of Early Infant Motor Performance scores. The addition of MR spectroscopy and diffusion separately improved the functional predictions of 12-month outcomes.
CONCLUSIONS: Microstructural integrity of the major WM tracts and metabolism in the basal ganglia and frontal WM strongly correlate with early developmental performance, suggesting that the Specific Test of Early Infant Motor Performance reflects CNS integrity after preterm birth. This study demonstrates that combining quantitative neuroimaging and early functional movement improves the prediction of 12-month outcomes in premature infants.
ABBREVIATIONS:
- AIC
- Akaike Information Criterion
- adj-R2
- Adjusted R-squared
- Bayley-III
- Bayley Scales of Infant and Toddler Development, Third Edition
- BG
- basal ganglia
- DKI
- diffusional kurtosis imaging
- FA
- fractional anisotropy
- gCC
- genu of the corpus callosum
- IFOF
- inferior fronto-occipital fasciculus
- KFA
- kurtosis fractional anisotropy
- PLIC
- posterior limb of the internal capsule
- PTR
- posterior thalamic/optic radiations
- sCC
- splenium of the corpus callosum
- STEP
- Specific Test of Early Infant Motor Performance
Premature birth results in CNS dysmaturation characterized by altered microstructural WM and myelination not quantifiable on head sonography or qualitative MR imaging.1⇓-3 Interventions for developmental delays are typically not started until later in infancy on failure to sit or walk, squandering a period of high neuroplasticity. We developed the Specific Test of Early Infant Motor Performance (STEP) to address this need and have shown that STEP scores at term and 3 months can predict scores on the Bayley Scales of Infant and Toddler Development, Third Edition (Bayley-III) at 12 months.4⇓-6
A recent consensus statement emphasized combining neuroimaging with clinical assessment for improved diagnosis of cerebral palsy.7 In this study, we determined whether neuroimaging (MR spectroscopy or diffusion MR imaging) reflects STEP performance at term and improves the ability of STEP to predict later development. MR spectroscopy quantifies mobile intracellular metabolomics in the basal ganglia (BG) and frontal WM regions affected by neonatal injury that relate to outcome.8⇓⇓⇓-12 Both diffusional kurtosis imaging (DKI)13 and DTI14 are diffusion MR imaging methods that quantify tissue integrity through the random movement of water molecules. Kurtosis quantifies the water movement deviation away from and not represented by the strict Gaussian distribution that DTI imposes.13 DKI measures tissue heterogeneity (intra- or extracellular barriers)15 and is acutely sensitive to pathologic processes.16 While application of DKI to brain development is new,17⇓⇓⇓-21 studies in typically developing children show nonlinear relationships of kurtosis with age.19⇓-21 The kurtosis tensor provides higher-order CNS structural information on the organization of the microenvironment, which may directly impact development.
We hypothesized that lower metabolic or microstructural integrity identifies dysmaturation that manifests as functional developmental performance at term or 3 months by the STEP and at 12 months by the Bayley-III.4,12 This investigation provides evidence that quantitative neuroimaging paired with early developmental assessment improves the prediction of later-stage development.
MATERIALS AND METHODS
Sample
We enrolled infants (n = 16) born at 24–34 weeks gestational age with institutional review board approval and parental informed consent, in accordance with the Declaration of Helsinki.
Developmental Assessments
The STEP was administered at term (37–42 weeks) and at 3 months. The STEP is composed of 10 items with a maximum total score of 30. STEP cutoff scores for low and below normal at 12 months and Bayley-III scores are ≤16 (term) and ≤22 (3 months).6
We assessed neurodevelopment at 12 months by the Bayley-III (n = 15). Below normal or low Bayley-III gross motor and cognitive scaled scores are defined as <9 and 1 SD below the mean.22,23
Image Acquisition and Processing
We performed MRIs at 38–44 weeks on a 3T Tim Trio scanner (Siemens) on unsedated infants, as previously described.24 We acquired single-voxel 1H-MR spectra using point-resolved spectroscopy (TR = 2000 ms, TE = 30 and 270 ms, 128 signal averages). Data were acquired using an isotropic voxel (15 mm)3 in the left BG and right frontal WM (Online Supplemental Data) for TE = 30 ms and in the right frontal WM for TE = 270 ms. A single voxel (14.37 mm)3 was extracted from the left BG region of a multivoxel point-resolved spectroscopy chemical shift imaging scan (TR = 1700 ms, TE = 270 ms, 16 × 16 grid, and 1 signal average). All voxels were placed (or extracted) in regions that had no apparent lesions.
The LCModel25 (http://www.lcmodel.com/) quantified metabolite ratios to creatine using a simulated metabolite basis set26 generated with the VeSPA software package27 (https://github.com/vespa-mrs/vespa) with any additional metabolite ratios computed from these (Online Supplemental Data). Processed spectra were visually inspected for quality of fit, and the metabolite ratio inclusion criterion was the Cramer-Rao lower bound of <20%. For the TE = 30 ms spectra, the mean (SD) (median, minimum–maximum) of the full width at half maximum was 0.044 (0.033) (0.041, 0.014–0.114) with an SNR of 10 (3) (10, 4–14) for the BG and 0.055 (0.038) (0.043, 0.01–0.114) with an SNR of 10 (3) (9, 6–16) for the WM. For the TE = 270 ms spectra, the full width at half maximum was 0.069 (0.039) (0.060, 0.014–0.143) for the BG with an SNR of 8 (3) (8, 4–14) and 0.045 (0.032) (0.038, 0.010–0.124) for the WM with an SNR of 5 (2) (4, 2–9).
We obtained DKI data with (2.7 mm)3 isotropic voxels, with TE = 99 ms, TR = 4800 ms, and b = 0, 1000, and 2000 s/mm2 with 64 diffusion-encoding directions per nonzero b-value shell, 37 contiguous axial slices per image volume, and 10 additional non-diffusion-weighted images (b = 0 s/mm2).
After removing image volumes and gradient directions corrupted by excessive head motion (on average, 13 [12] volumes were removed), we used the DESIGNER pipeline28⇓⇓⇓⇓⇓-34 (https://pydesigner.readthedocs.io/en/latest/) to perform a DKI35 analysis, which calculated fractional anisotropy (FA), mean, axial, and radial diffusivities; and mean, axial, and radial kurtosis and kurtosis FA (KFA).13,36⇓⇓-39 DKI data were fit globally to obtain the best estimates for both the diffusion and kurtosis tensors along with their associated diffusion metrics (Fig 1).
An axial section from a representative subject showing rotationally invariant tensor quantities calculated from the DKI analysis. The upper row shows the mean, axial, and radial diffusivities and fractional anisotropy (MD, D‖, D⊥, and FA, respectively). The lower row shows the corresponding kurtosis metrics (MK, K‖, K⊥, and KFA). Diffusivity quantifies the degree of water mobility, while kurtosis is a measure of the complexity of water diffusion within the tissue microenvironment. Kurtosis provides more detail in the sparsely myelinated infant brain. The MD, D‖, and D⊥ have units of square micrometer/millisecond with the remaining metrics being unitless. MD indicates mean diffusivity; D‖, axial diffusivity; D⊥, radial diffusivity; MK, mean kurtosis; K‖, axial kurtosis; K⊥, radial kurtosis.
ROI Placement
Bilateral WM ROIs (n = 8) were manually drawn using each subject’s FA image for anatomic reference in the anterior limb and posterior limb of the internal capsule (PLIC), external capsule, uncinate fasciculus, inferior fronto-occipital fasciculus (IFOF), posterior thalamic/optic radiations (PTR), and the corpus callosum (genu/splenium [gCC]/[sCC], Online Supplemental Data). A clinical pediatric neuroradiologist verified correct ROI placement. Last, a total WM ROI was created by combining these individual ROIs.
Statistical Analysis
Multiple linear regression (SAS 3.8; SAS Institute) was performed with STEP at term or 3 months as the response variable and metabolite ratios from TE = 30 or 270 ms or DKI-derived metrics as predictor variables. Gestational age at birth and MR imaging were included in all models to account for degree of prematurity.
For the DKI metrics, STEP was modeled in two ways:
For a single ROI, each diffusion parameter was averaged separately over all voxels and included as an independent covariate (eg, STEPterm = FAsCC + KFAsCC). This approach yields a parsimonious model in which a single WM tract could be sampled with several different diffusion parameters to model prognosis in a clinical setting.
To assess whether global brain dysmaturity or injury is more important to functional outcome prediction, we used a single diffusion metric averaged within each ROI as the covariates in the models (eg, STEPterm = FAsCC + FAPTR + FAPLIC). This allowed us to determine whether an individual diffusivity or kurtosis parameter in major WM tracts could improve model predictions for parsimonious sampling compatible with clinical settings.
Model Selection
We selected models for the best goodness of fit via larger adjusted R-squared (adj-R2) and smaller Akaike Information Criterion (AIC). The difference in the AIC from full-versus-simpler models (ΔAIC) was calculated with a P value, P = exp(–ΔAIC/2).40 The model that minimized the AIC41 and maximized the ΔAIC between full and simple models with P < .05 was selected as the most parsimonious model.
RESULTS
Demographics
MR spectroscopy (n = 12 or 16) and DKI (n = 16) scans met our inclusion criteria. The mean age at MR imaging was 41.9 (1.5) weeks. Many infants had CNS conditions that may affect neurodevelopment (Online Supplemental Data).
MR Spectroscopy (NAA/Cho)BG and (NAA/Cr)WM at TE = 270 ms Strongly Correlate with STEP Performance
We assessed whether STEP performance represents concurrent MR spectroscopy metabolomics in the BG or frontal WM. After adjusting for gestational age at birth and MR imaging, (NAA/Cho)BG quantified at TE = 270 ms (n = 12) strongly correlated (P = .04) with term STEP: β = 5.83, adj-R2= 0.82, AIC = 11.32, Pmodel < .001; and (NAA/Cr)WM correlated (P = .04) with 3-month STEP: β = 9.76, adj-R2 = 0.82, AIC = 16.11, Pmodel < .001. A correlation was observed with the TE = 30 ms MR spectroscopy (n = 16) and STEP term (adj-R2 = 0.77, AIC = 16.76, Pmodel = .02), but unlike the compact TE = 270 ms models, it was composed of 5 WM and 2 BG metabolite ratios (Online Supplemental Data).
KFA and Inferior Fronto-Occipital Fasciculus Correlate Best with STEP Performance
We found a strong linear relationship between diffusivity or kurtosis parameters and STEP scores (Online Supplemental Data). The Online Supplemental Data provide details of the β-value estimates for each WM ROI in the model fits. Of note are FA and KFA (Fig 2): KFA showed a notable improvement over FA in predicting concurrent functional performance, though FA models included fewer ROIs (Fig 2 and Online Supplemental Data). The KFA also outperformed all the other diffusion metrics at both STEP time points. Most metric models resulted in models with adj-R2 > 0.70, indicating very strong correlations.
Plots showing measured STEP scores at term or 3 months versus modeled STEP scores using FA for diffusivity (left columns) or KFA for kurtosis (right columns) to predict STEP. KFA and FA models show the best fit to measured STEP scores primarily from combining metrics in the external capsule (yellow) and the PTR (purple). Goodness-of-fit metrics of adj-R2 and AIC are noted with each model. While both FA and KFA predict STEP function during 3 months of development, KFA performs better in the model than FA (higher R2 and lower AIC); for STEP at term: ΔAIC = 6.22, P < .05; and for STEP at 3 months: ΔAIC = 18.31, P < .001. All models are corrected for gestational age at birth and at MR imaging.
For the ROI-centered approach, we found that many individual WM tracts had moderately strong linear relationships with STEP scores (adj-R2 ≥ 0.60, Online Supplemental Data). IFOF modeled the STEP term (adj-R2 = 0.86, AIC = 8.86), and gCC modeled the STEP 3-month (adj-R2= 0.74, AIC = 25.38), similar to the total WM ROI (adj-R2 = 0.74, AIC = 25.47) across diffusion metrics (Online Supplemental Data). Again, kurtosis metrics contributed more than diffusivity to the models (Online Supplemental Data).
Both MR Spectroscopy and DKI Improve STEP Prediction of Bayley-III Scores
Combining MR spectroscopy with STEP scores predicted Bayley-III motor and cognitive scores markedly better than STEP scores alone (Online Supplemental Data). For TE = 270 ms data, improved motor score prediction models were found for term STEP (ΔAI C = 33.02, P < .001) and for 3-month STEP (ΔAIC = 22.63, P < .001). In modeling cognitive scores with TE = 270 ms, (NAA/Cho)BG improved predictions beyond STEP alone (term: ΔAIC = 17.09; 3 months: ΔAIC = 32.05; both P < .001). MR spectroscopy at TE = 30 ms improved STEP predictive models of cognitive scores (term: ΔAIC = 15.36; 3 months: ΔAIC = 27; both P < .001) but included multiple metabolite ratios that detracted from a parsimonious model, unlike the TE = 270 ms models, which included a single ratio. Model β-value estimates for each metabolite ratio included are detailed in the Online Supplemental Data.
We found that FA and radial and mean kurtosis were the best diffusion metrics to combine with STEP scores in predicting Bayley-III scores (Fig 3). For gross motor scores, FA improved the STEP-alone model at term and 3 months (both ΔAIC > 41, P < .001) as did radial kurtosis (both ΔAIC > 26, P < .001). For cognitive scores, FA improved the STEP-alone model at term and 3 months (both ΔAIC> 4, P < .001), as did mean kurtosis (both ΔAIC > 27, P < .001). Other DKI-derived metrics also predicted Bayley-III scores with STEP time points. (Online Supplemental Data). Included WM ROIs and β-value estimates for each are listed in the Online Supplemental Data.
Measured 12-month Bayley-III gross motor (left columns) and cognitive scores (right columns) compared against the modeled Bayley scores, using an individual diffusivity (upper rows) or kurtosis (lower rows) parameter across various WM ROIs. By means of this diffusion parameter approach, FA (left), MK (center), and K⊥ (right) combined with STEP strongly predict both motor and cognitive Bayley scores, adjusted for gestational age at birth and MR imaging. Other metrics also predicted future function when combined with STEP scores (see the Online Supplemental Data for details on the optimal combination of WM ROIs for each diffusion metric [β-values, highest adj-R2 and lowest AIC]). MK indicates mean kurtosis; K⊥, radial kurtosis.
For the ROI-centered approach, various combinations of kurtosis and diffusivity metrics within individual ROIs improved the predictions of Bayley-III gross motor and cognitive scores compared with the STEP-only models. For parsimony and simplicity, models using either PTR and IFOF with STEP scores had the lowest AICs for predicting Bayley Motor scores (Fig 4) and improved the model fit over the STEP alone (at term: ΔAIC > 30; P < .001; at 3 months: ΔAIC > 28; P < .001). The gCC and external capsule with STEP scores improved prediction in Bayley cognitive scores (Online Supplemental Data) over STEP alone (at term: ΔAIC > 18; P < .001; and at 3 months: ΔAIC > 14; P < .001). Model fits and β-value estimates for each diffusion metric covariate included in the selected models for each WM ROI are provided in the Online Supplemental Data.
Measured 12-month Bayley-III gross motor (left columns) and cognitive scores (right columns) plotted against the modeled Bayley scores using STEP at term (upper rows) or 3 months (lower rows) + diffusivity and kurtosis metrics within individual WM ROIs to predict function. By means of this ROI-centered approach, the PTR and IFOF metrics combined with the STEP score result in the best predictive models, adjusted for gestational age at birth and MR imaging. Compared with STEP-alone predictive models (Online Supplemental Data), diffusivity and kurtosis in individual WM ROIs greatly improve the STEP model fit for both Bayley-III gross motor and cognitive test scores at 12 months. β-value estimates are provided in the Online Supplemental Data.
DISCUSSION
During early life after preterm birth, brain injuries as well as alterations in gray and WM maturation occur that influence later development of motor and cognitive skills. Influencing neuroplasticity to improve outcomes requires understanding the neuroimaging representations of brain injuries and their relation to functional movements during early infancy. Using MR spectroscopy and DKI parameters at term-age equivalent, we show that metabolomics for healthy neurons and microstructural integrity correspond to concurrent performance on the STEP early developmental assessment and that neuroimaging combined with the STEP improves prediction of future motor and cognitive scores compared with the STEP alone. Better predictive models would allow testing interventions at a very early period to optimally harness neuroplasticity. Our data highlight the potential for prognostication using the STEP and term-age quantitative neuroimaging and serve as a guide for metric selection. Our data provide proof of concept that quantitative MR imaging of the developing brain is vitally important in prognosis.
MR spectroscopy at TE = 270 ms in either the frontal WM or BG strongly related to STEP and NAA, and Cho ratios improved STEP predictive models for both gross motor and cognitive Bayley-III scores (Online Supplemental Data). However, we observed no relation or benefit with NAA or other metabolite ratios obtained from point-resolved spectroscopy sequence spectra at TE = 30 ms. This discrepancy may be due to a more reliable quantification of NAA and Cho at a long TE, in which there are fewer overlapping metabolite peaks, as has previously been reported.26 As markers of neuronal metabolic health42 and myelination, NAA and Cho are both complementary and additive in our models.1,9⇓-11 Lower concentrations of NAA in the deep gray nuclei of the BG reflect metabolic impairment, lower neuronal volume or density, and possibly insufficient dendritic arborization in the thalamus, responsible for sensory-input integration and regulation of voluntary movement.43 In the WM, lower NAA is associated with decreased axonal integrity in neonatal WM disease10,44 and worse developmental outcomes in both preterm and term infants.9,11,12
ROIs (IFOF, PTR, and gCC) were also strongly correlated with STEP performance (Online Supplemental Data), indicating that visuomotor and sensorimotor integration pathways are critical in the early development of motor skills. IFOF and PTR with the STEP also predicted later motor function, while the gCC and the external capsule contributed strongly to STEP prediction for later cognitive function (Fig 4 and Online Supplemental Data). The best WM tracts for prognostication from our study (IFOF, PTR, and gCC) are readily identified, and ROIs are easily placed by clinical neuroradiologists on standard anatomic MR images or ROIs registered with an appropriate WM atlas.
Most quantitative clinical investigations use diffusion tensor–derived FA that measures the directional dependence of water diffusion in a Gaussian distribution. FA increases nonlinearly with axon myelination, packing, and maturation that plateaus around 2 years of age.19,21 However, the diffusion tensor neither accounts for non-Gaussian cellular factors nor fully characterizes microstructure across the spectrum of development. In estimating the kurtosis tensor, we captured the non-Gaussian spin displacement that corresponds to cellular and tissue heterogeneity within the WM microstructure, different from DTI.13,16,45 Kurtosis metrics show a nonlinear relationship with age19,21 and increase with intracellular complexity, cell density, and myelination due to further restrictions on water movement. Radial kurtosis, in particular, increases more quickly during development than its diffusivity counterpart because myelin restricts diffusion across the axon.21 KFA is mathematically analogous to FA, but the kurtosis tensor provides complementary, more detailed directional variation on anisotropy information.37,38 Thus, unlike FA, KFA measurements are sensitive to higher-order angular variations in WM fiber orientations,36,38 also found in fiber-crossing regions.37
In our study, KFA outperformed FA in representing early function via STEP performance (Figs 2 and Online Supplemental Data), suggesting that complex WM cellular barriers or fiber-crossings are important in the early ability to move with normal tone. Although not widely used, kurtosis metrics may be better markers of WM tract injury and health in the largely unmyelinated brain of infants than diffusivity metrics such as FA. When predicting future gross motor and cognitive performance (Online Supplemental Data), however, FA provided better results, followed closely by the KFA, radial kurtosis (gross motor), and mean kurtosis (cognitive). Taken together, our data support kurtosis information possibly significantly adding to conventional FA in understanding the complexity of WM development at term age.
As a standard protocol on clinical MR imaging machines, MR spectroscopy can yield high-quality results in neonatal studies due to high water content of brain tissue and low iron deposition. MR spectroscopy can be challenging in a clinical environment in unsedated neonates. Therefore, clinicians and MR imaging technicians should attempt to schedule MR spectroscopy scans after a feeding to encourage sleep and should be cognizant of when to repeat scans (ie, motion artifacts) or modify the imaging setup (eg, better swaddling/head immobilization). Furthermore, on the basis of our findings, long-TE spectral quantification is robust (ie, a more stable baseline and less confounding peaks) in quantifying a large concentration of metabolites (namely, NAA, Cho, and Cr). Conversely, shorter-TE MR spectroscopy will have a better SNR and more information on a small concentration of metabolites compared with long-TE sequences, with a trade-off in difficulties in quantification of overlapping peak shapes.
For more advanced diffusion imaging, DKI is easily performed within 4–6-minutes by adding the b-value 2000s/mm2 to standard DTI protocols with 30 directions per nonzero b-value. Numerous open-source platforms (eg, Diffusional Kurtosis Estimator,33 https://github.com/m-ama/DKE, DESIGNER,28/pyDesigner, and https://pydesigner.readthedocs.io/en/latest/) are available that provide semiautomated off-line DKI analysis.
Limitations of our study include our sample size, which precluded assessing whether brain dysmaturity or specific sites of injury were more important in neuroimaging models and any contributions of sex. Also, 12 months is a typical time for general developmental delays to manifest but is not optimal for diagnosing cerebral palsy. Another limitation is the combination of single-voxel and CSI data for the TE = 270 ms MR spectroscopy data modeling, which had slightly different TR values as well as differing spectral and shim quality. Last, MR spectroscopy voxel placement and WM ROI voxels were selected to reflect “apparently healthy” WM, but it is possible that lesions (Online Supplemental Data) may not have been readily visible during data gathering and analysis.
CONCLUSIONS
Our neuroimaging data provide proof of concept that the STEP reflects preterm brain dysfunction, dysmaturation, and/or injury. These data suggest that either MR spectroscopy or DKI can augment the STEP at term age in the prediction of long-term motor and cognitive development. If validated in a larger cohort, quantitative neuroimaging and STEP assessments may be used as surrogate end points in clinical trials and may facilitate earlier intervention for infants at high risk of developmental delays.
Footnotes
This research was supported by the South Carolina Clinical & Translational Research Institute at the Medical University of South Carolina, Clinical and Translational Science Awards National Institutes of Health/National Center for Advancing Translational Sciences UL1TR000062 (to P. Coker-Bolt), and National Institutes of Health/National Institute of Neurological Disorders and Stroke F31NS108623 (to H.G. Moss).
Disclosure forms provided by the authors are available with the full text and PDF of this article at www.ajnr.org.
Indicates open access to non-subscribers at www.ajnr.org
References
- Received June 17, 2021.
- Accepted after revision September 27, 2021.
- © 2022 by American Journal of Neuroradiology