Part 1: Relating noxious-response amplitude to resting-state activity
Subject information
We recruited healthy neonates from the postnatal ward at the John Radcliffe Hospital (Oxford University Hospital NHS Trust). Infants were considered healthy if they were inpatients on the postnatal ward that never required admission to the neonatal unit, had no history of congenital conditions or neurological problems, and were clinically stable at the time of study. Written informed consent was obtained from parents prior to the study. Ethical approval was obtained from an NHS Research Ethics Committee (National Research Ethics Service, REC reference: 12/SC/0447), and research was conducted in accordance with standards set by Good Clinical Practice guidelines and the Declaration of Helsinki. Demographic details of the 18-infant sample are displayed in Table 2. Definitions of the age and total brain volume variables are detailed below (see Clinical variables).
Experimental setup and design
Neonates were transported to the Wellcome Centre for Integrative Neuroimaging (Oxford, UK), then fed and swaddled prior to scanning. Infants were fitted with ear plugs, ear-muffs, and ear-defenders, and placed on a vacuum-positioning mattress with additional soft padding around the head to restrict motion. Heart rate and blood oxygen saturation were monitored throughout scanning. An event-related experimental design was used for the nociception paradigm 21. The mild non-skin-breaking nociceptive stimulus was a 128 mN sharp-touch pinprick (PinPrick Stimulator, MRC Systems). Ten trials of the stimulus were delivered to the dorsum of the left foot, each trial was 1 s, and the minimum inter-stimulus interval was 25 s. This long inter-stimulus interval was used to minimise the influence of motion at the time of stimulus delivery. The stimuli were applied when the infants were naturally still. For all other scan types, infants lay passively in the scanner. No sedatives were used at any stage of this study.
MRI data acquisition
All data were collected on a 3T Siemens Prisma with an adult 32 channel receive coil. The structural data acquisition was: T2-weighted, TSE (factor 11), 150° flip angle, TE = 89 ms, TR = 14,740 ms, parallel imaging GRAPPA 3, 192 × 192 in-plane matrix size, 126 slices, 1 mm isotropic voxels, and 2 mins 13 s acquisition time. The fieldmap data acquisition was: gradient echo, 2DFT readout, dual echo TE1/TE2 = 4.92/7.38 ms, TR = 550 ms, 46° flip angle, 90 × 90 in-plane matrix size, 56 slices, 2 mm isotropic voxels, and 1 min 40 s acquisition time. Both the resting-state and noxious-response fMRI data acquisitions were: T2* BOLD-weighted, gradient echo, EPI readout, 70° flip angle, TE = 50 ms 43, TR = 1,300 ms, multiband 4 50,51, 90 × 90 in-plane matrix size, 56 slices, 2 mm isotropic voxels, AP phase encode direction, and a single-band reference (SBref) image was acquired at the start. Resting-state acquisition time was 10 mins 50 s (500 volumes), and noxious-response mean acquisition time was approximately 6 min (approximately 277 volumes). The dMRI data acquisition was: T2 diffusion-weighted, spin echo, EPI readout, 90° flip angle, TE = 73 ms, TR = 2,900 ms, multiband 3, 102 × 102 in-plane matrix size, 60 slices, 1.75 mm isotropic voxels, AP phase encode direction, multishell (b = 500, 1000, 2000 s/mm2), a total of 140 directions uniformly distributed over the whole sphere, and approximately 8 mins acquisition time. Phase- reversed b0 images were collected to derived a spin-echo fieldmap for distortion correction of the diffusion data.
MRI data preprocessing
All MRI data were preprocessed using analysis pipelines developed as part of the Developing Human Connectome Project (dHCP) (http://www.developingconnectome.org). The T2 structural data were processed (brain extraction, bias field correction, and tissue segmentation) using the MIRTK Draw-EM neonatal pipeline 52, the tool forming the basis of the dHCP structural preprocessing pipeline 53. The GRE dual-echo fieldmap data were processed using a modified version of fsl_prepare_fieldmap.
Both the noxious-response and resting-state fMRI data were preprocessed using an extended version of the dHCP fMRI preprocessing pipeline 24,41. The functional data were corrected for motion and distortion using FSL’s EDDY 54,55, which included slice-to-volume motion correction 56 and susceptibility-by-movement distortion correction 57. Noxious-response fMRI data were high-pass temporally filtered at 0.01 Hz, and resting-state fMRI data at 0.005 Hz. Data were then denoised using FSL’s FIX 58,59, low-pass spatially filtered with a 3 mm FWHM filter using FSL’s SUSAN 60, and grand mean scaled to a global spatiotemporal median of 10,000. For spatial normalisation to standard space 61, the data were first registered from functional space to the infant’s T2 structural space, via the SBref, using 6 DoF rigid-body alignment, refined using BBR 62 with FSL’s FLIRT 63,64. The registration from structural space to the 40-week template 61, via an age-matched standard template, was performed using ANTs’s SyN 65.
The diffusion data were analysed using the dHCP dMRI preprocessing pipeline 27,66. The blip- up and blip-down b0 images were used to generate the fieldmap using FSL’s TOPUP 67,68. The diffusion data were simultaneously corrected for motion, distortion, and eddy currents using FSL’s EDDY, which included outlier detection and replacement 69 as well as the slice-to-volume motion correction and susceptibility-by-movement distortion correction used in the fMRI data correction. Spatial normalisation followed the same sequence of registrations as the functional data.
Noxious-response amplitudes
For each infant, a noxious-response map was generated using standard subject-level voxelwise GLM analysis in FSL’s FEAT 70, fitting the term-neonate double-gamma HRF 23,41. A group average t-statistic map was generated using the 18 infants’ noxious-response regression parameter maps. Individual infants’ regression parameter maps were used as the subject-level noxious-response maps; the group-average t-statistic map was used as the group-level noxious-response map. To summarise an individual infant’s noxious-response map to a single scalar measure of noxious-response amplitude, the group-level noxious- response map was regressed onto the infant’s noxious-response map. Thus, an infant’s noxious-response amplitude was defined as this spatial regression coefficient. Assessment of the potential influence of HRF goodness-of-fit on noxious-response amplitudes is detailed in Supplementary Information (Noxious-response HRF fit assessment and Supplementary Figure 1,3)
As detailed below, our prediction analyses examined associations between these noxious- response amplitudes and three sets of predictors (resting-state network amplitudes, resting- state imaging confounds, and clinical variables), and our structure-function analyses examined associations between noxious-response amplitudes and a dMRI model parameter (mean diffusivity). In all these analyses, the noxious-response amplitudes were adjusted for a set of three noxious-response imaging confounds extracted from each infant’s noxious- response fMRI data: mean head motion, stimulus-correlated head motion, and CSF signal amplitude. These three metrics were intended to capture cross-infant noxious-response variability due to subject motion (mean and stimulus-correlated head motion) and cardiac pulsatility (CSF amplitude). Mean head motion was defined as the mean framewise displacement across the entire noxious-response fMRI scan session. Stimulus-correlated head motion was estimated as a multiple correlation coefficient z-statistic (Fisher r-to-z transformation) between the predicted BOLD response (stimulus application timeseries convolved with the HRF) and the 24 head motion parameter timeseries (estimated by EDDY during motion correction). CSF amplitude was estimated from each infant’s noxious-response map as the mean regression coefficient within the CSF ROI. Details on the CSF ROI construction are provided in Supplementary Information (CSF and white matter regions-of- interest definition and Supplementary Figure 9).
Resting-state network amplitudes
To define a robust set of core resting-state networks in the infants’ resting-state fMRI data, resting-state networks identified in the 18-infant nociception-paradigm dataset were compared to those identified in a subset of the dHCP dataset, which had previously been produced as part of the dHCP 14. A robust set of core resting-state networks was defined as those replicated across datasets. Demonstrating replicability in the dHCP dataset confirmed the set of core networks were robust and not unique to our nociception-paradigm dataset. The dHCP data subset included 242 healthy term-aged infants: mean GA at birth = 38.6 weeks; mean PMA at scan = 40.4 weeks; 112 females and 124 males. The resting-state network analysis performed on the 18-infant nociception-paradigm dataset was closely matched to that described for the dHCP dataset 24. In brief, probabilistic functional mode (PFM) analysis using FSL’s PROFUMO 17,18 was run on both datasets with a pre-specified dimensionality of 25, and using the infant double-gamma HRF 23,41 as the temporal prior. PROFUMO’s Bayesian model complexity penalties can eliminate modes, thus returning a number of group-level modes that can be less than the pre-specified dimensionality. This is noted, as the data- determined dimensionality of the nociception-paradigm dataset was 11 despite a pre- specified dimensionality of 25. Due to the larger sample size, the data-determined dimensionality of the dHCP dataset was equal to the pre-specified dimensionality. The dHCP resting-state network maps had greater SNR due to the significantly larger sample size. Thus, the dHCP resting-state network maps forming the set of core resting-state networks were used as the template maps to extract resting-state network amplitudes from the resting-state fMRI data.
These resting-state network template maps were spatially regressed onto each infant’s resting-state functional data using multiple regression, resulting in network timeseries. While the timeseries standard deviation is the typical amplitude metric used and is the default in FSL’s FSLNets 71, the standard deviation is sensitive to outliers, which in this context, typically appear as head motion-related timeseries spikes. Resting-state network amplitudes were thus quantified using the median absolute deviation (MAD), due to the MAD’s increased robustness to outliers. This set of resting-state network amplitudes was directly tested for association with noxious-response amplitudes after adjusting for resting-state imaging confounds (defined below). Assessment of the potential influence of resting-state network timeseries outliers on noxious-response amplitudes is detailed in Supplementary Information (Resting-state network timeseries outlier assessment and Supplementary Figure 2,3).
Resting-state imaging confounds
A set of resting-state imaging confounds was directly tested for association with noxious- response amplitudes and used for confound-adjusting the resting-state network amplitudes. These confounds included three metrics extracted from each infant’s resting-state fMRI data: mean head motion, CSF amplitude, and white matter amplitude. These three metrics were intended to capture cross-infant resting-state variability due to subject motion (mean head motion), cardiac pulsatility (CSF amplitude), and global signal (white matter amplitude). Directly testing these resting-state imaging confounds assessed whether associations between resting-state network amplitudes and noxious-response amplitudes could be explained by undesirable artefactual features of the resting-state data. Mean head motion was defined as the mean framewise displacement across the entire resting-state fMRI scan session. Mean CSF and white matter timeseries were extracted from each infant’s resting- state data, and the timeseries amplitudes were defined as the MAD of these timeseries. Details on the CSF and white matter ROI construction are provided in Supplementary Information (CSF and white matter regions-of-interest definition and Supplementary Figure 9).
Clinical variables
A set of clinical variables was directly tested for association with noxious-response amplitudes and included the six variables in Table 2: postmenstrual age (PMA), gestational age (GA), postnatal age (PNA, also called chronological age), birth weight (BW), total brain volume (TBV), and sex. The three age variables are defined according to the American Academy of Paediatrics 72, and the TBV was calculated from the infants’ structural MRI data using the tissue segmentation outputs of the structural preprocessing pipeline. Testing the clinical variables assessed whether associations between resting-state network amplitudes and noxious-response amplitudes could be explained by biologically interesting underlying variables.
Predicting noxious-response amplitudes
For all prediction analyses, the responses to be predicted were the infants’ whole-brain noxious-response amplitudes (Figure 1 scalar values). Three sets of predictors were tested for predictive capacity: nine resting-state network amplitudes, six clinical variables, and three resting-state imaging confounds. For all three sets of predictors, a support vector regression (SVR) model with a linear kernel was used. Linear SVR was selected over a linear regression via ordinary least squares, due to the SVR cost function’s greater robustness to outliers. Out- of-sample predictions were generated using leave-one-out cross validation (LOO-CV). The noxious-response amplitudes were confound-adjusted for the three noxious-response imaging confounds (mean head motion, stimulus-correlated head motion, and CSF amplitude). When generating predictions using the resting-state network amplitudes, this set of predictors was confound-adjusted for the three resting-state imaging confounds (mean head motion, CSF amplitude, and white matter amplitude).
The linear SVR model was fit in Python using scikit-learn packages 73, with all steps performed in a LOO-CV manner. Confound adjustment of the resting-state network amplitudes and noxious-response amplitudes was performed using cross-validated confound regression, implemented using the publicly available code by Lukas Snoek (https://github.com/lukassnoek/MVCA), as described in the author’s article 74. Responses were z-scaled using training set means and standard deviations. The scikit-learn SVR parameters were: kernel = linear, loss function = epsilon insensitive, epsilon = 0.1, regularization = ridge, regularization strength = {0.001, 0.01, 0.1, 1}. Optimisation of the regularization strength parameter was performed using an initial LOO-CV grid search over this set of values. Regularisation tuning and SVR model training were optimised to minimise mean squared error.
The prediction accuracy was assessed using three summary metrics: root mean squared error (RMSE), sums-of-squares formulation of the coefficient of determination (R2), and Spearman’s rank correlation coefficient (RSp). The RMSE was selected as the primary metric of prediction accuracy, as it directly quantifies the error (the difference between predicted and actual observed values) and is in original units. The R2 was also reported, as its value is interpreted as the proportion of the total variation of the response (about its mean) that is accounted for by the fitted model, and is thus an intuitive metric to assess success of the predictions. The RSp between predicted and observed noxious-response amplitudes was also reported, as it may be valuable to know the model’s ability to correctly rank infants’ noxious- response amplitudes, on a relative scale, from lowest to highest. To test the statistical significance of the RMSE, R2, and RSp measures using null hypothesis testing, one-tailed significance tests were performed using permutation analysis, running 1,000 permutations through the full prediction pipeline. Assessment of the potential influence of an fMRI global signal confound (common to both resting-state and noxious-response data) on noxious- response amplitudes is detailed in Supplementary Information (Common fMRI global signal confound assessment and Supplementary Figure 4)
Part 2: Relating noxious-response amplitude to white matter microstructure
Structure-function analysis using an exploratory-confirmatory approach
The infants’ noxious-response amplitudes were assessed for structure-function associations by analysing white matter microstructure to better understand the biological basis for individual variability in noxious-response amplitude and to evaluate the temporal stability of the observed responses. Due to the insensitivity of white matter microstructure to wakefulness and emotional state, an observed structure-function relationship would suggest the infants’ noxious-response amplitudes were temporally stable trait effects. Temporal stability was assessed using this structure-function approach rather than looking at stability across multiple test occasions, as infants could only be tested on a single occasion. Due to the lack of knowledge regarding the brain’s structural basis for noxious responses in healthy newborn infants, an exploratory analysis was required. However, due to the small sample size of the nociception-paradigm dataset (n=17 with dMRI data, Table 2), the appropriate statistical multiple testing corrections to control the false positive rate would prohibit the identification of a true positive. To overcome this issue, we adopted an exploratory- confirmatory analysis approach. We used a large age-matched sample from the dHCP dataset (n=215, sample defined below) for the exploratory arm, in which a wide range of white matter tracts and dMRI model parameters were studied in order to identify candidate nociception- relevant microstructural features. Structure-function relationships identified in this exploratory arm facilitated the formulation of specific well-defined hypotheses. These were subsequently tested in the nociception-paradigm dataset (n=17) for validation, which constituted the confirmatory arm of the analysis.
Noxious-response amplitudes in the dHCP dataset
The dHCP fMRI data includes resting-state data only. To analyse nociception-relevant structure-function relationships in this dataset, the dHCP resting-state data were mapped to noxious-response amplitudes using the SVR prediction model described previously – see Predicting noxious-response amplitudes above. This prediction model was trained on the nociception-paradigm dataset (n=18) using the nine resting-state network amplitudes as predictors (adjusted for resting-state imaging confounds) and the noxious-response amplitudes as responses (adjusted for noxious-response imaging confounds). In a sample from the dHCP dataset (defined below), the nine resting-state network amplitudes and three resting-state imaging confounds were extracted in an identical manner to the analysis performed in the nociception-paradigm dataset – see Resting-state network amplitudes and Resting-state imaging confounds above. The resting-state network amplitudes were adjusted for the resting-state imaging confounds, and the adjusted amplitudes were used to generate predicted noxious-response amplitudes. Frequency distribution histograms of the predicted noxious-response amplitudes from the dHCP dataset and the observed noxious-response amplitudes from the nociception-paradigm dataset were qualitatively compared.
Sample selection in the dHCP dataset
Infants in the dHCP dataset were included in our sample if they satisfied three quality control (QC) criteria and two age criteria to ensure the sample data were of reasonable quality and were age-matched to the prediction model training set. The three QC criteria were: (i) both an infant’s fMRI and dMRI data had to pass basic dHCP QC pipelines 24,66, (ii) both scan sessions had to have completed fully (300 volumes for dMRI data; 2,300 volumes for fMRI data) to remove inter-subject variability due to data quantity related to scan length (all infants in the nociception-paradigm dataset satisfy this criterion), and (iii) the vertex of the cerebral cortex had to remain within the scan field of view (FOV) for at least 95% of scan session (all infants in the nociception-paradigm dataset satisfy this criterion). This last QC criterion excluded infants in which primary somatosensory and motor brain regions, demonstrated in the nociception-paradigm dataset to be of central importance to noxious stimulus processing (Supplementary Figure 4), would have unreliable data. The two age criteria were: (i) infants had to have both a gestational age and a postmenstrual age at time of scan between 36–42 weeks, and (ii) infants had to have been scanned within the first 10 days of postnatal life. These selection criteria resulted in a dHCP dataset sample size of n=215 infants.
White matter microstructural features
Analogous to the pre-existing dHCP resting-state network templates used for resting-state network amplitude feature extraction (see Resting-state network amplitudes above), our 215- infant dHCP sample was used to generate a set of 16 bilateral white matter tract regions-of- interest (ROIs). These tracts were generated using the “baby autoPtx” approach established as part of the dHCP dMRI preprocessing pipeline development 27. In brief, FSL’s probabilistic multi-shell ball and zeppelins model 75 is fit as part of the dHCP dMRI preprocessing pipeline. Probabilistic tractography using FSL’s PROBTRACKX 76,77 is run using pre-defined seed, target, and exclusion masks. At the time of analysis, masks for 29 white matter tracts were available, of which 13 were unilateral and three bilateral. To create bilateral white matter ROIs analogous to our bilateral resting-state networks, the unilateral tracts were fused, resulting in a total of 16 bilateral tracts. In our 215-infant dHCP sample, the normalised probability value results of each tract were group-averaged in standard space and thresholded at a probability of 0.01. As part of the dHCP preprocessing pipeline, FSL’s DTIFIT is used to generate mean diffusivity (MD), fractional anisotropy (FA), and mean kurtosis (MK) parameter maps for each infant. We thresholded each infant’s parameter maps to remove noisy voxels with values falling outside the expected theoretical range, which can happen in practice due to poor SNR or head motion: for MD, this included negative values; for FA, this included values outside the interval [0,1]; for MK, this included values outside the interval [0,3]. The 16 bilateral white matter ROIs were used to extract mean parameter values for each tract. These 48 values (16 tracts x 3 parameters) per subject constituted the white matter microstructural features for our structure-function analyses.
Identifying a valid structure-function association
Using the 215-infant dHCP sample, univariate correlations between predicted noxious- response amplitudes and each microstructural feature was assessed using permutation testing with FSL’s PALM 78. These correlations were adjusted for three dMRI imaging confounds: mean head motion (estimated by EDDY during preprocessing), number of noisy voxels falling outside the expected theoretical range (see White matter microstructural features above), and TBV (see Clinical variables above). Our dMRI parameters-of-interest are influenced by tissue density and partial voluming artefacts due to brain volume variance across infants, so adjustment for TBV was included to mitigate these global confounds. There is no need to adjust for fMRI imaging confounds, as the SVR prediction model maps to confound-adjusted noxious-response amplitudes. Statistical significance was assessed using two-tailed Pearson correlations with 10,000 permutations and FWER-corrected for multiple testing across all 48 tests 79. While the observed statistically significant negative correlations with the MD of five tracts (Figure 4 and Supplementary Figure 7) are statistically valid due to appropriate FWER-adjustment of false positive rate, these findings are tentative due to the use of predicted noxious-response amplitudes. The dHCP dataset has no noxious-response amplitude ground truth, so the results of this exploratory arm need confirmation in the nociception-paradigm dataset, for which ground truth observed noxious-response amplitudes exist.
In the confirmatory arm, the negative correlation between predicted noxious-response amplitudes and MD identified in the dHCP dataset was assessed in the nociception-paradigm dataset using two approaches. In both approaches, correlations were adjusted for both noxious-response imaging confounds (mean head motion, stimulus-correlated head motion, and CSF amplitude) and dMRI imaging confounds (mean head motion, number of noisy voxels, and TBV). First, the correlation polarities between observed noxious-response amplitudes and MD of the five statistically significant tracts were qualitatively compared across datasets. Thus, confirmatory arm question one was: “are the correlation coefficient polarities (positive or negative) between noxious-response amplitudes and MD consistent between datasets for these five tracts?” Second, in the dHCP dataset, principal component analysis was run across the MD values of the five tracts of all 215 infants. The first principal component (PC1) accounted for 84.5% of the total MD variance with a negative correlation with predicted noxious-response amplitudes (r = –0.25). Due to the double-dipping circularity of this analysis in the dHCP dataset 80, this negative correlation between MD PC1 and predicted noxious-response amplitude will be biased toward high statistical significance. However, the demonstration in the dHCP dataset that MD PC1 accounts for a major portion of the variance in these tracts and has a statistically significant negative correlation with predicted noxious-response amplitudes serves as a single straight-forward quantitative test that can be directly confirmed (or not) in the nociception-paradigm dataset in an unbiased and non-circular manner. Thus, confirmatory arm question two was: “is the statistically significant negative correlation between noxious-response amplitudes and MD PC1 (across these five tracts) consistent between datasets?”. Statistical significance was assessed in PALM using a one-tailed Pearson correlation with 10,000 permutations.