Functional geometry of auditory cortical resting state networks derived from intracranial electrophysiology

doi:10.21203/rs.3.rs-1386098/v1

Download PDF

Article

Functional geometry of auditory cortical resting state networks derived from intracranial electrophysiology

https://doi.org/10.21203/rs.3.rs-1386098/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Critical details remain unresolved about the organization of the human auditory cortical hierarchy and its relationship to higher order brain networks. We investigated this organization using diffusion map embedding (DME) applied to resting state intracranial electroencephalography (iEEG) obtained in neurosurgical patients. DME was applied to functional connectivity measured between regions of interest (ROIs). ROIs exhibited a hierarchical organization, symmetric between the two hemispheres and robust to the choice of iEEG frequency band, connectivity metric, and imaging modality. Tight clusters of canonical auditory and prefrontal ROIs were maximally segregated in embedding space. Clusters consistent with ventral and dorsal auditory processing streams were paralleled by a cluster suggestive of a third stream linking auditory and limbic structures. Portions of anterior temporal cortex were characterized as global hubs. This approach lays the foundation for identifying network changes during active speech and language processing and elucidating mechanisms underlying disorders of auditory processing.

Functional connectivity

hierarchy

fMRI

diffusion map embedding

electrocorticography

iEEG

ECoG

Extensive research has contributed to our understanding of the organization of the human auditory cortical hierarchy^1,2, yet fundamental questions remain unresolved. These include identification of secondary, tertiary, and higher order regions, hindering elucidation of the neural basis of speech and language comprehension. For example, the anterior portion of the superior temporal gyrus (STGA) and planum polare (PP) are adjacent to auditory cortex on Heschl’s gyrus, yet diverge from it functionally^3,4. Conversely, the posterior insula (InsP) has response properties similar to those of core auditory cortex in the posteromedial portion of Heschl’s gyrus (HGPM), yet is not considered a canonical auditory area⁵. Additionally, the upper and lower banks of the superior temporal sulcus (STSU, STSL) have distinct roles in speech and language processing^6,7, yet their functional relationships with other auditory areas are unclear.

Questions remain regarding broader organizational features as well. While the auditory hierarchy is posited to be organized along two processing streams^8–10, the specific brain regions involved and the functional relationships within each stream are vigorously debated^11–13. Furthermore, communication between auditory cortex and hippocampus, amygdala, and anterior insula (InsA)¹⁴ – areas involved in auditory working memory and processing of emotional aspects of auditory information^15–18 – suggests a third “limbic” auditory processing stream, complementary to the dorsal and ventral streams. Additionally, while hemispheric lateralization of speech and language processing is a widely accepted organizational feature^19,20, the degree to which it shapes the auditory hierarchy and is reflected in hemisphere-specific connectivity profiles is unknown^8,12,21−24.

This study examined the organization of the human auditory cortical hierarchy within the broader context of cortical networks using resting state (RS) intracranial electroencephalography (iEEG) in neurosurgical patients. Although previous studies have investigated connectivity within the auditory cortical hierarchy using magnetic resonance imaging (MRI)^6,25,26, iEEG offers superior spatio-temporal resolution and is free of methodological problems that affect MRI in key regions such as the anterior temporal lobe^27,28. Functional connectivity was measured as gamma band (30–70 Hz) power envelope correlations²⁹ between regions of interest (ROIs) based on an auditory-centric cortical parcellation scheme. Functional geometry of the network was defined based on the connectivity profiles of its constituent ROIs using diffusion map embedding (DME)^30,31. DME maps ROIs into a Euclidean space where proximity of two ROIs reflects similarity between their connectivity profiles. DME has been previously applied to functional MRI (fMRI) data to demonstrate the hierarchical structure of canonical RS networks³². Here, we applied DME to iEEG data for the first time to gain new insights into the functional organization of human auditory cortical networks.

DME applied to iEEG data

Intracranial electrodes densely sampled cortical structures involved in auditory processing in the temporal and parietal lobes, as well as prefrontal, sensorimotor, and other ROIs in 46 participants (a total of 6487 sites; Fig. 1, Supplementary Tables 1, 2). Electrode coverage was largely restricted to a single hemisphere in individual participants. On average, each participant contributed 141 ± 54 recording sites, representing 28 ± 7.9 ROIs (mean ± standard deviation) (see example in Fig. 2a). DME was applied to pairwise functional connectivity computed between recording sites in each participant. The functional connectivity matrix was normalized and thresholded to yield a transition probability matrix P with an apparent community structure along the horizontal and vertical dimensions (Fig. 2b). DME reveals the functional geometry of the sampled cortical sites by mapping the data of P into a low-dimensional embedding space where proximity between nodes represents similarity in their connectivity to the rest of the network (Fig. 2c; see Supplementary Fig. 1 for additional views).

DME exhibited superior signal-to noise characteristics compared to direct analysis of functional connectivity in 40 out of 46 participants (Supplementary Fig. 2). Sites that belonged to specific ROIs distributed to specific locations in embedding space. For example, in Fig. 2c, tight clusters of auditory cortical sites (red/orange/yellow) and sites in prefrontal cortex (blue) were maximally segregated along dimension 1 (see Fig. 1 and Supplementary Table 3 for the list of abbreviations). Other ROIs [e.g., posterior portion of middle temporal gyrus (MTGP)] had a more distributed representation, consistent with their functional heterogeneity.

Functional geometry of cortical networks

To pool data across participants with variable electrode coverage, P matrices were computed at the ROI level and averaged across participants (Fig. 3a). The eigenvalue spectrum |λ_i| of P featured an inflection point for i > 4 (Fig. 3a, inset), indicating that the first four dimensions of embedding space accounted for the community structure of the data. The data are plotted in the first four dimensions of embedding space in Fig. 3b (see also Supplementary Fig. 3 and Supplementary Movies 1 and 2), providing a graphical representation of the functional geometry of all sampled brain regions. Functionally related ROIs clustered together, and these clusters segregated within embedding space. For example, auditory cortical and prefrontal ROIs were at opposite ends of dimension 1, and parietal and limbic ROIs were at opposite ends of dimension 2. By contrast, some ROIs [e.g., STGA, anterior and middle portions of middle temporal gyrus (MTGA, MTGM)] were situated in the interior of the data cloud.

The connectivity metric employed here discards components exactly in phase between two brain regions, mitigating the influence of volume conduction²⁹. However, brain areas that are anatomically close to each other are often densely interconnected^33–37. Thus, anatomical proximity can contribute to the observed functional geometry. Auditory cortical ROIs followed this rule and clustered together with one notable exception. PP, located immediately anterior to anterolateral Heschl’s gyrus (HGAL), segregated from the rest of auditory cortical ROIs along dimension 2 in embedding space (Fig. 3b, upper panel, lower left corner). Separation in embedding space was also observed along specific dimensions between other anatomically adjacent ROIs, including STGA and STGM, temporal pole (TP) and the rest of the anterior temporal lobe (ATL), and InsA and InsP. Anatomical proximity explained 15% of the variance in embedding distance (mean adjusted r² = 0.15 for regressions between anatomical and embedding Euclidean distance, calculated separately for each ROI). Thus, the embedding representation elucidates organizational features of ROIs beyond anatomical proximity.

The grouping of canonical auditory ROIs is apparent in Fig. 3b, as PT, HGAL, and middle and posterior portions of the superior temporal gyrus (STGM, STGP) were all close to HGPM in embedding space. Interestingly, so was STSU, which was significantly closer to auditory cortex in embedding space compared to STSL (p = 0.002). This separation between STSL and STSU is consistent with differences in their response properties reported recently⁷. Particularly, responses in STSL, but not STSU, were predictive of performance in a semantic categorization task. This suggests that in embedding space, STSL would be closer to regions involved in semantic processing compared to STSU. Indeed, a permutation analysis revealed that STSL was closer to ROIs reported to contribute to semantic processing [inferior frontal gyrus (IFG) pars operculum/triangularis/orbitalis (IFGop, IFGtri, IFGor), TP, STGA, MTGA, MTGP, anterior and posterior portions of inferior temporal gyrus (ITGA, ITGP), anterior and posterior angular gyrus (AGA, AGP), supramarginal gyrus (SMG)]^38–40 compared to STSU (p = 0.010).

InsP responds robustly to acoustic stimuli⁵, suggesting that a portion of this area could be considered an auditory region⁴¹. It can track relatively fast (> 100 Hz) temporal modulations, similar to HGPM^5,42, possibly due to direct inputs from the auditory thalamus. However, InsP was functionally segregated from HGPM and was situated between auditory and limbic ROIs, consistent with the broader role of InsP in polysensory exteroceptive processing and interoception^43,44.

Figure 3b also characterizes the temporal and parietal ROIs outside auditory cortex that are nonetheless part of the extended auditory network, including components of the dorsal and ventral processing streams. These ‘auditory-related’ ROIs (shades of green in Fig. 3b), were distributed along a considerable extent of all four dimensions, consistent with functional heterogeneity of these regions and their involvement in multimodal integration⁴⁵.

So far, the results of analyses have been presented at a single spatial scale (t = 1), which emphasized local network structure. Analysis of data at a coarser spatial resolution (t = 5) showed that the dominant structure in the data was governed by dimension 1, with auditory ROIs at one end and prefrontal ROIs at the other (Supplementary Fig. 4). This indicates that the functional distance between these two groups of ROIs is the most prominent of all organizational features in the data.

Hierarchical clustering of ROIs

Hierarchical clustering applied to the first four dimensions of the embedded data revealed segregation of the ROIs into several well-delineated clusters (Fig. 4). Auditory cortical ROIs (excluding PP) formed an ‘Auditory’ cluster with STSU. Another major cluster (labeled ‘Limbic’) included ROIs traditionally considered part of the limbic system [parahippocampal gyrus (PHG), amygdala and hippocampus], ATL ROIs (TP, STGA, PP), and the insula. ROIs typically considered part of the ventral and dorsal auditory streams segregated into two clusters. Additional clusters included ROIs involved in executive and sensorimotor functions (labeled ‘Executive’ and ‘Action’, respectively in Fig. 4). Thus, the hierarchical clustering analysis revealed a segregation of ROIs in embedding space that aligned with known functional differentiation of brain regions. This analysis expands our understanding of functional relationships within the auditory cortical hierarchy (e.g., clustering of STSU with canonical auditory ROIs and segregation of PP and InsP from HGPM) and between auditory and higher order cortical areas (e.g., the proximity of ‘Auditory’ and ‘Limbic’ clusters).

Identification of network hubs

Identification of ‘global hubs’ within brain networks is critical for understanding their topology⁴⁶. These nodes integrate and regulate information flow in the network by virtue of their centrality and strong connectivity. DME can identify global hubs, as the closer an ROI is to the center of the data cloud in embedding space, the more equal is its connectivity to the rest of the network. This is illustrated in Fig. 5a, which depicts a simulated network of five ROIs, with one serving as a global hub (Fig. 5a, left panel, green). The network structure can also be represented as an adjacency matrix, wherein the hub ROI has strong connectivity with other ROIs (Fig. 5a, middle panel). In embedding space, this ROI occupies a central location, with the other four serving as spokes, i.e., nodes that interact with each other through the central hub (Fig. 5a, right panel).

However, a node’s proximity to the center of the data cloud reflects the homogeneity of its connectivity to the rest of the network. Thus, a node can appear at a central location if it is weakly but consistently connected to all other nodes. By contrast, a node strongly connected to a small number of nodes (‘local hub’) can have high mean connectivity but still be distant from the center of the data cloud. Combining distance from the center with mean connectivity (Fig. 5b) allows for classification of ROIs as global versus local hubs and for distinguishing hubs from spokes and central nodes that have relatively weak overall connectivity (‘bridge’). ATL ROIs (MTGA, STGA, but not TP) and MTGM lie in the upper left quadrant of the plot > 2 standard deviations from the center of the data cloud (outer dashed ellipse) and appear as global hubs. ITGA, posterior cingulate/precuneus (PCC/preCun), and STSL also exhibited hub-like properties, i.e., were located in the upper left quadrant of Fig. 5b.

STSU and orbitofrontal gyrus (OG) tended toward high connectivity but were located at a greater distance from the center and can thus be considered local hubs. Primary sensory and motor regions and canonical auditory cortex (except PP) tended to behave as spokes, occupying the lower right quadrant of the graph. Finally, the ROIs in the lower left quadrant can be considered bridges between isolated clusters of ROIs, characterized by more selective connectivity patterns. Thus, DME can identify topological features critical to information flow within cortical networks.

Comparisons across hemispheres

Speech and language networks are lateralized in the human brain, with nearly all right-handed and most left-handed individuals left hemisphere language-dominant⁴⁷. However, both hemispheres are activated during speech processing^9,22,48,49, and the extent to which lateralization is reflected in asymmetries in the organization of auditory networks is unclear. We addressed this issue by comparing the geometry of cortical networks derived from participants with electrode coverage in the language-dominant (N = 22) versus non-dominant (N = 21) hemisphere. ROIs exhibited a symmetric functional organization in embedding space (Supplementary Fig. 5). Pairwise inter-ROI distances in embedding space, calculated separately for dominant versus non-dominant hemisphere, were highly correlated (r = 0.77), with no obvious outliers (Fig. 6, left panel). Furthermore, this was true specifically for ROIs involved in speech and language comprehension and production [PT, PP, STSL, STGP, STGM, STGA, SMG, AGA, premotor cortex (PMC), precentral gyrus (PreCG), IFGop, IFGtr]^12,50,51 (r = 0.71; Fig. 6, right panel). Permutation analysis indicated that neither the positions of ROIs in embedding space nor the pairwise distances between ROIs were significantly different between dominant and non-dominant hemispheres (all p-values > 0.05). Thus, hemispheric asymmetry of functional organization of speech and language networks was not detectable in RS connectivity.

Stability of functional geometry across frequency band and connectivity measures

Embeddings presented so far have been derived from gamma-band power envelope correlations. Other bands (theta, alpha, beta, high gamma) and a different measure (debiased weighted phase lag index, wPLI⁵²) produced similar embeddings. For power envelope correlations, inter-ROI distances were highly similar for adjacent bands (r ≥ 0.83), and even for non-adjacent bands (r ≥ 0.72; Supplementary Fig. 6a). For wPLI, correlations were only slightly weaker (0.59–0.79). Correlations were strong even across the two connectivity measures (0.58–0.78). Thus, DME identified overall, rather than band- or measure-specific, organizational features of cortical networks.

However, a particular band or measure might be preferred if it produced narrower estimation margins in the functional geometry. An overall relative uncertainty was calculated as the variation of ROI position across bootstrapped samples relative to the average distance to other ROIs. Relative uncertainty was similar across bands and measures but was lowest for power envelope correlations in beta, gamma, and high gamma bands (Supplementary Fig. 6b). These analyses suggest that gamma power envelope correlations are a robust measure for exploring functional geometry.

Comparison to embeddings derived from RS-fMRI data

Intracranial recordings sample the brain non-uniformly and sparsely as dictated by clinical considerations. To examine the impact of this sampling, DME was applied to RS-fMRI data available in a subset of ten participants. We first verified the consistency of functional geometry derived from the two modalities in the same participants (Fig. 7). Connectivity matrices were constructed based on RS-fMRI data from voxels located at iEEG recording sites and grouped into the same ROIs as in Fig. 1. The iEEG and fMRI embeddings averaged across participants were qualitatively similar (Fig. 7a, b), and the overall organization derived from this subset was consistent with that observed in the full iEEG dataset (cf. Figure 3b). Inter-ROI distances in the fMRI and iEEG embedding spaces were strongly correlated (Fig. 7), with highest correlations for beta-, gamma- and high gamma-band envelopes (r > 0.5; Fig. 7d, line and symbols). Similar results were observed at the individual participant level as well, albeit with smaller r values (Fig. 7d, box and whiskers plot).

The analysis presented in Fig. 7 indicates that fMRI data can be used to address two questions regarding the effects of limited, non-uniform sampling. The first question is the effect of non-uniformly sampling only a subset of brain regions. We used a standard parcellation scheme developed for fMRI data (Schaefer-Yeo 400 ROIs;⁵³) rather than the iEEG parcellation scheme introduced in Fig. 1. For each participant, embeddings were derived from RS-fMRI connectivity matrices computed from all cortical ROIs (“Full fMRI”, Fig. 8a, leftmost column). A subset of these ROIs [“Full fMRI (iEEG subset)”, Fig. 8a, 2nd column] were retained for comparison to embeddings computed from the fMRI data corresponding to the ROIs sampled by iEEG [“Partial fMRI (ROI level)”, Fig. 8a, 3rd column]. The two embeddings were compared by computing the correlation between inter-ROI distances in the respective embedding spaces (Fig. 8b). Although the scale of the embeddings was different for the full fMRI versus partial fMRI data (because the number of dimensions was different), the two were highly correlated (r = 0.94; Fig. 8c). Thus, embeddings constructed from the portion of the brain sampled by iEEG were nearly identical to embeddings derived from the whole brain.

The second question is the effect of representing an entire ROI by sparse sampling with a limited number of electrodes. To address this question, embeddings were computed from the averages across entire ROIs in each participant [“Partial fMRI (ROI level)”, Fig. 8a, 3rd column] and from averages of the fMRI voxels located at iEEG recording sites [“Partial fMRI (site level)”, Fig. 8a, rightmost column]. ROI- and site-level embedding distances were strongly correlated (r > 0.5; Fig. 8c). Thus, sparse sampling within an ROI had a greater impact on estimates of functional geometry than limited sampling of the complete set of ROIs. Overall, ROIs were faithfully represented in embedding space even when DME was based on a small number of locations within ROIs. Taken together, these results indicate broad consistency between functional organization derived from iEEG and fMRI and the robustness of this approach to sparse sampling afforded by iEEG recordings.

Organization of auditory cortical networks

We have shown that DME applied to iEEG data can be used to characterize the organization of the human auditory cortical hierarchy. Previous work in macaque has defined over a dozen auditory cortical fields based on cytoarchitectonics, connectivity, and response properties⁵⁴. By contrast, there is no consensus on how auditory cortex is organized in humans, with multiple candidate parcellations based on cytoarchitectonics, tonotopy or myeloarchitecture ^55–58. Our results contribute to this body of knowledge by showing that superior temporal ROIs including core auditory cortex (HGPM) and putative auditory belt and parabelt areas (PT, HGAL, STGP, STGM^55,58) cluster together in embedding space, indicating functional similarity. We could also discern hierarchical relationships within this cluster by combining embedding analysis with mean connectivity. Specifically, despite the proximity of HGPM to other auditory ROIs in embedding space (Fig. 3b), it exhibited weaker mean connectivity, suggesting that it functions as a spoke rather than a local hub (Fig. 5b). This is consistent with the position of HGPM at the lowest level of the auditory cortical hierarchy. STGP had an intermediate position between HGPM and HGAL/PT/STGM in terms of its eccentricity and mean connectivity. This suggests that STGP is the locus of relatively early non-core auditory cortex^59,60, occupying a low-level position in the hierarchy⁶¹.

Functional differentiation between STSU and STSL

The superior temporal sulcus is a critical node in speech and language networks linking canonical auditory cortex with higher order temporal, parietal, and frontal areas^{6,48,50,62−64}. Previous studies have shown that STSU and STSL differ in cytoarchitecture⁶⁵ and have distinct responses to speech^23,66−68. A recent iEEG study demonstrated enhanced, shorter-latency, responses to speech syllables in STSU compared to STSL⁷. STSU is traditionally not considered part of canonical auditory cortex (but see ⁵⁶), yet it clustered with auditory cortical ROIs. STSL, by contrast, was closer in embedding space to semantic ROIs. This is consistent with iEEG evidence that responses in STSL, but not STSU, correlated with performance on a semantic categorization task⁷. The regions specifically involved in semantic processing is a current topic of debate, with multiple competing models^28,38−40, and we defined a list of semantic ROIs by combining across these models. Using a family of plausible candidate ROIs for this comparison dilutes the importance of including or excluding any particular ROI. Taken together, the results firmly place STSU and STSL at different levels of the auditory cortical hierarchy.

Functional and theoretical framework of a limbic auditory pathway

Multiple lines of evidence support a pathway linking auditory cortical and limbic structures^69–72 that subserves auditory memory^14,17,18 and affective sound processing⁷³. The data presented here contribute to our understanding of this pathway. Clustering analysis identified a set of ROIs (PP, InsP, STGA, InsA, TP) positioned between auditory and limbic cortex (Fig. 4). PP is anatomically close and connected to auditory cortex⁷⁴, yet it is unique in auditory-responsive cortex for its syntactic-level language processing³ and its preferential activation by music, which has a strong affective component⁴. The recently reported connectivity between Hippocampus and PP⁷⁵ is consistent with PP’s role as an intermediate node in this stream. Similarly, InsP, despite its anatomical proximity and overlapping response properties with HGPM, is likely involved in the transformation of auditory information in auditory cortex to affective representations in InsA⁵.

The ATL structures STGA and TP are involved in semantic processing^3,28 and auditory memory⁷⁶, in particular the representation and retrieval of memories for people, social language, and behaviors (‘social knowledge’)⁷⁷. Tight clustering of TP with limbic ROIs in embedding space is consistent with its previously reported functional association with limbic cortex^78,79, with which TP shares key features of laminar cytoarchitecture and strong connectivity⁷⁵. We suggest that the organization depicted in Figs. 3 and 4, combined with evidence for bidirectional information sharing between auditory cortex and limbic areas, merits the identification of a third auditory processing stream alongside the dorsal and ventral streams^8,80. This ‘limbic stream’ would underlie auditory contributions to affective and episodic memory processing.

Ventral and dorsal streams linking auditory and frontal cortex

Current models of speech and language processing posit the existence of ventral and dorsal processing streams linking non-core auditory cortex with PMC and inferior frontal gyrus via several distinct anatomical pathways encompassing temporal, parietal, and frontal cortex ^{8–10, 50}. Despite substantial experimental evidence supporting these models, there is a lack of consensus on the specific functions subserved by the two streams. For example, the dorsal stream has been envisioned to subserve spatial processing (“where”⁸), sensorimotor integration (“how”⁹), and syntactic processing¹⁰. There is a parallel debate about the specific cortical regions comprising the two streams.

As broadly predicted by these models, temporal and parietal ROIs segregated in embedding space in the analysis presented here (Fig. 3b, 4). We observed a cluster that included STSL, middle and inferior temporal gyrus ROIs, in conformity with the ventral auditory stream proposed by Hickok and Poeppel⁹ and Friederici¹⁰. By contrast, the cluster that included SMG, AGP, and AGA aligned with the dorsal processing stream as proposed by Rauschecker and Scott⁸. Association of FG and MOG with the ventral and dorsal clusters, respectively, likely represents the sharing of information across sensory modalities.

A previous fMRI-based DME study found that primary sensory and default mode ROIs segregated along the first dimension in embedding space³². Coverage of mesial cortex in our dataset was limited, precluding a direct comparison. However, the striking separation between auditory and prefrontal cortex in embedding space shown here, and its robustness to the choice of the parameter t, indicate that the current results align well with the previous report. This separation places auditory and frontal regions at opposite ends of the auditory processing hierarchy, linked by ventral and dorsal processing streams^8–10.

Network hubs

Hubs in brain networks play a critical role in integrating distributed neural activity⁴⁶. In the present analysis, global hubs were characterized by their central location within embedding space and high mean connectivity (Fig. 5). These hubs included STGA and MTGA, both components of the ATL. Previous reports indicate that ATL serves as a transmodal hub, transforming sensory domain-specific to domain-general representations^28,81,82 and playing a central role in semantic processing and social memory^28,77,83. MTGM also appears as a global hub, even though it is not formally part of the ATL. Interestingly, patients with semantic dementia have ATL degeneration^84,85, but the damage is often more widespread and can include MTGM⁸⁶.

Several other ROIs also exhibited hub-like properties. These included ITGA (part of ATL), the well-established global hub PCC/preCun³², and STSL. The latter observation is consistent with the diverse and multimodal functionality attributed to the STS previously^63,87.

Unlike other ATL structures, TP does not appear as a global hub in Fig. 5b. The close association of TP with limbic structures in embedding space suggests that TP mediates interactions between multimodal integration centers in the ATL and structures subserving memory functions. More broadly, the heterogeneity of ATL ROIs in terms of their global hub-like connectivity profiles conforms to the observation that the terminal fields of white matter tracts converging in the ATL only partially overlap^28,88,89.

Hemispheric lateralization

Although speech and language networks are classically described as highly lateralized, imaging studies have demonstrated widespread bilateral activation during speech and language tasks^90–92. We found no evidence for hemispheric differences in cortical functional organization based on analysis of all sampled brain regions, nor when comparison was restricted to speech and language ROIs (Fig. 6). These results are consistent with a recent fMRI study²⁶ showing similar RS connectivity patterns between left and right temporal lobe (but see⁶).

A recent study that applied DME to RS-fMRI data presented evidence for hemispheric asymmetry in semantic networks, which correlated with semantic task performance⁹³. The large dataset in that study and the availability of data from both hemispheres in every participant likely facilitated detection of those differences. Another possible reason for this divergence is that we investigated hemispheric differences based on all relevant dimensions of embedding space, whereas the previous study relied only on position along the first dimension. In addition, hemispheric asymmetry in network connectivity increases along the cortical hierarchy⁹⁴. Thus, ROIs involved in heteromodal semantic processing would likely show greater hemispheric differences compared to speech and language ROIs, which include lower order sensory regions. The present study suggests that neither anatomical nor task-specific functional hemispheric asymmetries necessarily translate into asymmetric network configurations during resting state. This does not exclude the possibility of asymmetries emerging during auditory tasks, for example reflecting hemispheric biases in spectral and temporal processing^9,12.

Caveats & limitations

A key concern regarding all human iEEG studies is that participants may not be representative of a healthy population. In the present study, results were consistent across participants despite differences in seizure disorder histories, medications, and seizure foci, and aligned with results obtained previously in healthy participants³². Another caveat is that our dataset, however extensive, did not sample the entire brain, and it was not possible to infer connectivity with unsampled regions. To address this, we applied DME analysis to fMRI data to establish that the organization of ROIs in embedding space was robust to the exclusion of unsampled ROIs. Although there was a greater effect of sparse, non-uniform sampling within an ROI, there was still considerable similarity in functional organization to embeddings derived from averages across the entire ROI.

While subcortical structures (e.g., thalamus) that link sensory and higher order networks⁹⁵ were not sampled, the functional organization presented here was likely influenced indirectly by thalamo-cortical pathways^61,96. Previous fMRI studies of RS networks focused exclusively on cortical ROIs and did not consider the role of the thalamus and other subcortical structures. Despite this limitation, these studies have yielded valuable insights into the functional organization of the human cortical networks^97,98.

Concluding remarks and future directions

This study extends the DME approach to characterize functional relationships between cortical regions investigated using iEEG recordings. These data help resolve several outstanding issues regarding the functional organization of human auditory cortical networks and stress the importance of a limbic pathway complementary to the dorsal and ventral streams. These results lay the foundation for future work investigating network organization during active speech and language processing. While the current work focused on auditory cortical networks, this approach can be readily generalized to advance our understanding of changes in brain organization during sleep and anesthesia, disorders of consciousness, as well as reorganization of cortical functional geometry secondary to lesions.

Participants

The study was carried out in 46 neurosurgical patients (20 females) diagnosed with medically refractory epilepsy. The patients were undergoing chronic invasive electrophysiological monitoring to identify seizure foci prior to resection surgery (Supplementary Table 1). Research protocols were approved by the University of Iowa Institutional Review Board and the National Institutes of Health, and written informed consent was obtained from all participants. Research participation did not interfere with acquisition of clinically necessary data, and participants could rescind consent for research without interrupting their clinical management.

All participants except two were native English speakers. The participants were predominantly right-handed (39 out of 46); six participants were left-handed, and one had bilateral handedness. The majority of participants (32 out of 46) were left language-dominant, as determined by Wada test. Two participants were right hemisphere-dominant, and one had bilateral language dominance. The remaining 11 participants were not evaluated for language dominance; 9 of them were right-handed and thus were assumed left language-dominant for the purposes of the analysis of lateralization (see below). The remaining two participants who did not undergo Wada test and who were left-handed were not included in that analysis.

All participants underwent audiological and neuropsychological assessment prior to electrode implantation, and none had auditory or cognitive deficits that would impact the results of this study. The participants were tapered off their antiepileptic drugs during chronic monitoring when RS data were collected.

Experimental procedures

Pre-implantation neuroimaging. All participants underwent whole-brain high-resolution T1-weighted structural MRI scans before electrode implantation. In a subset of ten participants (Supplementary Table 2), RS-fMRI data were used for estimates of functional connectivity. The scanner was a 3T GE Discovery MR750W with a 32-channel head coil. The pre-electrode implantation anatomical T1 scan (3D FSPGR BRAVO sequence) was obtained with the following parameters: FOV = 25.6 cm, flip angle = 12 deg., TR = 8.50 ms, TE = 3.29 ms, inversion time = 450 ms, voxel size = 1.0 × 1.0 × 0.8 mm. For RS-fMRI, 5 blocks of 5-minute gradient-echo EPI runs (650 volumes) were collected with the following parameters: FOV = 22.0 cm, TR = 2260 ms, TE = 30 ms, flip angle = 80 deg., voxel size = 3.45 × 3.45 × 4.0 mm. In some cases, fewer RS acquisition sequences were used in the final analysis due to movement artifact or because the full scanning session was not completed. For each participant, RS-fMRI runs were acquired in the same session but non-contiguously (dispersed within an imaging session to avoid habituation). Participants were asked to keep their eyes open, and a fixation cross was presented through a projector.

iEEG recordings. iEEG recordings were obtained using either subdural and depth electrodes, or depth electrodes alone, based on clinical indications. Electrode arrays were manufactured by Ad-Tech Medical (Racine, WI). Subdural arrays, implanted in 36 participants out of 46, consisted of platinum-iridium discs (2.3 mm diameter, 5–10 mm inter-electrode distance), embedded in a silicon membrane. Stereotactically implanted depth arrays included between 4 and 12 cylindrical contacts along the electrode shaft, with 5–10 mm inter-electrode distance. A subgaleal electrode, placed over the cranial vertex near midline, was used as a reference in all participants. All electrodes were placed solely on the basis of clinical requirements, as determined by the team of epileptologists and neurosurgeons⁹⁹.

No-task RS data were recorded in the dedicated, electrically shielded suite in The University of Iowa Clinical Research Unit while the participants lay in the hospital bed. RS data were collected 6.4 +/- 3.5 days (mean ± standard deviation; range 1.5–20.9) after electrode implantation surgery. In the first 15 participants (L275 through L362), data were recorded using a TDT RZ2 real-time processor (Tucker-Davis Technologies, Alachua, FL). In the remaining 31 participants (R369 through L585), data acquisition was performed using a Neuralynx Atlas System (Neuralynx Inc., Bozeman, MT). Recorded data were amplified, filtered (0.1–500 Hz bandpass, 5 dB/octave rolloff for TDT-recorded data; 0.7–800 Hz bandpass, 12 dB/octave rolloff for Neuralynx-recorded data) and digitized at a sampling rate of 2034.5 Hz (TDT) or 2000 Hz (Neuralynx). The durations of recordings were 13 +/- 11 min. In all but two participants, recording durations were between 10 and 22 min.; in one participant duration was 6 min., and in one participant the duration was 81 min.

Data analysis

Anatomical reconstruction and ROI parcellation. Localization of recording sites and their assignment to ROIs relied on post-implantation T1-weighted anatomical MRI and post-implantation computed tomography (CT). All images were initially aligned with pre-operative T1 scans using linear coregistration implemented in FSL (FLIRT)¹⁰⁰. Electrodes were identified in the post-implantation MRI as magnetic susceptibility artifacts and in the CT as metallic hyperdensities. Electrode locations were further refined within the space of the pre-operative MRI using three-dimensional non-linear thin-plate spline warping¹⁰¹, which corrected for post-operative brain shift and distortion. The warping was constrained with 50–100 control points, manually selected throughout the brain, which were visually aligned to landmarks in the pre- and post-implantation MRI.

To pool data across participants, the dimensionality of connectivity matrices was reduced by assigning electrodes to one of 58 ROIs organized into 6 ROI groups (see Fig. 1; Supplementary Table 2, 3) based upon anatomical reconstructions of electrode locations in each participant. For subdural arrays, ROI assignment was informed by automated parcellation of cortical gyri^102,103 as implemented in the FreeSurfer software package. For depth arrays, it was informed by MRI sections along sagittal, coronal, and axial planes. For recording sites in Heschl’s gyrus, delineation of the border between core auditory cortex adjacent non-core areas (HGPM and HGAL, respectively) was performed in each participant using physiological criteria^104,105. Specifically, recording sites were assigned to HGPM if they exhibited phase-locked (frequency-following) responses to 100 Hz click trains and if the averaged evoked potentials to these stimuli featured short-latency (< 20 ms) peaks. Such response features are characteristic for HGPM and are not present within HGAL¹⁰⁴. Additionally, correlation coefficients between average evoked potential waveforms recorded from adjacent sites were examined to identify discontinuities in response profiles along Heschl’s gyrus that could be interpreted as reflecting a transition from HGPM to HGAL. Superior temporal gyrus was subdivided into posterior and middle non-core auditory cortex ROIs (STGP and STGM), and auditory-related anterior ROI (STGA) using the transverse temporal sulcus and ascending ramus of the Sylvian fissure as macroanatomical boundaries. The insula was subdivided into posterior and anterior ROIs, with the former considered within the auditory-related ROI group⁵. Middle and inferior temporal gyrus were each divided into posterior, middle, and anterior ROIs by diving the gyrus into three approximately equal-length thirds. Angular gyrus was divided into posterior and anterior ROIs using the angular sulcus as a macroanatomical boundary. Anterior cingulate cortex was identified by automatic parcellation in FreeSurfer and was considered as part of the prefrontal ROI group, separately from the rest of the cingulate gyrus. Postcentral and precentral gyri were each divided into ventral and dorsal portions using the y_MNI coordinate (see below) of 40 mm as a boundary. Recording sites identified as seizure foci or characterized by excessive noise, and depth electrode contacts localized to the white matter or outside brain, were excluded from analyses and are not listed in Supplementary Table 2.

Preprocessing of fMRI data. Standard preprocessing was applied to the RS-fMRI data acquired in the pre-implantation scan using FSL’s FEAT pipeline, including spatial alignment and nuisance regression. White matter, cerebrospinal fluid and global ROIs were created using deep white matter, lateral ventricles and a whole brain mask, respectively. Regression was performed using the time series of these three nuisance ROIs as well as 6 motion parameters (3 rotations and 3 translations) and their derivatives, detrended with second order polynomials. Temporal bandpass filtering was 0.008–0.08 Hz. Spatial smoothing was applied with a Gaussian kernel (6 mm full-width at half maximum). The first two images from each run were discarded. Frame censoring was applied when the Euclidean norm of derivatives of motion parameters exceeded 0.5 mm¹⁰⁶. All runs were processed in native EPI space, then the residual data were transformed to MNI152 and concatenated.

Preprocessing of iEEG data. Analysis of iEEG data was performed using custom software written in MATLAB Version 2020a programming environment (MathWorks, Natick, MA, USA). After initial rejection of recording sites identified as seizure foci, several automated steps were taken to exclude recording channels and time intervals contaminated by noise. First, channels were excluded if average power in any frequency band (broadband, delta, theta, alpha, beta, gamma, or high gamma; see below) exceeded 3.5 standard deviations of the average power across all channels for that participant. Next, transient artifacts were detected by identifying voltage deflections exceeding 10 standard deviations on a given channel. A time window was identified extending before and after the detected artifact until the voltage returned to the zero-mean baseline plus an additional 100 ms buffer before and after. High-frequency artifacts were also removed by masking segments of data with high gamma power exceeding 5 standard deviations of the mean across all segments. Only time bins free of these artifact masks were considered in subsequent analyses. Artifact rejection was applied across all channels simultaneously so that all connectivity measures were derived from the same time windows. Occasionally, particular channels survived the initial average power criteria yet had frequent artifacts that led to loss of data across all the other channels. There is a tradeoff in rejecting artifacts (losing time across all channels) and rejecting channels (losing all data for that channel). If artifacts occur on many channels, there is little benefit to excluding any one channel. However, if frequent artifacts occur on one or simultaneously on up to a few channels, omitting these can save more data from other channels than those channels contribute at all other times. We chose to optimize the total data retained, channels × time windows, and omitted some channels when necessary. To remove shared signals unlikely to derive from brain activity, data from retained channels were high-pass filtered above 200 Hz, and a spatial filter was derived from the singular value decomposition omitting the first singular vector. This spatial filter was then applied to the broadband signal to remove this common signal.

Connectivity analysis. For RS-fMRI data, BOLD signals were averaged across voxel groupings and functional connectivity was calculated as Pearson correlation coefficients. Voxel groupings were either based on the Schaefer-Yeo 400 parcellation scheme⁵³ in MNI-152 space, or were based on iEEG electrode location in participant space (see Fig. 1). For the latter, fMRI voxels were chosen to represent comparable regions of the brain recorded by iEEG electrodes. For each electrode, the anatomical coordinates of the recording site were mapped to the closest valid MRI voxel, E, and a sphere of 25 voxels (25 mm³) centered on E used as the corresponding recording site. This process was repeated for all N electrodes in the same ROI, and a single time series computed as the average of the fMRI BOLD signal in these N×25 voxels. These averages were used to compute an ROI-by-ROI connectivity matrix for RS-fMRI data. For comparisons between iEEG and fMRI embeddings, voxels were processed in participant space and ROI labels from the parcellation scheme illustrated in Fig. 1 and Supplementary Table 2 were applied to the fMRI data. For comparisons between fMRI embeddings derived from all cortical ROIs versus fMRI embeddings derived from just ROIs sampled in the iEEG experiments, electrode locations were transformed from participant space to MNI-152 space, then assigned to ROIs within the Schaefer-Yeo 400 scheme.

For iEEG data, two measures of functional (non-directed) connectivity were used: the orthogonalized band power envelope correlation²⁹ and the debiased weighted phase lag index (wPLI;⁵²), a measure of phase synchronization. Both measures avoid artifacts due to volume conduction by discounting connectivity near zero phase lag. For both measures, data were divided into 60-second segments, pairwise connectivity estimated in each segment, and then connectivity estimates averaged across all segments for that participant.

Envelope correlations were estimated for each data segment and every recording site as in²⁹, except time-frequency decomposition was performed using the demodulated band transform¹⁰⁷, rather than wavelets. For each frequency band (theta: 4–8 Hz, alpha: 8–13 Hz, beta: 13–30 Hz, gamma: 30–70 Hz; high gamma: 70–120 Hz), the power at each time bin was calculated as the average (across frequencies) log of the squared amplitude. For each pair of signals X and Y, one was orthogonalized to the other by taking the magnitude of the imaginary component of the product of one signal with the normalized complex conjugate of the other:

$${Y}_{orth}= \left|\text{I}\text{m}\left\{Y\times {X}^{*}/\left|X\right|\right\}\right|$$

Both signals were band-pass filtered (0.2–1 Hz), and the Pearson correlation calculated between signals. The process was repeated by orthogonalizing in the other direction and the overall envelope correlation for a pair of recording sites was the average of the two Pearson correlations.

wPLI was estimated for each data segment and every recording site pair from the sign of the imaginary part of the cross-spectrum at each frequency and averaged across frequencies within each band of interest (theta: 4–8 Hz, alpha: 8–13 Hz, beta: 13–30 Hz). The cross spectrum was calculated from the demodulated band transform as described previously¹⁰⁸.

Prior to diffusion map embedding, connectivity matrices were thresholded by saving at least the top third (rounded up) connections for every row, as well as their corresponding columns (to preserve symmetry). We also included any connections making up the minimum spanning tree of the graph represented by the elementwise reciprocal of the connectivity matrix to ensure the graph is connected.

ROI-based connectivity analysis. Connectivity between ROIs was computed as the average envelope correlation or wPLI value between all pairs of recording sites in the two ROIs. For analyses in which connectivity was summarized across participants (Fig. 3–8), we used only a subset of ROIs such that every possible pair of included ROIs was represented in at least two participants (Supplementary Table 2). This list of ROIs was obtained by iteratively removing ROIs with the worst cross-coverage with other ROIs until every ROI remaining had sufficient coverage with all remaining ROIs.

Diffusion map embedding. See the Appendix for details about DME. In brief, the connectivity matrix K = [k(i,j)] is normalized by degree to yield a transition probability matrix P. DME consists of performing an eigendecomposition of P, with eigenvalues scaled according to the free parameter t, which sets the spatial scale of the analysis. If the recording sites are conceptualized as nodes on a graph with edges defined by K, then P can be understood as the transition probability matrix for a ‘random walk’ or a ‘diffusion’ on the graph (see Appendix;^30,31). The parameter t is the number of time steps in that random walk; larger values of t correspond to exploring the structure of the data at larger spatial scales. In the analyses presented here, k(i,j) is either the orthogonalized power envelope correlations, or the debiased wPLI. We note that in recent applications of DME to fMRI functional connectivity data, K was first transformed by applying cosine similarity³². However, this additional step has not been universally implemented (e.g.,¹⁰⁹), nor is it required within the mathematical framework of DME^30,31. In the case of the dataset presented here, applying cosine similarity to the data served to smear the data in embedding space, increasing variance, and decreasing separation between ROIs, but otherwise produced qualitatively similar results.

In two sets of analyses presented here, pairs of embeddings were compared to each other: in the analysis of lateralization of speech and language networks, and in the comparison between iEEG and fMRI data. To do that, we used a change of basis operator to map embeddings into a common embedding space using the method described in Coifman et al 2014³¹.

Dimensionality reduction via low rank approximations to P. Diffusion map embedding offers an opportunity to reduce the dimensionality of the underlying data by considering only those dimensions that contribute importantly to the structure of the data, as manifested in the structure of the transition probability matrix P. We used the eigenvalue spectrum of P to determine its ideal low rank approximation, balancing dimensionality reduction and information loss. Because P is real and symmetric, the magnitude of the eigenvalues is identical to the singular values of P. The singular values tell us about the fidelity of low rank approximations to P. Specifically, if P has a set of singular values σ₁≥ σ₁≥…≥ σ_n, then for any integer k ≥ 1,

where $\tilde{{\mathbf{P}}_{k}}$ is the rank-k approximation to P. Thus, the magnitude of the eigenvalues corresponds to the fidelity of the lower dimensional approximation, and the difference in the magnitude of successive eigenvalues represents the improvement in that approximation as the dimensionality increases. The spectrum of P invariably has an inflection point (“elbow”), separating two sets of eigenvalues λ_i: those whose magnitude decreases more quickly with increasing i, and those beyond the inflection point whose magnitude decreases more slowly with increasing i. The inflection point thus delineates the number of dimensions that are most important for approximating P. The inflection point k_infl was identified algorithmically¹¹⁰, and the number of dimensions retained set equal to k_infl – 1.

Comparing distances in embedding space. The relative distance between points in embedding space provides insight into the underlying functional geometry. In several analyses presented here, two embeddings of identical sets of ROIs were compared as ROI distances within the two embeddings. After mapping to a common space and reducing dimensionality as described above, the two embeddings A and B were used to create the pairwise distance matrices A` and B`. The Pearson correlation coefficient r was then computed between the upper triangles (excluding the diagonal) of the corresponding elements in the distance matrices.

Signal to noise (SNR) characteristics. To measure the robustness of the embedding analysis to variability over time, an SNR was computed as follows. For each participant, a channel × channel P matrix was calculated for each 60 s segment of data. For each segment, DME analysis was applied and a channel × channel distance matrix calculated. These distance matrices were averaged across segments. The ‘signal’ of interest was defined as the variability (standard deviation) of this averaged distance matrix (ignoring the diagonals). The ‘noise’ was defined as the variability across time, estimated for each element of the distance matrix as the standard deviation across segments, then averaged across the elements of the matrix. The SNR for functional connectivity itself was computed in an analogous manner, using the original channel × channel connectivity matrix rather than the matrix of embedding distances.

Estimating precision in position and distances in embedding space. To obtain error estimates for both ROI locations in embedding space and embedding distance between ROIs, average ROI × ROI adjacency matrices were calculated. This was done by drawing each edge from an averaged bootstrap sample across participants, obtaining 10,000 such adjacency matrices, and performing diffusion map embedding for each. For locations in embedding space, these embeddings were then mapped via the change of basis procedure described above to the original group average embedding space. For each ROI, the mapped bootstrap iterations produced a cloud of locations in embedding space which were summarized by the standard deviation in each dimension. For embedding distances, no change of basis was necessary because distances were preserved across bases.

To compare the positions of STSL versus STSU relative to canonical auditory cortical ROIs (HGPM, HGAL, PT, PP, STGP, and STGM) or ROIs involved in semantic processing (STGA, MTGA, MTGP, ITGA, ITGP, TP, AGA, AGP, SMG, IFGop, IFGtr, IFGor^28,38−40), we calculated the average pairwise distance from STSL or STSU to each such ROI. The difference between these averages was compared to a null distribution obtained by Monte Carlo sampling of the equivalent statistic obtained by randomly exchanging STSL/STSU labels by participant. The specific comparisons performed were chosen a priori to constrain the number of possible hypotheses to test; pairwise comparisons of all possible ROI pairs (let alone comparisons of all higher-order groupings) would not have had sufficient statistical power under appropriate corrections for multiple comparisons. Though different choices could have been made for inclusion in the “semantic processing” category, exchanging one or two of these ROIs would not strongly influence the average distance in a group of twelve ROIs.

Hierarchical clustering. Agglomerative hierarchical clustering was done using the linkage function in MATLAB, with Euclidean distance as the distance metric and Ward’s linkage (minimum variance algorithm) as the linkage method. The ordering of ROIs along the horizontal axis in the dendrogram was determined using the optimalleaforder function in MATLAB, with the optimization criterion set to ‘group’.

Comparing language dominant/non-dominant hemispheres. To test for differences in embedding location between language dominant and non-dominant hemispheres, two measures were considered: different location of individual ROIs in embedding space, and different pairwise distances between ROIs in embedding space. To calculate differences in location of individual ROIs, dominant/non-dominant average embeddings were mapped to a common space (from an embedding using the average across all participants regardless of language dominance) using the change of basis operator. The language-dominant location difference for a specific ROI was calculated as the Euclidean distance between the two locations of each ROI in this common space. For pairwise distances, the change of basis is irrelevant, so pairwise Euclidean distances were calculated in embedding space for each hemisphere and then subtracted to obtain a difference matrix. To determine whether the differences in location or pairwise distances were larger than expected by chance, random permutations of the dominant/non-dominant labels were used to generate empirical null distributions. Since this approach produces a p-value for every pair of connections, p-values were adjusted using false discovery rate (FDR) to account for multiple comparisons.

Analyses of fMRI connectivity in embedding space. Two sets of analyses were performed using fMRI data. First, iEEG and fMRI data were compared in embedding space. In this analysis, connectivity based on RS-fMRI data from voxels located at electrode recording sites was compare with the corresponding connectivity matrix derived from iEEG data. The embedding analysis was applied to the two connectivity matrices, all pairwise inter-ROI distances computed, and iEEG and fMRI data compared using the correlation of the pairwise ROI distances. The second analysis was to compare embeddings derived from all ROIs in the RS-fMRI scans to those derived from just ROIs sampled with iEEG electrodes. Here, ROI × ROI connectivity matrices were computed for all ROIs, then embeddings created from the full matrices or from matrices containing just rows and columns corresponding to the ROIs sampled with iEEG.

Acknowledgements

This work was supported by the National Institutes of Health (grant numbers R01-DC04290, R01-GM109086, S10OD025025, UL1-RR024979). We are grateful to Jess Banks, Alex Billig, Haiming Chen, Phillip Gander, Christopher Garcia, Matthew Howard, Ariane Rhone, and Matthew Sutterer for help with data collection, analysis, and comments on the manuscript.

Scott, S.K. The neurobiology of speech perception and production–can functional imaging tell us anything we did not already know? J Commun Disord 45, 419–425 (2012).
Woods, D.L. & Alain, C. Functional imaging of human auditory cortex. Curr Opin Otolaryngol Head Neck Surg 17, 407–411 (2009).
Friederici, A.D., Meyer, M. & von Cramon, D.Y. Auditory language comprehension: an event-related fMRI study on the processing of syntactic and lexical information. Brain Lang 75, 289–300 (2000).
Angulo-Perkins, A., et al. Music listening engages specific cortical regions within the temporal lobes: differences between musicians and non-musicians. Cortex 59, 126–137 (2014).
Zhang, Y., et al. The Roles of Subdivisions of Human Insula in Emotion Perception and Auditory Processing. Cereb Cortex 29, 517–528 (2019).
Abrams, D.A., Kochalka, J., Bhide, S., Ryali, S. & Menon, V. Intrinsic functional architecture of the human speech processing network. Cortex 129, 41–56 (2020).
Nourski, K.V., et al. Electrophysiology of the Human Superior Temporal Sulcus during Speech Processing. Cereb Cortex 31, 1131–1148 (2021).
Rauschecker, J.P. & Scott, S.K. Maps and streams in the auditory cortex: nonhuman primates illuminate human speech processing. Nat Neurosci 12, 718–724 (2009).
Hickok, G. & Poeppel, D. The cortical organization of speech processing. Nat Rev Neurosci 8, 393–402 (2007).
Friederici, A.D. The cortical language circuit: from auditory perception to sentence comprehension. Trends Cogn Sci 16, 262–268 (2012).
Cloutman, L.L. Interaction between dorsal and ventral processing streams: where, when and how? Brain Lang 127, 251–263 (2013).
Hickok, G. & Poeppel, D. Neural basis of speech perception. Handb Clin Neurol 129, 149–160 (2015).
Rauschecker, J.P. Where, When, and How: Are they all sensorimotor? Towards a unified view of the dorsal pathway in vision and audition. Cortex 98, 262–268 (2018).
Munoz-Lopez, M.M., Mohedano-Moriano, A. & Insausti, R. Anatomical pathways for auditory memory in primates. Front Neuroanat 4, 129 (2010).
Kraus, K.S. & Canlon, B. Neuronal connectivity and interactions between the auditory and limbic systems. Effects of noise and tinnitus. Hear Res 288, 34–46 (2012).
Husain, F.T. & Schmidt, S.A. Using resting state functional connectivity to unravel networks of tinnitus. Hear Res 307, 153–162 (2014).
Kumar, S., et al. A Brain System for Auditory Working Memory. J Neurosci 36, 4492–4505 (2016).
Kumar, S., et al. Oscillatory correlates of auditory working memory examined with human electrocorticography. Neuropsychologia 150, 107691 (2021).
Geschwind, N. The organization of language and the brain. Science 170, 940–944 (1970).
Hagoort, P. The neurobiology of language beyond single-word processing. Science 366, 55–58 (2019).
McGettigan, C. & Scott, S.K. Cortical asymmetries in speech perception: what's wrong, what's right and what's left? Trends Cogn Sci 16, 269–276 (2012).
Turkeltaub, P.E. & Coslett, H.B. Localization of sublexical speech perception components. Brain Lang 114, 1–15 (2010).
Leaver, A.M. & Rauschecker, J.P. Cortical representation of natural complex sounds: effects of acoustic features and auditory object category. J Neurosci 30, 7604–7612 (2010).
Eisner, F., McGettigan, C., Faulkner, A., Rosen, S. & Scott, S.K. Inferior frontal gyrus activation predicts individual differences in perceptual learning of cochlear-implant simulations. J Neurosci 30, 7179–7186 (2010).
Saur, D., et al. Ventral and dorsal pathways for language. Proc Natl Acad Sci U S A 105, 18035–18040 (2008).
Jackson, R.L., Bajada, C.J., Rice, G.E., Cloutman, L.L. & Lambon Ralph, M.A. An emergent functional parcellation of the temporal cortex. Neuroimage 170, 385–399 (2018).
Visser, M., Jefferies, E. & Lambon Ralph, M.A. Semantic processing in the anterior temporal lobes: a meta-analysis of the functional neuroimaging literature. J Cogn Neurosci 22, 1083–1094 (2010).
Ralph, M.A., Jefferies, E., Patterson, K. & Rogers, T.T. The neural and computational bases of semantic cognition. Nat Rev Neurosci 18, 42–55 (2017).
Hipp, J.F., Hawellek, D.J., Corbetta, M., Siegel, M. & Engel, A.K. Large-scale cortical correlation structure of spontaneous oscillatory activity. Nat Neurosci 15, 884–890 (2012).
Coifman, R.R., et al. Geometric diffusions as a tool for harmonic analysis and structure definition of data: diffusion maps. Proc Natl Acad Sci U S A 102, 7426–7431 (2005).
Coifman, R.R. & Hirn, M.J. Diffusion maps for changing data. Applied and Computational Harmonic Analysis 36, 79–107 (2014).
Margulies, D.S., et al. Situating the default-mode network along a principal gradient of macroscale cortical organization. Proc Natl Acad Sci U S A 113, 12574–12579 (2016).
Jones, E.G., Coulter, J.D. & Hendry, S.H. Intracortical connectivity of architectonic fields in the somatic sensory, motor and parietal cortex of monkeys. J Comp Neurol 181, 291–347 (1978).
Morel, A., Garraghty, P.E. & Kaas, J.H. Tonotopic organization, architectonic fields, and connections of auditory cortex in macaque monkeys. J Comp Neurol 335, 437–459 (1993).
Kaas, J.H. & Hackett, T.A. Subdivisions of auditory cortex and processing streams in primates. Proc Natl Acad Sci USA 97, 11793–11799 (2000).
Kaas, J.H. & Hackett, T.A. Subdivisions of auditory cortex and levels of processing in primates. Audiol Neurootol 3, 73–85 (1998).
Cavada, C., Company, T., Tejedor, J., Cruz-Rizzolo, R.J. & Reinoso-Suarez, F. The anatomical connections of the macaque monkey orbitofrontal cortex. A review. Cereb Cortex 10, 220–242 (2000).
Binder, J.R., Desai, R.H., Graves, W.W. & Conant, L.L. Where is the semantic system? A critical review and meta-analysis of 120 functional neuroimaging studies. Cereb Cortex 19, 2767–2796 (2009).
Humphreys, G.F., Hoffman, P., Visser, M., Binney, R.J. & Lambon Ralph, M.A. Establishing task- and modality-dependent dissociations between the semantic and default mode networks. Proc Natl Acad Sci U S A 112, 7857–7862 (2015).
Jackson, R.L., Hoffman, P., Pobric, G. & Lambon Ralph, M.A. The Semantic Network at Work and Rest: Differential Connectivity of Anterior Temporal Lobe Subregions. J Neurosci 36, 1490–1501 (2016).
Remedios, R., Logothetis, N.K. & Kayser, C. An auditory region in the primate insular cortex responding preferentially to vocal communication sounds. J Neurosci 29, 1034–1045 (2009).
Steinschneider, M., Nourski, K.V. & Fishman, Y.I. Representation of speech in human auditory cortex: is it special? Hear Res 305, 57–73 (2013).
Craig, A.D. Interoception: the sense of the physiological condition of the body. Curr Opin Neurobiol 13, 500–505 (2003).
Kuehn, E., Mueller, K., Lohmann, G. & Schuetz-Bosbach, S. Interoceptive awareness changes the posterior insula functional connectivity profile. Brain Struct Funct 221, 1555–1571 (2016).
Bernstein, L.E. & Liebenthal, E. Neural pathways for visual speech perception. Front Neurosci 8, 386 (2014).
Bullmore, E. & Sporns, O. Complex brain networks: graph theoretical analysis of structural and functional systems. Nat.Rev.Neurosci. 10, 186–198 (2009).
Knecht, S., et al. Handedness and hemispheric language dominance in healthy humans. Brain 123 Pt 12, 2512–2518 (2000).
Price, C.J. A review and synthesis of the first 20 years of PET and fMRI studies of heard speech, spoken language and reading. Neuroimage 62, 816–847 (2012).
Schirmer, A., Fox, P.M. & Grandjean, D. On the spatial organization of sound processing in the human temporal lobe: a meta-analysis. Neuroimage 63, 137–147 (2012).
Chang, E.F., Raygor, K.P. & Berger, M.S. Contemporary model of language organization: an overview for neurosurgeons. J Neurosurg 122, 250–261 (2015).
Ardila, A., Bernal, B. & Rosselli, M. How Localized are Language Brain Areas? A Review of Brodmann Areas Involvement in Oral Language. Arch Clin Neuropsychol 31, 112–122 (2016).
Vinck, M., Oostenveld, R., van Wingerden, M., Battaglia, F. & Pennartz, C.M. An improved index of phase-synchronization for electrophysiological data in the presence of volume-conduction, noise and sample-size bias. Neuroimage 55, 1548–1565 (2011).
Schaefer, A., et al. Local-Global Parcellation of the Human Cerebral Cortex from Intrinsic Functional Connectivity MRI. Cereb Cortex 28, 3095–3114 (2018).
Hackett, T.A., Preuss, T.M. & Kaas, J.H. Architectonic identification of the core region in auditory cortex of macaques, chimpanzees, and humans. J Comp Neurol. 441, 197–222 (2001).
Hackett, T.A. Anatomic organization of the auditory cortex. Handb Clin Neurol 129, 27–53 (2015).
Woods, D.L., et al. Functional properties of human auditory cortical fields. Front Syst Neurosci 4, 155 (2010).
Barton, B., Venezia, J.H., Saberi, K., Hickok, G. & Brewer, A.A. Orthogonal acoustic dimensions define auditory field maps in human cortex. Proc Natl Acad Sci U S A 109, 20738–20743 (2012).
Moerel, M., De Martino, F. & Formisano, E. An anatomical and functional topography of human auditory cortical areas. Front Neurosci 8, 225 (2014).
Howard, M.A., et al. Auditory cortex on the human posterior superior temporal gyrus. J Comp Neurol. 416, 79–92 (2000).
Nourski, K.V., et al. Spectral organization of the human lateral superior temporal gyrus revealed by intracranial recordings. Cereb Cortex 24, 340–352 (2014).
Hamilton, L.S., Oganian, Y., Hall, J. & Chang, E.F. Parallel and distributed encoding of speech across human auditory cortex. Cell 184, 4626–4639 e4613 (2021).
Hickok, G. The functional neuroanatomy of language. Phys Life Rev 6, 121–143 (2009).
Beauchamp, M.S. The social mysteries of the superior temporal sulcus. Trends Cogn Sci 19, 489–490 (2015).
Venezia, J.H., et al. Auditory, Visual and Audiovisual Speech Processing Streams in Superior Temporal Sulcus. Front Hum Neurosci 11, 174 (2017).
Zachlod, D., et al. Four new cytoarchitectonic areas surrounding the primary and early auditory cortex in human brains. Cortex 128, 1–21 (2020).
Belin, P., Zatorre, R.J., Lafaille, P., Ahad, P. & Pike, B. Voice-selective areas in human auditory cortex. Nature 403, 309–312 (2000).
Deen, B., Koldewyn, K., Kanwisher, N. & Saxe, R. Functional Organization of Social Perception and Cognition in the Superior Temporal Sulcus. Cereb Cortex 25, 4596–4609 (2015).
Wilson, S.M., Bautista, A. & McCarron, A. Convergence of spoken and written language processing in the superior temporal sulcus. Neuroimage 171, 62–74 (2018).
Kahn, I., Andrews-Hanna, J.R., Vincent, J.L., Snyder, A.Z. & Buckner, R.L. Distinct cortical anatomy linked to subregions of the medial temporal lobe revealed by intrinsic functional connectivity. J Neurophysiol 100, 129–139 (2008).
Wang, S.F., Ritchey, M., Libby, L.A. & Ranganath, C. Functional connectivity based parcellation of the human medial temporal lobe. Neurobiol Learn Mem 134 Pt A, 123–134 (2016).
Michelmann, S., et al. Moment-by-moment tracking of naturalistic learning and its underlying hippocampo-cortical interactions. Nat Commun 12, 5394 (2021).
Rocchi, F., et al. Common fronto-temporal effective connectivity in humans and monkeys. Neuron 109, 852–868 e858 (2021).
Fruhholz, S., Trost, W. & Kotz, S.A. The sound of emotions-Towards a unifying neural network perspective of affective sound processing. Neurosci Biobehav Rev 68, 96–110 (2016).
Upadhyay, J., et al. Effective and structural connectivity in the human auditory cortex. J Neurosci 28, 3341–3349 (2008).
Maller, J.J., et al. Revealing the Hippocampal Connectome through Super-Resolution 1150-Direction Diffusion MRI. Sci Rep 9, 2418 (2019).
Munoz-Lopez, M., Insausti, R., Mohedano-Moriano, A., Mishkin, M. & Saunders, R.C. Anatomical pathways for auditory memory II: information from rostral superior temporal gyrus to dorsolateral temporal pole and medial temporal cortex. Front Neurosci 9, 158 (2015).
Olson, I.R., McCoy, D., Klobusicky, E. & Ross, L.A. Social cognition and the anterior temporal lobes: a review and theoretical framework. Soc Cogn Affect Neurosci 8, 123–133 (2013).
Mesulam, M.M. Paralimbic (mesocortical) areas. in Principles of behavioral and cognitive neurology 49–54 (Oxford University Press, New York, NY, 2000).
Chanes, L. & Barrett, L.F. Redefining the Role of Limbic Areas in Cortical Processing. Trends Cogn Sci 20, 96–106 (2016).
Hickok, G. Computational neuroanatomy of speech production. Nat Rev Neurosci 13, 135–145 (2012).
Simmons, W.K. & Martin, A. The anterior temporal lobes and the functional architecture of semantic memory. J Int Neuropsychol Soc 15, 645–649 (2009).
Abel, T.J., et al. Direct physiologic evidence of a heteromodal convergence region for proper naming in human left anterior temporal lobe. J Neurosci 35, 1513–1520 (2015).
Patterson, K., Nestor, P.J. & Rogers, T.T. Where do you know what you know? The representation of semantic knowledge in the human brain. Nat Rev Neurosci 8, 976–987 (2007).
Scott, S.K., Blank, C.C., Rosen, S. & Wise, R.J. Identification of a pathway for intelligible speech in the left temporal lobe. Brain 123 Pt 12, 2400–2406 (2000).
Spitsyna, G., Warren, J.E., Scott, S.K., Turkheimer, F.E. & Wise, R.J. Converging language streams in the human temporal lobe. J Neurosci 26, 7328–7336 (2006).
Gorno-Tempini, M.L., et al. Cognition and anatomy in three variants of primary progressive aphasia. Ann Neurol 55, 335–346 (2004).
Hein, G. & Knight, R.T. Superior temporal sulcus–It's my area: or is it? J Cogn Neurosci 20, 2125–2136 (2008).
Makris, N., et al. Delineation of the middle longitudinal fascicle in humans: a quantitative, in vivo, DT-MRI study. Cereb Cortex 19, 777–785 (2009).
Binney, R.J., Parker, G.J. & Lambon Ralph, M.A. Convergent connectivity and graded specialization in the rostral human temporal lobe as revealed by diffusion-weighted imaging probabilistic tractography. J Cogn Neurosci 24, 1998–2014 (2012).
de Heer, W.A., Huth, A.G., Griffiths, T.L., Gallant, J.L. & Theunissen, F.E. The Hierarchical Cortical Organization of Human Speech Processing. J Neurosci 37, 6539–6557 (2017).
Binder, J.R., et al. Human temporal lobe activation by speech and nonspeech sounds. Cereb Cortex 10, 512–528 (2000).
Cogan, G.B., et al. Sensory-motor transformations for speech occur bilaterally. Nature 507, 94–98 (2014).
Gonzalez Alam, T., et al. A tale of two gradients: differences between the left and right hemispheres predict semantic cognition. Brain Struct Funct (2021).
Karolis, V.R., Corbetta, M. & Thiebaut de Schotten, M. The architecture of functional lateralisation and its relationship to callosal connectivity in the human brain. Nat Commun 10, 1417 (2019).
Sherman, S.M. & Guillery, R.W. Distinct functions for direct and transthalamic corticocortical connections. J Neurophysiol 106, 1068–1077 (2011).
Hu, B. Functional organization of lemniscal and nonlemniscal auditory thalamus. Exp Br Res 153, 543–549 (2003).
Biswal, B.B., et al. Toward discovery science of human brain function. Proc Natl Acad Sci U S A 107, 4734–4739 (2010).
Seitzman, B.A., Snyder, A.Z., Leuthardt, E.C. & Shimony, J.S. The State of Resting State Networks. Top Magn Reson Imaging 28, 189–196 (2019).
Nourski, K.V. & Howard, M.A., 3rd. Invasive recordings in the human auditory cortex. Handb Clin Neurol 129, 225–244 (2015).
Jenkinson, M., Bannister, P., Brady, M. & Smith, S. Improved optimization for the robust and accurate linear registration and motion correction of brain images. Neuroimage 17, 825–841 (2002).
Rohr, K., et al. Landmark-based elastic registration using approximating thin-plate splines. IEEE Trans Med Imaging 20, 526–534 (2001).
Destrieux, C., Fischl, B., Dale, A. & Halgren, E. Automatic parcellation of human cortical gyri and sulci using standard anatomical nomenclature. Neuroimage 53, 1–15 (2010).
Destrieux, C., et al. A practical guide for the identification of major sulcogyral structures of the human cortex. Brain Struct Funct 222, 2001–2015 (2017).
Brugge, J.F., et al. Coding of repetitive transients by auditory cortex on Heschl's gyrus. J Neurophysiol 102, 2358–2374 (2009).
Nourski, K.V., Steinschneider, M. & Rhone, A.E. Electrocorticographic Activation within Human Auditory Cortex during Dialog-Based Language and Cognitive Testing. Front Hum Neurosci 10, 202 (2016).
Power, J.D., Barnes, K.A., Snyder, A.Z., Schlaggar, B.L. & Petersen, S.E. Spurious but systematic correlations in functional connectivity MRI networks arise from subject motion. Neuroimage 59, 2142–2154 (2012).
Kovach, C.K. & Gander, P.E. The demodulated band transform. J Neurosci Methods 261, 135–154 (2016).
Banks, M.I., et al. Cortical functional connectivity indexes arousal state during sleep and anesthesia. Neuroimage 211, 116627 (2020).
Langs, G., Golland, P., Tie, Y., Rigolo, L. & Golby, A.J. Functional Geometry Alignment and Localization of Brain Areas. Adv Neural Inf Process Syst 1, 1225–1233 (2010).
Satopaa, V., Albrecht, J., Irwin, D. & Raghavan, B. Finding a "Kneedle" in a Haystack: Detecting Knee Points in System Behavior. in 2011 31st International Conference on Distributed Computing Systems Workshops 166–171 (2011).

There is NO Competing Interest.

Appendix.docx
Appendix: Diffusion Map Embedding
FunctionalgeometryofcorticalRSNsderivedfromiEEGSupplementaryInformation20220222KN.docx
Supplementary Information
Supplmovie1RSgammaenvemb3Drotationdim24.mp4
Supplementary Movie 1
Supplmovie2RSgammaenvemb3Drotationdim35.mp4
Supplementary Movie 2

Download PDF

Version 1

posted

You are reading this latest preprint version

Functional geometry of auditory cortical resting state networks derived from intracranial electrophysiology

Status:

Version 1

Abstract

Figures

Introduction

Results

DME applied to iEEG data

Functional geometry of cortical networks

Hierarchical clustering of ROIs

Identification of network hubs

Comparisons across hemispheres

Stability of functional geometry across frequency band and connectivity measures

Comparison to embeddings derived from RS-fMRI data

Discussion

Organization of auditory cortical networks

Functional differentiation between STSU and STSL

Functional and theoretical framework of a limbic auditory pathway

Ventral and dorsal streams linking auditory and frontal cortex

Network hubs

Hemispheric lateralization

Caveats & limitations

Concluding remarks and future directions

Online Methods

Participants

Experimental procedures

Data analysis

Declarations

Acknowledgements

References

Additional Declarations

Supplementary Files

Status:

Version 1