Study population
Patients 16 years were recruited over 19 months (March 2021–Oct 2022) from 17 UK sites through the COVID-CNS, a case-control study within the National Institute of Health Research (NIHR) COVID-19 BioResource (REC reference 17/EE/0025; 22/EE/0230 (East of England—Cambridge Central Research Ethics Committee)). COVID-CNS included hospitalised patients with COVID-19 without a prior relevant neurological diagnosis, who have had a new acute neurological or psychiatric complication (NeuroCOVID) alongside COVID-19 controls without these diagnoses (COVID), matched on a group level by age, sex, ethnicity, clinical frailty status, COVID-19 severity, and epoch of admission during the pandemic (40, 41). Some neurological or psychiatric complications required secondary care input without hospitalisation, partially related to pandemic pressures and risk assessments, and a proportion of the COVID group were therefore recruited who attended the emergency department but were not admitted. This analysis contains a patient subset that completed cognitive testing. Case ascertainment varied by study site but patient identification was most frequently via inpatient and outpatient attendance, neurology referrals, and SARS-CoV-2 positive laboratory reports. Participants were assessed at a single post-acute appointment which took place 1-26 months post-discharge, including a computerised cognitive assessment (Cognitron), patient-reported measures, blood sampling via venepuncture, 3T MRI and a clinical examination. Self-reported measures included PCL-5, GAD-7, PHQ-9 and CFQ. Multimorbidity, defined as ≥2 comorbidities, and Anticholinergic Burden score (a measure of how many medications taken might cumulatively contribute to an anticholinergic effect) were collected from past medical history and medications (42). To create a normative community comparator group, we sampled ≥8 individuals for each COVID-CNS participant matched for age, sex, first language, and level of education who completed the same cognitive assessment from a large normative dataset (1, 2). The research team completed a Case Record Form to collect harmonised clinical data across sites regarding acute admission and neurological complications.
Eligibility criteria
Patients with significant pre-existing neurological or psychiatric disorders managed in secondary care or pre-existing cognitive impairment were excluded. In the case of doubt about eligibility, this was discussed on a case-by-case basis at a national multi-disciplinary case evaluation panel (full criteria, see Supplementary Table 1).
Cognitive outcome
The cognitive assessment included seven tasks from the Cognitron assessment battery completed once under supervised conditions and twice online during follow-up (Supplementary Information 2). We included patients within the COVID-CNS cohort who had completed at least the first supervised assessment. Cognitron is sensitive, specific, and valid in the general population and disease cohorts (1, 2, 43, 44). Cognitive tasks were selected to sample across five domains defined by the DSM-5 classification (45) - Executive Function; Learning and Memory; Complex Attention; Perceptual-Motor Control and Language. Accuracy and median RTs were extracted by task, comprising 13 measures. These data were transformed into Deviation from Expected (DfE) scores using established linear models trained on a large normative dataset (>400,000 individuals) designed to predict performance based upon demographics. In this analysis, GDfE, DfE accuracy and DfE RT represent how an individual performs compared to what would be expected based upon their age, sex, first language and level of education. Any cognitive impairment was defined as GDfE less than expected (<0). A technical correction was applied excluding those responding unfeasibly fast or slow based upon normative data. Follow-up 1 and 2 were completed three and six months following the post-acute assessment. Recovery of cognitive performance was calculated as GDfE at Follow-up 1 minus GDfE at post-acute appointment.
Brain injury marker measurement
Brain injury markers were measured in serum using a Quanterix Simoa kit run on an SR-X Analyser (Quanterix, Billerica, MA, USA, Neurology 4-Plex A Advantage Kit, cat#102153). We assessed Nfl-L, UCH-L1, Tau, and GFAP. Normative values were taken from n=60 healthy controls recruited to the NIHR BioResource, reflecting the median (IQR, range) age distribution of the cohort as a whole (50 [32-62, 20-79] years) (46).
Neuroimaging
3T MRI protocols were harmonised and the published standardised protocol was consistent across sites (39). Specific brain regions were selected based on extant literature a priori to analysis; the parahippocampal gyrus, entorhinal cortex, orbitofrontal cortex, anterior cingulate cortex, insula and superior temporal gyrus (7, 47, 48, 49, 50, 51). MRI data were processed with FSL and Freesurfer, using the established UK Biobank pipeline, (37, 39, 52) modified for COVID-CNS, in order to produce biologically relevant metrics of brain structure and function - IDPs. IDPs from T1 and T2-FLAIR weighted MRI were obtained for global brain regions and for cortical regions as defined by Desikan-Killiany parcellation. IDPs represent grey matter thickness, volume and surface area. Fifty-four of these IDPs were selected as representative of general brain structure and the a priori selected brain regions. Volume and surface IDPs were found to be collinear (Variance Inflation Factor >10) and so 38 IDPs representing volume and thickness were included in subsequent analysis (for a full list, see Supplementary Table 2). Individual IDPs were compared to the COVID-CNS population means and standard deviations in order to calculate z-scores such that IDPs from disparate regions could be analysed in unison. Z-scores were combined into 14 composite z-scores, representative of volume and thickness of a priori regions of interest.
Model development
Candidate variables for linear models were pre-defined clinically important variables; age, sex, COVID-19 severity, clinical diagnostic group, level of education, frailty, mental health (PHQ9, GAD7 and PCL5), Chalder fatigue scale (53), vaccination against COVID-19, acute treatment with steroids, acute serum inflammatory markers (C-reactive protein (CRP) and white cell count (WCC)), subjective cognitive impairment, and time since COVID-19. The four brain injury markers and fourteen neuroimaging composites were additional candidate variables. Collinearity was assessed using correlation matrices. Variables were selected for the final model based on explanation of variance, biological plausibility, and missingness (<20% missingness). Date of admission and days since admission were included in all models. Final models represent complete case analysis. Within the pre-registration, three sample size calculations were undertaken to determine adequate power (95%) at the 0.05 significance level for the cross-sectional analysis.
Statistical analysis
The full analysis plan was pre-registered prior to data access and is openly available via Open Science Framework (21). In summary, the primary outcome measure was GDfE on computerised cognitive assessment. DfE effect sizes are calculated comparing COVID-CNS participants to matched community controls. We used standard two-sided p<.05 criteria for determining statistical significance, with false discovery rate correction where applicable. There were minor deviations from the analysis plan: there were seven individuals in the overall COVID-CNS cohort who had non-COVID respiratory illness who were excluded from this analysis due to small numbers. Additionally, the community normative group was not stratified by COVID-19 status due to lack of data. We report multiple regression models for GDfE rather than accuracy and RT separately to improve clarity. We based models on complete case analysis rather than multiple imputation as existing data was deemed sufficient (<20% missingness). For MRI analysis, we report the analyses of a priori defined regions. Cortical volume and surface area were co-linear and therefore cortical volume only was included (Variance Inflation Factor >10). The statistical analysis plan was otherwise conducted as documented. Statistical analyses were performed in R (The R Foundation©, version 3.6.1 or later). Potential confounders were included as candidate variables in all multiple regression models. These variables were premorbid state including pre-existing cognitive impairment, age, education, fatigue, subjective cognitive impairment and mental health. The GDfE score represents performance compared to what would be expected by age, sex, level of education and first language and therefore reduces the risk of confounding from these variables. GDfE is based on linear models trained on normative data from >400,000 individuals. Fatigue, subjective cognitive impairment and mental health measures were found to be collinear and PHQ-9 score explained the most variance in GDfE. PHQ-9 was therefore included in both final multiple linear regression models in Table 2.
Patient and public involvement
The COVID-CNS Patient and Public Involvement and Engagement panel represents patients across the brain-mind spectrum and through its bimonthly meetings has influenced the study throughout. Specifically, to support this analysis, the panel trialled and provided feedback on the cognitive testing prior to use, supported actioning of participant feedback, and guided presentation of findings.