Common genetic variants contribute to heritability of age at onset of schizophrenia

doi:10.21203/rs.3.rs-2487478/v1

Download PDF

Article

Common genetic variants contribute to heritability of age at onset of schizophrenia

https://doi.org/10.21203/rs.3.rs-2487478/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 13 Jun, 2023

Read the published version in Translational Psychiatry →

You are reading this latest preprint version

Schizophrenia (SCZ) is a complex disorder that typically arises in late adolescence or early adulthood. Age at onset (AAO) of SCZ is associated with long-term outcomes of the disease. We explored the genetic architecture of AAO with a genome-wide association study (GWAS), heritability, polygenic risk score (PRS), and copy number variant (CNV) analyses in 4 740 subjects of European ancestry. Although no genome-wide significant locus was identified, SNP-based heritability of AAO was estimated to be between 17 and 21%, indicating a moderate contribution of common variants. We also performed cross-trait PRS analyses with a set of mental disorders and identified a negative association between AAO and common variants for Schizophrenia, childhood maltreatment and attention-deficit/hyperactivity disorder. In addition, we explored whether copy number variants (CNVs) previously associated with SCZ played a role in AAO and found that there was no association with earlier onset. To our knowledge, this is the largest GWAS of AAO of SCZ to date, and the first study to determine the involvement of common variants in the heritability of AAO. Finally, we evidenced the role played by higher SCZ load in determining AAO but discarded the role of pathogenic CNVs. Altogether, these results shed light on the genetic architecture of AAO, which needs to be confirmed with larger studies.

Health sciences/Diseases/Psychiatric disorders/Schizophrenia

Biological sciences/Genetics/Genomics

age at onset

schizophrenia

GWAS

polygenic risk score

SNP-based heritability

Schizophrenia (SCZ) is a complex disorder influenced by an intricate interplay of genetic and environmental factors. SCZ patients show substantial heterogeneity in clinical characteristics such as symptomatology, cognitive ability, course, overall functioning, and age at onset (AAO). AAO has been consistently included among the most important determinants of disease outcome and is widely accepted as a significant clinical and prognostic factor ^1,2. For instance, an earlier AAO is associated with a higher likelihood of having relatives with SCZ ^3,4, and has been correlated with an increased number of hospitalizations and illness episodes, more frequent negative symptoms, and poorer cognition, overall functioning, and global outcome ^5–7. In general, men have an earlier AAO, usually between 20 and 24 years of age, while in women, the onset occurs between 25 and 35 years of age ^8–10. In addition, women appear to have a secondary peak around menopause, between 50 and 54 years old ^7,11.

The genetic architecture of SCZ is complex. Heritability estimates from twin and population-based studies range between 64% and 81% ^12,13, in which common genetic variants account for a large proportion (24.4%) ¹⁴. There is also evidence that both rare single-nucleotide variants and rare copy number variants (CNVs) contribute to the risk of developing SCZ ^15–19. In fact, individuals with a pathogenic CNV represent more than 2% of the confirmed cases ²⁰. CNVs are highly penetrant and may cause early-onset forms of developmental delay or autism spectrum disorders. Thus, similarly, it has been suggested that the presence of CNVs may play an important role in the onset of SCZ, although their contribution is still unclear ²⁰.

The heritability of the AAO has been estimated in sibling pairs at 33% ²¹, indicating a moderate genetic basis. However, in contrast with the vast amount of information on the genetics of SCZ obtained from genome-wide association studies (GWAS) ^14,22,23, the genetic determinants underlying AAO remain largely unknown. To date, only three GWAS have been performed in relatively small cohorts (< 3000 individuals) and none of them identified any genomic loci associated with AAO at genome-wide significance ^24–26. On the other hand, recent GWAS carried out based on the age at onset of both Bipolar Disorder (BD) and Major Depression Disorder (MDD) with larger sample sizes have determined a significant SNP-based heritability and shared genetic risk with other psychiatric disorders ^27,28. Thus, further studies with larger sample sizes are required to estimate the contribution of common genetic variants to AAO and the heritability they may explain.

Ultimately, identifying and researching the genetic factors that influence the AAO of SCZ may improve our understanding of the development and progression of this disease, provide new targets for therapy, and facilitate the development of personalized therapeutic interventions and preventive measures. In this study, we aimed to explore the genetic architecture of AAO by performing 1) a GWAS meta-analysis of nearly 5 000 subjects of European ancestry, 2) heritability estimates based on common genetic variants, 3) polygenic risk score (PRS) analyses with a set of mental traits, and finally 4) an assessment of the influence of known CNVs on AAO.

Sample

Four different datasets (CIBERSAM, PsyCourse, GAIN, and nonGAIN) were obtained and combined to perform a GWAS meta-analysis comprising 4 740 patients of European ancestry (Table 1). In all datasets, subjects met the criteria for SCZ, schizoaffective disorder, schizophreniform disorder, delusional disorder, brief psychotic disorder, or psychotic disorder not otherwise specified, in the Diagnostic and Statistical Manual of Mental Disorders version IV (DSM-IV).

Table 1

Description of samples used in the study.
Dataset	Source	N	% Females	Mean AAO (SD)	Genotyping chip
CIBERSAM	Seven groups from the Biomedical Research Network in Mental Health (CIBERSAM)	1,704	32.98	25.32 (8.84)	Illumina Infinium PsychArray
PsyCourse	The PsyCourse study	499	38.88	26.26 (9.55)	Illumina Infinium PsychArray
GAIN	The Genome-Wide Association Study of Schizophrenia (dbGaP repository study accession: phs000021.v3.p2)	1,280	29.92	21.11 (6.77)	Affymetrix Genome-Wide Human SNP Array 6.0
nonGAIN	The Molecular Genetics of Schizophrenia—nonGAIN Sample (MGS_nonGAIN, dbGaP repository study accession: phs000167.v1.p1)	1,224	31.54	21.77 (7.24)	Affymetrix Genome-Wide Human SNP Array 6.0

Participants in the CIBERSAM (Biomedical Research Network in Mental Health) dataset were recruited from psychiatric in-patient units at seven different hospitals in Spain ²⁹. Participation was approved by the ethical committees at the hospitals involved in the recruitment. Finally, samples were genotyped using the Illumina Infinium PsychArray at the Broad Institute as part of the wave 3 meta-analysis GWAS of SCZ of the Psychiatric Genomics Consortium (PGC-SCZ wave 3)³⁰.

The PsyCourse samples were part of a multi-site German/Austrian longitudinal study (www.psycourse.de) that was conducted between 1 January 2012 and 31 December 2019. The study collected deep phenotypic, neuropsychologic, and omics data from patients with brief psychotic disorder, major depressive disorder (MDD), bipolar disorder (BD), SCZ, schizoaffective disorder, and healthy individuals. Adult participants were referred by the clinical staff or identified by querying patient registries. Study protocols were reviewed and approved by the ethics committees of the Medical Centers and Faculties involved in the recruitment, in accordance with the Declaration of Helsinki. All participants provided written informed consent. The phenotype information was gathered using the v4.1 version of the PsyCourse data release. These samples were genotyped using the Illumina Infinium PsychArray ³¹.

GAIN and nonGAIN datasets were both obtained from the dbGaP repository (accession numbers phs000021.v3.p2 and phs000167.v1.p1, respectively for each dataset), and genotyped with the Affymetrix Genome-Wide Human SNP Array 6.0 as described elsewhere ³².

Age at onset

In the CIBERSAM dataset, AAO was defined as the onset of the first psychotic symptoms. The patient (and/or family members) and the psychiatrist defined when the first psychological symptoms appeared. In cases where this information was not available, we used the first psychiatric contact due to a psychotic episode. In the PsyCourse dataset, AAO was collected as both the age at first outpatient and inpatient treatment. Subjects with information available for either of these data were included, and when both data were available, the earliest age was used. For the GAIN and nonGAIN datasets, AAO was defined as the most likely AAO of psychotic symptoms consistent with the onset of SCZ. A consensus diagnostician (PI or senior research clinician delegate) reviewed the diagnostic ratings made independently by two research diagnosticians (one of which could be the consensus diagnostician as well) and assigned a final diagnosis and AAO if the ratings were in agreement. The Kolmogorov-Smirnov test was used to determine pair-wise differences in AAO between datasets.

GWAS and functional analyses

Quality control (QC) was conducted for each dataset separately (CIBERSAM, PsyCourse, GAIN and nonGAIN) using PLINK 1.9 ³³, according to standard procedures for GWAS ³⁴. Briefly, genetic variants with missingness rate > 2%, minor allele frequency (MAF) < 5%, Hardy-Weinberg equilibrium (HWE) P-value < 1e-06, and those belonging to non-autosomal chromosomes were excluded from downstream analyses. Ambiguous and multiallelic variants were also removed. Subjects with a missingness rate > 2%, increased or decreased heterozygosity rates (defined as ± 3 standard deviations away from the sample mean), and relatedness > 12.5% (PI_HAT > 0.125) were excluded. Sex was imputed based on X chromosome heterozygosity/homozygosity rates, before removing sex chromosomes. Principal component analyses (PCA) were conducted using SMARTPCA from EIGENSOFT 6.1.4 ³⁵. To keep only subjects with European ancestry, we did not use those individuals who were beyond ± 3 standard deviations from the mean of the first two principal components (PCs) of the European cluster of the 1000 Genomes Project ³⁶ Phase I. We also removed four individuals who clustered in the Finnish subgroup of the European cluster. Before genotype imputation, a PCA was performed again on the remaining subjects and the top 10 PCs were kept for further analysis.

Genotype imputation was conducted for each resulting dataset independently using the TOPMed reference panel ³⁷. Imputed datasets were filtered according to an imputation quality score (r-squared) < 0.9 and converted to binary files using PLINK’s --vcf flag. Then, a post-imputation QC was conducted for each dataset separately, using PLINK. Only single-nucleotide polymorphisms (SNPs) were kept for further analyses. In addition, ambiguous and multiallelic SNPs, as well as SNPs with MAF < 1%, and an HWE P-value < 1e-06 were excluded. The resulting datasets were lifted over to genome build 19 using the UCSC liftOverPlink tool ³⁸. The CIBERSAM dataset included: 1 704 subjects and 4 962 031 SNPs; PsyCourse: 499 subjects and 5 338 835 SNPs; GAIN: 1,278 subjects and 6 042 664 SNPs; and nonGAIN: 1 259 subjects and 6 074 765 SNPs. A GWAS was performed by linear regression for each dataset in PLINK, using normalized AAO as outcome and sex and the top 10 PCs as covariates. A meta-analysis was then conducted using the tool METAL ³⁹ applying an inverse variance strategy. As a result we obtained information on 4 740 subjects and 6 540 522 SNPs. As an alternative method, individual-level imputed genotype data for each of the four separate datasets was merged and a GWAS was conducted in parallel for comparison purposes. From here on, this approach is called the merged approach (Supplementary Note).

Genomic loci showing suggestive associations with AAO (P-value < 1e-05) were identified and explored using the FUMA software ⁴⁰. Each genomic risk locus was represented by the top lead SNP which had the minimum P-value in the locus. Lead SNPs were defined as associated SNPs (P-value < 1e-05) that were independent of each other at LD r² < 0.1. Independent significant SNPs were defined as SNPs with a P-value < 1e-05, and independent of each other at a linkage disequilibrium (LD) threshold r² < 0.6. The genomic risk loci were mapped to protein-coding genes within a 10 kb window based on ANNOVAR ⁴¹ annotations. Finally, pathway enrichment analyses were conducted using KOBAS-i ⁴². In all the analyses, a 5% False Discovery Rate (FDR) was considered for multiple testing correction.

SNP-based heritability

The proportion of phenotypic variance explained by SNPs was estimated using two different methods. First, SNP-based heritability was estimated with the Linkage Disequilibrium Score Regression (LDSC) method ⁴³. To reduce the standard error given our relatively small sample size, the intercept was constrained to 1 after testing that it was not significantly higher than 1 ⁴⁴. The LDSC intercept has been widely employed to distinguish between inflation due to confounding factors (such as population stratification and cryptic relatedness) and inflation due to polygenicity. Deviation of the intercept from 1 is indicative of residual confounding, thus observing an intercept not significantly higher than 1 can be interpreted as there being minimal confounding bias ⁴³. Second, we also estimated heritability using individual-level genotypes with the Genome-based Restricted Maximum Likelihood (GREML) approach implemented in the Genome-wide Complex Trait Analysis (GCTA) tool ⁴⁵, adjusting for sex, the dataset and the top 10 PCs.

Cross-trait polygenic risk score

PRS analyses were conducted using the PRS-cs software ⁴⁶ between ten mental phenotypes and AAO. Specifically, we obtained summary statistics data on seven psychiatric disorders downloaded from the Psychiatric Genomics Consortium (https://www.med.unc.edu/pgc/) ⁴⁷, including SCZ, BD, MDD, attention-deficit/hyperactivity disorder (ADHD), autism spectrum disorder (ASD), obsessive-compulsive disorder (OCD), and cannabis use disorder (CUD). In addition, we obtained GWAS data on three conditions previously associated with AAO: neuroticism ⁴⁸, educational attainment (EA) ¹¹, and childhood maltreatment ⁴⁹ (Table S1). The downloaded summary statistics were filtered to remove ambiguous, multiallelic and duplicated SNPs, and SNPs with an imputation score less than 0.9 (if the information was provided). The summary statistics of the mental phenotypes were used as base data, and the individual-level genotype dataset was used as target data. We calculated the PRS for each mental phenotype using the “auto” mode (the shrinkage parameter phi was determined from the data with a Bayesian approach). Then, the scores obtained for each individual in our dataset were regressed out against sex, age, and batch to obtain new adjusted-PRS scores. Linear regressions were performed with the normalized AAO values as outcome and the adjusted-PRSs as independent variables to evaluate the association of each PRS with AAO. Finally, p-values were corrected for multiple correction using FDR.

Copy number variation analysis

CNV analyses were conducted using signal intensity data from those individuals in the four cohorts from whom we obtained the intensity files (N = 4 630). The raw CNVs were obtained using PennCNV ^50,51. Briefly, quality control of CNV calls was based on a sample-level criterion that examined the relationship between the standard deviation of the logarithm R Ratio (LRR_SD) and the number of CNV calls (NumCNV). At the end of the process, adjacent calls were merged together into one single call. Thresholds were carefully chosen to include as many subjects as possible but reduce false positives. Thus, subjects with LRR_SD > 0.35, BAF > 0.01, WF > 0.05, and NumCNV > 150 were not included. In parallel, 12 CNVs previously described as significantly associated with SCZ (SCZ-CNV) were obtained from the literature ⁵². Then, the BEDTools intersect ⁵³ was used to look for overlaps between the described CNVs and our data. Briefly, we selected as CNV carriers only those individuals for whom at least 90% of the SCZ-CNV overlapped with a detected CNV (-f 0.9) and/or with at least 80% of reciprocal overlap (-r -f 0.8). To determine whether the presence of CNVs could be influencing AAO, Wilcoxon and Kruskal-Wallis tests were performed in R4.1.2 ⁵⁴, using non-normalized AAO values.

Age at onset

Mean AAO was 23.35 (SD = 8.26). Mean AAO varied across datasets, ranging from 21.11 to 26.26 (Table 1). In the whole sample 1 525 subjects (32.4%) were females (mean AAO = 25.09), and 3 182 subjects (67.6%) were males (mean AAO = 22.52). Significant differences were detected between the AAO of CIBERSAM and GAIN (P-value < 2.2e-16), CIBERSAM and nonGAIN (P-value < 2.2e-16), PsyCourse and GAIN (P-value = 2.2e-16), and PsyCourse and nonGAIN (P-value = 4.44e-16). However, differences were not detected between CIBERSAM and PsyCourse (P-value = 0.11), and neither between GAIN and nonGAIN (P-value = 0.33, Figure S1). Since the distribution of AAO was right-skewed (Figure S1), it was normalized using a rank-based inverse-normal transformation and used in all subsequent analyses.

GWAS and functional analyses

A total of 4 740 subjects of European ancestry and 6 540 522 SNPs were included in the GWAS meta-analysis. Although none of the analyzed SNPs reached the genome-wide significant threshold (P-value < 5e-08, Fig. 1), 25 lead SNPs were identified, corresponding to 22 genomic risk loci that were mapped to 183 genes (Table 2). Using the Linkage Disequilibrium Score Regression (LDSC) method ⁴³, an intercept of 1.03 (SE = 0.0083) was obtained. Mapped genes were enriched in categories such as transport of small molecules (FDR-adj P-value = 1.7e-02), vesicle-mediated transport (FDR-adj P-value = 1.8e-02), metabolism (FDR-adj P-value = 2.55e-02), and Asparagine N-linked glycosylation (FDR-adj P-value = 3.3e-02), among others (Table S2).

Table 2

Genomic risk loci for AAO identified in the genome-wide meta-analysis.
Genomic Locus	Position^a	Lead SNP	Lead SNP Position	P-value	nSNPs^b	Genes
1	1:181854925	rs185188889	1:181854925	6.735e-06	1	-
2	2:81911239:82893571	rs13383639	82802994	7,8040E-06	19	REG3G, REG1B, REG3A, CTNNA2, LRRTM1, SUCLG1
3	3:173494302:173563936	rs7652242	173553275	4,57E-06	59	NLGN1, NAALADL2
4	4:30196116:30196116	rs111289733	30196116	1,66E-07	1	RP11-180C1.1
5	4:38573143:38604331	rs4833071	38582859	4,79E-06	12	RELL1, TBC1D1, PTTG2, AC021860.1, KLF3, TLR10, RFC1, UGDH
6	4:40443966:40443966	rs10755175	40443966	4,49E-06	1	RHOH, RBM47, NSUN7
7	4:167064039:167096817	rs10018884	167092502	2,36E-06	65	MARCH1, MSMO1, SPOCK3, ANXA10, DDX60, PALLD
8	5:108260989:108520863	rs78438786	108260989	6,72E-07	6	FER, PJA2, MAN2A1, TMEM232, SLC25A46
9	5:120718453:120901978	rs2195409	120892594	6,73E-06	34	DMXL1, HSD17B4, FAM170A, FTMT, SRFBP1, LOX, SNCAIP
10	5:133457000:133774168	rs4431386	133557876	2,54E-06	65	SLC22A5, C5orf56, IL4, KIF3A, CCNI2, GDF9, UQCRQ, LEAP2, AFF4, ZCCHC10, HSPA4, C5orf15, VDAC1, TCF7, SKP1, CTD-2410N18.5, PPP2CA, CDKL3, UBE2B, CDKN2AIPNL, JADE2, SAR1B, SEC24A, DDX46, C5orf24, TXNDC15, PCBD2, PITX1
11	7:48751259:48840965	rs139864446	48831950	9,07E-06	7	AC004899.1, VWC2, C7orf72, IKZF1, DDC
12	7:106206611:106399401	rs111513327	106207969	1,75E-06	7	EFCAB10, ATXN7L1, SYPL1, NAMPT, CCDC71L, PIK3CG, PRKAR2B, HBP1, COG5, DUS4L, BCAP29, SLC26A4, CBLL1, SLC26A3
13	8:20289797:20370404	rs12550821	20315601	4,85E-06	32	CSGALNACT1, LPL, SLC18A1, ATP6V1B2, LZTS1, GFRA2, DOK2, XPO7, LGI3, SFTPC, BMP1
14	8:121006828:121262332	rs10808509	121227216	5,61E-06	77	TAF2, DSCC1, DEPTOR, COL14A1, MRPL13, MTBP, SNTB1
15	11:2192798:2192798	rs10840489	2192798	2,04E-06	1	KRTAP5-4, KRTAP5-5, KRTAP5-6, IFITM10, RP11-295K3.1, CTSD, SYT8, TNNI2, LSP1, C11orf89, TNNT3, TH, C11orf21
16	11:44644743:44870058	rs11038082	44644743	2,51E-06	28	C11orf96, EXT2, ALX4, CD82, TSPAN18, TP53I11, PRDM11, SYT13, SLC35C1, CRY2
17	12:32500207:32537185	rs144642024	32500207	1,58E-06	2	OVCH1, FAM60A, AC024940.1, DENND5B, METTL20, AMN1, KIAA1551, FGD4, DNM1L, YARS2, ALG10
18	13:61356260:61390956	rs9570366	61377708	1,69E-06	19	PCDH20
19	14:20479798:20546983	rs72667672	20492904	6,82E-06	82	OR4N2, OR4K2, OR4K5, OR4K1, OR4K14, OR4K13, OR4L1, OR4K17, OR4N5, TTC5, RNASE9, RNASE4, ANG, AL163636.6
20	16:71388416:71388416	rs142248381	71388416	2,45E-07	1	COG4, SF3B3, MTSS1L, VAC14, HYDIN, CMTR2, ZNF23, ZNF19, TAT, HP, HPR
21	16:84318421:84322649	rs11645140	84322546	5,17E-07	4	NECAB2, MBTPS1, HSDL1, DNAAF1, KCNG4, WFDC1, ATP2C2
22	17:51482920:52487345	rs146709267	52065109	1,48E-06	11	MBTD1, UTP18, CA10, AC102948.2, C17orf112, KIF2B, TOM1L1, COX11, STXBP4, HLF, MMD, ANKFN1, NOG
^a Positions are based on Human Genome version 19 (hg19), build 37. ^b Number of SNPs in the genomic locus (r² ≥ 0.6 with any of the independent significant SNPs).

SNP-based heritability

The SNP-based heritability (h²_SNP) was estimated using two methods that showed consistent results with moderate and significant SNP-based heritability. First, with LDSC, we obtained h²_SNP = 0.21 (SE = 0.07). SNP-based heritability was also estimated using individual-level genotypes with GCTA-GREML, adjusting for sex, dataset and the top 10 PCs, resulting in an estimate of h²_SNP = 0.17 (SE = 0.06; P-value = 3.33e-03). These results were consistent with the heritability estimates obtained from the GWAS merged approach (h²_SNP = 0.13, Supplementary Note).

Cross-trait polygenic risk score

Ten mental phenotypes were examined through a cross-trait PRS analysis. Only the PRSs calculated based on ADHD, SCZ and childhood maltreatment sumstats were associated with AAO in our dataset (FDR < 0.05). All adjusted PRS beta coefficients were negative, corresponding to the higher burden of disease/condition risk variants being associated with earlier onset of SCZ (Fig. 2 and Table S3). The variance explained by the adjusted-PRS of these three phenotypes was low but significant (1.2e-03; 1.3e-03 and 1.1e-03, respectively for ADHD, SCZ, and childhood maltreatment). Among the other mental phenotypes, only BD-PRS was nominally associated with AAO (P = 0.02).

CNV analysis

After quality control, a total of 3 965 individuals remained for downstream analyses. Among them, 117 individuals (3%) carried CNVs previously associated with SCZ, and 4 individuals carried two pathogenic CNVs. The rest, 3 848 individuals (N = 97%) were determined to be non-CNV-carriers. Among the CNV-carriers, we detected 36 (29.8%) carrying 15q11.2del, 20 (16.53%) 22q11.2del, 13 (10.74%) 16p12.1del, 12 (9.92%) 16p11.2dup, 11 (9.09%) 16p13.11dup, 10 (8.26%) 15q13.3del, 6 (4.96%) 3q29del, 4 (3.31%) carrying 1q21.1dup, 4 (3.31%) 15q11-q13dup, 3 (2.48%) 1q21.1del, and 2 (1.65%) 7q11.23dup individuals. The deletion corresponding to 2p16.3 was not present in our sample. In our dataset, the presence of pathogenic CNVs was not associated with an earlier AAO (Wilcoxon test; P-value = 0.73, Fig. 3A). In addition, no differences in the AAO were found across CNVs (Kruskal-Wallis test; P-value = 0.49, Fig. 3B).

This study explored the genetic architecture of AAO of SCZ. To this end, we performed a case-only GWAS of European ancestry in the largest sample collected to date. Although no genome-wide significant signals were detected, we successfully estimated for the first time the SNP-based heritability of AAO using two different approaches that showed consistent results of moderate heritability, ranging from 17–21%. We also provided evidence of negative genetic associations of cross-trait PRS derived from ADHD, SCZ, and childhood maltreatment with AAO. Finally, we determined that CNVs previously reported in SCZ were not associated with AAO in our dataset.

In our study, the strongest association signal was found in a genomic locus at chromosome 4 (lead SNP rs111289733 P-value = 1.66e-07), which harbored the long non-coding RNA RP11-180C1.1. The mapped genes belonging to the suggestive associations were enriched in transport of small molecules and vesicle-mediated transport, among others. These categories are promising candidates for further studies on pathways associated with AAO since the transport of molecules has been previously associated with SCZ ⁵⁵, and abnormalities of the vesicular transport mechanism might also participate in the pathogenesis of SCZ ^56,57.

Over the years, many studies have evaluated the role of genetics in AAO of SCZ, estimating a heritability of AAO ranging from approximately 20–58% ⁷. In our study, based on two different methodologies we estimated the SNP-based heritability of AAO to be between 17% and 21%. In fact, our estimates show consistent results suggesting a moderate but significant contribution of common variants to AAO. Interestingly, SNP-based heritability is slightly higher than that of AAO in BD and MDD, which has recently been estimated at 5% and 6% respectively, using larger sample sizes ²⁷. Further studies with larger sample sizes are needed to obtain more accurate estimates in SCZ AAO.

Cross-trait PRSs constructed with ADHD, SCZ, and childhood maltreatment were significantly associated with AAO; however, they explained a very small fraction of AAO variation ⁵⁸, showing that a higher risk of developing these conditions is associated with an earlier AAO in SCZ. Moreover, SCZ has been associated with a variety of comorbid psychiatric conditions, and previous studies have reported genetic correlations between SCZ and ADHD of 0.22. In fact, a study reported that ADHD was among the commonest comorbidities in children and adolescents with SCZ ⁵⁹, and it has been argued that the genetic architecture of ADHD has a large link with SCZ ⁶⁰. In this line, our results suggest that an earlier AAO may be related to a more severe neurodevelopmental impairment. In our sample, we also report a negative association between AAO and the PRS of both SCZ and childhood maltreatment. It has also been suggested that individuals with higher genetic loadings for SCZ are at a higher risk of early onset ⁶¹, and similarly for MDD ²⁸. Moreover, patients with histories of being abused as children show an earlier onset of symptoms ⁶². However, it is still unknown how the genetic architectures of these traits are linked.

A recent study reported that the prevalence of recurrent CNVs was higher in early onset psychosis than in the general population, as well as CNV pathogenicity ⁶¹. In addition, some of these CNVs cause earlier-onset disorders such as developmental delay or ASD, but not SCZ ²⁰. However, in our study, pathogenic CNV-carriers were not associated with AAO. Similarly, none of the specific CNVs were associated with earlier onset. Nevertheless, we detected a 1.9% prevalence of the pathogenic CNVs, which is close to the previously reported prevalence of 2.6% ²⁰. Thus, although a proportion of risk for SCZ can be explained by rare mutations ⁶³, pathogenic CNVs reported to date were not associated with an earlier onset of the disease.

We have detected some strengths and limitations in our study that deserve some discussion. Despite the lack of genome-wide significant associations at the SNP-level, which could be expected given the relatively small sample size of our study, we were able to estimate, for the first time, the SNP-based heritability of AAO and to determine relevant associations with the PRS of three mental phenotypes. This can help begin to uncover the genetic architecture of AAO. Our results may be biased by the fact that the AAO collected does not correspond to the real AAO but rather to the first visit. In the future, with more reliable estimates of AAO, the heritability explained by SNPs could increase. The rank-based transformation applied to the AAO values may have affected our GWAS and subsequent analyses; however, it is considered as one of the best approaches to use. Some studies have reported that for small sample sizes or genetic effects, there is an improvement in sensitivity for rank-based transformations that outweighs a slight increase in the false-positive rate ⁶⁴. In addition, a recent study has demonstrated that these transformation tests outperform the standard untransformed association test, both in terms of power and type I error rate control ⁶⁵. Moreover, we were not able to control for putative differences between the different recruitment sites. All the datasets included in the study might be multicenter; thus, heterogeneity within datasets could be considerable. Such phenotypic heterogeneity has been reported to affect genetic analyses ^27,66, which indicates that phenotype harmonization is as important as a larger sample size for improving the power to detect significant associations and avoiding a biased view of genetic architectures. Finally, we acknowledge that AAO in our study differed across some datasets, therefore we chose to perform a GWAS meta-analysis over other approaches to preclude any relevant bias due to heterogeneity in the phenotype.

In conclusion, we report on the largest GWAS of AAO in SCZ to date, providing the first SNP-based heritability estimate of AAO in individuals of European ancestry. Although no genome-wide significant SNP was detected, we provide evidence of a genetic background for AAO and a negative association with the PRS of ADHD, SCZ and childhood maltreatment. In addition, we demonstrate that SCZ load is associated with the AAO of the disease, but not the pathogenic CNVs reported to date. Larger sample sizes would be useful for determining the genetic architecture of AAO, which could help us understand further the pathogenesis of SCZ and contribute to the development of better strategies for the early detection of SCZ.

Acknowledgements

This work was supported by Instituto de Salud Carlos III (PI18/00514 and PI21/00612) and by the Catalan Agency of Research and Universities (AGAUR, 2017SGR-00444). The PsyCourse study was supported by DFG (SCHU 1603/4-1, 5-1, 7-1, FA241/16-1).

Conflicts of interest

The authors have no relevant financial or non-financial interests to disclose.

Delisi, L. E. The Significance of Age of Onset for Schizophrenia. Schizophrenia Bulletin 18, 209–215 (1992).
Öngür, D., Lin, L. & Cohen, B. M. Clinical characteristics influencing age at onset in psychotic disorders. Comprehensive Psychiatry 50, 13–19 (2009).
Kendler, K. S. & MacLean, C. J. Estimating familial effects on age at onset and liability to schizophrenia. I. Results of a large sample family study. Genetic Epidemiology 7, 409–417 (1990).
Sham, P. C. et al. Age at onset, sex, and familial psychiatric morbidity in schizophrenia. Camberwell collaborative psychosis study. British Journal of Psychiatry 165, 466–473 (1994).
Rajji, T. K., Ismail, Z. & Mulsant, B. H. Age at onset and cognition in schizophrenia: meta-analysis. The British Journal of Psychiatry 195, 286–293 (2009).
Immonen, J., Jääskeläinen, E., Korpela, H. & Miettunen, J. Age at onset and the outcomes of schizophrenia: a systematic review and meta-analysis: Age at onset and the outcomes of schizophrenia. (2017) doi:10.1111/eip.12412.
Musket, C. W. et al. Why does age of onset predict clinical severity in schizophrenia? A multiplex extended pedigree study. American Journal of Medical Genetics, Part B: Neuropsychiatric Genetics 183, 403–411 (2020).
Aleman, A., Kahn, R. S. & Selten, J. P. Sex Differences in the Risk of Schizophrenia: Evidence From Meta-analysis. Archives of General Psychiatry 60, 565–571 (2003).
Leung, M. Sex differences in schizophrenia, a review of the literature. Acta Psychiatrica Scandinavica 3–38 (2003).
Neill, E. et al. Examining which factors influence age of onset in males and females with schizophrenia. (2020) doi:10.1016/j.schres.2020.08.011.
Ochoa, S., Usall, J., Cobo, J., Labad, X. & Kulkarni, J. Gender Differences in Schizophrenia and First-Episode Psychosis: A Comprehensive Literature Review. Schizophrenia Research and Treatment 2012, 1–9 (2012).
Sullivan, P. F., Kendler, K. S. & Neale, M. C. Schizophrenia as a Complex Trait: Evidence From a Meta-analysis of Twin Studies. Archives of General Psychiatry 60, 1187–1192 (2003).
Lichtenstein, P. et al. Common genetic influences for schizophrenia and bipolar disorder: A population-based study of 2 million nuclear families. Lancet 373, 1–14 (2009).
Consortium, T. S. W. G. of the P. G., Ripke, S., Walters, J. T. & O’Donovan, M. C. Mapping genomic loci prioritises genes and implicates synaptic biology in schizophrenia. medRxiv 2020.09.12.20192922 (2020).
Levinson, D. F. et al. Copy number variants in schizophrenia: Confirmation of five previous finding sand new evidence for 3q29 microdeletions and VIPR2 duplications. American Journal of Psychiatry 168, 302–316 (2011).
Rees, E. et al. Analysis of copy number variations at 15 schizophrenia-associated loci. British Journal of Psychiatry 204, 108–114 (2014).
Rees, E. et al. Analysis of intellectual disability copy number variants for association with schizophrenia. JAMA Psychiatry 73, 963–969 (2016).
Marshall, C. R. et al. Contribution of copy number variants to schizophrenia from a genome-wide study of 41,321 subjects. Nature Genetics 2016 49:1 49, 27–35 (2016).
Halvorsen, M. et al. Increased burden of ultra-rare structural variants localizing to boundaries of topologically associated domains in schizophrenia. Nature Communications 2020 11:1 11, 1–13 (2020).
Kirov, G. et al. The penetrance of copy number variations for schizophrenia and developmental delay. Biological Psychiatry 75, 378–385 (2014).
Hare, E. et al. Heritability of age of onset of psychosis in schizophrenia. American Journal of Medical Genetics, Part B: Neuropsychiatric Genetics 153, 298–302 (2010).
Ripke, S. et al. Genome-wide association analysis identifies 13 new risk loci for schizophrenia. Nature Genetics 45, 1150–1159 (2013).
Working Group of the Psychiatric Genomics Consortium, S. Biological insights from 108 schizophrenia-associated genetic loci. (2014) doi:10.1038/nature13595.
Wang, K.-S., Liu, X., Zhang, Q., Aragam, N. & Pan, Y. Genome-wide association analysis of age at onset in schizophrenia in a European-American sample. American Journal of Medical Genetics Part B: Neuropsychiatric Genetics 156, 671–680 (2011).
Bergen, S. E. et al. Genetic modifiers and subtypes in schizophrenia: Investigations of age at onset, severity, sex and family history. Schizophrenia Research 154, 48–53 (2014).
Woolston, A. L. et al. Genetic loci associated with an earlier age at onset in multiplex schizophrenia. Scientific Reports 7, 6486 (2017).
Kalman, J. L. et al. Characterisation of age and polarity at onset in bipolar disorder. The British Journal of Psychiatry 1–11 (2021) doi:10.1192/bjp.2021.102.
Harder, A. et al. Genetics of age-at-onset in major depression. Translational Psychiatry 2022 12:1 12, 1–7 (2022).
Salagre, E. et al. CIBERSAM: Ten years of collaborative translational research in mental disorders. Revista de Psiquiatría y Salud Mental (English Edition) 12, 1–8 (2019).
Trubetskoy, V. et al. Mapping genomic loci implicates genes and synaptic biology in schizophrenia. Nature 604, 502–508 (2022).
Budde, M. et al. A longitudinal approach to biological psychiatric research: The PsyCourse study. American Journal of Medical Genetics, Part B: Neuropsychiatric Genetics 180, 89–102 (2019).
Shi, J. et al. Common variants on chromosome 6p22.1 are associated with schizophrenia. Nature 460, 753–757 (2009).
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81, 559–575 (2007).
Marees, A. T. et al. A tutorial on conducting genome-wide association studies: Quality control and statistical analysis. Int J Methods Psychiatr Res 27, e1608 (2018).
Patterson, N., Price, A. L. & Reich, D. Population Structure and Eigenanalysis. PLoS Genet 2, e190 (2006).
Genomes Project Consortium et al. A global reference for human genetic variation. Nature 526, 68–74 (2015).
NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium et al. Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program. Nature 590, 290–299 (2021).
Navarro Gonzalez, J. et al. The UCSC Genome Browser database: 2021 update. Nucleic Acids Res 49, D1046–D1057 (2021).
Willer, C. J., Li, Y. & Abecasis, G. R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191 (2010).
Watanabe, K., Taskesen, E., Van Bochoven, A. & Posthuma, D. Functional mapping and annotation of genetic associations with FUMA. Nature Communications 8, 1–11 (2017).
Wang, K., Li, M. & Hakonarson, H. ANNOVAR: Functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Research 38, 1–7 (2010).
Bu, D. et al. KOBAS-i: intelligent prioritization and exploratory visualization of biological functions for gene enrichment analysis. Nucleic Acids Research 49, (2021).
Bulik-Sullivan, B. et al. LD score regression distinguishes confounding from polygenicity in genome-wide association studies. Nature Genetics 47, 291–295 (2015).
Nievergelt, C. M. International meta-analysis of PTSD genome-wide association studies identifies sex-and ancestry-specific genetic risk loci. doi:10.1038/s41467-019-12576-w.
Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: A tool for genome-wide complex trait analysis. American Journal of Human Genetics 88, 76–82 (2011).
Ge, T., Chen, C.-Y., Ni, Y., Feng, Y.-C. A. & Smoller, J. W. Polygenic prediction via Bayesian regression and continuous shrinkage priors. Nat Commun 10, 1776 (2019).
Sullivan, P. F. et al. Psychiatric Genomics: An Update and an Agenda. doi:10.1176/appi.ajp.2017.17030283.
Goodwin, R. D., Fergusson, D. M. & Horwood, L. J. Neuroticism in adolescence and psychotic symptoms in adulthood. Psychological Medicine 33, 1089–1097 (2003).
Varese, F. et al. Childhood adversities increase the risk of psychosis: A meta-analysis of patient-control, prospective-and cross-sectional cohort studies. Schizophrenia Bulletin 38, 661–671 (2012).
Wang, K. et al. PennCNV: An integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data. doi:10.1101/gr.6861907.
Sayers, E. W. & Karsch-mizrachi, I. Chapter 1 Using GenBank. 1374, 1–23.
Warland, A., Kendall, K. M., Rees, E., Kirov, G. & Caseras, • Xavier. Schizophrenia-associated genomic copy number variants and subcortical brain volumes in the UK Biobank. Molecular Psychiatry 25, 854–862 (2020).
Quinlan, A. R. & Hall, I. M. BEDTools: A flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Team, R. C. R: A language and environment for statistical computing. Preprint at (2021).
Nakato, M. et al. ABCA13 dysfunction associated with psychiatric disorders causes impaired cholesterol trafficking. Journal of Biological Chemistry 296, 100166 (2021).
Egbujo, C. N., Sinclair, D. & Hahn, C.-G. Dysregulations of Synaptic Vesicle Trafficking in Schizophrenia. Curr Psychiatry Rep 18, 77 (2016).
Schubert, K. O., Föcking, M., Prehn, J. H. M. & Cotter, D. R. Hypothesis review: are clathrin-mediated endocytosis and clathrin-dependent membrane and protein trafficking core pathophysiological processes in schizophrenia and bipolar disorder? Mol Psychiatry 17, 669–681 (2012).
Anttila, V. et al. Analysis of shared heritability in common disorders of the brain. Science 360, (2018).
Ross, R. G., Heinlein, S. & Tregellas, H. High rates of comorbidity are found in childhood-onset schizophrenia. Schizophrenia Research 88, 90–95 (2006).
Hamshere, M. L. et al. Shared polygenic contribution between childhood attention-deficit hyperactivity disorder and adult schizophrenia. British Journal of Psychiatry 203, 107–111 (2013).
Bearden, C. C. et al. Prevalence of Rate of Deleterious Copy Number Vari- ants Similar in Early Onset Psychosis and Autism Spectrum Disorders: Implications for Clinical Practice. Biological Psychiatry 91, S56–S57 (2022).
Kaufman, J. & Torbey, S. Child maltreatment and psychosis. Neurobiology of Disease 131, 104378 (2019).
Malhotra, D. & Sebat, J. CNVs: Harbingers of a Rare Variant Revolution in Psychiatric Genetics. Cell 148, 1223–1241 (2012).
Goh, L. & Yap, V. B. Effects of normalization on quantitative traits in association test. BMC Bioinformatics 10, (2009).
McCaw, Z. R., Lane, J. M., Saxena, R., Redline, S. & Lin, X. Operating characteristics of the rank-based inverse normal transformation for quantitative trait analysis in genome-wide association studies. Biometrics 76, 1262–1272 (2020).
Cai, N. et al. Minimal phenotyping yields genome-wide association signals of low specificity for major depression. Nature Genetics 52, 437–447 (2020).

The authors have declared there is NO conflict of interest to disclose

supplementarytablesv4.xlsx
Related Manuscript File
SupplementaryNotev4.docx

Download PDF

Journal Publication

published 13 Jun, 2023

Read the published version in Translational Psychiatry →

Editorial decision: revise
16 Feb, 2023
Review #1 received at journal
07 Feb, 2023
Review #2 received at journal
04 Feb, 2023
Reviewer #2 agreed at journal
23 Jan, 2023
Reviewer #1 agreed at journal
21 Jan, 2023
Reviewers invited by journal
19 Jan, 2023
Submission checks completed at journal
18 Jan, 2023
First submitted to journal
17 Jan, 2023
Unknown event
17 Jan, 2023
Editor assigned by journal
17 Jan, 2023

You are reading this latest preprint version

Common genetic variants contribute to heritability of age at onset of schizophrenia

Status:

Journal Publication

Version 1

Abstract

Figures

Introduction

Material And Methods

Results

Discussion

Conclusions

Declarations

References

Additional Declarations

Supplementary Files

Status:

Journal Publication

Version 1