Genetic determinants of centenarian longevity, as quantified by the 'CentPGS' score, are associated with a lower risk of multiple age-related diseases and a longer healthspan.

doi:10.21203/rs.3.rs-3916561/v1

Download PDF

Article

Genetic determinants of centenarian longevity, as quantified by the 'CentPGS' score, are associated with a lower risk of multiple age-related diseases and a longer healthspan.

https://doi.org/10.21203/rs.3.rs-3916561/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Centenarians exhibit remarkable longevity, and exploring the genetic determinants of that longevity is crucial for understanding the mechanisms of human ageing. Although APOE4 is the most common implicated negative factor in longevity, other genetic factors and their associated phenotypes are not fully understood. We conducted a genome-wide association study (GWAS) of 964 Japanese centenarians (including 173 supercentenarians) and 7,306 controls to identify the genetic components of longevity and the correlated phenotypes. GWAS summary statistics revealed that the genetic components of longevity were negatively associated with the risk of multiple age-related diseases and biometrics. Survival analysis indicated that a polygenic score derived from these summary statistics, called CentPGS, was correlated with healthspan in a cohort of healthy older people. This association was independent of APOE4 genotype and sex, suggesting that CentPGS is a promising genetic indicator of healthspan and may be used in future investigations into healthy longevity.

Health sciences/Medical research/Genetics research

Health sciences/Health care/Public health/Epidemiology

Human longevity and healthy ageing are complicated phenotypes that are shaped not only by the absence of disease but also by genetic predisposition, lifestyle, and social environments¹. Although many biological mechanisms of ageing have been elucidated and proposed by studies in cells and animal models, it is still difficult to analyse the identified biological mechanisms in humans to determine whether they are relevant to human ageing pathways. Centenarians and supercentenarians (≥100 or ≥110 years old, respectively) generally avoid or experience a delay in the onset of age-related diseases²; maintain physical and cognitive independence at an extremely advanced age^3,4; retain organ reserves in the heart, kidneys and liver⁵; and possess heritable genetic components that may confer a survival advantage⁶. Understanding the biological mechanisms underlying the characteristics of supercentenarians might provide insight into how this trait may be extended to the general population.

Genome-wide association studies (GWASs) are powerful approaches for identifying key genes and genetic components associated with complex traits. Many GWASs have been performed for multiple age-related traits^7-10, healthy ageing¹¹; healthspan, which is defined as the period of life spent in good health free from chronic diseases and the disability of ageing¹²; parental lifespan¹³; and multivariate analyses of these ageing factors^14-16. While several loci have been identified in these GWASs, the APOE locus remains the most significant for age-associated traits^12-16. Actually, the allele frequency of APOE4 has been reported to decrease with age in older people^17,18.

To detect genetic differences reliably, the statistical power of GWASs can be generally improved by increasing the number of targets or study participants with highly heritable components. Therefore, studying centenarians, especially supercentenarians, who are more likely to have highly heritable traits, is important ¹⁹ for determining the genetic components underlying extreme longevity. Furthermore, recent technological advances have enabled us to analyse the genetic correlation between GWAS summary statistics²⁰ and polygenic risk scores (PRSs)²¹. Several studies have utilized PRSs to understand ageing and longevity^22-24; however, only a few observable phenotypes correlated with the longevity PRS have been reported²².

Here, we report a GWAS of 964 Japanese centenarians (including 173 supercentenarians) and 7,306 controls aiming to explore the genetic components underlying extreme human longevity. We subsequently applied GWAS summary statistics and PRS analyses and evaluated the associations of these parameters with observed traits. Furthermore, we also analysed healthy individuals aged 85-89 years since traits correlated with these genetic components are thought to be mainly observed in this slightly younger age group. We showed that a polygenic score derived from the centenarian GWAS summary statistics, called CentPGS, was correlated with healthspan in a cohort of healthy agers independent of APOE4 genotype and sex. These analyses comprising both centenarians and healthy agers reveal the genetic components of extreme human longevity, which might be shared with those of healthspan or other age-related traits in the general population.

Study populations

To identify the genetic components enriched in centenarians, we isolated genomic DNA (gDNA) from 967 Japanese centenarians (Cent) and 30,081 Japanese controls (Table 1, Extended Data Figure 1). The gDNA sequences of these samples were determined by whole-genome sequencing (WGS) or DNA microarray analysis with imputation. After removal of samples according to the exclusion criteria, 964 centenarians and 7,304 controls (Cont(GWAS)) were used for the GWAS, and 10,000 controls (Cont(PRS)) were used for PRS analysis (Supplementary Figure 1). The Japanese healthy agers (HA) was composed of individuals aged 85-89 years who maintained their independence (Extended Data Figure 2). The gDNA sequences of the HA were determined by DNA microarray with imputation, and 1,016 HAs were used for further analyses after removal of sequences similar to those of the Cent.

Genome-wide association study of Japanese centenarians

The GWAS was analysed with 21 covariates (sex and principal component (PC)1-20 in principal component analysis (PCA)), and GWAS statistics were combined using N-weighted multivariate GWAMAs²⁵ (Supplementary Figure 2). Four lead single-nucleotide variants (SNVs) were isolated in three genes (APOE, EYS and GRM7; Figure 1a, Supplementary Table 1). For the APOE locus, we found two groups of associated SNVs corresponding to the APOE4 missense mutation rs429358 and the APOE2 missense mutation rs7412 (Figure 1b). For both the EYS and GRM7 loci, the lead SNVs (rs75571981 (EYS) and rs73116078 (GRM7)) were located in the intron regions (Figure 1b).

Characteristics of lead SNVs in Japanese centenarian GWASs

Next, we compared the minor allele frequencies (MAFs) of these SNVs among the Cont(GWAS), HA, and Cent. The MAFs associated with the lead SNVs for APOE4 and rs73116078 (GRM7) gradually decreased with age in both the Cent and HA, whereas those associated with APOE2 and rs75571981 (EYS) increased only in the Cent (Figure 1c, Extended Data Figure 3). We then analysed the association of four SNVs with gene expression using the Genotype-Tissue Expression database²⁶. However, no expression data for rs75571981 (EYS) or rs73116078 (GRM7) were obtained. Therefore, we performed an additional analysis of these loci via the Enformer tool²⁷, which predicts gene expression from sequences by integrating long-range interactions. The APOE4 and APOE2 were shown to potentially affect gene expression, while rs75571981 (EYS) and rs73116078 (GRM7) were less likely to be involved.

Generalized gene-based analysis in the Japanese centenarian GWAS and overlap analysis with known age-related genes

To analyse genes including small effects on longevity, GWAS summary statistics of Japanese centenarians were also analysed via GWAS gene set analysis using the multimarker analysis of genomic annotation (MAGMA) method²⁸. This analysis identified three significant genes (APOE, NARS, and FAM188B) (P<0.05/17990) according to Bonferroni’s multiple comparison test; 13, 45, 284, and 1,099 genes were detected at significance values of P<1.0x10^-4, P<1.0x10^-3, P<0.01, and P<0.05, respectively (Extended Data Figure 4a and b). MAGMA tissue expression analysis revealed that genes with P<0.01 (termed “Cent.genes”) were significantly expressed in ten tissues (Supplementary Figure 3). To understand the features of the Cent.genes, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses were performed. However, no significant GO or KEGG pathways were identified. We subsequently compared the overlap of our genes with known age-related gene lists, including a longevity map, a cell age map, a GenAge map in Human Ageing Genomic Resources²⁹, and KEGG longevity regulating pathway genes³⁰. We also analysed genes from the type 2 diabetes (T2D) risk locus³¹ as a reference. Among these, the highest percentage of overlapping genes was 4.55% for T2D vs. Cent.genes, indicating no obvious overlap between Cent.genes and known age-related gene lists (Extended Data Figure 4c and Supplementary Table 2).

SNP-based heritability, l_GC, and genetic correlation analyses using Japanese centenarian GWAS summary statistics

To estimate SNP-based heritability (SNPh2), l_GC, and genetic correlation, we analysed the GWAS summary statistics of Japanese centenarians²⁰. We also examined the summary statistics after removing the loci of APOE, EYS and GRM7 to evaluate the effects of SNVs in these three genes. The SNPh2 calculated from the Japanese centenarian GWAS data was 0.204, which was comparable to the SNPh2 calculated without the three genes and that of a European 90th percentile GWAS but lower than that of a European 99th percentile GWAS (Figure 1d)⁹. The l_GC of the centenarian GWAS was 1.04, which was similar to that of the centenarian GWAS without the three genes and the European 90th and 99th percentile GWASs, suggesting that no obvious founder effect influenced the genetic components of Japanese centenarians (Figure 1e). Genetic correlation analysis revealed that the summary statistics of the Japanese centenarian GWAS were positively correlated with those of the European 90th and 99th percentile groups, but the correlation coefficients were 0.331 and 0.191, respectively (Figure 1f). Negative genetic correlations with age-related diseases, including Alzheimer’s disease (AD); several cardiovascular diseases; T2D; and several biometrics, including blood pressure, glucose metabolism, and liver function were identified in the Japanese centenarian GWAS summary statistics (Figure 1f). Heritability enrichment analysis with the Japanese centenarian GWAS summary statistics revealed no specific heritability enrichment in any tissue (Extended Data Figure 5). Taken together, these results suggest that the genetic components of Japanese longevity are correlated with genetic components for multiple age-related diseases and biometrics.

Differences in the PRS distribution between centenarians and controls

To elucidate the differences in genetic components between centenarians and controls, we compared the distributions of PRSs calculated using 62 phenotypes from centenarians and an additional 10,000 controls (Cont(PRS)). The PRS was calculated with SNVs with a significance greater than P=0.01, and the output was converted to the sex-adjusted Z score of PRS least square means (lsmeans) and standardized by the PRS distribution of Cont(PRS). The analysis of the PRS lsmeans distribution revealed that distributions of 7 phenotypes (mean arterial pressure (MAP), systolic blood pressure (SBP), diastolic blood pressure (DBP), gamma-glutamyl transferase (GGT) level, alanine aminotransferase (ALT) level, congestive heart failure (CHF), and basophil (Baso) counts) were significantly different in both the Cent and HA compared with the Cont(PRS) (Supplementary Table 3). Moreover, the PRS lsmeans distributions for 7 phenotypes (C-reactive protein (CRP) level, blood sugar (BS) level, pulse pressure (PP), aspartate aminotransferase (AST) level, arrhythmia, T2D, and total cholesterol (TC) level) were significantly different between the Cent and the Cont(PRS). Finally, the PRS lsmeans distribution in patients with asthma significantly differed in the HA compared with the Cont(PRS) according to Bonferroni’s multiple comparison test (P<0.00079; Figure 2a, b).

Multivariate logistic regression analysis of genetic components among centenarians and controls

To estimate the differences in the genetic components between centenarians and controls, least absolute shrinkage and selection operator (LASSO) and multiple logistic regression analyses were performed using 62 PRSs and the genotypes of the four SNVs (APOE4, APOE2, SNV(EYS), and SNV(GRM7)). We identified 12 PRSs and three SNVs that significantly contributed to the difference in genetic components between centenarians and Cont(PRS) individuals (Figure 2c). Receiver operating characteristic curve analysis revealed that the area under the curve (AUC) for 12 PRSs and three SNVs was 0.694, and the AUC for 12 PRSs was greater than that for the three SNVs, suggesting that the polygenic components associated with Japanese longevity are greater than the genetic components derived from three SNVs, including APOE4 and APOE2 (Figure 2d, e).

Establishment of CentPGS, the polygenic score based on Japanese centenarian GWAS summary statistics

To isolate the phenotype associated with longevity, we attempted to develop "CentPGS", which is a standardized polygenic score based on 53,760 SNVs with p<0.01 from the Japanese centenarian GWAS summary statistics. The mean CentPGS of the HA was 0.292, which was significantly greater than that of the Cont(PRS) (P<0.001; Figure 2f). In contrast, the mean CentPGSs of the Cent and Cont(GWAS) were 3.413 and -0.374, respectively, which were considered to be inflated because these were used for the GWAS. To assess the reproducibility of CentPGS, we validated it in 60 Japanese centenarians enrolled in the 5-COOP project at age 100-101 years³² and 36 siblings of the Japanese centenarians enrolled at age 90-104 years, none of which were included in the GWAS. The mean CentPGS of the additional 60 centenarians was 1.053, which was significantly greater than that of the control and HA groups (P<0.001). In contrast, the mean CentPGS of the 36 siblings of the centenarians was 1.794, which was also expected to be affected by inflation (Supplementary Figure 4). These results indicated that CentPGS could be a novel polygenic score candidate for evaluating genetic components associated with longevity.

Multivariate logistic regression analysis of genetic components among the HA and control

Next, we evaluated the genetic differences between the HA and Cont(PRS) through a series of LASSO and multiple logistic regression analyses using CentPGS, 62 PRSs and genotypes of the four SNVs (APOE4, APOE2, SNV(EYS), and SNV(GRM7)). The distributions of the 8 PRSs (Baso count, eosinophil (Eosino) count, potassium (K), GGT, monocyte (Mono) count, CHF, asthma, and MAP), CentPGS and 1 SNV(APOE4) were significantly different between the HA and Cont(PRS) (Figure 2g). Among these genetic components, CentPGS had the highest effect (odds ratio (OR) (95% confidence interval(CI)): 1.33 (1.24-1.42)), indicating that CentPGS represented the largest known genetic component of HAs, rather than APOE4 (OR (95%CI): 0.84 (0.78-0.91)).

Survival analysis revealed that CentPGS is correlated with healthspan in HAs

The genetic components identified in the centenarian GWAS should be associated with survival later in life if they are indeed related to longevity. Therefore, a survival analysis was performed for the HAs. HAs were stratified into three groups according to their genotype (APOE, SNV(EYS), or SNV(GRM7)) and were also stratified into two groups based on CentPGS (low and high CentPGS). Survival analyses were performed with two endpoints (lifespan and healthspan) for men and women separately since sex was identified as a strong determinant of both endpoints (Extended Data Figures 6 and 7). Kaplan–Meier survival analysis for lifespan revealed a significantly greater survival probability in HA men with a high CentPGS than in men with a low CentPGS, but a similar result was not observed for HA women (Figure 3a). Kaplan–Meier survival analysis for lifespan revealed that the survival probability of HAs with a low CentPGS was significantly greater than that of HAs with a high CentPGS for both men and women (Figure 3b). Survival analysis via regularized multiple Cox regression revealed that CentPGS and sex were independently associated with lifespan and that APOE4, CentPGS and sex were independently associated with healthspan (Figure 3c). Taken together, these results suggest that CentPGS is correlated with healthspan and lifespan in a cohort of healthy older people.

CentPGS was not associated with duration-related or observable traits

Next, we evaluated whether CentPGS was associated with duration-related traits, such as survival days, or observable traits, such as the frailty index score at the time of enrolment, in HAs. For duration-related traits, regularized multiple Cox regression analyses showed that CentPGS and four SNVs were not significantly associated with either survival days or remaining days of healthspan in participants from the time of enrolment (Extended Data Figure 8a, b, Supplementary Figures 5 and 6). We then analysed the association between CentPGS and the lifespan-healthspan gap, which is the number of survival days from the end of the healthspan. Kaplan–Meier survival analysis revealed that sex was a significant genetic factor for the lifespan-healthspan gap, whereas regularized multiple Cox regression analysis revealed that sex and age at the end of healthspan were significantly associated with the lifespan-healthspan gap, but CentPGS and the four SNVs were not associated with the gap (Figure 3d, e).

We then analysed the association between genetic factors (CentPGS and four SNVs) and standardized observable traits using a generalized linear regression adjusted for sex and age at enrolment. We found known associations between the APOE4 genotype and history of dementia and between the APOE2 genotype and low-density lipoprotein cholesterol (LDLC) concentration, but no significant associations were observed between CentPGS and any observable traits, including known ageing-associated traits (Figure 3f). Taken together, these results indicated that CentPGS was correlated with healthspan but not with duration-related or observable traits in the HA.

Genetic components retained in the oldest human agers, those who survive more than the 99.99th percentile of the population

To understand the genetic factors associated with lifespan among centenarians, a survival analysis was performed using CentPGS, four SNVs, and sex. Survival analysis revealed that sex was the only genetic factor associated with age at death, with women having greater longevity (Figure 4ab, Extended Data Figure 9). In addition, these genetic factors were not associated with survival days in participants at the time of enrolment (Extended Data Figure 8c, Supplementary Figure 7). These results indicate that CentPGS represents genetic components are not associated with lifespan beyond 100 years of age. We then analysed the associations between genetic factors (CentPGS and four SNVs) and standardized observable traits in centenarians using generalized linear regression adjusted for sex and age at enrolment. We found known associations between the APOE4 genotype and extended clinical dementia grade (ex.CDR)³³ and APOE2 genotype and LDLC level, but no significant associations were observed between CentPGS and known age- or survival-related traits, with similar results to those of HAs (Extended Data Figure 10). To determine the genetic factors associated with the extreme longevity studied thus far, we compared the genetic components between two groups of centenarians stratified by lifespan. To increase the accuracy of this stratification, centenarians were first divided by sex and subsequently divided into those above and below the 99.99th percentile among those with the same year of birth (i.e., oldest human agers [OHAs] and normal centenarians [nCent]; Figure 4c). The standardized PRS lsmeans distributions for eight phenotypes (DBP, SBP, MAP, BS, T2D, PP, CRP, and Baso) in the OHA and nCent groups were both shown to be significantly different than those in the Cont(PRS) group; additionally, a significant difference was observed between the OHA and Cont(PRS) groups among those with the AST phenotype (P<0.00079, Figure 4d, e and Supplementary Table 4). Although the PRS lsmeans distribution for fibrinogen (Fbg) did not significantly differ between the nCent and Cont(PRS) groups or between the OHA and Cont(PRS) groups, we found a significant difference in Fbg between the nCent and OHA. A series of LASSO and multiple logistic regression analyses showed that 2 PRSs (Fbg and AST) and SNV(APOE4) that were significantly associated with the difference in the genetic components between the OHA and nCent (Figure 4f). These results indicate that CentPGS is not associated with lifespan after 100 years in the same way as it is in HAs, and OHAs, who have extreme longevity, have been shown to retain partly enhanced (APOE4 and AST) or additional genetic components (Fbg) compared to the genetic components in the nCent population.

Centenarians maintain their independence at an extremely advanced age; therefore, a genetic study of centenarians is an opportunity to study the genetic components of longevity. In this study, we conducted a GWAS of 964 Japanese centenarians and identified genetic components associated with extreme longevity. The Japanese centenarian GWAS has the following novel characteristics: 1) as a longevity GWAS, 77.7% of the study sample consisted of individuals surviving longer than 99.9% of the Japanese population; and 2) this study presented data on longevity in a non-European population, which allowed us to understand the genetic components of longevity commonly found in the European-Asian population^9,10. Genetic correlation and PRS analyses indicated that the genetic components of centenarians are correlated with those of multiple age-related diseases. Furthermore, the centenarian GWAS summary statistics allowed us to calculate the polygenic score "CentPGS", and the APOE4 genotype, CentPGS, and sex were independently associated with healthspan in a cohort of healthy older individuals. Moreover, the APOE4 genotype and genetic components associated with Fbg and AST were significantly more common in the OHA than in the nCent. Taken together, the results of this Japanese centenarian GWAS revealed that longevity has multiple genetic components, including a decreased susceptibility to multiple age-associated diseases; that the trait most associated with the genetic components of centenarians was healthspan; and that most centenarians have a longer, genetically encoded healthspan. Furthermore, the CentPGS findings in healthy older people suggest that healthy older people and centenarians share some or most of the genetic components for healthy longevity and that becoming a centenarian is an extension of healthy longevity.

Our GWAS showed that the most common distinguishing genetic components for longevity were three loci (APOE, EYS, and GRM7), with four leading SNVs and a decreasing allele frequency for APOE4 with age, which is consistent with the findings of previous studies in both European and Asian populations^9,17,18. APOE4 is the largest known genetic risk factor for Alzheimer's disease³⁴, suggesting that age-related dementia is a major risk factor associated with reduced longevity across populations. Recently, APOE4 has been reported to be involved in cerebral amyloid angiopathy³⁵ and blood‒brain barrier dysfunction, predicting cognitive decline³⁶, suggesting that the function of APOE4 in the context of longevity should be elucidated soon.

Among the other key loci identified in the present study, GRM7 encodes glutamate metabotropic receptor 7, which is responsible for neurodevelopmental disorders such as seizures, hypotonia, and brain imaging abnormalities (NEDSHBA)³⁷. Furthermore, loss of the metabotropic glutamate receptor in Drosophila has been observed to cause age-related sleep disruption and a short lifespan³⁸, suggesting that the GRM7 locus might affect human lifespan and/or healthspan through brain dysfunction.

Finally, EYS encodes the eyes-shut homologue, an orthologue of Drosophila eyes shut/spacemaker, and is mutated in autosomal recessive retinitis pigmentosa 25³⁹. The EYS homologue was also reported to be important for maintaining photoreceptor morphology and visual function in zebrafish⁴⁰. Although no association between EYS and lifespan has been reported, the EYS locus may be associated with visual function in older individuals. Although our data for the GRM7 and EYS genes as causal genes are inconclusive, these genes may be important candidates for identifying longevity-associated genes through further studies.

Genetic correlation analysis revealed that the GWAS summary statistics of Japanese centenarians were correlated with those of several age-related diseases and biometrics, such as T2D incidence, CVD incidence, blood pressure, and blood glucose. The current findings are consistent with the reported results of genetic correlation analyses with healthspan, lifespan and longevity in European populations^9,13,14,16, suggesting that the genetic components associated with these age-related phenotypes are likely to be conserved between European and Asian populations.

Another unique aspect of this study was the comparison between the OHA and nCent, in which the APOE4 genotype and the polygenic components of AST and Fbg were identified as unique genetic features distinguishing these groups. The polygenic component of AST has been identified in Japanese centenarians by genetic correlation and PRS analyses but not in European centenarians, suggesting that AST is not only unique to Asian centenarians but also effective at limiting the human lifespan. We also identified the polygenetic component of Fbg as a genetic factor for extreme longevity. For example, hypercoagulability, a significant increase in plasma fibrinogen, has been reported in healthy centenarians⁴¹. Although the clinical significance of fibrinogen in human longevity has yet to be elucidated, recent whole-genome analysis of plasma fibrinogen has identified polygenic components shared with liver enzymes and liver regulatory elements⁴², suggesting potential roles for liver homeostasis in achieving extreme longevity in humans.

This study had the following limitations: 1) our cohort of Japanese centenarians was relatively small compared with the sample sizes of typical GWASs. We considered that our additional analysis with CentPGS and trait data in HAs would partially compensate for this weakness. Replication and meta-GWAS analyses are needed to validate this study. 2) A sex-stratified analysis was not possible for the centenarian GWAS because only 15% of centenarians were men, which may have prevented the identification of sex-specific genetic components associated with longevity. 3) The significance of CentPGS in younger populations is unclear. The phenotyping of the genetic components of centenarians might be less effective in younger individuals; thus, a larger sample size is needed for further analysis. 4) Rare variant information for Japanese centenarians was not included. A recent whole-exome study of 515 Ashkenazi Jewish centenarians reported enrichment of rare coding variants in the insulin/IGF-1 and AMPK signalling pathways⁴³. Accordingly, we sequenced the whole genomes of 529 centenarians, and a rare variant analysis is underway. 5) The genetic components of the Japanese centenarian GWAS statistics are still not fully understood. To overcome these limitations, longitudinal analyses of multiage cohorts or cellular experiments involving induced pluripotent stem cells from centenarians are important.

In conclusion, we established CentPGS, which reflects the genetic components extracted from a centenarian GWAS that are associated with healthspan in older individuals, through combined analyses of genomic and trait data from both centenarians and healthy older people. We expect that the genetic components represented by CentPGS are also associated with resilience to age-related pathologies; it will be important for future research to analyse how genetic components, observable biomarkers, and environmental factors interact with each other and relate to dynamic changes in healthspan and health resilience. Furthermore, CentPGS is expected to also be used in gene–environmental interaction analysis to numerically evaluate the interaction effect of environmental or behavioural factors on healthspan, which may lead to the identification of preventive interventions to promote healthy ageing in the broader population. We believe that CentPGS, which numerically quantifies the genetic components of longevity in individuals, will play an important role in future investigations on healthy longevity.

Selection and recruitment of the Japanese Cent, HA, and Cont groups

For the centenarian group, we used data from two prospective cohort studies of the oldest individuals in Japan, the Tokyo Centenarian Study (TCS) and the Japan Semisupercentenarian Study (JSS)^3,44. The cut-off date for the data collected from the Cent group was May 31, 2022. Among the 967 centenarians from whom gDNA was obtained, 964 centenarians were included, with data from three centenarians who had mismatched sex information or were assigned to close relatives (PI HAT of 0.1875 or higher, which represents the half-way point between 2nd- and 3rd-degree relatives) excluded; this resulted in a study population comprising 144 men and 820 women (female-to-male ratio: 0.851) with a median age of 106.0 years [interquartile range (IQR): 103.9-107.1]. Among these individuals, there were 915 deaths (death rate: 0.969), with a median age at death of 107.5 years [IQR: 105.7-109.2] (Table 1, Extended Data Figure 1). To validate CentPGS, an additional 60 Japanese centenarians aged 100-101 years from the 5-COOP project³² and 36 siblings aged 90-104 years were enrolled in this study. These additional samples were not excluded because they met the same exclusion criteria as the centenarians, but the close relative criterion was not applied to the siblings of the centenarians.

For the HA group, data from the Kawasaki Ageing and Wellbeing Project (KAWP), a prospective cohort study of older adults aged between 85 and 89 years with no limitations in performing activities of daily living (ADLs) at baseline, were used (Extended Data Figure 3)^45,46. The cut-off date for the KAWP data was Sept. 30, 2022. Among the 1,026 participants in the KAWP, two individuals were excluded due to a lack of permission to determine their gDNA sequence, and eight individuals were excluded due to being close relatives; thus, 1,016 individuals were enrolled as healthy agers (HAs; 513 men and 503 women; female-to-male ratio: 0.495), with a median age of 86.8 years [IQR: 85.9-88.2], 168 deaths (death rate: 0.169), and a median age at death of 90.4 years [IQR: 89.0-91.8] (Table 1, Extended Data Figure 1).

The Tohoku Medical Megabank (TMM) project comprises community-based prospective cohort studies that include a population‐based adult cohort⁴⁷ and a three‐generation cohort⁴⁸. Among 30,081 participants, 11,519 individuals from the three-generation cohort were excluded because they were close relatives, 1,237 individuals were excluded due to lack of phenotype, and 9 individuals were excluded because the data were identified as PCA outliers from the Japanese cluster; thus, 17,306 individuals were enrolled as Japanese controls (Cont; 4,728 women (female-to-male ratio: 0.647) with a median age of 45 years [IQR: 32-63]) (Table 1, Extended Data Figure 1). Allele frequency data for ToMMo38k were downloaded from jMorp⁴⁹ [https://jmorp.megabank.tohoku.ac.jp]. Written informed consent was obtained from the participants. The ethics committee of Tohoku University approved the protocol for all of the cohort studies (ID: 2023-4-097, 2023-4-098), which have been previously described^47,48.

Genomic DNA extraction

For individuals in the Cent and HA groups, total gDNA was extracted from whole blood using a FlexGene DNA Kit (Qiagen, Hilden, Germany). We confirmed the quality of the gDNA by agarose gel electrophoresis and found that the gDNA was not degraded. For individuals in the Cont group, total gDNA was extracted from whole blood as previously described⁴⁹.

Whole-genome DNA sequencing and analysis

For the Cent and HA groups, whole-genome DNA sequencing was performed for 526 centenarians via the HiSeq2500, HiSeqX, or NovaSeq 6000 platforms as previously described⁵⁰. The whole-genome DNA sequence of 3,340 Japanese controls was determined using whole-genome DNA sequencing with HiSeq 2500 or NovaSeq 6000 as previously described⁴⁹. Resequencing analysis was performed as described by Tadaka et al. with minor modifications⁴⁹. In brief, a workflow known as the GATK Best Practices workflow, which is becoming the standard procedure globally for whole-genome resequencing analysis⁵¹, was used. Then, base quality score recalibration (BQSR) was applied to effectively reduce sequencer-specific bias.

Genotyping and imputation using a DNA microarray

The genotypes of 0.65 M SNVs of 441 centenarians, 60 additional centenarians, 36 siblings of centenarians, and 26,741 controls were determined using an Axiom Japonica Array NEO according to the manufacturer’s protocol. The genotypes of 0.65 M SNVs of 1,015 individuals in the KAWP were determined using an Infinium Asian Screening Array-24 v1.0 BeadChip Kit according to the manufacturer’s protocol. All DNA microarray images were analysed using previously described protocols ⁴⁹. Genotypic imputation was performed to estimate the genetic variants among the genotypes identified by a DNA microarray. Prior to genotypic imputation, prephasing was performed by SHAPEIT⁵² version v2.r387 with the duoHMM method and a window size of 5. After this prephasing step, genotypic imputation was carried out using IMPUTE2⁵³ version 2.2.2 with the ToMMo 3.5KJPNv2 haplotype reference panel. For IMPUTE2, we used the following options: prephased haplotypes (-use_prephased_g), effective population size (-Ne) 20000, and number of reference haplotypes (-k_hap) 7000. In the imputation process, we divided each autosome into 3 Mb chunks for entry. After the imputations for each chunk were completed, we concatenated these imputed chunks to reconstruct a contiguous autosome.

Outlier identification by PCA

To identify outliers in the Japanese population, gDNA sequence samples from the Cent, HA, and Cont groups were subjected to PCA along with 2,504 gDNA sequences determined from the 1000 Genomes Project⁵⁴, which contains data from 26 human races, including Asian, European, and African populations. Common SNVs between samples in the present study and the 1000 Genomes samples were extracted, SNVs were pruned using PLINK (version 1.90), and PCs 1-20 were computed using the "pca" command in PLINK ⁵⁵.

GWAS and meta-GWAS analysis

To identify relevant SNVs in Japanese centenarians, we compared gDNA sequences between the Cent and Cont groups via a GWAS. WGS and DNA microarray-imputation samples were subjected to GWAS analysis separately using PLINK 1.90⁵⁵ and merged as a meta-GWAS using N-weighted multivariate GWAMAs²⁵. For N-weighted multivariate GWAMA data, summary statistics for 5.98 M SNVs commonly found between WGS and DNA microarray-imputation samples were extracted. The cross-trait intercept between the WGS and DNA microarray-imputation samples was 0.0606, and the SNP heritability values of the WGS and DNA microarray-imputation samples were 0.3211 and 0.2757, respectively. A Manhattan plot was generated using the "qqman" package (version 0.1.8) in R⁵⁶. An enlarged view of a Manhattan plot with recombination rate information was generated using LocusZoom (version 1.3)⁵⁷.

Calculations of SNPh2, l_GC and genetic correlation using LDSC and GWAS summary statistics

SNP heritability, l_GC, and genetic correlations were calculated using LDSC (version 1.0.1)²⁰. For longevity-associated GWAS summary statistics, the European 90th/99th survival percentiles⁹ and parental lifespan (PLS)¹³ were used. Japanese disease and quantitative GWAS summary statistics were downloaded from the RIKEN JENGER server (http://jenger.riken.jp)⁵⁸, and Japanese GWAS summary statistics for Alzheimer’s disease were downloaded from NBDC (https://humandbs.biosciencedbc.jp/hum0237-v1)⁵⁹.

Transcript expression and promoter prediction analyses to identify relevant tissue-specific genes

The SNV-associated genes associated with e quantitative trait locus (eQTLs), sQTLs, ieQTLs, and isQTLs were analysed using the Genotype-Tissue Expression (GTEx) portal (V8, https://www.gtexportal.org/home/)²⁶. The promoter predictions were performed according to the Enformer promoter prediction method²⁷. Briefly, the Enformer architecture consists of three parts: (1) 7 convolutional blocks with pooling, (2) 11 transformer blocks, and (3) a cropping layer followed by final pointwise convolutions branching into two organism-specific network heads. Enformer takes a one-hot-encoded DNA sequence as input and predicts 5,313 genomic tracks for the human genome. Enformer generates a score summarizing the effect of a given variant as a single number per gene by comparing expression predictions for both variant and reference sequences. Finally, we extracted the data corresponding to the lead SNVs in the centenarian GWAS.

Generalized gene-based test for Japanese centenarians via MAGMA

A generalized gene-based test for Japanese centenarians in this GWAS was conducted using the FUMA web application (https://fuma.ctglab.nl/)⁶⁰ to estimate longevity-associated genes. Independent significant SNPs were identified according to their P values (P < 1.0 × 10^-2) and independence (r2 < 0.6 in the 1000 Genomes phase 3 ALL reference panel population) within a 250-kb window. The results of MAGMA gene-set analysis²⁸ were assessed with both GO and KEGG pathway enrichment analysis with the "clusterProfiler" package in R. MAGMA tissue-expression analysis²⁸ was conducted with eQTL data from the Genotype-Tissue Expression project (GTEx v8)²⁶.

The ageing-related gene lists for the longevity map (341 genes, build 3), cell age (866 genes, build 3), and GenAge (307 genes, build 21) were downloaded from the Human Ageing Genomic Resource (https://genomics.senescence.info)²⁹. The longevity-regulating pathway genes (89 genes), including those involved in the IGF/insulin, sirtuin, AMP-AMPK, and TOR pathways (hsa04211), were identified in the KEGG pathway database (https://www.genome.jp/)³⁰. The gene list for the Japanese T2D risk locus (286 genes) was obtained from Imamura et al.³¹. The gene symbols in the gene lists were converted to Entrez IDs using the "bitr" command in the "clusterProfiler" package in R, and the overlapping Entrez IDs were counted in R.

Heritability enrichment analysis against the tissue-specific expressed genes or enhancer regions using LDSC

Heritability enrichment analysis was performed according to the instructions for cell type-specific analyses on GitHub for LDSC (https://github.com/bulik/ldsc/)⁶¹. LD score data for the East Asian population, including 10 cell type group-specific annotations and 220 cell type-specific annotations, were downloaded from the RIKEN Jenner server (http://jenger.riken.jp)⁶².

PRS, Z score of PRS, and lsmean for PRS Z score

PRSs for Japanese centenarians, Japanese diseases, and Japanese quantitative GWAS summary statistics were calculated with PRS-CS (version 1.0.0)⁶³. The SNVs identified from the GWAS summary statistics were screened at a significance level of P < 0.01, the phi parameter was set to 0.01, and the LD reference panels were generated with the EAS reference, which was constructed using the 1000 Genomes Project phase 3 samples. The other parameters used were set to their defaults.

All PRSs were Z score standardized using the mean and standard deviation of the Cont(PRS) group. The average PRS in each group was calculated using the lsmeans method adjusted for sex to account for the different female-to-male ratios among the Cent, HA and Cont groups.

CentPGS

The SNVs identified from the Japanese centenarian GWAS summary statistics were screened at a significance level of P < 0.01. CentPGS was calculated with PRS-CS (version 1.0.0)⁶³ with the phi parameter set to 0.01, and the LD reference panels were generated with the EAS reference. CentPGS was Z score standardized using the mean and standard deviation of the Cont(PRS) group.

Multiple logistic regression analysis for PRS and ROC analysis

For multiple logistic regression analysis using genetic factors, 62 PRSs, the genotypes of 4 SNVs, and CentPGS (only for logistic regression analysis between the HA and Cont(PRS) groups) were evaluated by univariate logistic regression adjusted for sex and LASSO to select covariates for multiple logistic regression analyses among the Cent, HA, and Cont(PRS) groups. All phenotype abbreviations for the 62 PRSs are listed in Supplementary Table 3. After covariates were selected, differences in genetic components among the Cent, HA, and Cont(PRS) groups and in the contributions of PRSs and SNV genotypes were analysed via multiple logistic regression analysis. The ROC plot was generated, and the AUC was calculated using the "pROC" package in R with default parameters.

Kaplan‒Meier and Cox regression analyses and regularized multiple Cox regression survival analyses

For the univariate and multivariate survival analyses, Kaplan‒Meier or regularized multiple Cox regression survival analyses were performed using the "survival" package in R. For the survival analysis endpoints, a telephone survey was used to determine the age of individuals in the Cent group at death, whereas both the telephone survey and medical insurance claims were used to determine the age of those in the HA group at death. Long-term care insurance claims were used to determine age at the end of each individual’s healthspan; specifically, the age at which long-term care grade 2 or greater was confirmed or the age at death in the medical insurance claim was used. These claims data were provided by the local government with the consent of the participants.

Baseline examination for observed traits in the Cent and HA groups

The methods for obtaining all observed traits at the baseline examination except telomere length were described previously^3,44-46. Briefly, both Instrumental Activities of Daily Living (IADL) and Activities of Daily Living (ADL) are questionnaires used to assess the functions necessary for daily living. The timed up and GO (TUG) test, which measures the time required to rise from a chair, walk three metres, turn approximately 180 degrees, return to the chair, and sit down while turning 180 degrees, was used to assess mobility. BMI, SBP, and DBP were measured at the baseline health check. The Mini-Mental State Examination (MMSE; 0-30 points) is a questionnaire used to assess cognitive function⁶⁴. The Extended Clinical Dementia Rating Scale (ex.CDR) is an assessment method for later stages of dementia³³. The Geriatric Depression Scale (GDS) is a questionnaire used to assess levels of depression in older individuals⁶⁵. The frailty index was calculated using the deficit accumulation model proposed by Rockwood⁶⁶. Blood biomarker concentrations, including NTproBNP, cystatin C, and interleukin-6 (IL-6), were measured via ELISA. Blood tests for triglyceride (TG), high-density lipoprotein cholesterol (HDLC), LDLC, choline esterase (CHE), aspartate aminotransferase (AST), haemoglobin A1c (HbA1c), C-reactive protein (CRP), and albumin (ALB) were performed by SRL, a clinical laboratory in Japan. Medical history, including pneumonia, cancer, and dementia, was collected via interviews with a physician (Supplementary Table 5).

Phenotype-genotype correlations

To evaluate correlations between phenotype and genotype, all baseline observed traits were analysed using a generalized linear model with age at enrolment and sex as moderator variables. To compare the coefficients between observed traits, all scores of the observed traits were standardized.

Percentile analysis for Japanese centenarians

To correct for the variances in age that were due to differences in the number of births in each year, percentiles were calculated to map the age at death to the percentile of the population born in the same year. Demographic data for the number of births and the population by age were extracted from Japanese census data for the years 2000, 2005, 2010, 2015, and 2020 and were downloaded from e-Stat, a portal site for Japanese government statistics (https://www.e-stat.go.jp/en/). Finally, individuals exceeding the 90th, 99th, and 99.9th percentile ages were calculated for each birth year, and thresholds for the 90th, 99th, and 99.9th percentiles were determined. Among Japanese centenarians, 77.7% (749/964) were older than the 99.9th percentile.

Statistical analyses

The baseline characteristics, biomarkers, and medical history are expressed as the median and interquartile range or number with a percentage (Supplementary Table 5). Differences in baseline data were evaluated using the Wilcoxon rank-sum test, chi-square test, and Fisher’s exact test (Supplementary Table 5). All the statistical analyses were performed using R (version 4.2.2) with exactRankTests (wilcox.exact, Wilcoxon rank-sum test [version 0.8-31]), glmnet (LASSO and multivariate analyses [version 4.1-8]), survival (survival analysis [survfit, coxph, and cox.zph] [version 3.2-13]), lsmeans (lsmeans [version 2.30-0]), stats (glm [version 4.2.2]), pROC (roc, [version 1.18.4]) and default packages. The statistical significance threshold for multiple logistic regression, regularized multiple Cox regression, Kaplan‒Meier and Wilcoxon rank sum test analyses was set at 0.05, and the statistical significance threshold for the lsmean, multiple regression and genetic correlation with multiple testing was set at 0.05 with Bonferroni correction.

Data availability

The GWAS summary statistics for Japanese centenarians were deposited in the NBDC database and available on the ToMMo jMorp website (https://jmorp.megabank.tohoku.ac.jp). The genomic DNA sequences of Japanese centenarians are available via the intranet at ToMMo in collaboration with Keio University. The genomic DNA sequences of Japanese controls are available in the intranet environment at ToMMo in collaboration with both Keio University and ToMMo. The observed trait data with age for the Cent and HA groups have ethical and legal restrictions on public deposition to avoid personal identification and will be available upon request with an appropriate research arrangement with the approval of the Research Ethics Committee of Keio University School of Medicine for Clinical Research. For requests, please contact Takashi Sasaki (corresponding author) via e-mail: [email protected].

Acknowledgements

We thank Dr. Michiaki Kubo and Dr. Kohei M. Itoh for helping us collect the fundamental data for this Japanese centenarian study. We thank BioBank Japan for providing the Japanese GWAS summary statistics. We thank the staff of Kawasaki City for their help with the KAWP. We thank Ms. Mie Furuhashi for her contribution to the experiments. We thank Ms. Miho Shimura and Ms. Mitsuko Kasahara for their help in recruitment. This research used the supercomputer system and dbTMM provided by the Tohoku Medical Megabank Project. We thank all the participants and family members of the Japanese centenarians, healthy agers, and controls who participated in this study.

Funding:

This study was supported by grants from the Program for an Integrated Database of Clinical and Genomic Information from the Japan Agency for Medical Research and Development (No. 16kk0205009h001, 17jm0210051h0001, 19dk0207045h0001, 22zf0127007h0001); the medical-welfare-food-agriculture collaborative consortium project from the Japan Ministry of Agriculture, Forestry, and Fisheries; the Biobank Japan Program from the Ministry of Education, Culture, Sports, and Technology, the Ministry of Health, Welfare, and Labour for the Scientific Research Projects for Longevity; a Grant-in-Aid for Scientific Research (No. 21590775, 24590898, 15KT0009, 18H03055, 20K20409, 20K07792, 23H03337) from the Japan Society for the Promotion of Science; the Japan Science and Technology Agency (JST) Research Complex Program "Tonomachi Research Complex" Wellbeing Research Campus: Creating new values through technological and social innovation (JP15667051); the Keio University Global Research Institute (KGRI); the Ishii-Ishibashi Fund in Keio University; and the Kanagawa Institute of Industrial Science and Technology (KISTEC).

Corresponding authors

Please direct correspondence to Takashi Sasaki for gDNA data regarding Japanese centenarians and healthy agers. Please direct correspondence to Yasumichi Arai for epidemiological data regarding Japanese centenarians and healthy agers. Please direct correspondence to Kengo Kinoshita regarding inquiries related to the Japanese control data in ToMMo.

Ethical statements

All of the TCS, JSS, and KAWP were managed by the Center for Supercentenarian Medical Research, Keio University School of Medicine. Written informed consent was obtained either from the study participant or from a proxy if the participant lacked the capacity to provide consent. The ethics committee approved all cohort studies of the Keio University School of Medicine (ID: 20021020, 20022020, 20070047, and 20160297). The KAWP is also registered in the University Hospital Medical Information Network Clinical Trial Registry (ID: UMIN000040446 and UMIN000026053). The ethics committee of the Tohoku Medical Megabank Organization approved all cohort studies (2023-4-097, 2023-4-098, and 2017-4-046).

Competing interests

Hideyuki Okano received consulting fees from SanBio Co., Ltd., and K Pharma, Inc., and participated in the Advisory Board for both SanBio Co., Ltd., and K Pharma, Inc. The corresponding author is President of the Japanese Society for Regenerative Medicine and the Japanese Society for Neurochemistry. Yasumichi Arai has received a grant from the Cyclic Innovation for Clinical Empowerment (AMED EKID) and from DAIICHI SANKYO Co., Ltd.

The remaining authors have no competing interests to declare.

Partridge, L., Deelen, J. & Slagboom, P.E. Facing up to the global challenges of ageing. Nature 561, 45-56 (2018).
Terry, D.F., Sebastiani, P., Andersen, S.L. & Perls, T.T. Disentangling the roles of disability and morbidity in survival to exceptional old age. Arch Intern Med 168, 277-83 (2008).
Arai, Y. et al. Physical independence and mortality at the extreme limit of life span: supercentenarians study in Japan. J Gerontol A Biol Sci Med Sci 69, 486-94 (2014).
Andersen, S.L., Sebastiani, P., Dworkis, D.A., Feldman, L. & Perls, T.T. Health span approximates life span among many supercentenarians: compression of morbidity at the approximate limit of life span. J Gerontol A Biol Sci Med Sci 67, 395-405 (2012).
Hirata, T. et al. Associations of cardiovascular biomarkers and plasma albumin with exceptional survival to the highest ages. Nat Commun 11, 3820 (2020).
Perls, T.T. et al. Life-long sustained mortality advantage of siblings of centenarians. Proc Natl Acad Sci U S A 99, 8442-7 (2002).
Uffelmann, E.H., Q.Q.; Munung, N.S.; Vries, J.D.; Okada, Y.; Martin, A.R.; Martin, H.C., Lappalainen, T.; Posthuma, D. Genome-wide association studies. Nature Reviews Methods Primers 1(2021).
Melzer, D., Pilling, L.C. & Ferrucci, L. The genetics of human ageing. Nat Rev Genet 21, 88-101 (2020).
Deelen, J. et al. A meta-analysis of genome-wide association studies identifies multiple longevity genes. Nat Commun 10, 3669 (2019).
Bae, H. et al. A Genome-Wide Association Study of 2304 Extreme Longevity Cases Identifies Novel Longevity Variants. Int J Mol Sci 24(2022).
Erikson, G.A. et al. Whole-Genome Sequencing of a Healthy Aging Cohort. Cell 165, 1002-11 (2016).
Zenin, A. et al. Identification of 12 genetic loci associated with human healthspan. Commun Biol 2, 41 (2019).
Timmers, P.R. et al. Genomics of 1 million parent lifespans implicates novel pathways and common diseases and distinguishes survival chances. Elife 8(2019).
Timmers, P., Wilson, J.F., Joshi, P.K. & Deelen, J. Multivariate genomic scan implicates novel loci and haem metabolism in human ageing. Nat Commun 11, 3570 (2020).
Rosoff, D.B. et al. Multivariate genome-wide analysis of aging-related traits identifies novel loci and new drug targets for healthy aging. Nat Aging 3, 1020-1035 (2023).
Timmers, P. et al. Mendelian randomization of genetically independent aging phenotypes identifies LPA and VCAM1 as biological targets for human aging. Nat Aging 2, 19-30 (2022).
Sasaki, T. et al. Sex-Specific Effects of Apolipoprotein epsilon4 Allele on Mortality in Very Old and Centenarian Japanese Men. J Gerontol A Biol Sci Med Sci 75, 1874-1879 (2020).
Sebastiani, P. et al. APOE Alleles and Extreme Human Longevity. J Gerontol A Biol Sci Med Sci 74, 44-51 (2019).
Garagnani, P. et al. Whole-genome sequencing analysis of semi-supercentenarians. Elife 10(2021).
Bulik-Sullivan, B. et al. An atlas of genetic correlations across human diseases and traits. Nat Genet 47, 1236-41 (2015).
Choi, S.W., Mak, T.S. & O'Reilly, P.F. Tutorial: a guide to performing polygenic risk score analyses. Nat Protoc 15, 2759-2772 (2020).
Tesi, N. et al. Polygenic Risk Score of Longevity Predicts Longer Survival Across an Age Continuum. J Gerontol A Biol Sci Med Sci 76, 750-759 (2021).
Revelas, M. et al. High polygenic risk score for exceptional longevity is associated with a healthy metabolic profile. Geroscience 45, 399-413 (2023).
Gunn, S. et al. Distribution of 54 polygenic risk scores for common diseases in long lived individuals and their offspring. Geroscience 44, 719-729 (2022).
Baselmans, B.M.L. et al. Multivariate genome-wide analyses of the well-being spectrum. Nat Genet 51, 445-451 (2019).
Consortium, G.T. The Genotype-Tissue Expression (GTEx) project. Nat Genet 45, 580-5 (2013).
Avsec, Z. et al. Effective gene expression prediction from sequence by integrating long-range interactions. Nat Methods 18, 1196-1203 (2021).
de Leeuw, C.A., Mooij, J.M., Heskes, T. & Posthuma, D. MAGMA: generalized gene-set analysis of GWAS data. PLoS Comput Biol 11, e1004219 (2015).
Tacutu, R. et al. Human Ageing Genomic Resources: new and updated databases. Nucleic Acids Res 46, D1083-D1090 (2018).
Kanehisa, M., Furumichi, M., Sato, Y., Kawashima, M. & Ishiguro-Watanabe, M. KEGG for taxonomy-based analysis of pathways and genomes. Nucleic Acids Res 51, D587-D592 (2023).
Imamura, M. et al. Genome-wide association studies in the Japanese population identify seven novel loci for type 2 diabetes. Nat Commun 7, 10531 (2016).
Herr, M. et al. Frailty and Associated Factors among Centenarians in the 5-COOP Countries. Gerontology 64, 521-531 (2018).
Heyman, A. et al. Early-onset Alzheimer's disease: clinical predictors of institutionalization and death. Neurology 37, 980-4 (1987).
Corder, E.H. et al. Gene dose of apolipoprotein E type 4 allele and the risk of Alzheimer's disease in late onset families. Science 261, 921-3 (1993).
Blanchard, J.W. et al. Reconstruction of the human blood-brain barrier in vitro reveals a pathogenic mechanism of APOE4 in pericytes. Nat Med 26, 952-963 (2020).
Montagne, A. et al. APOE4 leads to blood-brain barrier dysfunction predicting cognitive decline. Nature 581, 71-76 (2020).
Charng, W.L. et al. Exome sequencing in mostly consanguineous Arab families with neurologic disease provides a high potential molecular diagnosis rate. BMC Med Genomics 9, 42 (2016).
Ly, S. & Naidoo, N. Loss of DmGluRA exacerbates age-related sleep disruption and reduces lifespan. Neurobiol Aging 80, 83-90 (2019).
Abd El-Aziz, M.M. et al. EYS, encoding an ortholog of Drosophila spacemaker, is mutated in autosomal recessive retinitis pigmentosa. Nat Genet 40, 1285-7 (2008).
Messchaert, M. et al. Eyes shut homolog is important for the maintenance of photoreceptor morphology and visual function in zebrafish. PLoS One 13, e0200789 (2018).
Mari, D. et al. Hypercoagulability in centenarians: the paradox of successful aging. Blood 85, 3144-9 (1995).
Huffman, J.E. et al. Whole genome analysis of plasma fibrinogen reveals population-differentiated genetic regulators with putative liver roles. medRxiv (2023).
Lin, J.R. et al. Rare genetic coding variants associated with human longevity and protection against age-related diseases. Nat Aging 1, 783-794 (2021).
Arai, Y. et al. Inflammation, But Not Telomere Length, Predicts Successful Ageing at Extreme Old Age: A Longitudinal Study of Semi-supercentenarians. EBioMedicine 2, 1549-58 (2015).
Arai, Y. et al. Behavioral changes and hygiene practices of older adults in Japan during the first wave of COVID-19 emergency. BMC Geriatr 21, 137 (2021).
Sasaki, T. et al. Status and physiological significance of circulating adiponectin in the very old and centenarians: an observational study. Elife 12(2023).
Hozawa, A. et al. Study Profile of the Tohoku Medical Megabank Community-Based Cohort Study. J Epidemiol 31, 65-76 (2021).
Kuriyama, S. et al. Cohort Profile: Tohoku Medical Megabank Project Birth and Three-Generation Cohort Study (TMM BirThree Cohort Study): rationale, progress and perspective. Int J Epidemiol 49, 18-19m (2020).
Tadaka, S. et al. jMorp updates in 2020: large enhancement of multi-omics data resources on the general Japanese population. Nucleic Acids Res 49, D536-D544 (2021).
Sasaki, T. et al. Association among extracellular superoxide dismutase genotype, plasma concentration, and comorbidity in the very old and centenarians. Sci Rep 11, 8539 (2021).
DePristo, M.A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet 43, 491-8 (2011).
O'Connell, J. et al. A general approach for haplotype phasing across the full spectrum of relatedness. PLoS Genet 10, e1004234 (2014).
Howie, B.N., Donnelly, P. & Marchini, J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet 5, e1000529 (2009).
Fairley, S., Lowy-Gallego, E., Perry, E. & Flicek, P. The International Genome Sample Resource (IGSR) collection of open human genomic variation resources. Nucleic Acids Res 48, D941-D947 (2020).
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81, 559-75 (2007).
Turner, S.D. qqman: an R package for visualizing GWAS results using Q-Q and manhattan plots. Journal of Open Source Software 3, 1-2 (2018).
Pruim, R.J. et al. LocusZoom: regional visualization of genome-wide association scan results. Bioinformatics 26, 2336-7 (2010).
Ishigaki, K. et al. Large-scale genome-wide association study in a Japanese population identifies novel susceptibility loci across different diseases. Nat Genet 52, 669-679 (2020).
Shigemizu, D. et al. Ethnic and trans-ethnic genome-wide association studies identify new loci influencing Japanese Alzheimer's disease risk. Transl Psychiatry 11, 151 (2021).
Watanabe, K., Taskesen, E., van Bochoven, A. & Posthuma, D. Functional mapping and annotation of genetic associations with FUMA. Nat Commun 8, 1826 (2017).
Finucane, H.K. et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat Genet 47, 1228-35 (2015).
Kanai, M. et al. Genetic analysis of quantitative traits in the Japanese population links cell types to complex human diseases. Nat Genet 50, 390-400 (2018).
Ge, T., Chen, C.Y., Ni, Y., Feng, Y.A. & Smoller, J.W. Polygenic prediction via Bayesian regression and continuous shrinkage priors. Nat Commun 10, 1776 (2019).
Folstein, M.F., Folstein, S.E. & McHugh, P.R. "Mini-mental state". A practical method for grading the cognitive state of patients for the clinician. J Psychiatr Res 12, 189-98 (1975).
Yesavage, J.A. et al. Development and validation of a geriatric depression screening scale: a preliminary report. J Psychiatr Res 17, 37-49 (1982).
Rockwood, K. & Mitnitski, A. Frailty in relation to the accumulation of deficits. J Gerontol A Biol Sci Med Sci 62, 722-7 (2007).
Bipolar, D., Schizophrenia Working Group of the Psychiatric Genomics Consortium. Electronic address, d.r.v.e., Bipolar, D. & Schizophrenia Working Group of the Psychiatric Genomics, C. Genomic Dissection of Bipolar Disorder and Schizophrenia, Including 28 Subphenotypes. Cell 173, 1705-1715 e16 (2018).
Marioni, R.E. et al. GWAS on family history of Alzheimer's disease. Transl Psychiatry 8, 99 (2018).

Table 1 is available in the Supplementary Files section.

There is NO Competing Interest.

Download PDF

Version 1

posted

You are reading this latest preprint version

Genetic determinants of centenarian longevity, as quantified by the 'CentPGS' score, are associated with a lower risk of multiple age-related diseases and a longer healthspan.

Status:

Version 1

Abstract

Figures

Introduction

Results

Discussion

Methods

Declarations

References

Tables

Additional Declarations

Supplementary Files

Status:

Version 1