Assessment of polygenic risk score performance in East Asian populations for ten common diseases: A Korean cohort study

doi:10.21203/rs.3.rs-4781909/v1

Download PDF

Article

Assessment of polygenic risk score performance in East Asian populations for ten common diseases: A Korean cohort study

https://doi.org/10.21203/rs.3.rs-4781909/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Polygenic risk score (PRS) uses genetic variants to assess disease susceptibility. While PRS performance is well-studied in Europeans, its accuracy in East Asians is less explored. This study compared East Asian PRS-continuous shrinkage (PRS-CS) from single-population genome-wide association studies (GWAS) with transferability PRS (PRS-CSx) integrating European and East Asian GWAS for ten common diseases in the Health Examinees (HEXA) cohort (n = 55,870) in Korea. PRS-CSx showed significant transferability, improving predictive metrics: likelihood ratio test (LRT) [1.31-fold], odds ratio per 1 standard deviation (perSD OR) [1.04-fold], and net reclassification improvement (NRI) [1.24-fold]. The difference in R² values between PRS-CS and PRS-CSx, analyzed using the r2redux method, was statistically significant across eight diseases, demonstrating an average increase of 0.35% in R² for PRS-CSx. Additionally, we compared the relative performance of these East Asian PRSs with their respective European PRSs for seven diseases, resulting in an average performance of 85.69%. Our findings indicate that while transferability enhances the performance of East Asian PRSs, large-scale East Asian GWAS data are essential to bridge the performance gap with European PRSs for effective disease prediction in East Asian populations.

Biological sciences/Genetics/Population genetics/Genetic variation

Health sciences/Diseases

Genome-wide association studies (GWASs) have revolutionized our understanding of complex traits by identifying a significant number of genetic variants associated with their expression^{1, 2, 3}. However, individual genetic variants often contribute modestly to phenotypic variation, even in highly heritable traits⁴. This emphasizes the polygenic nature of the most complex traits, in which numerous genetic variances with small effects collectively influence the trait variance⁵. Consequently, polygenic risk score (PRS) has emerged as a valuable predictive tool. The PRS aggregates risk information from numerous genetic variants and offers a cumulative measure of an individual’s genetic susceptibility to a disease⁶. This field is rapidly progressing with advances in methods⁷ and cataloging⁸.

The PRS demonstrates the potential to stratify individuals based on disease susceptibility in Europeans^{9, 10}. The PRS estimation revealed a significant increase in risk in the high-risk group. Specifically, individuals in the top 8.0% for coronary artery disease, 6.1% for atrial fibrillation, 3.5% for type 2 diabetes, 3.2% for inflammatory bowel disease, and 1.5% for breast cancer experienced a three-fold increased risk compared to the remaining group⁹. Additionally, significant differences in obesity prevalence (body mass index, [BMI] ≥ 30 kg/m²) were observed across the deciles of PRS for BMI¹⁰. However, studies on this PRS type have not been well explored beyond the European ethnicity.

Recently, large-scale GWAS have been expanded to include other ethnic groups¹¹. East Asian GWAS summary statistics were reported as the second highest in number, following the European GWAS Catalog database¹². GWASs were conducted on 220 traits using data from BioBank Japan (BBJ), comprising 170,000 Japanese, the largest sample of East Asians ever studied for GWAS¹³. However, a significant limitation in PRS prediction stems from the smaller sample size of East Asians than that of the Europeans¹⁴. A meta-analysis of standing height was performed using a sample size of 4,080,687 Europeans and 472,730 East Asians¹⁴. This variation in sample size affected the statistical power, resulting in a disparity in SNP heritability (50% for European height heritability vs. 35% for East Asian height heritability). SNP heritability is closely associated with PRS performance metrics, such as the correlation (R²) between PRS and trait^{6, 15}. Consequently, it may be challenging to achieve predictive accuracy like that observed in European studies.

Therefore, leveraging well-analyzed European GWAS data is crucial to achieve higher predictive performance of the PRS for East Asians¹⁶. Practical challenges arise because of variations in linkage disequilibrium (LD) patterns between East Asian and European populations, rendering the direct utilization of European GWAS data for PRS estimation in East Asians impractical¹⁷. One of the existing PRS transferability methods involves the meta-analysis of GWAS summary statistics across multiple populations using the inverse-variance method and subsequently constructing a PRS using the independent single nucleotide polymorphisms (SNPs) that exhibit statistical significance through the P + T method^{13, 16, 18, 19}. However, this approach lacks the incorporation of population-specific alleles, frequencies, and LD patterns. To address these limitations, a PRS-CSx method was developed for PRS transferability to non-Europeans¹⁶. This approach integrates GWAS data from various ethnic groups with large-scale GWAS data from Europeans to assess the PRS for non-Europeans. The transferability method of PRS-CSx employs a Bayesian technique to enhance the accuracy of PRS prediction by considering genetic effects and LD diversity across distinct ethnic groups¹⁶. By leveraging the relationships between genetic associations and LD patterns in distinct ethnic groups, the PRS-CSx effectively increases the effective sample size while accommodating specific genetic variations within each ethnic group¹⁶.

Recently, the PRSs in East Asians were assessed using the transferability methods of PRS^{20, 21}. The predictive performance of diverse PRSs for type 2 diabetes was assessed using diverse transferability methods, such as PRS-CSx, PRCS-meta, and Ldpred2-meta. Among these methods, PRS-CSx significantly increased the risk of type 2 diabetes in East Asians²¹. PRSs in East Asians were assessed using the PRS-CS and PRS-CSx in inflammatory bowel disease (IBD), Crohn's disease (CD), and ulcerative colitis (UC)²⁰. It was observed that the PRS-CSxs demonstrated risk for CD (8.0%), IBD (6.5%), and UC (5.5%) on a liability scale in the Chinese population. In contrast, PRS-CS, trained only with East Asian GWAS data, exhibited a lower risk of CD (6.4%), IBD (4.7%), and UC (3.2%). Compared with PRS-CS, PRS-CSx exhibited an average enhancement of 1.5% in risk prediction for these diseases²⁰. Therefore, it is essential to assess the transferability of PRSs for various common diseases in diverse East Asian ethnic groups to confirm their superiority over conventional PRSs.

In this study, we assessed the predictive performance of East Asian PRSs, including PRS-CS and PRS-CSx for ten common diseases in a Health Examinees (HEXA) East Asian cohort in Korea²², comprising a sample size of 58,700 Koreans. We assessed the predictive performance of PRSs using three distinct statistical methods: LRT to assess the PRS goodness of fit, perSD OR to quantify the risk increase associated with PRS, NRI to quantitatively assess individual risk prediction enhancement, and r2redux analysis to compare the R² difference between PRSs. Additionally, we assessed East Asian PRSs using follow-up data from the Korea Association Resource (KARE)²³, comprising a sample size of 8,840 through Cox regression analysis.

Basic characteristics

The baseline characteristics of participants in the HEXA cohort²² are listed in Supplementary Table 1. This study included 58,700 Korean individuals (65.43% female) with an average age of 53.80 years, a mean height of 160.72 cm, and an average BMI of 23.89 kg/m². We selected ten common diseases based on having a prevalence greater than 1% in the HEXA cohort (Table 1). The prevalence rates ranged from 1.18% (stroke) to 45.91% (hypertension). Additionally, data on the risk factors for each disease were included based on the Mayo Clinic guidelines (https://www.mayoclinic.org/). These basic characteristics of participants are summarized in Table 1. The BMI and age of all diseases were higher in the cases than in the controls. For coronary artery disease (CAD), osteoporosis, and stroke, family history frequencies were higher in the cases than in the controls. We also observed that the risk factors exhibited significant frequencies or values in an unfavorable direction. For example, compared to controls, cases exhibited significant risk factors for stroke, such as systolic blood pressure (122.40 ± 14.77 vs. 127.21 ± 15.05), diastolic blood pressure (75.75 ± 9.73 vs. 77.03 ± 9.66), high density lipoprotein (53.80 ± 13.15 vs. 49,75 ± 12.09), coronary artery disease (2.80% vs. 7.22%), and type 2 diabetes (8.57% vs. 21.65%).

Table 1

Basic characteristics for ten common diseases in HEXA cohort.
Disease (Prevalence%)	Demographic data and clinical data	Case	Control	P^a
Asthma (1.67%)	Sample size	977	57,644
	Age	55.42 ± 8.40	53.37 ± 8.01	1.59E-09
	Female (%)	71.03	65.34
	Body mass index	24.28 ± 3.24	23.88 ± 2.87	1.55E-04
	White blood cell	6.01 ± 1.71	5.69 ± 1.54	1.17E-06
	Exposure to secondhand smoke (%)	26.62	23.83
Cataract (3.53%)	Sample size	2,070	56,559
	Age	61.83 ± 6.35	53.50 ± 7.92	< 2.20E-16
	Female (%)	59.08	65.67
	Body mass index	24.27 ± 2.86	23.87 ± 2.88	1.17E-09
	Type 2 diabetes (%)	21.59	8.25
	Systolic blood pressure	125.83 ± 14.84	122.33 ± 4.76	< 2.00E-16
Cholelithiasis (3.04%)	Sample size	1,784	56,844
	Age	57.08 ± 7.51	53.70 ± 8.01	< 2.20E-16
	Female (%)	60.03	65.60
	Body mass index	24.40 ± 2.97	23.87 ± 2.87	2.22E-13
	Type 2 diabetes (%)	15.08	8.53
	Aspartate aminotransferase	25.08 ± 13.25	23.71 ± 22.70	3.66E-05
	Alanine aminotransferase	24.68 ± 20.29	23.32 ± 22.39	2.05E-06
	Alkaline Phosphatase	195.19 ± 96.43	180.10 ± 96.88	3.81E-08
Colon polyp (5.69%)	Sample size	3,336	55,276
	Age	57.08 ± 7.17	53.60 ± 8.02	< 2.20E-16
	Female (%)	49.28	66.42
	Body mass index	24.16 ± 2.78	23.87 ± 2.88	4.59E-09
Coronary artery disease (2.85%)	Sample size	1,671	56,954
Coronary artery disease (2.85%)	Age	59.88 ± 6.77	53.62 ± 7.98	< 2.20E-16
	Female (%)	47.94	65.95
	Body mass index	24.89 ± 2.94	23.86 ± 2.87	< 2.20E-16
	Family history of heart disease (%)	13.92	7.26
	Systolic blood pressure	124.65 ± 14.56	122.39 ± 14.78	5.06E-10
	High density lipoprotein	49.16 ± 11.96	53.89 ± 13.16	< 2.20E-16
	Triglycerides	129.88 ± 81.82	124.95 ± 85.59	1.55E-02
	Type 2 diabetes (%)	21.96	8.33
Hypertension (45.91%)	Sample size	17,073	20,112
	Age	57.36 ± 7.45	51.13 ± 7.53	< 2.20E-16
	Female (%)	56.52	76.47
	Body mass index	25.02 ± 2.94	22.86 ± 2.58	< 2.20E-16
	Systolic blood pressure	135.01 ± 14.63	108.07 ± 7.33	< 2.20E-16
	Diastolic blood pressure	83.12 ± 9.90	67.26 ± 5.96	< 2.20E-16
Obesity (32.20%)	Sample size	18,895	39,793
	Age	54.76 ± 7.97	53.35 ± 8.00	< 2.20E-16
	Female (%)	57.35%	69.26%
	Body mass index	27.13 ± 1.95	22.35 ± 1.75	< 2.20E-16
	Have you consistently engaged in vigorous exercise to the point of perspiration?	53.81%	55.07%
	Dietary energy (kcal) intake over a single day	1779.02 ± 561.62	1727.52 ± 545.82	< 2.20E-16
Osteoporosis (5.24%)	Sample size	3,074	55,537
	Age	59.48 ± 7.97	53.48 ± 7.97	< 2.00E-16
	Female (%)	95.67	63.77
	Body mass index	23.49 ± 2.84	23.91 ± 2.88
	Family history of osteoporosis (%)	9.56	4.67
	Height	155.26 ± 5.83	161.03 ± 7.92	< 2.00E-16
Stroke (1.18%)	Sample size	693	57,940
	Age	59.80 (6.91)	53.73 (8.00)	< 2.00E-16
	Female (%)	46.03	65.66
	Body mass index	24.57 (2.71)	23.88 (2.88)	6.84E-11
	Family history of stroke (%)	27.97	13.29
	Systolic blood pressure	127.21 ± 15.05	122.40 ± 14.77	3.22E-16
	Diastolic blood pressure	77.03 ± 9.66	75.75 ± 9.73	5.31E-04
	High density lipoprotein	49.75 ± 12.09	53.80 ± 13.15	< 2.00E-16
	Coronary artery disease (%)	7.22	2.80
	Type 2 diabetes (%)	21.65	8.57
Type 2 diabetes (10.44%)	Sample size	4,982	42,756
	Age	57.86 (7.37)	52.97 (7.96)	< 2.00E-16
	Female (%)	50.36	70.21
	Body mass index	24.99 (3.06)	23.58 (2.78)	< 2.00E-16
	Family history of type 2 diabetes (%)	64.44	16.97
	Fasting glucose	135.35 (40.48)	87.81 (6.86)	< 2.00E-16
	High density lipoprotein	48.97 (11.91)	54.70 (13.21)	< 2.00E-16
a. means that the statistical significance of analyzing the differences in each variable between the case and control groups using a t-test

Predictive performance of PRS calculated using PRS-CS in the HEXA cohort

We calculated the PRSs for ten common diseases using the East Asian GWASs and the PRS-CS method. The GWAS summary statistics for East Asians were obtained from the GWAS Catalog (https://www.ebi.ac.uk/gwas/), and the sample size of GWAS summary statistics is listed in Supplementary Table 2^{13, 24, 25}. The sample sizes of GWAS summary statistics ranged from 51,442 (CAD) to 341,204 (asthma). The majority of GWAS summary statistics, excluding those for asthma, were obtained from Japanese datasets^{13, 25}. For asthma, we used the Meta GWAS summary statistics provided by the Global BioBank Meta-analysis Initiative (GBMI) (n = 341,204)²⁴.

We assessed the performance of East Asian PRSs (PRS-CSs) in the HEXA cohort (n = 55,870). We used three different statistical methods for predictive performance: 1) LRT to assess the fit of the logistic regression model for PRSs; 2) perSD OR to quantify the effect size of the PRS; and 3) NRI to assess the enhancement of individual classification. The results are presented in Table 2.

Table 2

Predictive performance metrics for PRS-CS.
		LRT^a		perSD OR^b				NRI^c
Disease	Method	Deviance	P	perSD OR	BETA	SE^d	P	NRI	Lower CI^e (95%)	Upper CI^e (95%)	P
Asthma	PRS-CS	55.15	1.12E-13	1.27	0.24	0.03	1.10E-13	0.21	0.12	0.30	< 2.20E-16
Cataract		11.85	5.78E-04	1.08	0.08	0.02	5.81E-04	0.11	0.05	0.17	3.80E-04
Cholelithiasis		32.80	1.02E-08	1.15	0.14	0.02	1.05E-08	0.08	0.01	0.15	1.66E-02
Colon polyp		24.39	7.86E-07	1.09	0.09	0.02	7.95E-07	0.06	0.01	0.10	2.64E-02
Coronary artery disease		93.71	< 2.20E-16	1.28	0.24	0.03	< 2.20E-16	0.21	0.14	0.28	< 2.20E-16
Hypertension		2039.70	< 2.20E-16	1.71	0.54	0.01	< 2.20E-16	0.42	0.39	0.44	< 2.20E-16
Obesity		2644.60	< 2.20E-16	1.61	0.48	0.01	< 2.20E-16	0.36	0.34	0.39	< 2.20E-16
Osteoporosis		21.11	4.34E-06	1.09	0.09	0.02	4.53E-06	0.09	0.03	0.14	1.04E-03
Stroke		11.91	5.57E-04	1.14	0.13	0.04	5.61E-04	0.09	-0.02	0.20	9.48E-02
Type 2 diabetes		2069.50	< 2.20E-16	2.16	0.77	0.02	< 2.20E-16	0.55	0.51	0.59	< 2.20E-16
a. LRT: Likelihood ratio test;
b. perSD OR: Odds ratio per 1 SD PRS;
c. NRI: Net reclassification improvement;
d. SE: Standard error;
e. CI: confidence interval

In Table 2, the term “deviance” for LRT indicates the goodness of fit by comparing the models with and without PRS-CS. It represents how well a model fits a given dataset and is calculated as the difference in the log probability between the two models. All PRSs were statistically significant (P < 5.00E-03; 0.05/10). The PRS for obesity exhibited the highest deviance in LRT (2,644.60), whereas the PRS for cataracts exhibited the lowest deviance in LRT (11.85). All the PRSs were statistically significant for per SD OR (P < 5.00E-03; 0.05/10) (Table 2). The perSD OR was the highest for type 2 diabetes (2.03), whereas cataracts exhibited the lowest (1.08). For the NRI, PRSs for seven diseases, such as asthma, cataracts, CAD, hypertension, obesity, osteoporosis, and type 2 diabetes were statistically significant (P < 5.00E-03; 0.05/10), while the other three diseases, cholelithiasis, colon polyps, and stroke, exhibited marginal statistical significance (P < 5.00E-02) (Table 2).

Predictive performance of transferability PRS in the HEXA cohort

We assessed the PRS transferability of ten common diseases using PRS-CSx¹⁶, which re-estimates the SNP effect size from both East Asian and European GWAS using the Bayesian technique. The PRS-CSx method employs a shared “Shared continuous shrinkage prior” effect size to accommodate diverse genetic architectures. The GWAS summary statistics for both Europeans and East Asians were obtained from the GWAS Catalog (https://www.ebi.ac.uk/gwas/) for the transferability of PRS. The East Asian summary statistics used for PRS were the same as those used for the PRS-CS (Supplementary Table 2)^{13, 24, 25, 26, 27, 28, 29, 30}. The sample size of GWAS summary statistics for Europeans ranged from 184,481 (CAD) to 1,339,889 (type 2 diabetes). As anticipated, the sample sizes of the European GWAS summary statistics were higher than those in the East Asia for all diseases (Supplementary Table 2).

We assessed the predictive performance of PRS-CSx in the HEXA cohort (Table 3). All ten PRS-CSxs met the statistical significance of the LRT (P < 5.00E-03; 0.05/10). In the LRT, the PRS-CSx for obesity exhibited the highest deviance (2956.60), while the PRS-CSx for cataracts exhibited the lowest deviance (12.36). Additionally, all PRS-CSxs met the statistical significance of the perSD OR, with type 2 diabetes exhibiting the highest (2.10) and cataracts the lowest values (1.08). All PRS-CSxs satisfied the statistical significance of the NRI.

Table 3. Predictive performance for the transferability PRS (PRS-CSx)

		LRT^a		perSD OR^b				NRI^c
Disease	Method	Deviance	P	perSD OR	BETA	SE^d	P	NRI	Lower CI^e (95%)	Upper CI^e (95%)	P
Asthma	PRS-CSx	69.27	< 2.20E-16	1.31	0.27	0.03	< 2.20E-16	0.20	0.11	0.29	1.00E-05
Cataract		12.36	4.39E-04	1.08	0.08	0.02	4.41E-04	0.11	0.05	0.17	3.40E-04
Cholelithiasis		52.11	5.26E-13	1.19	0.17	0.02	5.62E-13	0.12	0.05	0.18	6.50E-04
Colon polyp		43.27	4.76E-11	1.13	0.12	0.02	5.02E-11	0.09	0.04	0.14	5.70E-04
Coronary artery disease		114.99	< 2.20E-16	1.31	0.27	0.03	< 2.20E-16	0.24	0.17	0.31	< 2.20E-16
Hypertension		2562.70	< 2.20E-16	1.84	0.61	0.01	< 2.20E-16	0.47	0.45	0.50	< 2.20E-16
Obesity		2956.60	< 2.20E-16	1.66	0.50	0.01	< 2.20E-16	0.39	0.36	0.41	< 2.20E-16
Osteoporosis		24.44	7.68E-07	1.14	0.13	0.02	6.43E-12	0.10	0.05	0.16	7.00E-05
Stroke		18.45	1.74E-05	1.18	0.16	0.04	1.74E-05	0.16	0.06	0.27	2.67E-03
Type 2 diabetes		2521.80	< 2.20E-16	2.26	0.81	0.02	< 2.20E-16	0.57	0.53	0.61	< 2.20E-16

a. LRT: Likelihood ratio test;

b. perSD OR: Odds ratio per 1 SD PRS;

c. NRI: Net reclassification improvement;

d. SE: Standard error;

e. CI: confidence interval

Comparison between the PRS-CS and PRS-CSx in the HEXA cohort

We assessed whether PRS-CSx enhanced the predictive performance compared to PRS-CS (Fig. 1 and Supplementary Table 3). We calculated the ratio of predictive performance metrics between the PRS-CSx and PRS-CS. The PRS-CSxs exhibited increased predictive performance for all diseases except asthma. Colonic polyps exhibited the highest LRT ratio (1.77), hypertension the highest perSD OR ratio (1.07), and stroke the highest NRI ratio (1.79). Additionally, the PRS-CSs did not achieve statistical significance for NRI based on Bonferroni correction for stroke, cholelithiasis, and colon polyps. However, the PRS-CSx values for these diseases were statistically significant.

To assess the statistical significance of the increased predictive performance of PRS-CSxs, we performed an r2redux analysis between PRS-CS and PRS-CSx in the HEXA cohort³¹. This analysis calculated the variance and covariance of R² for each PRS, thereby facilitating the estimation of the 95% confidence interval (CI) and P-value for the difference between PRS-CS and PRS-CSx.

Initially, we assessed the R² and variance of R² using r2redux for each PRS. R² ranged from 0.0023 (asthma) to 0.2150 (hypertension) for PRS-CS and from 0.0025 (asthma) to 0.2270 (hypertension) for PRS-CSx (Supplementary Table 4).

Subsequently, we calculated the difference in R² between the PRS-CS and PRS-CSx using r2redux method (Supplementary Table 5). Among the ten diseases, eight exhibited statistically significant differences in R² (P < 5.00E-03; 0.05/10), demonstrating a higher R² for PRS-CSxs than that for PRS-CSs. The highest difference in the R² value was observed for hypertension (0.01201), whereas the lowest difference was observed for stroke (0.00029). The average increases in the R² and perSD OR values for PRS-CSx were 0.28% and 1.04-fold, respectively, which were higher than those for PRS-CS.

Comparison between the PRS-CSx and European PRS

To compare the predictive performance of East Asian PRSs (PRS-CS and PRS-CSx) with that of the European PRS, we used the polygenic score (PGS) Catalog database (https://www.pgscatalog.org/) and previous studies (Table 4 and Supplementary Table 6)⁸. Among the ten diseases, the performance metrics and per SD OR of the European PRS results were available for only eight diseases in both the PGS Catalog and previous studies. The perSD OR results are presented in Table 4.

Among the eight diseases, four-asthma, cataract, coronary artery disease, and stroke-demonstrated that the perSD ORs of the East Asian PRSs were within the European PRS value range. Obesity and osteoporosis did not reach the European PRS value range, whereas stroke and type 2 diabetes demonstrated significant performance compared to the European PRSs (Table 4). We assessed the relative performance of East Asian PRS-CSxs compared with European PRSs by calculating the percentage ratio between the maximum perSD OR computed from European PRS and the per SD OR computed from PRS-CSx. This indicated that the average performance of East Asian PRSs, as measured by the perSD OR across all eight diseases, was equivalent to 85.69% of that of European PRSs.

Table 4

Results of comparison between the PRSs for perSD OR
Disease	PRS-CS^a	PRS-CSx^b	Reported results of European PRS^c
Asthma	1.27	1.31	1.16 ~ 1.73
Cataract	1.08	1.08	1.04 ~ 1.12
Coronary artery disease	1.28	1.31	1.26 ~ 2.14
Hypertension	1.71	1.84	1.50 ~ 1.94
Obesity	1.61	1.66	2.08 ~ 3.50
Osteoporosis	1.09	1.14	1.31
Stroke	1.14	1.18	1.07 ~ 1.15
Type 2 diabetes	2.16	2.26	1.44 ~ 1.88
a. The results of PRS-CS are summarized in Table 2;
b. The results of PRS-CSx are summarized in Table 3;
c. The reported results of European PRS are summarized in Supplementary Table 6

Predictive performance of East Asian PRSs (PRS-CS and PRS-CSx) in the follow-up data

We assessed the performance of PRSs over time using the follow-up data from the KARE cohort²³. Supplementary Table 1 presents the baseline characteristics of the participants in the KARE cohort, which comprised 8,840 Koreans, with 52.69% females. The participants were aged between 40–69 years (average; 52.22 years). Data collection in the KARE cohort commenced in 2001, and follow-up examinations were conducted every two years, totaling seven examinations over a span of 14 years²². The analysis of follow-up data every two years revealed a novel incidence of diseases (Methods and Supplementary Table 7). Owing to the variations in diseases collected through KARE from HEXA, we were able to assess the predictive performance of PRSs for seven diseases in the KARE follow-up data (Supplementary Table 7).

We assessed the predictive performance of the PRSs using a Cox regression model adjusted for age and sex in the follow-up data (Table 5). Both the PRS-CS and PRS-CSx exhibited statistical significance (P < 7.14E-03, 0.05/7) for asthma, hypertension, obesity, and type 2 diabetes. However, no distinct variation was observed in the performance between the PRS-CS and PRS-CSx groups in the follow-up data.

Furthermore, we compared the performance of the East Asian and European PRSs using follow-up data. Among the four diseases, such as asthma, hypertension, obesity, and type 2 diabetes, which displayed statistical significance in hazard ratio, European PRS performance metrics were available for only three of them: asthma, obesity, and type 2 diabetes. These were documented in the PGS Catalog and in previous studies. The comparison results are presented in Table 6. Specifically, the East Asian PRS for asthma exhibited superior performance, whereas those for obesity and type 2 diabetes were within the range of values observed for European PRSs.

Table 5. Comparative evaluation of three PRSs methods using Cox regression analysis in KARE.
	PRSCS					PRSCSX
Disease	Coefficients	HR^a	SE^b	P	Coefficients		HR^a	SE^b	P
Asthma	0.3230	1.3812	0.0910	3.86E-04	0.2490		1.2827	0.0912	6.34E-03
Coronary artery disease	0.1175	1.1247	0.0645	6.87E-02	0.0987		1.1037	0.0647	1.27E-01
Hypertension	0.2269	1.2547	0.0508	8.02E-06	0.2279		1.2559	0.0508	7.17E-06
Obesity	0.1892	1.2083	0.0350	6.30E-08	0.2058		1.2286	0.0352	5.10E-09
Osteoporosis	0.0605	1.0624	0.0624	3.32E-01	0.0200		1.0202	0.0613	7.44E-01
Stroke	0.1385	1.1485	0.6678	3.82E-02	0.1296		1.1383	0.0670	5.32E-02
Type 2 diabetes	0.3023	1.3530	0.0383	2.79E-15	0.3172		1.3733	0.0376	< 2.00E-16
a. HR: Hazard ratio; b. SE: Standard error

Table 6. Comparison between of hazard ratio of the European and East Asian PRSs　

Disease	PRS-CS^a	PRS-CSx^b	Reported results of European PRS^c
Asthma	1.38	1.28	1.12 ~ 1.17
Obesity	1.21	1.23	1.26 ~ 1.45
Type 2 diabetes	1.35	1.37	1.29 ~ 2.00
a. The results of PRS-CS are shown in Table 5;
b. The results of PRS-CSx are shown in Table 5; c. The reported results of European PRS are shown in Supplementary Table 6

We assessed and compared the predictive performance of East Asian PRSs, including the PRS-CSx. Using the HEXA Korean cohort (n = 55,870), we demonstrated that PRS-CSx enhanced the predictive performance compared with PRS-CS for most diseases. This demonstrated significant results for LRT (1.31-fold on average), perSD OR (1.03-fold on average), NRI (1.23-fold on average), and R² (0.35% increase on average) for most diseases. Among all analyzed diseases, hypertension and type 2 diabetes showed the most significant improvements in predictive performance. Additionally, our results showed that the performance of East Asian PRSs was similar to that of the European PRSs, achieving an average equivalence of 85.69%.

The most significant contributor to the predictive performance of PRS was the SNP heritability for traits^{6, 15}. To reveal significant heritability, a substantial number of cases are essential for a GWAS³². Despite East Asian GWASs having the second-highest sample number following European¹², limitations persist owing to the small sample size of East Asian PRSs^{13, 14, 26}. Leveraging large-scale GWAS data from Europeans, there is potential for the PRS transferability of East Asians to exhibit significant predictive performance compared to East Asian GWAS data-based PRS¹⁶. The enhanced predictive performance of PRS-CSx over PRS-CS, as demonstrated by Liu et al., was using only East Asian GWAS data. A modest enhancement of 1.5% on average in disease risk prediction based on the liability scale R² was observed in the Chinese population²⁰. Our findings also exhibited an increase, but to a lesser extent, under conditions similar to those of previous studies. On average, there was a 0.41% increase in Nagelkerke's R² for PRS-CSx compared with PRS-CS for ten common diseases (Supplementary Table 8). The relatively small enhancement in transferability observed in this study may be due to the differences in sample sizes of the East Asian GWASs used in both studies. In our case, we used the GWAS generated from the BBJ cohort (> 170,000), while Liu et al. utilized GWAS performed with a larger East Asian sample size (> 350,000)²⁰. Additionally, our results indicated that PRS-CSx modestly enhanced the predictive performance of LRT (1.31-fold on average), perSD OR (1.03-fold on average), NRI (1.23-fold on average), and R² difference estimated through r2redux (0.28% increase on average) compared with PRS-CS. Because Liu et al. did not furnish these metrics, we were unable to compare our degree of enhancement with that of their study. In the other study, Ge et al calculated the PRS for type 2 diabetes using the PRS-CSx from the Taiwan BioBank dataset²¹. The perSD OR in diverse PRSs for type 2 diabetes ranging from 2.01– 2.19 was assessed. Our study yielded similar results within this range, with 2.16 in the HEXA cohort.

The enhanced predictive performance of the East Asian PRS-CSx highlights its effectiveness in predicting the genetic risk of diseases in East Asian populations. We attempted to understand the relative performance of the East Asian PRS-CSxs by comparing them to the European PRS using the largest East Asian and European GWAS currently available. To calculate this, we compared the perSD ORs of East Asian PRS-CSxs with the maximum perSD ORs obtained from European PRS for each disease. The East Asian PRS-CSxs exhibited an average performance of 85.69% across eight the diseases for those of the European PRS (Tables 4 and 6). However, stroke and type 2 diabetes exhibited significant performance in the PRS-CSxs compared to the European PRSs. These findings indicate a limitation for the transferability of the increased performance of non-European ethnic PRS by leveraging GWAS from Europeans with a larger sample size. Moreover, they emphasized the requirement for larger-scale East Asian GWAS to bridge the performance gap between European and East Asian PRSs.

Recently, various approaches have been explored to leverage the PRS for clinical utility^{9, 10, 33}. Among these, the classification of high-risk groups using the PRS has been widely applied. Previous studies have assessed the OR between high- and normal-risk groups of PRS for CAD, type 2 diabetes, obesity, and hypertension in Europeans^{9, 10, 33}. The OR of PRS was assessed by comparing the disease prevalence between the high- (top 10% of PRS) and normal-risk groups (40–60% of PRS)³³ and provided the OR for diseases, such as CAD (3.52), hypertension (3.28), and type 2 diabetes (4.27). Similarly, we assessed the OR between high- (top 10% PRS-CSx) and the normal-risk groups (41–60% of PRS-CSx), as summarized in Supplementary Table 9. Our findings demonstrated that the OR for CAD (1.63), hypertension (2.66), and type 2 diabetes (4.25) exhibited less discrimination compared to European PRSs. Additionally, the PRS for BMI was calculated, and the OR between the high-risk group (top 10% of PRS) and the remaining group (1–90% of PRS) for extreme obesity (BMI ≥ 40) was estimated to be 4.22¹⁰. Our findings demonstrated that ORs between the high-and normal-risk groups were 2.47 for obesity (≥ 25 kg/m²), 3.88 for severe obesity (≥ 30 kg/m²), and 12.40 for extreme obesity (≥ 40 kg/m²), indicating that BMI PRS-CSx exhibited significant discrimination of the high-risk group.

Our study had several limitations. First, despite an enhancement in the predictive performance in the transferability of PRS-CSx, we did not explore the underlying reasons. Although the sample size of the GWAS was anticipated to be a primary factor for enhancement, we failed to confirm any correlation between the sample size of the European GWAS summary statistics integrated into the PRS-CSx and the increased performance metrics (Tables S10 and S11). Future research is required to identify the factors that enhance the transferability of PRS to develop a highly accurate transferable PRS. Additionally, KARE cohort's follow-up data had a limitation due to its small sample size. The largest group we analyzed was type 2 diabetes, with 693 patients and 5,090 controls, making a total of 5,783 people. The small sample size of this group suggests that the modest increase in performance metrics evaluated through transferability PRS could be due to the limited data scale. Therefore, there is a need to evaluate transferability PRS with a larger follow-up dataset. Another limitation is the small number of diseases assessed owing to the limited data on diseases in the Korean cohorts, such as HEXA and KARE. Additionally, it is essential to demonstrate the effectiveness of the transferability of PRS in other East Asian countries, including Japan. Finally, we did not assess the applicability of the diverse methods for the transferability of PRS. Specifically, the widely recognized PolyPred method requires a minimum of 50,000 individuals for PRS training using the LD reference panel for its application in addition to the assessment of the PRS³⁴.

In this study, we assessed the predictive performance of PRS-CS and PRS-CSx in East Asians for ten common diseases. We observed an enhancement in the prediction performance of PRS-CSxs for the majority of diseases by integrating large-scale European GWAS summary statistics. However, it appears that transferability has limitations in enhancing non-European PRS, emphasizing the need for increased sample sizes in East Asian GWAS to effectively predict disease risk in East Asian populations.

HEXA (Health Examines)

The HEXA was initiated in 2004 and 173,357 participants, aged over 40 years, were recruited from 38 health examination centers and training hospitals located in eight regions of South Korea²². Of these, 58,700 individuals with genotype data and passing sample quality control criteria were extracted. The sample quality control criteria for exclusion are as follows: a history of cancer, gender inconsistencies, cryptic relatedness, low genotype call rate (< 95%), and sample contamination, as previously described²². All participants were genotyped with the Korean Chip (K-CHIP), which was designed by the Center for Genome Science, Korea National Institute of Health (KNIH), based on the UK Biobank Axiom® Array, and manufactured by Affymetrix. The SNP imputation was carried out using IMPUTE v2³⁵ with 1000 Genomes Phase 3 data as a reference panel.

KARE (Korea Association Resource)

Participants of KARE cohort (n = 8,840) were recruited from two regions in South Korea (Ansan and Ansung) from 2009 to 2012 for the Korean Genome and Epidemiology Study²³. All study participants aged ≥ 40 years provided written informed consent, and approval was obtained from the institutional review board. The exclusion criteria were as follows: history of cancer, gender inconsistencies, cryptic relatedness, low genotype call rate (< 95%), and sample contamination ^{22, 23}. The KARE study utilized the Afymetrix Genome-Wide Human SNP Array GeneChip 5.0. SNP imputation was performed using IMPUTE v2 with the 1000 Genomes Project (haplotype phase 1)³⁵.

Ethics approval and consent to participate

This study was conducted with bioresources from the National Biobank of Korea, the Korea Disease Control and Prevention Agency, Republic of Korea (KBN‐2021‐051).

Disease selections

For hypertension, we selected cases meeting any of the following criteria: systolic blood pressure ≥ 140 mmHg, diastolic blood pressure ≥ 90 mmHg, use of antihypertensive medicines, diagnosis of hypertension, or undergoing treatment for hypertension. Controls were those with systolic blood pressure < 120 mmHg and diastolic blood pressure < 80 mmHg³⁶.

For Type 2 diabetes, cases were selected if they satisfied any of the following criteria: fasting glucose level ≥ 126 mg/dl, 2-hour oral glucose tolerance test (2-hour OGTT) ≥ 200 mg/dl, receiving treatment for type 2 diabetes, or taking medication for condition. Controls were identified as those with fasting glucose level < 100 mg/dl, 2-hour OGTT < 140 mg/dl, and no history of type 2 diabetes treatment and diagnosis³⁷.

For asthma, cataract, cholelithiasis, colon polyp, and stroke, cases were chosen if they met any of these criteria: a diagnosis of each respective disease, taking medication for the same, or undergoing treatment for it. Conversely, controls were selected from those without a diagnosis of any of these diseases.

For coronary artery disease, cases were selected based on the following criteria: a diagnosis of myocardial infarction or angina pectoris, medication for either condition or undergoing treatment for them. Controls were those not having a diagnosis of both myocardial infarction and angina pectoris.

For obesity, cases meeting the criterion of a body mass index ≥ 25 were selected. Controls were identified as those with a body mass index < 25^{38, 39}.

For osteoporosis in HEXA, cases were selected based on these criteria: diagnosis of osteoporosis, taking medication for osteoporosis, or receiving treatment for osteoporosis. Controls were selected based on the criterion of not having a diagnosis of osteoporosis. For osteoporosis in KARE, we selected cases that met the following criteria: for females, a diagnosis of osteoporosis, taking medication for osteoporosis, undergoing treatment for osteoporosis, or having a distal radius T score < -2.6 or midshaft tibia T score < -3.0⁴⁰; for males, a diagnosis of osteoporosis, taking medication for osteoporosis, undergoing treatment for osteoporosis, or having a distal radius T score < -2.5 or midshaft tibia T score < -2.5⁴¹. In contrast, controls for females were defined as having a distal radius T score greater than -1.4 and a midshaft tibia T score of -1.6⁴⁰, and controls for males were defined as having a distal radius and midshaft tibia T score greater than -1.0⁴¹.

PRS-CS

PRS-CS is a Bayesian regression framework that enables “Shared continuous shrinkage priors” on SNP effects to infer their posterior mean effects, which is robust to varying genetic architectures, provides substantial computational advantages, and enables multivariate modeling of local LD patterns⁴². PRS-CS will learn the phi parameter from the discovery GWAS without requiring post-hoc tuning as an auto model. We used the default settings for other parameters. Also, we used the 1000 Genomes reference panel provided by PRS-CS (https://github.com/getian107/PRScs).

PRS-CSx

We used PRS-CSx, a recently developed Bayesian polygenic modeling method, to construct the transferability PRS²¹. PRS-CSx jointly models the two GWAS summary statistics and couples genetic effects across populations using a shared continuous shrinkage prior, which enables more accurate effect size estimation by sharing information between summary statistics and leveraging LD diversity across discovery samples. The shared prior allows for correlated but varying effect size estimates across populations, retaining the flexibility of the modeling framework. In addition, PRS-CSx accounts for population-specific allele frequencies and LD patterns and inherits efficient and robust posterior inference algorithms from PRS-CS. We used pre-computed 1000 Genomes Project reference panels that matched the ancestry of each discovery GWAS, and a fully Bayesian algorithm for model fitting, which automatically learned all model parameters from the summary statistics without the need for hyper-parameter tuning. Also, the PRS-CSx used the 1,259,754 HapMap3 variants information to estimate the PRS. So, we used only HapMap 3 variants in the HEXA (~ 1,150,090 SNPs) and KARE cohort (~ 919,166 SNPs).

Statistical analysis

To investigate the LRT and per SD OR, we used a logistic regression model using R statistical package version 4.1.0, as follows:

Disease (coded as 1 or 0) ~ β₁PRS + β₂age + β₃sex

, where logit(Disease) is the log odds of binary outcome variable disease (coded as 1 for control or 2 for case), range of age is from 40 to 69 and sex is coded as 0 or 1 for female or male.

We assessed its prediction performance metric using the continuous NRI, employing the ‘PredictABEL’ package in R. The formula for calculating the censored NRI when comparing the null model against new model 1 and 2 is as follows:

NRI_i = P (up_{new model i}> null model | Case) – P (down _{new model i}< null model | Case) + P (down _{new model i}< null model | Control) – P (up _{new model i}> null model | Control), where i = 1or 2.

We generated NRI indices for both ‘null model vs. new model 1’ and ‘null model vs. new model 2’ and compared these indices to assess the relative predictive performances. For this analysis, we randomly divided the samples into two equal halves. In one half, we generated the model, while in the other half, we estimated the NRI values.

To statistically investigate incidence data, which involves events occurring over time, we conducted Cox regression analysis using the ‘survival’ package in R.

To investigate mean differences of quantitative variables between cases and controls, we used the student's t-test using R statistical package version 4.1.0.

We depicted the bar plot using ‘ggplot2’ version 3.3.6 in R.

Ethics approval and consent to participate

All participants provided written informed consent to participate in the study. The study was approved by the Institutional Review Board of Kyung Hee University (KHSIRB-21-371[EA]).

Consent for publication

Not applicable.

Availability of data and materials

The description of the GWAS summary statistics used for PRS calculation can be found in Supplementary Table 2. Fig. 1 was constructed using the data provided in Supplementary Table 3. All data supporting the findings of this study are available within the paper and its supplementary information files. This paper does not report custom code or software. All computational tools utilized in this publication have been mentioned in the methodology section and can be accessed through their respective publications.

Conflict of interest

Prof. Oh, Drs. Lim, Kang, and Mr. Jung are leaders of Mendel, a genomics healthcare company with an interest in the application of genetics to precision health. The authors declare no conflicts of interest.

Funding

This research was supported by the Bio & Medical Technology Development Program of the National Research Foundation (NRF) funded by the Korean government (MSIT) (2019M3E5D3073365).

Author contributions

JEL and BO drafted the research protocol. H-UJ, HJ, JEL, and BO designed the study. H-UJ analyzed the data and wrote the first draft of the manuscript. HJ performed the statistical analysis. JEL and BO revised the manuscript. SYK and JOK provided the technical support. JY designed figures. EJB prepared Supplementary Tables and Supplementary Figures. All authors contributed to the interpretation of the results and critical revision of the manuscript for important intellectual content and approved the final version of the manuscript.

Acknowledgments

This study was conducted with bioresources from the National Biobank of Korea, the Korea Disease Control and Prevention Agency, Republic of Korea (KBN‐2021‐051).

Authors' information

Department of Biomedical Science, Graduate School, Kyung Hee University, Seoul, Republic of Korea

Hae-Un Jung, Hyein Jung, Shin Young Kwon, and Bermseok Oh

Mendel Inc, Seoul, Republic of Korea

Eun Ju Baek, Jaeyoon You, and Bermseok Oh

Department of Biochemistry and Molecular Biology, School of Medicine, Kyung Hee University, Seoul, Republic of Korea

Ji-One Kang, Ji Eun Lim, and Bermseok Oh

McCarthy, M.I., et al.: Genome-wide association studies for complex traits: consensus, uncertainty and challenges. Nat. Rev. Genet. 9, 356–369 (2008)
Visscher, P.M., et al.: 10 Years of GWAS Discovery: Biology, Function, and Translation. Am. J. Hum. Genet. 101, 5–22 (2017)
Visscher, P.M., Brown, M.A., McCarthy, M.I., Yang, J.: Five years of GWAS discovery. Am. J. Hum. Genet. 90, 7–24 (2012)
Yang, J., Lee, S.H., Goddard, M.E., Visscher, P.M.: GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 88, 76–82 (2011)
Gibson, G.: Rare and common variants: twenty arguments. Nat. Rev. Genet. 13, 135–145 (2012)
Choi, S.W., Mak, T.S., O'Reilly, P.F.: Tutorial: a guide to performing polygenic risk score analyses. Nat. Protoc. 15, 2759–2772 (2020)
Ma, Y., Zhou, X.: Genetic prediction of complex traits with polygenic scores: a statistical review. Trends Genet. 37, 995–1011 (2021)
Lambert, S.A., et al.: The Polygenic Score Catalog as an open database for reproducibility and systematic evaluation. Nat. Genet. 53, 420–425 (2021)
Khera, A.V., et al.: Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat. Genet. 50, 1219–1224 (2018)
Khera, A.V., et al.: Polygenic Prediction of Weight and Obesity Trajectories from Birth to Adulthood. Cell. 177, 587–596e589 (2019)
Peterson, R.E., et al.: Genome-wide Association Studies in Ancestrally Diverse Populations: Opportunities, Methods, Pitfalls, and Recommendations. Cell. 179, 589–603 (2019)
Sirugo, G., Williams, S.M., Tishkoff, S.A.: The Missing Diversity in Human Genetic Studies. Cell. 177, 1080 (2019)
Sakaue, S., et al.: A cross-population atlas of genetic associations for 220 human phenotypes. Nat. Genet. 53, 1415–1424 (2021)
Yengo, L., et al.: A saturated map of common genetic variants associated with human height. Nature. 610, 704–712 (2022)
Tanigawa, Y., et al.: Significant sparse polygenic risk scores across 813 traits in UK Biobank. PLoS Genet. 18, e1010105 (2022)
Ruan, Y., et al.: Improving polygenic prediction in ancestrally diverse populations. Nat. Genet. 54, 573–580 (2022)
Ding, Y., et al.: Polygenic scoring accuracy varies across the genetic ancestry continuum. Nature. 618, 774–781 (2023)
Choi, S.W., O'Reilly, P.F.: PRSice-2: Polygenic Risk Score software for biobank-scale data. GigaScience 8, (2019)
Willer, C.J., Li, Y., Abecasis, G.R.: METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics. 26, 2190–2191 (2010)
Liu, Z., et al.: Genetic architecture of the inflammatory bowel diseases across East Asian and European ancestries. Nat. Genet. 55, 796–806 (2023)
Ge, T., et al.: Development and validation of a trans-ancestry polygenic risk score for type 2 diabetes in diverse populations. Genome Med. 14, 70 (2022)
Kim, Y., Han, B.G., Ko, G.E.S.: Cohort Profile: The Korean Genome and Epidemiology Study (KoGES) Consortium. Int. J. Epidemiol. 46, 1350 (2017)
Cho, Y.S., et al.: A large-scale genome-wide association study of Asian populations uncovers genetic factors influencing eight quantitative traits. Nat. Genet. 41, 527–534 (2009)
Zhou, W., et al.: Global Biobank Meta-analysis Initiative: Powering genetic discovery across human disease. Cell. genomics. 2, 100192 (2022)
Matsunaga, H., et al.: Transethnic Meta-Analysis of Genome-Wide Association Studies Identifies Three New Loci and Characterizes Population-Specific Differences for Coronary Artery Disease. Circulation Genomic precision Med. 13, e002670 (2020)
Jiang, L., Zheng, Z., Fang, H., Yang, J.: A generalized linear mixed model association tool for biobank-scale data. Nat. Genet. 53, 1616–1621 (2021)
Nikpay, M., et al.: A comprehensive 1,000 Genomes-based genome-wide association meta-analysis of coronary artery disease. Nat. Genet. 47, 1121–1130 (2015)
Mahajan, A., et al.: Multi-ancestry genetic study of type 2 diabetes highlights the power of diverse populations for discovery and translation. Nat. Genet. 54, 560–572 (2022)
Evangelou, E., et al.: Genetic analysis of over 1 million people identifies 535 new loci associated with blood pressure traits. Nat. Genet. 50, 1412–1425 (2018)
Yengo, L., et al.: Meta-analysis of genome-wide association studies for height and body mass index in approximately 700000 individuals of European ancestry. Hum. Mol. Genet. 27, 3641–3649 (2018)
Momin, M.M., Lee, S., Wray, N.R., Lee, S.H.: Significance tests for R(2) of out-of-sample prediction using polygenic scores. Am. J. Hum. Genet. 110, 349–358 (2023)
O'Connor, L.J.: The distribution of common-variant effect sizes. Nat. Genet. 53, 1243–1249 (2021)
Thompson, D.J., et al.: UK Biobank release and systematic evaluation of optimised polygenic risk scores for 53 diseases and quantitative traits. medRxiv, (2022). 2022.2006.2016.22276246
Weissbrod, O., et al.: Leveraging fine-mapping and multipopulation training data to improve cross-population polygenic risk scores. Nat. Genet. 54, 450–458 (2022)
Howie, B.N., Donnelly, P., Marchini, J.: A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet. 5, e1000529 (2009)
Jung, H., Lee, G., Lim, K., Shin, S.: Association of milk consumption with management and incidence of hypertension among South Korean adults: A prospective analysis of the health examinees study cohort. Nutr. metabolism Cardiovasc. diseases: NMCD. 32, 2515–2525 (2022)
Lim, J.E., et al.: Gene-environment interaction in type 2 diabetes in Korean cohorts: Interaction of a type 2 diabetes polygenic risk score with triglyceride and cholesterol on fasting glucose levels. Genet. Epidemiol. 46, 285–302 (2022)
Organization, W.H.: The Asia-Pacific perspective: redefining obesity and its treatment. (2000)
Jung, H.U., et al.: Identification of genetic loci affecting body mass index through interaction with multiple environmental factors using structured linear mixed model. Sci. Rep. 11, 5001 (2021)
Knapp, K.M., Blake, G.M., Spector, T.D., Fogelman, I.: Can the WHO definition of osteoporosis be applied to multi-site axial transmission quantitative ultrasound? Osteoporosis international: a journal established as result of cooperation between the European Foundation for Osteoporosis and the National Osteoporosis Foundation of the USA 15, 367–374 (2004)
Gralow, J.R., et al.: NCCN Task Force Report: Bone Health In Cancer Care. J. Natl. Compr. Cancer Network: JNCCN. 11(3), S1–50 (2013). quiz S51
Ge, T., Chen, C.Y., Ni, Y., Feng, Y.A., Smoller, J.W.: Polygenic prediction via Bayesian regression and continuous shrinkage priors. Nat. Commun. 10, 1776 (2019)

There is NO Competing Interest.

2TAHaeunJSupplementaryTable.docx
Supplementary Table 1. Basic characteristics of HEXA and KARE cohorts. Supplementary Table 2. List of GWAS summary statistics used in this study. Supplementary Table 3. Ratios of predictive performance metrics between PRR-CSx and PRS-CS. Supplementary Table 4. Results of R² calculated by r2redux. Supplementary Table 5. Results of differences between R² of two PRSs (PRS-CS and PRS-CSx) in the HEXA cohort. Supplementary Table 6. Summary of European PRS studies identified from the PGS Catalog ( href="https://www.pgscatalog.org/">https://www.pgscatalog.org/). Supplementary Table 7. Disease incidence in KARE cohort during the follow-up of 7 times over 14 years. Supplementary Table 8. Results of Nagelkarke's R² of PRS for ten diseases. Supplementary Table 9. Odds ratio of high-risk group (10%) to normal risk group (41~60%) through PRS-CSx. Supplementary Table 10. Summary of Supplementary Table 2 and Supplementary Table 3. Supplementary Table 11. Results of correlation between performance metric and increased sample size ratio.

Download PDF

Version 1

posted

You are reading this latest preprint version

Assessment of polygenic risk score performance in East Asian populations for ten common diseases: A Korean cohort study

Status:

Version 1

Abstract

Figures

Introduction

Results

Basic characteristics

Predictive performance of PRS calculated using PRS-CS in the HEXA cohort

Predictive performance of transferability PRS in the HEXA cohort

Comparison between the PRS-CS and PRS-CSx in the HEXA cohort

Comparison between the PRS-CSx and European PRS

Predictive performance of East Asian PRSs (PRS-CS and PRS-CSx) in the follow-up data

Discussion

Methods

HEXA (Health Examines)

KARE (Korea Association Resource)

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1