Characteristics of the prostate cancer cohorts
A comparison of patient characteristics (demographic, lifestyle, and clinical) is given in Supplementary results Table 1a. The Taiwan (TW1-advanced, and TW2 - localized prostate cancer groups respectively) cases showed a significantly lower median BMI (24.2 kg/m2 and 24.7 kg/m2 respectively) compared to other cases (27.0 kg/m2 for NZ, 27.7 kg/m2 for US-Africans (US-AA) and 27.5 kg/m2 for US-Europeans (US-EA) and P<0.001). The US-AA cases recorded a significantly higher percentage of ever-smokers compared to NZ and US-EA cases (72% vs 56% for NZ and 61% for US-EA and P< 0.00001). The percentage alcohol consumption among NZ cases were significantly lower than the US-AA and the US-EA cases (71% vs 85% for AA and 90% for US-EA and P< 0.00001). Lifestyle data related to tobacco smoking and alcohol consumption are not available for TW1 and TW2 cases.
Median age at prostate cancer diagnosis was significantly higher among TW1 cases compared to NZ, US-AA and US-EA cases (73 y vs 66 y each for NZ and TW2, 63 y for US-AA and 65 y for US-EA and P<0.001). Median PSA at prostate cancer diagnosis was higher among the TW1 and TW2 cases (41 ng/ml and 11 ng/ml respectively vs 8.6 ng/ml for NZ, 7.0 ng/ml for US-AA and 5.8 ng/ml for US-EA and P < 0.001). Median Gleason sum score was the lowest among US-EA cases (6 vs 7 for the NZ, TW1, TW2, and AA cases; P < 0.001). A significantly higher percentage of high-risk prostate cancer with a prognostic stage of >IIB (86%) was recorded among the TW1 cases compared to 29-66% among other cases (TW2, NZ, US-AA and US-EA) (P<0.00001).
Characteristics of the controls cohorts are given in Supplementary results Table 1b. The NZ controls were significantly younger (median 54 y) than the controls from the US (US-AA median 66 y and US-EA median 64 y, <0.001). Significantly, different BMI values were recorded between the three cohorts with US-AA recording the highest median of 29 kg/m2 and NZ and US-EA controls recording medians of 26 and 27.4 kg/m2 respectively. Significantly different proportions of ever-smokers were recorded between the three controls cohorts with only 34% among NZ controls while US-AA and US-EA controls recorded 61% and 59% respectively. Median PSA at recruitment was also significantly different between the three controls cohorts (NZ- 0.9 ng/ml, US-AA-0.4ng/ml and US-EA- 0.4ng/ml). Among NZ controls, 60.8% have recorded no medication intake for any health disorder, while 21% were taking medication for cardiovascular disease, 1% for diabetes, 5.8% for benign prostatic hyperplasia / lower urinary tract infection (BPH/LUT), 3.7% for mental illnesses, and 8.2% for other health disorders.
AKR1C3 rs12529 genetic polymorphism distribution among the cohorts
The AKR1C3 rs12529 genotype data for a total of 366, 202, 232, 618 and 643 of cases from NZ-non-MPEA, US-AA, US-EA, TW1 and TW2 cohorts respectively, 13 NZ-MPEA cases and 454 from NZ controls are presented in Supplementary results Table 2.
A. Multiple variable testing on log PSA outcomes
I. Testing the association of log PSA on multiple variables among pooled cases.
I.a. A summary of the association of log PSA with ethnicity, disease prognostic stage, Gleason sum score, age at diagnosis, BMI and genotype for cases cohorts analysed with or without lifestyle factors is given in Table 1. Log PSA showed a significant association with all tested factors except for the genotype when analysed without lifestyle factors. The US-AA, NZ-non-MPEA, TW1 and TW2 cases cohorts showed a higher log PSA compared to that of the US-EA cohort. BMI showed a significant negative association on log PSA while the other variables showed a positive association.
When lifestyle factors of tobacco smoking and alcohol consumption were included in the model, log PSA showed a significant association with ethnicity, disease prognostic stage, Gleason sum score, age at diagnosis and smoking status. In this analysis too, genotype showed no association on log PSA. Alcohol consumption also showed no significant association with log PSA outcomes. The log PSA association with BMI was not significant in this analysis compared to the analysis which included TW cases, but without inclusion of lifestyle data.
Table 1. Results summary of multiple linear regression analyses for testing impacts of demographic, genetic, lifestyle and clinical parameters on log PSA for all cases cohorts.
Without lifestyle* data
|
With lifestyle* data
|
Parameter
|
Parameter Est.
|
Pr > F
|
Parameter
|
Parameter Est.
|
Pr > F
|
Ethnicity (ref=European American)
|
|
<.0001
|
Ethnicity (ref=European American)
|
|
0.0002
|
African American
|
0.35
|
|
African American
|
0.36
|
|
NZ-non MPEA
|
0.29
|
|
NZ- non MPEA
|
0.24
|
|
Taiwanese TW1
|
1.69
|
|
|
|
|
Taiwanese TW2
|
0.51
|
|
|
|
|
Prognostic Stage (ref=<IIB)
|
|
<.0001
|
Prognostic Stage (ref=<IIB)
|
|
<.0001
|
>IIB
|
0.55
|
|
>IIB
|
0.33
|
|
Gleason sum score
|
0.18
|
<.0001
|
Gleason sum score
|
0.27
|
<.0001
|
Genotype (ref=CC)
|
|
0.678
|
Genotype (ref=CC)
|
|
0.653
|
G
|
-0.02
|
|
CG
|
-0.006
|
|
GG
|
0.04
|
|
GG
|
0.069
|
|
Age at diagnosis
|
0.01
|
0.0004
|
Age at diagnosis
|
0.01
|
0.003
|
BMI
|
-0.02
|
0.004
|
BMI
|
-0.01
|
0.136
|
|
|
|
Ever-smoker (ref=never smoker)
|
0.16
|
0.015
|
|
|
|
Alcohol consumer (ref= never alcohol consumer)
|
-0.07
|
0.370
|
Model
|
R^2=0.396, Pr>F <0.0001
|
Model
|
R^2=0.187, Pr>F <0.0001
|
*- Ever-smoker and alcohol consumer lifestyle data are not available for Taiwanese (TW) cohorts.
NZ- non MPEA cases – New Zealanders self-identified as European, or from the Indian sub-continent, Middle-East and others.
1.b. Interactive effects on log PSA
Multiple linear regression for the interactions between age at diagnosis, lifestyle, genetics and ethnicity in the log PSA outcomes were analysed. However, except for the age at diagnosis and ethnicity two-way interaction (Table 2), the interactions between ethnicity and ever-smoking status or ethnicity and alcohol consumption status or the three-way interaction between age at diagnosis, ethnicity and genotype were not significantly associated with log PSA (Supplementary results Tables 3). However, the age at diagnosis*ethnicity interaction remained significant even under the three-way model.
Table 2. Statistical outcomes in the interactive model with age at diagnosis and ethnicity on log PSA outcome for US-EA, US-AA and NZ- non- MPEA cases cohorts.
Source
|
DF
|
Type III SS
|
Mean Square
|
F Value
|
Pr > F
|
age at diagnosis*ethnicity interaction
|
Ethnic Group
|
2
|
5.38
|
2.69
|
3.33
|
0.036
|
Prognostic stage
|
1
|
17.43
|
17.43
|
21.56
|
<.0001
|
Gleason sum score
|
1
|
36.93
|
36.93
|
45.69
|
<.0001
|
Genotype
|
2
|
0.66
|
0.33
|
0.41
|
0.667
|
Age at diagnosis
|
1
|
7.51
|
7.51
|
9.28
|
0.002
|
BMI
|
1
|
1.85
|
1.85
|
2.29
|
0.130
|
Smoker
|
1
|
4.99
|
4.99
|
6.17
|
0.013
|
Alcohol
|
1
|
0.60
|
0.60
|
0.75
|
0.388
|
Age at diagnosis*Ethnic Group
|
2
|
8.08
|
4.04
|
5
|
0.007
|
NZ- non MPEA cases – New Zealanders self-identified as European, Indian sub-continent, Middle-East and others
II. The association of log PSA with multiple variables in independent cases cohorts
As cases data showed a significant interaction of age at diagnosis and ethnicity with log PSA outcomes, all cases cohorts were also analysed independently with multiple linear regression. Independent cases cohorts assessed with multiple linear regression analysis (Table 3) indicate that log PSA is significantly associated with Gleason sum score for US-EA cases; Gleason sum score and BMI for US-AA cases; prognostic stage, age at diagnosis and tobacco smoking for NZ-non-MPEA cases; prognostic stage and BMI for TW1 cases and prognostic stage and Gleason sum score for TW2 cases.
Table 3. The association of log PSA with age, BMI, clinical data, lifestyle and genotype for US-EA, US-AA and NZ-non-MPEA cases cohorts analysed independently.
|
European American
|
African American
|
NZ- non-MPEA
|
TW1
|
TW2
|
Parameter
|
Estimate
|
Pr > F
|
Estimate
|
Pr > F
|
Estimate
|
Pr > F
|
Estimate
|
Pr > F
|
Estimate
|
Pr > F
|
Prognostic Stage >IIB (ref=<IIB)
|
0.13
|
0.310
|
0.22
|
0.179
|
0.62
|
<.0001
|
1.79
|
<.0001
|
0.35
|
0.0003
|
Gleason sum score
|
0.26
|
0.001
|
0.62
|
<.0001
|
0.07
|
0.208
|
0.08
|
0.098
|
0.12
|
0.006
|
Genotype (ref=CC) for all except TW (ref=CG)
|
|
0.361
|
|
0.720
|
|
0.778
|
|
|
|
|
CG
|
0.08
|
|
-0.047
|
|
-0.07
|
|
0.00
|
0.117
|
0.00
|
0.666
|
GG
|
0.024
|
|
0.105
|
|
-0.01
|
|
-2.35
|
|
0.44
|
|
Age at diagnosis
|
0.01
|
0.491
|
0.02
|
0.073
|
0.02
|
0.0003
|
-0.02
|
0.894
|
0.01
|
0.186
|
BMI
|
-0.01
|
0.587
|
-0.04
|
0.012
|
0.01
|
0.259
|
-0.06
|
0.006
|
0.01
|
0.659
|
Ever-smoker (ref=never smoker)
|
0.16
|
0.697
|
0.13
|
0.697
|
0.22
|
0.018
|
|
|
|
|
Alcohol consumer (ref= never alcohol consumer)
|
-0.07
|
0.595
|
0.21
|
0.325
|
-0.15
|
0.139
|
|
|
|
|
Model
|
R^2=0.08, Pr>F
<0.013
|
R^2=0.336, Pr>F <0.0001
|
R^2=0.206, Pr>F <0.0001
|
R^2=0.23, Pr>F <0.0001
|
R^2=0.111, Pr>F <0.0001
|
NZ- non MPEA cases – New Zealanders self-identified as European or from the Indian sub-continent, Middle-East and others.
III. The association of log PSA with multiple variables among pooled controls.
Multiple regression analysis showed that log PSA is significantly associated with ethnicity, age, BMI and smoking status when all controls cohorts were considered together (Table 4).
Table 4. Results of multiple linear regression analysis for testing impacts of age, BMI, and lifestyle on log PSA for all controls cohorts.
Parameter
|
Parameter Est.
|
Pr > F
|
Ethnicity (ref= NZ-European)
|
|
<0.0001
|
African American
|
-1.27
|
|
European American
|
-1.41
|
|
Age
|
0.04
|
<.0001
|
BMI
|
-0.02
|
0.0002
|
Ever-smoker (ref=never- smoker)
|
-0.12
|
0.036
|
Ever-alcohol consumer (ref= never-alcohol consumer)
|
-0.003
|
0.965
|
Model
|
R^2=275, Pr>F <0.0001
|
IV. The association of log PSA with multiple variables in independent controls cohorts.
When the control cohorts were independently analysed with multiple linear regression analysis (Table 5), age was significantly associated with log PSA in US-EA, US-AA and NZ controls. However, in US-EA and US-AA controls, log PSA was significantly associated also with BMI, while among US-AA controls, tobacco smoking was also a significant association factor.
Table 5. Summary of multiple linear regression analysis for testing impacts of age, BMI, and lifestyle on log PSA for US-EA, US-AA and NZ-European controls analysed independently.
|
European American
|
African American
|
NZ-European
|
Parameter
|
Parameter Est.
|
Pr > F
|
Parameter Est.
|
Pr > F
|
Parameter Est.
|
Pr > F
|
Age
|
0.03
|
<.0001
|
0.05
|
<.0001
|
0.03
|
<.0001
|
BMI
|
-0.02
|
0.048
|
-0.03
|
0.008
|
-0.01
|
0.2039
|
Ever-smoker (ref=never- smoker)
|
-0.18
|
0.072
|
-0.25
|
0.034
|
0.04
|
0.6255
|
Ever alcohol consumer (ref= never alcohol consumer)
|
0.13
|
0.389
|
0.09
|
0.4812
|
-0.19
|
0.0785
|
Model
|
R^2=0.064, Pr>F <0.0001
|
|
R^2=0.13, Pr>F <0.0001
|
|
R^2= 0.282, Pr>F <0.0001
|
|
B. Univariate analyses on log PSA correlation with age
As ethnicity interacting with age was the most influential factor that produced an impact on log PSA, we further attempted univariate analyses to understand age dependent impacts on log PSA levels with and without stratification by genotype for independent case and control cohorts (Table 6). Overall, all controls (NZ, US-EA, US-AA) and all cases except for the US-EA cases showed significant correlation between age and log PSA. A reduction in correlation coefficient strength was observed among cases compared to controls overall. The NZ control cohort showed significant age and log PSA correlation despite stratification by genotype. However, NZ-non-MPEA cases showed significant age and log PSA correlation only among those carrying the AKR1C3 rs12529 CG and GG genotypes. For US-AA cases, significant age and log PSA correlation was restricted to those carrying the AKR1C3 rs12529 CC and CG genotypes. For TW1 and TW2 cases, this correlation was restricted to men carrying the AKR1C3 rs12529 GG genotype, while for the US-EA cases, none of the AKR1C3 rs12529 genotypes showed significant correlations.
Table 6. Spearman correlation statistics between Age (age at diagnosis for cases and age at recruitment for controls) and log PSA stratified by ethnicity, case, control status and the AKR1C3 rs12529 genotype.
|
|
All
|
CC
|
CG
|
GG
|
NZ European controls
|
r
|
0.556
|
0.517
|
0.519
|
0.616
|
p
|
2E-07
|
2E-07
|
2E-07
|
2.26E-09
|
n
|
498
|
181
|
202
|
71
|
NZ - non MPEA cases
|
r
|
0.303
|
0.129
|
0.287
|
0.426
|
p
|
6.67E-011
|
0.160
|
2.33E-04
|
7.43E-05
|
n
|
449
|
120
|
161
|
82
|
US-AA controls
|
r
|
0.344
|
|
|
|
p
|
1.09E-12
|
n
|
410
|
US-AA cases
|
r
|
0.243
|
0.312
|
0.239
|
0.153
|
p
|
4.98E-04
|
0.017
|
0.014
|
0.349
|
n
|
202
|
58
|
105
|
39
|
US-EA controls
|
r
|
0.213
|
|
|
|
p
|
2.89E-06
|
n
|
475
|
US-EA cases
|
r
|
0.0244
|
0.113
|
-0.063
|
0.110
|
p
|
0.711
|
0.352
|
0.504
|
0.457
|
n
|
232
|
69
|
115
|
48
|
TW1 cases
|
r
|
0.119
|
0.500
|
-0.00108
|
0.140
|
p
|
0.003
|
0.182
|
0.990
|
0.002
|
n
|
622
|
8
|
133
|
477
|
TW2 cases
|
r
|
0.113
|
0.0286
|
0.103
|
0.121
|
p
|
0.005
|
1.000
|
0.217
|
0.008
|
n
|
622
|
6
|
144
|
472
|
r = correlation coefficient; p = significance of probability; n= number of pairs tested
NZ- non MPEA cases – New Zealanders self-identified as European, or from the Indian sub-continent, Middle-East and others.