Screening performance in high-risk groups of breast cancer by integrating classical risk factors, mammographic density and polygenic risk

doi:10.21203/rs.3.rs-1467695/v1

Download PDF

Research Article

Screening performance in high-risk groups of breast cancer by integrating classical risk factors, mammographic density and polygenic risk

https://doi.org/10.21203/rs.3.rs-1467695/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Background: Risk prediction models integrating classical risk factors (CRF), mammographic density (MD), and polygenic risk score (PRS) are increasingly developed to identify high-risk groups of breast cancer. Few studies investigate the screening performance in high-risk groups by these models.

Methods: Based on a median follow-up of 11.7 years of 7794 women from the Multi-modality Independent Screening Trial (MIST), four risk prediction models with CRF (Model_CRF), CRF and MD (Model_CRF+MD), CRF and PRS (Model_CRF+PRS), and all three components (Model_FULL) were developed to identify high-risk groups of breast cancer. The hazard ratio (HR) and 95% confidential interval (CI) of breast-cancer mortality for high-risk groups compared to low-risk groups was calculated to determine potential benefit of risk-reducing interventions. The detection rate (DR), accuracy and cancer-stage for clinical breast examination (CBE), breast ultrasonography (BUS), and mammography (MAM) were compared to determine the optimal screening method for high-risk groups.

Results: The areas under the curve of risk prediction model increased from 0.573 (95%CI: 0.532-0.614) for Model_CRF, to 0.587 (95%CI: 0.544-0.630) for Model_CRF+MD, 0.670 (95%CI: 0.622-0.717) for Model_CRF+PRS and 0.674 (95%CI: 0.623-0.725) for Model_FULL. The HRs of breast cancer mortality for high-risk groups compared to low-risk groups increased from 1.81 (95%CI: 1.17-2.81) for Model_CRF, to 2.22 (95%CI: 1.35-3.67) for Model_CRF+MD, 2.48 (95%CI: 1.43-4.29) for Model_CRF+PRS, and 3.68(95%CI: 1.94-6.99) for Model_FULL. Among high-risk groups by Model_CRF, the DR of BUS was similar to that of MAM (3.926/1,000 vs. 2.399/1,000, P=0.193), but significantly higher than that of CBE (1.091/1,000, P=0.024). Compared with MAM, BUS showed significantly lower sensitivity (50.0% vs. 81.8%, P=0.026), but comparable specificity (99.2% vs. 99.3%), positive prediction values (22.9% vs. 34.6%), and negative prediction values (99.8% vs. 99.9%). Further analyses showed no significant difference in the proportions of early-stage breast cancer detected between BUS and MAM (50.00% vs. 61.54%, P=0.673). Similar results were observed in high-risk groups by other models.

Conclusions: Accurate risk assessment integrating CRF, MD and PRS is needed to identify high-risk groups of breast cancer. The higher the risk, the greater the benefit of the intervention. BUS was comparable to MAM for screening breast cancer in high-risk groups.

Breast cancer

Mammographic density

Prediction model

Cohort study

According to the GLOBOCAN 2020 estimates, female breast cancer has surpassed lung cancer (both sexes combined) as the most commonly diagnosed cancer, with an estimated 2.3 million new cases (11.7%).[1] Due to the dramatic transformations in many social and economic conditions, along with increasing tobacco use, unhealthy diet, excess body weight, physical inactivity, and the harmful use of alcohol, several countries are facing growing disease and socioeconomic burden associated with breast cancer.[1–4] Timely interventions to address the increasing breast cancer burden is one of the major public health issues facing many countries, including China.[1, 2, 4]

Population-wide breast cancer screening programs and guidelines have been established in many countries to reduce breast cancer mortality through early detection and treatment.[5–9] Despite good evidence to support breast cancer screening,[10–12] several objective reasons, such as the large coverage of population, inaccessibility of cancer screening equipment, the lack of insurance coverage, and the lack of professional screening technicians, create barriers for low-resource countries to conduct population-wide breast cancer screening program.[2, 13] Screening for high-risk groups would be a more cost-effective and sustainable choice for those countries with limited resource, including China.

Risk prediction models integrating classical risk factors (CRF), mammographic density (MD), and polygenic risk score (PRS) are increasingly developed to identify high-risk groups of breast cancer.[14–17] Researchers also suggested that it’s necessary to improve the cost effectiveness and benefit-to-harm ratio of screening by adopting a high risk-targeted screening strategy using existing and evolving risk prediction models.[18, 19] However, most current cancer screening guidelines recommend routine screening using only age-based, family history-based, or genetic susceptibility-based screening strategy, but not a high risk-targeted screening strategy [5, 7, 20]. Moreover, few studies investigate the screening performance in high-risk groups by these risk prediction models.

Therefore, based on our previous low-cost easy-to-use screening model only with the number of 6 established risk factors of breast cancer, this study aims to develop long-time precision risk prediction models by integrating CRF, MD, and PRS, and to investigate the screening performance in high-risk groups by these models.

Study population

The Multi-modality Independent Screening Trial (MIST) of breast cancer was a trial that aimed to evaluate and compare the screening performances of clinical breast examination (CBE), breast ultrasonography (BUS), and mammography (MAM) among Chinese women. In briefly, a total of 33,234 asymptomatic women aged 45 to 65 years and lived in local communities for at least 3 years were initially recruited from 5 cities in China (Tianjin, Beijing, Nanchang, Shenyang, and Feicheng) to receive the first-round screening between July 2008 and December 2010. After consent informed and questionnaire interview, all women received CBE, BUS, and MAM concurrently. Baseline blood samples were collected from 10,852 women in Tianjin and Feicheng. All three screening modalities were performed followed unified screening protocols. Physicians performed and interpreted the screening results independently and blindly. Patients with suspicious malignancy and highly suggestive of breast cancer from any of three screening modalities were recommended for pathological examination. All breast cancers were confirmed with combinations of pathological examination, clinical diagnosis, and active or passive follow-up within one year after screenings. Detailed information of MIST referred to our previous studies.[13, 21]

All participants were invited to receive second-round screening from October 2013 to May 2015. Breast cancer incidence and mortality were linked to the local cancer registry and death registry. The diagnosis of cancer and the survival for women in Tianjin were further linked to the Tianjin Electronic Medical Record Information System (TEMRIS) until October 2021. The TEMRIS covered 95% hospitals in Tianjin, and the diagnosis from different hospital were encoded according to International Classification of Diseases (10th Revision). In order to develop long-time risk prediction model of breast cancer, women in Tianjin from MIST (MIST-TJ, N=7826) were included in the final analyses. This study was reviewed and approved by the institutional review board of Tianjin Medical University Cancer Institute and Hospital (TMUCIH).

Sociodemographic and epidemiologic information

After informed consent, all women received a face-to-face questionnaire-based interview conducted by trained investigators to collect information on sociodemographic (age at enrollment, race, marital status, education, family income, and insurance), family history of breast cancer in first- and second degree relatives, history of benign breast disease, diet and lifestyle factors, and female-specific factors (age at menarche, age at first birth, menopausal status, duration of breastfeeding, oral contraceptive use and hormone replacement therapy). Body weight (kg) and height (m) were measured by trained investigators, and the body mass index was calculated as the weight in kg divided by the square of height in meters (kg/m²). Women were classified into three BMI groups: underweight (<18.5 kg/m²), overweight (≥24 kg/m²), and obese (≥28 kg/m²). Family history of breast cancer referred to at least one of the) with breast cancer. Regular cigarette smoking was defined as smoking at least one cigarette per day for six months or more.

Screening methods and assessment of mammographic density

CBE, BUS, and MAM were performed by physicians with at least 5 years of work experience. Bilateral MAM was conducted with a full-field digital mammography system. Bilateral BUS was performed with color Doppler and high-resolution transducers with maximum frequency of at least 10 MHz. Results of CBE and BUS were classified into four groups: 1, normal; 2, abnormal benign; 3, suspicious malignancy; and 4, highly suggestive of a malignancy. Results of MAM were classified into six groups according to Breast Imaging Reporting and Data System (BIRADS) of the American College of Radiology (ACR): 0, additional imaging needed; 1, negative; 2, benign finding; 3, probably benign finding; 4, suspicious malignancy; and 5, highly suggestive of a malignancy. All assessments of MAM and BUS were double-checked at local screening sites. Disagreements in two MAM/BUS physicians were reassessed by another more experienced physician. During mammography screening, both craniocaudal and mediolateral oblique views were used to determine mammographic density according to the BI-RADS, Qualitative assessment of MD was classified into four groups: 1, fatty breast (< 25% glandular); 2, scattered fibro-glandular breast (25%–50% glandular); 3, heterogeneously dense breast (51%–75% glandular); and 4, extremely dense breast (>75% glandular). Detailed information referred to our previously published papers.[13, 21]

Single Nucleotide Polymorphism selection and genotyping

Until 2016, a total of 93 Single Nucleotide Polymorphisms (SNPs) achieved genome-wide significant associations with breast cancer in 34 published GWAS[22]. Among these GWAS-identified SNPs, 9 SNPs were initially identified in East Asians[23-27], while another 16 SNPs were initially identified in Europeans[28-32] and were further validated in large East-Asian populations.[28-32] Among 25 initially selected SNPs, 2 SNPs with high linkage disequilibrium with other SNPs (r²>0.8) and low risk allele frequency in East Asian populations were further excluded. Finally, 23 SNPs were selected for subsequent genotyping testing. A total of 5 ml ETDA-anticoagulated venous blood was collected from each participant. Leukocytes were separated from the collected plasma and stored in a cryotube at -80°C Celsius refrigerator for DNA extraction. The QIAGEN DNA Extraction Kit (QIAGEN Inc.) was used to extract genomic DNA and the Wafergen SmartChip platform was used to genotype the targeted 23 SNPs.[34, 35] In order to ensure the accuracy and reliability of the genotyping results, approximately 5% of the samples were randomly selected for retesting. Since rs6472903 was not successful genotyped in most samples, it was also excluded in the final analysis.

Statistical analysis

The logrank test based on KaplanMeier curve was used to compare the incidence risk of breast cancer between subgroups within each CRF, MD, and PRS. Due to limited CRF associated with the risk of breast cancer under the significant level of 0.05 in log-rank tests (Table 1), CRF with p value <0.20 in log-rank tests were included in the risk predication models. Four risk prediction models with CRF (Model_CRF), CRF and MD (Model_CRF+MD), CRF and PRS (Model_CRF+PRS), and all three components (Model_FULL) were developed to identify different high-risk groups of breast cancer and compare the screening performance between different high-risk groups. Relative risks were estimated using hazard ratios (HRs) and 95% confidence intervals (95%CI) with Cox regression model.

Table 1

Long-time risk of breast cancer by baseline characteristics
Characteristic^*	No. (%) of women	Follow-up, 1000 women years	No. of cancer	IR per 1000 women years	P value for K-M curve	Age-adjusted HR (95% CI)
Overall	7794 (100.0)	78	217	2.8
Age at enrollment
45–50, years	2514 (32.3)	24	60	2.5	0.644
51–60, years	4328 (55.5)	44	134	3.0
≥ 61, years	952 (12.2)	10	23	2.3
Age at menarche					0.607
≤ 12, years	1101 (14.2)	11	28	2.5		0.91 (0.61, 1.35)
> 12 years	6652 (85.8)	67	188	2.8		Ref.
Age at first birth					0.893
Nulliparous	143 (1.9)	1	3	3.0		0.80 (0.26, 2.51)
< 30, years	6004 (79.6)	60	465	7.8		Ref.
≥ 30 years	1395 (18.5)	14	42	3.0		1.04 (0.74, 1.47)
Breastfeeding					0.139
No	1532 (20.0)	15	51	3.4		Ref.
Yes	6143 (80.0)	62	161	2.6		0.79 (0.58, 1.08)
Menopausal status					0.005
Premenopausal	2332 (30.7)	24	75	3.1		1.85 (1.31, 2.61)
Postmenopausal	5264 (69.3)	53	136	2.5		Ref.
Family history of breast cancer					0.363
No	7545 (96.8)	76	207	2.7		Ref.
Yes	249 (3.2)	3	10	3.3		1.34 (0.71, 2.52)
History of breast benign disease					0.058
No	5562 (74.3)	55	143	2.6		Ref.
Yes	1924 (25.7)	20	67	3.4		1.32 (0.99, 1.77)
Hormone replacement therapy					0.251
No	6409 (96.3)	64	180	2.8		Ref.
Yes	246 (3.7)	3	5	1.7		0.59 (0.24, 1.45)
Oral contraceptives					0.595
No	6383 (88.3)	64	181	2.8		Ref.
Yes	843 (11.7)	8	24	3.0		0.90 (0.59, 1.38)
Body mass index					0.815
< 18.5	151 (1.9)	1	4	4.0		1.11 (0.41, 3.03)
18.5–23.9	3641 (47.0)	36	95	2.6		Ref.
24.0-27.9	3054 (39.4)	31	92	3.0		1.11 (0.83, 1.48)
≥ 28.0	898 (11.6)	9	25	2.5		0.91 (0.59, 1.42)
Ever smoking					0.575
No	6998 (94.0)	70	200	2.9		Ref.
Yes	444 (6.0)	4	11	2.8		0.84 (0.46, 1.55)
Negative events					0.140
No	6690 (89.3)	67	176	2.6		Ref.
Yes	804 (10.7)	8	31	3.9		1.33 (0.91, 1.96)
Mammographic density					0.002
Fatty	803 (11.7)	8	15	1.9		Ref.
Scattered	3005 (43.8)	30	82	2.7		1.76 (1.01, 3.07)
Heterogeneous	2943 (42.9)	30	98	3.3		2.69 (1.53, 4.74)
Dense	114 (1.7)	1	4	4.0		2.87 (0.93, 8.83)
22-locus PRS quartiles					< 0.001
1st quartile	897 (20.2)	9	12	1.3		Ref.
2nd quartile	1135 (25.6)	11	29	2.6		1.71 (0.87, 3.36)
3rd quartile	1377 (31.0)	14	43	3.1		2.23 (1.18, 4.24)
4th quartile	1026 (23.1)	10	65	6.5		4.75 (2.57, 8.81)
Note: *, unknown group in index variables were not shown; IR, incidence rate; HR (95%CI), hazard ratio (95% confidential interval); PRS polygenic risk score.

Polygenic risk score (PRS) was calculated to measure the cumulative effect of multiple genetic risk variants with the following formula:

where β_k is the per-allele log OR for breast cancer associated with SNP_kfrom univariate cox regression, x_k is the alleles dosage for SNP_k (0, 1, or 2), and n is the total number of SNPs included in the PRS.

Discrimination of risk prediction model was measured by the area under the receiver operating characteristic curve (AUC). Calibration of 10-year risk prediction model was assessed by comparing the observed and expected number of cases overall and within risk categories. [36, 37] CIs for expected-to-observed ratios (O/E) were calculated by assuming a Poisson distribution for the observed numbers of cases with the following formula:

O/E=1 would indicate perfect calibration. The 10-year risk prediction model for breast cancer were further visualized using nomograms. Due to the inconsistent missing data in CRF, MD, and PRS, sensitivity analyses in subgroup population with complete data were conducted to further compare discrimination of different risk models.

In order to compare with the 10-year breast cancer risk reported in previous studies,[38] the 10-year breast cancer risk in MIST-TJ were divided into the following five categories: below average risk (<0.40%), average risk (0.4% to <0.6%), above average risk (0.6% to <1.0%), moderately increased risk (1.0% to <2.0%) and high risk(≥2.0%). Moreover, the women in MIST-TJ was further simplified into high-risk and low-risk groups according to the optimal cut-off values under the receiver operating curve of different risk prediction models.

The relative risk measured by HR of breast-cancer mortality for high-risk groups compared to low-risk groups was calculated to determine potential benefit of risk-reducing interventions. The detection rate (DR), accuracy and cancer-stage for CBE, BUS, and MAM were compared to determine the optimal screening method for high-risk groups.

The analyses were conducted with R software (version 4.0.3) and SPSS software (version 24). All statistical tests were two-sided, and a P value equal to or less than 0.05 was considered statistically significant.

Long-time risk of breast cancer by baseline characteristics

During a median follow-up of 11.7 years (interquartile range [IQR], 9.8–12.7 years), a total of 217 breast cancer cases were identified, with an incident rate of 2.8 per 1000 person-year (Table 1). Ever experience of negative events, history of breast benign disease, premenopausal, never breastfeeding were associated with the increased risk of breast cancer under the significant level of 0.20 in log-rank tests (Table 1, Additional file 2). Compared with fatty breast, HRs increased from 1.76 (95%CI: 1.01–3.07) for scattered fibro-glandular breast, to 2.69 (95%CI: 1.53–4.74) for heterogeneously dense breast, and 2.87 (95%CI: 0.93–8.83) for extremely dense breast. Compared with the first IQR of 22-locus PRS, HRs increased from 1.71 (95%CI: 0.87–3.36) for the second IQR, to 2.23 (95%CI: 1.18–4.24) for the third IQR, and 4.75 (95%CI: 2.57–8.81) for the fourth IQR (Table 1).

Calibration And Discrimination Of Different Breast-cancer Risk Prediction Models

As shown in Table 2 and Fig. 1, the AUC of risk prediction model increased from 0.573(95%CI: 0.532–0.614) for Model_CRF, to 0.587(95%CI: 0.544–0.630) for Model_CRF+MD, 0.670(95%CI: 0.622–0.717) for Model_CRF+PRS, and 0.674(95%CI: 0.623–0.725) for Model_FULL. Sensitivity analyses of discrimination for different breast-cancer risk prediction models in the same subgroup population with complete data (N = 3398, case = 123) showed similar results (Additional file 1). Although the predicted risk of breast cancer in women with 10-year breast cancer risk > 1.0% with Model_CRF+PRS was significantly underestimated compared to the observed risk of breast cancer [O/E of 1.76 (95%CI: 1.24–2.51) for women with moderately increased risk and 2.78 (95%CI: 1.23–6.29) for those with high risk], the other subgroups within this model and all subgroups within other models showed good calibration, with overall O/E of 1.00 (95%CI: 0.87–1.15) for Model_CRF, 1.05(95%CI: 0.91–1.22) for Model_CRF+MD, 1.14(95%CI: 0.95–1.36) for Model_CRF+PRS, and 1.00(95%CI: 0.84–1.19) for Model_FULL (Table 2).

Table 2

Calibration and discrimination of different breast-cancer risk prediction models with classical risk factors (CRF), mammographic density (MD), and polygenic risk score (PRS)
Model by 10-years risk	No. (%) of women	Follow-up,1000 women years	No. of cancer		O/E (95%CI)	IR/1000 women years		HR (95%CI)	AUC (95%CI)
Model by 10-years risk	No. (%) of women	Follow-up,1000 women years	Observed	Expected	O/E (95%CI)	Observed	Expected	HR (95%CI)	AUC (95%CI)
Risk prediction model with CRF^*									0.573 (0.532, 0.614)
All	7087	71.3	199	198	1.00 (0.87, 1.15)	2.8	2.8
< 0.4%	526 (7.4)	5.3	7	9	0.76 (0.40, 1.44)	1.3	1.7	0.51 (0.24, 1.09)
0.4% to < 0.6%	4802 (67.8)	48.3	126	121	1.04 (0.87, 1.24)	2.6	2.5	Ref.
0.6% to < 1.0%	1701 (24.0)	17	59	65	0.91 (0.72, 1.16)	3.5	3.8	1.32 (0.97, 1.81)
1.0% to < 2.0%	58 (0.8)	0.6	7	4	1.74 (0.66, 4.64)	11.7	6.7	4.60 (2.05, 10.34)
Risk prediction model with CRF and MD									0.587 (0.544, 0.630)
All	6265	63.6	183	174	1.05 (0.91, 1.22)	2.9	2.7
< 0.4%	2139 (34.1)	21.7	43	51	0.84 (0.64, 1.10)	2.0	2.4	0.69 (0.47, 1.02)
0.4% to < 0.6%	2312 (36.9)	23.4	67	60	1.11 (0.87, 1.44)	2.9	2.6	Ref.
0.6% to < 1.0%	1633 (26.1)	16.6	58	52	1.11 (0.85, 1.46)	3.5	3.1	1.23 (0.86, 1.75)
1.0% to < 2.0%	181 (2.9)	1.8	15	9	1.59 (0.84, 3.02)	8.3	5.2	2.86 (1.60, 5.12)
Risk prediction model with CRF and PRS									0.670 (0.622, 0.717)
All	4029	40.5	136	120	1.14 (0.95, 1.36)	3.4	3.0
< 0.4%	715 (17.7)	7.1	12	16	0.76 (0.47, 1.25)	1.7	2.2	0.80 (0.41, 1.57)
0.4% to < 0.6%	1480 (36.7)	15	31	41	0.75 (0.55, 1.02)	2.1	2.8	Ref.
0.6% to < 1.0%	771 (19.1)	7.8	23	26	0.88 (0.60, 1.29)	2.9	3.4	1.42 (0.82, 2.46)
1.0% to < 2.0%	919 (22.8)	9.3	54	31	1.76 (1.24, 2.51)	5.8	3.3	2.80 (1.78, 4.38)
≥ 2.0%	144 (3.6)	1.4	16	6	2.78 (1.23, 6.29)	11.4	4.1	5.30 (2.83, 9.96)
Risk prediction model with CRF, MD and PRS									0.674 (0.623, 0.725)
All	3398	34.6	123	123	1.00 (0.84, 1.19)	3.6	3.6
< 0.4%	808 (23.8)	8.3	18	13	1.39 (0.81, 2.40)	2.2	1.6	1.40 (0.71, 2.76)
0.4% to < 0.6%	1004 (29.5)	10.2	16	23	0.69 (0.46, 1.04)	1.6	2.3	Ref.
0.6% to < 1.0%	735 (21.6)	7.5	26	25	1.04 (0.70, 1.54)	3.5	3.3	2.22 (1.18, 4.17)
1.0% to < 2.0%	691 (20.3)	6.9	45	44	1.02 (0.76, 1.37)	6.5	6.4	4.09 (2.29, 7.29)
≥ 2.0%	160 (4.7)	1.6	18	18	1.02 (0.64, 1.63)	11.3	11.0	7.06 (3.52, 14.16)
Note: O/E, Observed/Expected cases; IR, incidence rate; HR (95%CI), hazard ratio (95% confidential interval); AUC, area under the receiver operating characteristic curve; *, CRF included age at enrollment, breastfeeding, menopausal status, history of breast benign disease, and negative events.

Breast Cancer Mortality Of Different Risk Groups With Different Risk Prediction Models

As shown in Table 3, after risk reclassification according to the optimal cut-off values under the receiver operating curve of different risk prediction models, the HRs of breast cancer-specific mortality for high-risk groups compared to low-risk groups increased from 1.81(95%CI: 1.17–2.81) for Model_CRF, to 2.22(95%CI: 1.35–3.67) for Model_CRF+MD, 2.48(95%CI: 1.43–4.29) for Model_CRF+PRS, and 3.68(95%CI: 1.94–6.99) for Model_FULL.

Table 3

Breast cancer (BC) mortality of different risk groups with classical risk factors (CRF), mammographic density (MD), and polygenic risk score (PRS).
Risk groups	Participants N (%)	BC deaths N (%)	Follow-up, 1000 person years	BC mortality, 1/1000 person years	P value for Fine-Gray test	HR (95%CI)
Risk prediction model with CRF					0.007
Low risk	3758 (53.0)	32 (38.1)	37.25	0.86		Ref
High risk	3329 (47.0)	52 (61.9)	34.08	1.53		1.81 (1.17, 2.81)
Risk prediction model with CRF and MD					0.001
Low risk	2652 (42.3)	21 (27.6)	27	0.78		Ref
High risk	3613 (57.7)	55 (72.4)	36.61	1.50		2.22 (1.35, 3.67)
Risk prediction model with CRF and PRS					< 0.001
Low risk	3022 (75.0)	28 (54.9)	30.43	0.92		Ref
High risk	1007 (25.0)	23 (45.1)	10.09	2.28		2.48 (1.43, 4.29)
Risk prediction model with CRF, MD and PRS					< 0.001
Low risk	2019 (59.4)	13 (28.9)	20.56	0.63		Ref
High risk	1379 (40.6)	32 (71.1)	14	2.29		3.68 (1.94, 6.99)
Note: HR (95%CI), hazard ratio (95% confidential interval).

Screening Performances Of Different Screening Modalities In High-risk Groups With Different Risk Prediction Models

In high-risk groups by Model_CRF, after two-round screening, the detection rate of BUS was similar to that of MAM (3.926/1,000 vs. 2.399/1,000, P = 0.193), but significantly higher than that of CBE (1.091/1,000, P = 0.024) (Table 4). Compared with MAM, BUS showed significantly lower sensitivity (50.0% vs. 81.8%, P = 0.026), but comparable specificity (99.2% vs. 99.3%, P = 0.721), positive prediction values (22.9% vs. 34.6%, P = 0.198), and negative prediction values (99.8% vs. 99.9%, P = 0.071) (Table 4). Further analyses showed no significant difference in the proportions of early-stage breast cancer detected between BUS and MAM (50.00% vs. 61.54%, P = 0.673) (Table 5). Similar but few significant results were observed in high-risk groups by other models (Additional file 3, Additional file 4).

Table 4

Cancer detection rates and accuracy of different screening modalities in high-risk groups with classical risk factors.
Screening performances	CBE		BUS		MAM		P value^a	P value^b
Screening performances	No./total exams	Rate	No./total exams	Rate	No./total exams	Rate	P value^a	P value^b
Cancer detection Rate per 1000 exams
Round 1	4/2913	1.373	7/2913	2.403	11/2913	3.776	0.185	0.345
Round 2	1/1672	0.598	4/1672	2.392	7/1672	4.187	0.105	0.365
Round 1 + 2	5/4585	1.091	11/4585	2.399	18/4585	3.926	0.024	0.193
Sensitivity
Round 1	4/12	0.333	7/12	0.583	11/12	0.917	0.017	0.155
Round 2	1/10	0.100	4/10	0.400	7/10	0.700	0.029	0.370
Round 1 + 2	5/22	0.227	11/22	0.500	18/22	0.818	< 0.001	0.026
Specificity
Round 1	2867/2901	0.988	2867/2901	0.988	2876/2901	0.991	0.016	0.239
Round 2	1653/1662	0.995	1659/1662	0.998	1653/1662	0.995	0.179	0.083
Round 1 + 2	4520/4563	0.991	4526/4563	0.992	4529/4563	0.993	0.573	0.721
PPV
Round 1	4/38	0.105	7/41	0.171	11/36	0.306	0.083	0.163
Round 2	1/10	0.100	4/7	0.571	7/16	0.438	0.092	0.667
Round 1 + 2	5/48	0.104	11/48	0.229	18/52	0.346	0.016	0.198
NPV
Round 1	2867/2875	0.997	2867/2872	0.998	2876/2877	1.000	0.044	0.220
Round 2	1653/1662	0.995	1659/1665	0.996	1653/1656	0.998	0.224	0.510
Round 1 + 2	4520/4537	0.996	4526/4537	0.998	4529/4533	0.999	0.019	0.071
Note: CBE, clinical breast examination; BUS, breast ultrasonography; MAM, mammography; PPV/NPV, positive/negative predictive value; a, comparison between CBE, BUS and MAM; b, comparison between BUS and MAM.

Table 5

Cancer stage of different screening modalities in high-risk groups with classical risk factors.
TNM stage	CBE (N = 6)		BUS (N = 12)		MAM (N = 18)		P value^a	P value^b
TNM stage	n	%	n	%	n	%	P value^a	P value^b
Stage 0	0	0.00%	0	0.00%	1	7.69%	1.000	1.000
Stage I	2	50.00%	4	50.00%	7	53.85%
Stage II	2	50.00%	3	37.50%	4	30.77%
Stage III	0	0.00%	1	12.50%	1	7.69%
Stage 0-I	2	50.00%	4	50.00%	8	61.54%	0.871	0.673
Stage II-III	2	50.00%	4	50.00%	5	38.46%
Note: CBE, clinical breast examination; BUS, breast ultrasonography; MAM, mammography; a, comparison between CBE, BUS and MAM; b, comparison between BUS and MAM.

This was the first study to investigate screening performance in high-risk groups of breast cancer by integrating CRF, MD and PRS. This study not only once again support that risk prediction models integrating CRF, MD and PRS were very useful to identify high-risk groups of breast cancer, but also support that BUS was comparable to MAM for breast cancer screening except for the lower sensitivity in high-risk groups. Moreover, it is suggested that after improved risk stratification, the higher the risk, the greater the benefit of the intervention. Therefore, this study further supports that it is very necessary to adopt a high risk-targeted screening strategy rather than only age-based, family history-based, or genetic susceptibility-based screening strategy.

Previous several studies had developed and investigated the performances of breast cancer risk prediction models integrating CRF and PRS among Chinese women.[39–42] However, most of these studies only evaluated the discrimination and calibration of these models in case-control studies but not in cohort studies, and no study evaluate the improvement of MD in the prediction of breast cancer risk in Chinese women due to inaccessibility of mammography equipment in the general population-based studies. Although MD was identified as a strong risk factor for breast cancer since 2007,[43] no direct evidence support this result among Chinese women until now. Therefore, this was the first study to support the positive association between MD and risk of breast cancer among Chinese women. Moreover, the association between PRS and breast cancer in this study was significantly stronger than that reported in previous studies among Chinese women.[42, 44, 45] Based on CRF, the improvement on the accuracy of breast cancer risk prediction with PRS is also significantly better than previous studies. The major reasons would probably be that this study included more and Chinese-specific SNPs associated with breast cancer.

In addition to the above implications of breast cancer risk prediction for Chinese women, the most important implications of this study was that it provided four different models to identify high-risk groups of breast cancer for risk-reducing interventions in four different scenarios. In the first scenario when limited resources are available, the simplest model with CRF can be used to identify high-risk groups of breast cancer. Even though not all predictors are significantly associated with breast cancer risk, including these CRF predictors at a high significant level can also achieve relatively good discrimination. That was similar to the low-cost and easy-to-use model proposed in our previous study.[46] In the second scenario when women ever received a mammographic screening, it is very necessary to collect data of MD and develop model with CRF and MD to identify high-risk groups of breast cancer. In the third scenario when adequate resources are available but women do not ever receive mammographic screening, the combination of CRF and PRS can well detect high-risk groups of breast cancer, and may even be better than the combination of CRF and MD (Table 2 and Table 3). How to select target GWAS-identified SNPs for PRS is very important, and it is necessary to select as many population-specific SNPs as possible. However, with the increase of GWAS-identified SNPs associated with breast cancer, some researchers proposed 313-locus PRS or whole-genome-based PRS to identify high-risk groups.ADDIN These PRS based on so large number of SNP would be unaffordable or less cost-effective for population-based risk-reduction interventions. In the last scenario when adequate resources are available and women ever receive mammographic screening, the combination of CRF, MD and PRS will be the best way to identify high-risk groups of breast cancer.

Another important finding was that we reconfirmed that BUS was comparable to MAM for breast cancer screening in high-risk groups. Although this result was similar to previous studies, the difference in this study was that the sensitivity of BUS was significantly lower than that of MAM in the high-risk groups. This difference existed not only in high-risk groups, but also in the whole population (Additional file 5). Although MAM detected more breast cancers, we also found no significant difference in early-stage breast cancer between BUS and MAM in high-risk groups. This non-significant difference in early-stage breast cancer was likely due to the small sample size, however, the absolute difference in early-stage breast cancer between BUS and MAM (50.00% vs. 61.54%, Table 5) in high-risk populations cannot be ignored. In contrast, the absolute difference in early-stage breast cancer between BUS and MAM (61.90% vs. 60.61%, Additional file 6) in the whole population was not so obvious. These results suggested that although MAM can detect more breast cancers, the MAM-detected breast cancers are inclined to be small calcified cancers that progressed slowly at an early stage, while BUS-detected breast cancers are more inclined to relatively invasive and rapidly progressing cancers. These results had also been suggested in previous studies.[49, 50] More studies with more sophisticated design are needed in the future to validate the results.

Finally, some limitations can be found in this study. First, there is no independent population to validate the screening performances of breast cancer in the four types of high-risk groups. However, it is indeed very difficult to find the validation population who had collected CRF, MD, and PRS information, received screening of CBE, BUS, and MAM, and had been followed up for a long time. Second, because there was no unscreened control group, we can only assess the potential maximum benefit of risk-reduced intervention in high-risk groups, but cannot accurately assess the true effect of screening in these high-risk groups. Third, the sample size was relatively small, so we can only observe the non-significant but obvious difference in early-stage breast cancer between BUS and MAM in high-risk groups.

In conclusion, this study once again support that accurate risk assessment integrating CRF, MD and PRS is needed to identify high-risk groups of breast cancer. Moreover, the higher the risk, the greater the benefit of the intervention. BUS was comparable to MAM for screening breast cancer in high-risk groups. More studies with more sophisticated design are needed in the future to validate the results.

MIST Multi-modality Independent Screening Trial

BMI body mass index

SNP single nucleotide polymorphisms

GWAS genome-wide association studies

CRF classical risk factors

MD mammographic density

PRS polygenic risk score

IR incidence rate

HR hazards risk

95% CIs 95% confidence intervals

ROC receiver-operating characteristic

AUC area under the receiver operating characteristic curve

O/E Observed/Expected cases

Author Contribution

Prof. Song and Prof. Chen have full access to all data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis. Study concept and design: Y.H, F.S., K.C. Acquisition of data: Y.H., Z.W., Z.L., H.D., Y.Z., P.L., Y.Z. Analysis and interpretation of data: Y.H., Z.W., Z.L., H.D., Y.Z., P.L., Y.Z., F.S., K.C. Drafting of the manuscript: Y.H., Z.W., Z.L., F.S., K.C. Critical revision of the manuscript for important intellectual content: Y.H., Z.W., Z.L., H.D., Y.Z., P.L., Y.Z., F.S., K.C. Obtained funding: Y.H., F.S., K.C. Administrative, technical, or material support: P.L., Y.Z., F.S., K.C. Study supervision: F.S., K.C.

Funding

This work was supported by the Chinese National Key Research and Development Project (No. 2021YFC2500400); Tianjin Municipal Health Committee Foundation (No.TJWJ2021MS008), Tianjin Science and Technology Committee Foundation (No.18JCQNJC80300).

Availability of data and materials

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Ethics approval and consent to participate

The written informed consent was acquired from each participant or their guardian in MIST, and the current study was reviewed and approved by the institutional review board of Tianjin Medical University Cancer Institute and Hospital (TMUCIH).

Conflict of Interest

The authors declare that they have no competing interests.

Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, Bray F: Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 2021, 71(3):209–249.
Fan L, Strasser-Weippl K, Li J, St Louis J, Finkelstein DM, Yu K, Chen W, Shao Z, Goss PE: Breast cancer in China. The Lancet Oncology 2014, 15(7):e279-e289.
Cao W, Chen HD, Yu YW, Li N, Chen WQ: Changing profiles of cancer burden worldwide and in China: a secondary analysis of the global cancer statistics 2020. Chin Med J (Engl) 2021, 134(7):783–791.
Chen W, Xia C, Zheng R, Zhou M, Lin C, Zeng H, Zhang S, Wang L, Yang Z, Sun K et al: Disparities by province, age, and sex in site-specific cancer burden attributable to 23 potentially modifiable risk factors in China: a comparative risk assessment. LANCET GLOB HEALTH 2019, 7(2):e257-e269.
Siu AL: Screening for Breast Cancer: U.S. Preventive Services Task Force Recommendation Statement. ANN INTERN MED 2016, 164(4):279–296.
Tozaki M, Kuroki Y, Kikuchi M, Kojima Y, Kubota K, Nakahara H, Ito Y, Mukai H: The Japanese Breast Cancer Society clinical practice guidelines for screening and imaging diagnosis of breast cancer, 2015 edition. BREAST CANCER-TOKYO 2016, 23(3):357–366.
Oeffinger KC, Fontham ET, Etzioni R, Herzig A, Michaelson JS, Shih YC, Walter LC, Church TR, Flowers CR, LaMonte SJ et al: Breast Cancer Screening for Women at Average Risk: 2015 Guideline Update From the American Cancer Society. JAMA 2015, 314(15):1599–1614.
World Health Organization: WHO Postition Paper on Mammography Screening. Geneva, Switzerland: WHO Press; 2014.
Hao X, Tong Z, Chen K, Wang Y, Liu P, Gu L, Liu J, Yu J, Song F, Huang Y et al: Breast cancer screening guideline for Chinese Women. CANCER BIOL MED 2019, 4(16):822–824.
Nelson HD, Fu R, Cantor A, Pappas M, Daeges M, Humphrey L: Effectiveness of Breast Cancer Screening: Systematic Review and Meta-analysis to Update the 2009 U.S. Preventive Services Task Force Recommendation. ANN INTERN MED 2016, 164(4):244–255.
Myers ER, Moorman P, Gierisch JM, Havrilesky LJ, Grimm LJ, Ghate S, Davidson B, Mongtomery RC, Crowley MJ, McCrory DC et al: Benefits and Harms of Breast Cancer Screening: A Systematic Review. JAMA 2015, 314(15):1615–1634.
Independent UK Panel on Breast Cancer Screening: The benefits and harms of breast cancer screening: an independent review. The Lancet 2012, 380(9855):1778–1786.
Huang Y, Wang H, Lyv Z, Dai H, Liu P, Zhu Y, Song F, Chen K: Development and evaluation of the screening performance of a low-cost high-risk screening strategy for breast cancer. CANCER BIOL MED 2021.
Tice JA, Miglioretti DL, Li CS, Vachon CM, Gard CC, Kerlikowske K: Breast Density and Benign Breast Disease: Risk Assessment to Identify Women at High Risk of Breast Cancer. J CLIN ONCOL 2015, 33(28):3137–3143.
van Veen EM, Brentnall AR, Byers H, Harkness EF, Astley SM, Sampson S, Howell A, Newman WG, Cuzick J, Evans D: Use of Single-Nucleotide Polymorphisms and Mammographic Density Plus Classic Risk Factors for Breast Cancer Risk Prediction. JAMA ONCOL 2018, 4(4):476–482.
Brentnall AR, Cuzick J, Buist D, Bowles E: Long-term Accuracy of Breast Cancer Risk Assessment Combining Classic Risk Factors and Breast Density. JAMA ONCOL 2018, 4(9):e180174.
Lee A, Mavaddat N, Wilcox AN, Cunningham AP, Carver T, Hartley S, Babb DVC, Izquierdo A, Simard J, Schmidt MK et al: BOADICEA: a comprehensive breast cancer risk prediction model incorporating genetic and nongenetic risk factors. GENET MED 2019, 21(8):1708–1718.
Gierach GL, Choudhury PP, Garcia-Closas M: Toward Risk-Stratified Breast Cancer Screening: Considerations for Changes in Screening Guidelines. JAMA ONCOL 2020, 6(1):31–33.
Mukama T, Kharazmi E, Xu X, Sundquist K, Sundquist J, Brenner H, Fallah M: Risk-Adapted Starting Age of Screening for Relatives of Patients With Breast Cancer. JAMA ONCOL 2020, 6(1):68–74.
Saslow D, Boetes C, Burke W, Harms S, Leach MO, Lehman CD, Morris E, Pisano E, Schnall M, Sener S et al: American Cancer Society guidelines for breast screening with MRI as an adjunct to mammography. CA Cancer J Clin 2007, 57(2):75–89.
Dai H, Yan Y, Wang P, Liu P, Cao Y, Xiong L, Luo Y, Pan T, Ma X, Wang J et al: Distribution of mammographic density and its influential factors among Chinese women. INT J EPIDEMIOL 2014, 43(4):1240–1251.
Huang Y, Song F, Chen K: [Current status of genome-wide association studies (GWAS) on breast cancer and application values of single nucleotide polymorphisms identified from GWAS]. Zhonghua Liu Xing Bing Xue Za Zhi 2015, 36(10):1058–1061.
Cai Q, Zhang B, Sung H, Low SK, Kweon SS, Lu W, Shi J, Long J, Wen W, Choi JY et al: Genome-wide association analysis in East Asians identifies breast cancer susceptibility loci at 1q32.1, 5q14.3 and 15q26.1. NAT GENET 2014, 46(8):886–890.
Long J, Cai Q, Sung H, Shi J, Zhang B, Choi JY, Wen W, Delahanty RJ, Lu W, Gao YT et al: Genome-wide association study in east Asians identifies novel susceptibility loci for breast cancer. PLOS GENET 2012, 8(2):e1002532.
Cai Q, Long J, Lu W, Qu S, Wen W, Kang D, Lee JY, Chen K, Shen H, Shen CY et al: Genome-wide association study identifies breast cancer risk variant at 10q21.2: results from the Asia Breast Cancer Consortium. HUM MOL GENET 2011, 20(24):4991–4999.
Long J, Cai Q, Shu XO, Qu S, Li C, Zheng Y, Gu K, Wang W, Xiang YB, Cheng J et al: Identification of a functional genetic variant at 16q12.1 for breast cancer risk: results from the Asia Breast Cancer Consortium. PLOS GENET 2010, 6(6):e1001002.
Zheng W, Long J, Gao YT, Li C, Zheng Y, Xiang YB, Wen W, Levy S, Deming SL, Haines JL et al: Genome-wide association study identifies a new breast cancer susceptibility locus at 6q25.1. NAT GENET 2009, 41(3):324–328.
Michailidou K, Hall P, Gonzalez-Neira A, Ghoussaini M, Dennis J, Milne RL, Schmidt MK, Chang-Claude J, Bojesen SE, Bolla MK et al: Large-scale genotyping identifies 41 new loci associated with breast cancer risk. NAT GENET 2013, 45(4):353–361, 361e.
Ghoussaini M, Fletcher O, Michailidou K, Turnbull C, Schmidt MK, Dicks E, Dennis J, Wang Q, Humphreys MK, Luccarini C et al: Genome-wide association analysis identifies three new breast cancer susceptibility loci. NAT GENET 2012, 44(3):312–318.
Ahmed S, Thomas G, Ghoussaini M, Healey CS, Humphreys MK, Platte R, Morrison J, Maranian M, Pooley KA, Luben R et al: Newly discovered breast cancer susceptibility loci on 3p24 and 17q23.2. NAT GENET 2009, 41(5):585–590.
Stacey SN, Manolescu A, Sulem P, Thorlacius S, Gudjonsson SA, Jonsson GF, Jakobsdottir M, Bergthorsson JT, Gudmundsson J, Aben KK et al: Common variants on chromosome 5p12 confer susceptibility to estrogen receptor-positive breast cancer. NAT GENET 2008, 40(6):703–706.
Hunter DJ, Kraft P, Jacobs KB, Cox DG, Yeager M, Hankinson SE, Wacholder S, Wang Z, Welch R, Hutchinson A et al: A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer. NAT GENET 2007, 39(7):870–874.
Zheng W, Zhang B, Cai Q, Sung H, Michailidou K, Shi J, Choi JY, Long J, Dennis J, Humphreys MK et al: Common genetic determinants of breast-cancer risk in East Asian women: a collaborative study of 23 637 breast cancer cases and 25 579 controls. HUM MOL GENET 2013, 22(12):2539–2550.
Zhang L, Han L, Huang Y, Feng Z, Wang X, Li H, Song F, Liu L, Li J, Zheng H et al: SNPs within microRNA binding sites and the prognosis of breast cancer. Aging (Albany NY) 2021, 13(5):7465–7480.
Cui P, Zhao Y, Chu X, He N, Zheng H, Han J, Song F, Chen K: SNP rs2071095 in LincRNA H19 is associated with breast cancer risk. Breast Cancer Res Treat 2018, 171(1):161–171.
Pal CP, Wilcox AN, Brook MN, Zhang Y, Ahearn T, Orr N, Coulson P, Schoemaker MJ, Jones ME, Gail MH et al: Comparative Validation of Breast Cancer Risk Prediction Models and Projections for Future Risk Stratification. J Natl Cancer Inst 2020, 112(3):278–285.
Schonfeld SJ, Pee D, Greenlee RT, Hartge P, Lacey JJ, Park Y, Schatzkin A, Visvanathan K, Pfeiffer RM: Effect of changing breast cancer incidence rates on the calibration of the Gail model. J CLIN ONCOL 2010, 28(14):2411–2417.
Brentnall AR, Harkness EF, Astley SM, Donnelly LS, Stavrinos P, Sampson S, Fox L, Sergeant JC, Harvie MN, Wilson M et al: Mammographic density adds accuracy to both the Tyrer-Cuzick and Gail breast cancer risk models in a prospective UK screening cohort. BREAST CANCER RES 2015, 17(1):147.
Zheng W, Wen W, Gao YT, Shyr Y, Zheng Y, Long J, Li G, Li C, Gu K, Cai Q et al: Genetic and clinical predictors for breast cancer risk assessment and stratification among Chinese women. J Natl Cancer Inst 2010, 102(13):972–981.
Dai J, Hu Z, Jiang Y, Shen H, Dong J, Ma H, Shen H: Breast cancer risk assessment with five independent genetic variants and two risk factors in Chinese women. BREAST CANCER RES 2012, 14(1):R17.
Han Y, Lv J, Yu C, Guo Y, Bian Z, Hu Y, Yang L, Chen Y, Du H, Zhao F et al: Development and external validation of a breast cancer absolute risk prediction model in Chinese population. BREAST CANCER RES 2021, 23(1):62.
Wen W, Shu XO, Guo X, Cai Q, Long J, Bolla MK, Michailidou K, Dennis J, Wang Q, Gao YT et al: Prediction of breast cancer risk based on common genetic variants in women of East Asian ancestry. BREAST CANCER RES 2016, 18(1):124.
Boyd NF, Martin LJ, Bronskill M, Yaffe MJ, Duric N, Minkin S: Breast tissue composition and susceptibility to breast cancer. J Natl Cancer Inst 2010, 102(16):1224–1237.
Zheng W, Wen W, Gao YT, Shyr Y, Zheng Y, Long J, Li G, Li C, Gu K, Cai Q et al: Genetic and clinical predictors for breast cancer risk assessment and stratification among Chinese women. J Natl Cancer Inst 2010, 102(13):972–981.
Dai J, Hu Z, Jiang Y, Shen H, Dong J, Ma H, Shen H: Breast cancer risk assessment with five independent genetic variants and two risk factors in Chinese women. BREAST CANCER RES 2012, 14(1):R17.
Huang Y, Wang H, Lyu Z, Dai H, Liu P, Zhu Y, Song F, Chen K: Development and evaluation of the screening performance of a low-cost high-risk screening strategy for breast cancer. CANCER BIOL MED 2021.
Hurson AN, Pal CP, Gao C, Husing A, Eriksson M, Shi M, Jones ME, Evans D, Milne RL, Gaudet MM et al: Prospective evaluation of a breast-cancer risk model integrating classical risk factors and polygenic risk in 15 cohorts from six countries. INT J EPIDEMIOL 2022, 50(6):1897–1911.
Mars N, Koskela JT, Ripatti P, Kiiskinen T, Havulinna AS, Lindbohm JV, Ahola-Olli A, Kurki M, Karjalainen J, Palta P et al: Polygenic and clinical risk scores and their impact on age at onset and prediction of cardiometabolic diseases and common cancers. NAT MED 2020, 26(4):549–557.
Berg WA, Bandos AI, Mendelson EB, Lehrer D, Jong RA, Pisano ED: Ultrasound as the Primary Screening Test for Breast Cancer: Analysis From ACRIN 6666. J Natl Cancer Inst 2016, 108(4).
Shen S, Zhou Y, Xu Y, Zhang B, Duan X, Huang R, Li B, Shi Y, Shao Z, Liao H et al: A multi-centre randomised trial comparing ultrasound vs mammography for screening breast cancer in high-risk Chinese women. Br J Cancer 2015, 112(6):998–1004.

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Screening performance in high-risk groups of breast cancer by integrating classical risk factors, mammographic density and polygenic risk

Status:

Version 1

Abstract

Figures

Introduction

Methods

Results

Long-time risk of breast cancer by baseline characteristics

Calibration And Discrimination Of Different Breast-cancer Risk Prediction Models

Breast Cancer Mortality Of Different Risk Groups With Different Risk Prediction Models

Screening Performances Of Different Screening Modalities In High-risk Groups With Different Risk Prediction Models

Discussion

Abbreviations

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1