Characteristics of the Participants
There was a total of 14604 subjects, comprising 7124 lung cancer cases and 7480 healthy controls included in the study. Males accounted 56.48% (4103 cases for the case group, and 4145 cases for the control group) and females 43.52% (3021 cases for the case group, and 3335 cases for the control group) of total participants. In age group, the proportion of cases and control was higher in 60-70 years old than other groups.
Participants who were younger than 40 years old had a lower probability of developing lung cancer (OR, 0.397; 95% CI, 0.344-0.459). For sex, male had a higher probability of developing lung cancer (OR, 1.093; 95% CI, 1.024-1.167) than female. Unmarried cut their risk for lung cancer (OR, 0.377; 95% CI, 0.288-0.492) while married increased their risk for lung cancer (OR, 1.397; 95% CI, 1.172-1.664). Lung related diseases enabled the reduction of lung cancer risk (OR, 0.351;95% CI, 0.311-0.396). About smoking status, the result showed that participants who were smoking (OR, 1.843; 95% CI, 1.710-1.986) and smoking quit (OR, 2.584; 95% CI, 2.230-2.995) had a higher risk of lung cancer than never smoking participants. As for alcohol consumption, the findings showed that participants who had been drinking consistently had a 79.4% increased risk of lung cancer (OR, 1.794; 95% CI, 1.654-1.947) compared with never drinking. A family history of cancer multiplied their risk for lung cancer (OR, 2.824; 95% CI, 2.562-3.113) visibly. Corporate personnel and production/service workers were at lower risk of developing lung cancer than agriculture/fishing workers. (Table 1)
Table 1 General characteristics of lung cancer case group and control group
|
Factor
|
Cases N(%)
|
Control N(%)
|
P-value
|
OR
|
95%CI
|
Age
|
|
|
0.000
|
|
|
|
<40
|
349(4.9)
|
879(11.8)
|
0.000
|
0.397
|
0.344
|
0.459
|
40-50
|
1696(23.8)
|
1569(21.0)
|
0.132
|
1.081
|
0.977
|
1.196
|
50-60
|
1786(25.1)
|
1739(23.2)
|
0.600
|
1.027
|
0.930
|
1.135
|
60-70
|
1909(26.8)
|
1909(25.5)
|
1.000
|
1.000
|
0.907
|
1.103
|
≥70
|
1384(19.4)
|
1384(18.5)
|
|
1.00
|
reference
|
|
Sex
|
|
|
0.000
|
|
|
|
Male
|
4103(57.6)
|
4145(55.4)
|
|
1.093
|
1.024
|
1.167
|
Female
|
3021(42.4)
|
3335(44.6)
|
|
1.00
|
reference
|
|
Marriage
|
|
|
0.000
|
|
|
|
Unmarried
|
115(1.7)
|
424(5.7)
|
0.000
|
0.377
|
0.288
|
0.492
|
Married
|
6785(95.2)
|
6745(90.2)
|
0.000
|
1.397
|
1.172
|
1.664
|
Widowed/Divorce
|
224(3.1)
|
311(4.1)
|
|
1
|
reference
|
|
Related disease
|
|
|
0.000
|
|
|
|
Yes
|
394(5.6)
|
1070(14.3)
|
|
0.351
|
0.311
|
0.396
|
No
|
6730(94.4)
|
6410(85.7)
|
|
1.00
|
reference
|
|
Smoking situation
|
|
|
0.000
|
|
|
|
Smoking
|
2312(32.5)
|
1641(21.9)
|
0.000
|
1.843
|
1.710
|
1.986
|
Smoking Quit
|
567(7.9)
|
287(3.8)
|
0.000
|
2.584
|
2.230
|
2.995
|
Never smoke
|
4245(59.6)
|
5552(74.2)
|
|
1.00
|
reference
|
|
Alcohol consumption
|
|
|
0.000
|
|
|
|
Yes
|
1823(25.6)
|
1203(16.1)
|
|
1.794
|
1.654
|
1.947
|
No
|
5301(74.4)
|
6277(83.9)
|
|
1.00
|
reference
|
|
Family history of cancer
|
|
|
0.000
|
|
|
|
Yes
|
1541(21.6)
|
666(8.9)
|
|
2.824
|
2.562
|
3.113
|
No
|
5583(78.4)
|
6814(91.1)
|
|
1.00
|
reference
|
|
Profession
|
|
|
0.000
|
|
|
|
Corporate personnel
|
717(10.1)
|
3943(52.7)
|
0.000
|
0.334
|
0.301
|
0.372
|
Production/Service
|
3431(48.2)
|
2056(27.5)
|
0.000
|
0.601
|
0.559
|
0.646
|
Agriculture/fishing
|
2976(41.8)
|
1481(19.8)
|
|
1.00
|
reference
|
|
Five-year average PM2.5
|
7124(48.8%)
|
7480(51.2%)
|
0.000
|
1.060
|
1.057
|
1.062
|
Multivariate Analysis of Risk Factors
It showed the association between risk factors and lung cancer in table 2. Marital status of unmarried (OR, 0.678; 95% CI, 0.481-0.956), males (OR, 0.496; 95% CI,0.453-0.543), people with a history of lung-related disease (OR, 0.394; 95% CI, 0.341-0.456), corporate personnel and production (OR, 0.421; 95% CI, 0.371-0.478)/service personnel (OR, 0.784; 95% CI, 0.719-0.854) were protective factors for lung cancer. People younger than 40 years old (OR, 1.767; 95% CI,1.450-2.153) and 40-50 years old (OR, 1.630; 95% CI,1.436-1.850), people who were smoking (OR, 2.630; 95% CI, 2.364-2.925) and quit smoking (OR, 4.232; 95% CI, 3.505-5.110), people who had been drinking consistently (OR, 1.295; 95% CI,1.156-1.451), people with family history of cancer (OR, 2.785; 95% CI,2.462-3.150) and PM2.5 pollution exposure (OR, 1.062; 95% CI,1.059-1.065) were proven to be a risk factor for lung cancer.
We analyzed the impact of different variables on lung cancer stratifying by sex, smoking status, and air pollution. By sex, notable results were that there were big differences in smoking status for male and female. It may be due to the low smoking rate among Chinese women, persistent smoking (OR, 3.286; 95% CI, 2.918-3.700) and smoking quit (OR, 4.029; 95% CI, 3.328-4.878) were risk factors for lung cancer in men, while only smoking quit after smoking was a risk factor for lung cancer in women (OR, 6.242; 95% CI, 2.971-13.112), and the impact on women was greater than that in men. Consistent alcohol consumption added risk for lung cancer in men, but not in women. Men and women with a family history of cancer had 3.087-fold and 2.472-fold higher risk of developing lung cancer, respectively. PM2.5 pollution exposure was a risk factor for lung cancer in both men (OR, 1.055; 95% CI, 1.051-1.058) and women (OR, 1.072; 95% CI, 1.067-1.076). In addition, a history of lung-related disease was a risk factor for lung cancer in female (OR, 2.631; 95% CI, 1.853-3.737) (Figure 2).
By smoking status, male was risk factor for lung cancer in never smokers. Consistent alcohol consumption added risk for lung cancer in never smokers. Never smokers with a family history of cancer had 3.535-fold higher risk of developing lung cancer than ever smokers with 1.462-fold. Among never smokers, a history of lung-related disease was a risk factor for lung cancer. Obviously, the combined effects of PM2.5 pollution exposure and ever smoking aggravated the incidence of lung cancer (OR, 1.063; 95% CI, 1.058-1.0.68) (Figure 3).
According to air pollution, lung cancer risk factors are completely different in lightly and heavily polluted areas. In lightly polluted areas, a history of lung-related disease was a risk factor for lung cancer. In heavily polluted areas, male, consistent alcohol consumption, a family history of cancer, ever smokers and smoking quit were all risk factors for lung cancer (Figure 4).
TABLE 2 Estimated risks of lung cancer associated with multivariate factors
|
Factor
|
P-value
|
OR
|
95%CI
|
|
|
|
|
Age
|
0.000
|
|
|
|
<40
|
0.000
|
1.767
|
1.450
|
2.153
|
40-50
|
0.000
|
1.630
|
1.436
|
1.850
|
50-60
|
0.000
|
1.287
|
1.140
|
1.453
|
60-70
|
0.000
|
1.268
|
1.128
|
1.426
|
≥70
|
|
1.00
|
reference
|
|
Sex
|
0.000
|
|
|
|
Male
|
|
0.496
|
0.453
|
0.543
|
Female
|
|
1.00
|
reference
|
|
Marriage
|
0.000
|
|
|
|
Unmarried
|
0.027
|
0.678
|
0.481
|
0.956
|
Married
|
0.001
|
1.463
|
1.178
|
1.816
|
Widowed/Divorce
|
|
1.00
|
reference
|
|
Related disease
|
|
|
|
|
Yes
|
0.000
|
0.394
|
0.341
|
0.456
|
No
|
|
1.00
|
reference
|
|
Smoking situation
|
0.000
|
|
|
|
Smoking
|
0.000
|
2.630
|
2.364
|
2.925
|
Smoking Quit
|
0.000
|
4.232
|
3.505
|
5.110
|
Never smoke
|
|
1.00
|
reference
|
|
Alcohol consumption
|
0.000
|
|
|
|
Yes
|
0.000
|
1.295
|
1.156
|
1.451
|
No
|
|
1.00
|
reference
|
|
Family history of cancer
|
0.000
|
|
|
|
Yes
|
0.000
|
2.785
|
2.462
|
3.150
|
No
|
|
1.00
|
reference
|
|
Profession
|
0.000
|
|
|
|
Corporate personnel
|
0.000
|
0.421
|
0.371
|
0.478
|
Production/Service
|
0.000
|
0.784
|
0.719
|
0.854
|
Agriculture/fishing
|
|
1.00
|
reference
|
|
Five-year average PM2.5
|
0.000
|
1.062
|
1.059
|
1.065
|
Lung cancer nomogram
A nomogram was developed based on the significant factors identified in the log-rank model. In the nomogram estimation system, a weighted point value was attributed to each factor that implied a contribution to the lung cancer. We found that participants with higher scores had a higher chance of developing lung cancer than that observed in those with lower scores. The result showed that PM2.5 was the largest contributor to the occurrence of lung cancer, with the increase of PM2.5, the greater the possibility of lung cancer occurrence. The final nomogram model was developed to predict lung cancer probability and the calibration curve for the probability of lung cancer in the model was good. The C-statistic of the nomogram to predict lung cancer was 0.811 (95%CI, 0.805- 0.818) (Figure 5).