Cardiovascular disease (CVD) is a leading cause of death both in California and the United States.The environment problems and source of pollutions have been shown to increase the risk of CVD rate. PM2.5 and other air pollutants such as ozone, nitrogen dioxide, and sulfur dioxide have been considered in the development of health-based standards. Moreover, people living in communities that were identified as “disadvantage” by California Environmental Protection Agency (CalEPA) are more vulnerable to the effects of pollution than others.
The main purpose of this project is to evaluate the association between pollution burden and CVD rate while accounting for community’s vulnerability to this association in all California counties.
The data set was downloaded from (https://data.ca.gov/dataset/calenviroscreen-3-0-results) through API. Once downloaded, the desired information was extracted and formed into a data table. This is a data set including environmental, health, and socioeconomic information of all communities in California State. The key independent variables that were examined in this study were PM2.5, ozone, traffic density, cardiovascular disease rate and community category.
An modified Poisson regression model was used to estimate the association of pollutants and cardiovascular disease rate in all communities of California State. Analyses were performed by adjusting for community category confounding factor. Further model fit assessment was tested by Person chi-square test. The overdispersion of the model was also tested to determine if the variance was larger than what would be expected under a Poisson distribution. Negative binomial regression model was used to address overdispersion issue. P-values< 0.05 were considered statistically significant.
PM2.5 and ozone, are main particles considered in the study. Pollution score is an averaged pollution burden of environmental effects for each county. It is calculated as the average percentiles,the percentile represents a relative score for the indicator, of the environmental effects indicators. The county with higher score therefore has relatively high pollution burdens. In Figure 1, PM2.5 and ozone are positively related with CVD rate. The pollution score plot is also showing a positive correlation with CVD rate.
Figure 2: State map depict the pollution score and CVD rate data of each county in California.
Counties with high pollution score is consistent with higher CVD rate. For example, Madera, Fresno, Los Angeles, San Joaquin, San Bernadino and Riverside counties have higher pollution score as well as a high CVD rate( Figure 2 ). However, there are some inconsistency. For instance, the northern counties of state shows lower pollution burden but have relatively higher CVD rate. This suggests that other life characteristics and factors may contribute to their high CVD rate.
County | Pollution Score | Disadvantaged Community |
---|---|---|
Fresno | 7.483057 | 0.5979899 |
Stanislaus | 7.286043 | 0.6489362 |
Tulare | 7.245979 | 0.4487179 |
San Bernardino | 7.217753 | 0.4254743 |
Kern | 7.182617 | 0.4503311 |
Kings | 6.997034 | 0.4074074 |
Los Angeles | 6.976374 | 0.4430218 |
Merced | 6.913608 | 0.7551020 |
San Joaquin | 6.498406 | 0.5107914 |
Riverside | 6.470632 | 0.2229581 |
Figure 3: Figure 3 lists top 10 counties with the higher pollution score. Fresno county has the highest pollution burden and is approximately 60% disadvantaged communities in this county. A high pollution burden is not exactly positive related with high proportion of disadvantaged communities of the county.
Among the top 10 cities with high pollution burden, 90% of them are disadvantaged communities. CalEPA classifies communities as “disadvantaged” due to the environmental conditions and vulnerability of people living in those communities. Figure3 and table1 both indicate that living in an disadvantaged community increase the risk of CVD rate.
Figure 4 shows the association between CVD rate and pollution burden by community category. The average pollution burden in disadvantaged communities is obviously higher than non-disadvantage communities. However, the relationship between CVD rate and pollution burden is not as obvious as in Figure 1.
Traffic density level (vehicles-km/hr/km) | PM2.5 (ug/m3) | Ozone (ppm) |
---|---|---|
[22.4,442] | 9.975939 | 0.0489529 |
(442,700] | 10.304066 | 0.0469590 |
(700,1.19e+03] | 10.572666 | 0.0471831 |
(1.19e+03,4.57e+04] | 10.618026 | 0.0465895 |
Table 3: Traffic density vs PM2.5 and Ozone
The lowest traffic density level was 22.4-442 (vehicles-km/hr/km).The mean PM2.5 and ozone value associated with the lowest traffic density were 9.98(ug/m3) and 0.049(ppm), respectively. The highest traffic density level was associated the highest PM2.5 value 10.62(ug/m3). Thus, increased PM2.5 was associated with high traffic density level.
Unadjusted Model | Negative binomial Model | |||||
---|---|---|---|---|---|---|
Predictors | Incidence Rate Ratios | Conf. Int (95%) | P-Value | Incidence Rate Ratios | Conf. Int (95%) | P-Value |
(Intercept) | 1.53 | 1.42 – 1.66 | <0.001 | 1.74 | 1.60 – 1.89 | <0.001 |
community | 1.35 | 1.31 – 1.39 | <0.001 | |||
ozone_ctr | 2.27 | 2.15 – 2.40 | <0.001 | 2.21 | 2.08 – 2.35 | <0.001 |
PM2.5 | 1.01 | 1.01 – 1.02 | <0.001 | 1.00 | 0.99 – 1.00 | 0.463 |
PM2.5:traffic.q4(1.19e+03,4.57e+04] | 0.96 | 0.94 – 0.97 | <0.001 | 0.93 | 0.92 – 0.95 | <0.001 |
PM2.5:traffic.q4(442,700] | 0.99 | 0.98 – 1.00 | 0.173 | 0.99 | 0.98 – 1.00 | 0.043 |
PM2.5:traffic.q4(700,1.19e+03] | 0.98 | 0.97 – 1.00 | 0.008 | 0.97 | 0.96 – 0.98 | <0.001 |
traffic.q4: [22.4,442] | Reference | Reference | ||||
traffic.q4: (442,700] | 1.07 | 0.96 – 1.19 | 0.214 | 1.11 | 0.99 – 1.24 | 0.080 |
traffic.q4: (700,1.19e+03] |
1.16 | 1.02 – 1.32 | 0.025 | 1.30 | 1.13 – 1.49 | <0.001 |
traffic.q4: (1.19e+03,4.57e+04] |
1.47 | 1.24 – 1.75 | <0.001 | 1.84 | 1.53 – 2.21 | <0.001 |
Observations | 7951 | 7951 | ||||
R2 Nagelkerke | 0.184 | 0.239 |
Table 5: Adjusted association between the pollution and CVD rate.
After checking overdispersion, there was an 1.14 (P<0.001) overdispersion term in the Poisson model. A negative binomial regression model was performed to address overdispersion. In the final negative binomial model, PM2.5 was not statistically associated with CVD rate(p=0.463) compared to unadjusted model(p<0.001), and the exposure had the similar CVD rate as non-exposure(95% CI 0.99,1.00;p=0.463). Every one-unit ozone exposure is associated with 2.21 times the CVD rate(95% CI 2.08,2.35;p<0.001), while 2.27 times before adjusting for community. The highest traffic density exposure was associated with 1.84 times CVD rate (95% CI 1.53,2.21;p<0.001) which increased 25% compared to unadjustment model (95% CI 1.24,1.75;p<0.001). Moreover, there is a significant interaction between PM2.5 and traffic density,and increased PM2.5 was associated with high traffic density level( table3 ).
Pollutants such PM2.5, Ozone and traffic were statistically significantly associated with CVD rate and living in an disadvantaged community increases risk of CVD rate. The baseline of CVD rate after adjustment of community category increased 13%. The CVD rate in highest traffic density level increased 25% compared to unadjusted model. Although the CVD rate associated with PM2.5, and ozone exposure did not change much after adjustment, the people with ozone exposure still had 2.21 times the rate of CVD than the baseline. PM2.5 had a significant interaction with traffic density level. Thus, improvements in air quality would be helpful to reduce overall CVD prevalence across California. In addition, efforts to identify the pollution source that accounting for a community’s vulnerability would beneficial to those disadvantaged communities which suffered most from CVD occurrence.