Our analysis proceeded in four phases. First, we assembled spatial data from multiple sources and compiled them to a common analysis unit. Second, we developed an algorithm that would find the maximum urban reforestation realistically possible, given other land-use constraints. Third, we estimated the heat mitigation-related benefits of current tree canopy and of future planting scenarios, up to the maximum urban reforestation realistically possible. Benefits evaluated were avoided mortality, avoided morbidity, avoided electricity consumption, avoided release of greenhouse gases from avoided electricity consumption, and carbon sequestration in aboveground tree biomass. Fourth, we valued these benefits in monetary terms. In the following section, we discuss each of these phases in detail.
Data sources
Our study focused on all US urbanized areas larger than 500 km2 in extent38. Urbanized area is defined by the US Census Bureau as contiguous census blocks that exceed a population density threshold as well as adjacent territory that links nearby patches of high-density settlements61. There are 100 urbanized areas that are greater than 500 km2 in extent, housing 180 million people during the 2020 census. Each urbanized area contains many municipalities or other census-designated places; in total, our study area contains 5,723 such communities38. These span the range of population sizes and densities, from a large central city like New York City (8.4 million people) to small communities at the edges of metro areas with just a few thousand people. These urbanized areas also span the range of biomes in the United States, from forests to deserts to grasslands.
Our tree cover maps are taken from McDonald et al.38, which mapped tree cover for the same set of 100 urbanized areas, using the boundaries of the urbanized area set after the 2010 Census. For consistency with that dataset, we use the same boundary file for this analysis, rather than the 2020 urbanized area boundaries. Aerial photos taken during the summer growing season at approximately 2m resolution from the National Agricultural Imagery Program (NAIP) were classified into a forest/non-forest grid using Google Earth Engine’s cloud computing platform. Training data for the supervised classification came from Nowak and Greenfield67, which examined tree cover at 10,000 control points throughout urban and community areas of the United States. After creating several texture variables and band indices (e.g., NDVI), this information, along with information from the original spectral bands, was classified using a random forests classifier built off the training data. Pixel-level classification accuracy varies by biome but averages around 75%. In this study, we use tree cover estimates aggregated up to the US Census block level. Estimates at this scale were shown to be extremely accurate (R2 = 0.94) compared to an external validation dataset. Much more info on our classification methodology is available in McDonald et al.38.
Our information on demography and socio-economic status comes from the US Census Bureau’s 2020 decennial census, downloaded from National Historical Geographic Information System (NHGIS)68. Population and its breakdown by race and ethnicity were available at the Census block level. Details of the GIS analysis and processing of this data can be found in McDonald et al.38, which used the 2010 Census but otherwise had similar processing of Census data.
Impervious surface data were taken from the National Land Cover Database 2019 product69. This dataset was derived from Landsat imagery, and measures as a continuous variable the percent impervious surface in each pixel. While 30m data on impervious surface cover is coarser than our forest cover map, the fact that there is a continuous estimate of imperviousness helps address sub-pixel heterogeneity.
Land surface temperature (LST) was derived from the Collection 2 LST science product from Landsat 8 at 100 m resolution. Pixel level quality flags were used to remove cloud contaminated pixels, following the methodology of Chakraborty et al. 202250. All data between 2016 and 2020 were accessed through the Google Earth Engine platform70 to create mean summer (June, July, August; JJA) LST composites. This 5-year period averages over year-to-year variability in LST, while also being long enough to ensure that most regions have usable, cloud-free pixels for which LST can be estimated.
Maximum urban reforestation and tree planting scenarios
Next, we defined an algorithm to find the maximum urban reforestation realistically possible, given other land-use and climatic constraints. Conceptually, we wanted to consider two constraints: a physical constraint (trees can generally only be planted in non-impervious surfaces, excluding rare landscape features such as planter boxes) and a social/political/climatic constraint (planting trees is constrained by numerous other considerations, including competing land uses, landowner preferences, zoning and building codes, and climatic conditions).
Tree cover is negatively correlated with impervious surface cover38. We divided the landscape into five impervious surface categories (0–20%, 20–40%, 40–60%, 60–80%, 80–100%) to reflect the different landscape contexts from lightly settled suburbs to dense urban core. Our general strategy was to model different scenarios, in each US Census block, where tree cover was increased by intervals of 5% (e.g., from 5–10%) potentially up to the maximum possible where all non-impervious land surfaces were greened. However, tree planting was stopped when there were social/political constraints. Social/political/climatic constraints were defined at the 90th percentile of observed tree cover in each impervious surface category for each urbanized area. In other words, tree planting was halted at the 90th percentile of tree cover for sites in that urbanized area with similar landscape contexts (imperviousness). This approach implicitly accounts for differences among cities (e.g., in climate), as well as urban form and zoning and building codes.
Once the maximum possible urban reforestation was estimated at the Census block level, we constructed a set of scenarios for all our sample cities, each aiming to hit a certain nominal increase in tree cover. So, for instance, the 5% increase planting scenario aimed to increase tree cover in each census block by 5%. If this nominal increase exceeded the maximum possible urban tree cover increase for a census block, then tree cover for that census block was set at the maximum possible urban tree cover amount instead. We constructed planting scenarios with increases of 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, and Maximum (i.e., all census blocks set at the maximum possible urban reforestation amount), respectively.
Note that an increase in tree canopy cover, as seen from directly overhead a city and above the tree canopy, necessarily decreases other land covers, since total land cover in a region must sum to 100%. To account for this in our planting scenarios, we made the simplest possible assumption, that additional tree cover occurs randomly over other land uses, in proportion to their cover before planting. For instance, if in an impervious surface category in a particular city, 25% of the area not already in tree cover was impervious and 75% was in sparse vegetation, we assumed that any tree canopy increases would occur 25% over impervious surfaces and 75% over sparse vegetation, displacing these land cover types accordingly.
Finally, we calculated the number of new stems that would need to be planted to achieve this increase in tree canopy, based on the average canopy size of 19.6 m2/stem derived from Nowak and Greenfield71. Note that planted trees grow their tree canopy over time, with some tree species in mesic environments taking 3–5 years to reach 5m height and diameter and more than 10 years to reach 10m height and diameter72. Our tree planting scenarios assume spacing appropriate for adult trees, and we are estimating benefits for the tree canopy associated with those adult trees.
Estimating benefits
Values of average annual net carbon sequestration per m2 of urban tree canopy were taken from Nowak et al.73. They assembled data on tree growth rates and carbon accumulation in tree biomass and subtracted out GHG emissions during planting and maintenance to estimate there was 0.226 kg C annual average net carbon sequestration per m2 of canopy.
To estimate the likely reduction in summer air temperature that would occur due to urban reforestation, we used a two-step regression approach, similar to that of Zhang et al.74. First, we statistically related impervious cover and urban tree cover to LST at the census block level. Urbanized areas were divided into two climatic groups, mesic and xeric, based on biomes (see McDonald et al. 27 for details). For each climatic group, a linear regression was conducted using PROC GLM in SAS, relating LST (the dependent variable) to both impervious cover and urban tree cover. A fixed effect for urbanized area was also included. To avoid potential issues of spatial autocorrelation, we take a sparse sample of 10,000 census blocks out of the 2.3 million census blocks in our sample area. The sparse sample represents 0.4% of the total number of Census blocks and ensures that census blocks are on average quite far from one another and are thus more likely to be independent statistically in terms of the dependent variable, LST (i.e., any errors in estimation of LST in one census block are likely to be spatially uncorrelated with errors in other census blocks). Five observations were dropped due to missing data. Results from our regression are shown in Table 5. The R2 for xeric regions was 0.85, while that for mesic regions was 0.76.
Table 5
Model parameters from regression relating LST to tree cover and impervious surface cover. Separate regression analyses were undertaken for mesic and arid cities. A fixed effect for urbanized area (NAME) was included.
Census blocks in mesic cities (N = 8498 with complete data) |
---|
Variable | Type III SS | Mean SS | F value | P | Estimate | SE |
Tree cover (fraction) | 1443 | 1443 | 310 | < 0.001 | -2.88 | 0.16 |
Impervious surface cover (fraction) | 15684 | 15684 | 3367 | < 0.001 | 9.06 | 0.16 |
NAME (N = 86) | 68869 | 810 | 174 | < 0.001 | multiple | |
Census blocks in arid cities (N = 1497 with complete data) |
Variable | Type III SS | Mean SS | F value | P | Estimate | SE |
Tree cover (fraction) | 1031 | 1031 | 212 | < 0.001 | -8.06 | 0.55 |
Impervious surface cover (fraction) | 327 | 327 | 67 | < 0.001 | 2.78 | 0.34 |
NAME (N = 14) | 35906 | 2762 | 567 | < 0.001 | multiple | |
Second, we statistically related air temperature information with LST. The air temperature data came from the Global Historical Climatology Network75 for summer (JJA) average daily mean (see McDonald et al.27 for details on data handling and processing). There were 423 air temperature stations within our study area. A linear regression was conducted using PROC GLM in SAS, relating air temperature (the dependent variable) to LST. The fitted slope was allowed to vary by biome (i.e., biome is included as a fixed effect in interaction with LST in the model). A fixed effect for urbanized area was also included. The number of air temperature sensors is relatively small, they are measured independently by different machines, and they are relatively far from one another, so we did not suspect there would be spatial autocorrelation in the dependent variable after accounting for LST. Results from our regression are shown in Table 6. The R2 for this regression was 0.49, lower than for our statistical model explaining LST, perhaps reflecting the fact that there is a broad variety of factors that affect air temperature beyond the local surface temperature76. Regardless, we can propagate any uncertainty in estimating air temperature in our analysis, which is an advantage of our statistical approach as opposed to more mechanistic models.
Table 6
Model parameters from regression relating air temperature to LST. Regression slopes were allowed to vary by the biome in which an urbanized area is located. The overall regression also included an intercept term, and was highly significant (F = 57.8, P < 0.001).
Census blocks with air sensors (N = 423) |
---|
Biomes | Estimate | SE |
Temperate Broadleaf and Mixed Forests | 0.33 | 0.030 |
Temperate Coniferous Forests | 0.35 | 0.030 |
Tropical and subtropical grasslands, savannas, and shrublands | 0.40 | 0.033 |
Temperate Grasslands, Savannas, and Shrublands | 0.36 | 0.028 |
Flooded Grasslands and Savannas | 0.46 | 0.034 |
Mediterranean Forests, Woodlands, and Scrub | 0.25 | 0.026 |
Deserts and Xeric Shrublands | 0.35 | 0.023 |
Note that therefore, the air temperature impacts of our tree planting scenarios are a function of multiple factors: the amount of plantable area in a Census block; the fraction of impervious surface before planting, which determines how much impervious surface cover is reduced through planting; the aridity and biome of an urbanized area, and an urbanized area fixed effect. We summarize the effect of multiple factors in Table S1, which lists (by urbanized area and impervious surface category) the realized change in tree cover and impervious surfaces for the 5% target scenario, as well as the estimated change in LST and air temperature. Not surprisingly, the greatest decline in impervious surface area with planting is generally in impervious surface category 5 (defined as neighborhoods with 80–100% impervious surface cover). The only exception is when there is not enough plantable area in impervious category 5 neighborhoods to approach the 5% target for tree canopy increase. Conversely, the smallest decline in impervious surface area with planting is generally in impervious surface category 1 (defined as neighborhoods with 0–20% impervious surface cover). For this reason, the greatest decline in LST occurs in impervious category 5 while there are lesser declines in impervious category 1, with similar patterns across climate variables (aridity and biome). Asheville, NC, has the greatest decline in LST in impervious category 5 (0.49 ⁰C). Patterns for air temperature declines are similarly generally greater in impervious category 5 and less in impervious category 1, with similar patterns across climate variables (aridity and biome). Bonita Spring, FL, has the greatest decline in air temperature in category 5 (0.21 ⁰C). Finally, note that we estimate statistically the change in LST and air temperature at the scale of a US census block, which includes a variety of land-use types. This makes our estimates of temperature impact per unit increase in tree cover different from (and lower in magnitude than) the estimates of tree cooling efficiency of Zhao et al.77, who used Landsat-based estimates of tree cover and LST to quantify change in LST for those pixels with tree canopy.
This study focuses on tree canopy and its benefits relative to the race/ethnicity of neighborhoods. Within our sample of cities, race/ethnicity is correlated to the impervious surface category (Table 7), with most neighborhoods in categories 4 and 5 being majority POC, while most neighborhoods in categories 1 and 2 are white. This trend is discussed extensively in McDonald et al.38, and has to do with low-income households, which are more likely to be POC, being more frequently located in US Census blocks with high population density and high impervious surface cover in city centers rather than less dense suburbs. Race/ethnicity is also correlated with aridity, as the arid southwest of the US has a larger fraction of POC in all impervious categories than does the mesic region of the country. This trend primarily has to do with people of Hispanic origin, who are counted in the POC category for our study. Since impervious surface category and aridity are two of the explanatory variables used to predict the impact of tree canopy increases on air temperature, the correlation of race/ethnicity with these variables necessarily implies that the impact of tree canopy increases on air temperature varies with respect to race/ethnicity.
Table 7
Race/ethnicity composition of our sample, relative to impervious surface and aridity. Population totals, in millions, are shown. Percentage of population within each aridity category and each impervious surface category are shown in parentheses. For instance, in arid urbanized areas, there are 2.7 million POC in impervious surface category 5, 78% of the total population of arid cities in this impervious surface category.
| Arid population, millions (%) | Mesic population, millions (%) | |
---|
Impervious surface category | White | POC | White | POC | Total population, millions |
---|
1 (0–20%) | 0.7 (54%) | 0.6 (46%) | 15.6 (79%) | 4.1 (21%) | 21.0 |
2 (20–40%) | 2.0 (54%) | 1.6 (46%) | 25.9 (66%) | 13.4 (34%) | 42.9 |
3 (40–60%) | 4.5 (36%) | 8.1 (64%) | 22.9 (52%) | 20.7 (48%) | 56.2 |
4 (60–80%) | 3.0 (22%) | 10.8 (78%) | 10.6 (38%) | 17.2 (62%) | 41.8 |
5 (80–100%) | 0.8 (22%) | 2.7 (78%) | 5.2 (36%) | 9.2 (64%) | 17.8 |
Overall categories | (32%) | (68%) | (55%) | (45%) | |
To estimate the health impacts of an estimated change in air temperature due to tree canopy expansion, we follow the methodology of McDonald et al.27. For estimating avoided mortality, we use published epidemiological studies that relate changes in air temperatures to changes in mortality over time. Specifically, we use Bobb et al.78, who estimated heat-mortality relationship for 105 US cities. In this study, we use Bobb et al.’s regional estimates of the heat-mortality response curve. For morbidity estimation, our analysis is based on Gronlund et al.79, who estimated heat-related hospitalizations as a function of air temperature for a large population in the US, and then scaled from the Gronlund et al. numbers to estimate emergency department visits and doctor’s office visits27.
We estimated avoided electricity consumption due to tree cover, following the methodology in McDonald et al.27. We base our analysis off Santamouris et al. 41, a literature review that collected empirical estimates of increases in electricity use during periods of high air temperature. We subset the results in Table 1 of their paper to just look at US studies, which gave a range of 2.9–8.5% increase in electricity consumption per 1°C increase in air temperature. We assume that the shift in summer daily mean temperatures we modeled would only increase electricity consumption for space cooling during summer months (1/4 of the year). From this assumption, we calculated an estimated annual increase in electricity consumption, assuming no increase in other seasons. We then multiplied percent increase in electricity consumption by city-level data on residential electricity use. To estimate avoided GHG emissions due to avoided electricity consumption, we multiplied avoided electricity consumption by the average carbon intensity of US electricity generation, taken from EPA data.
Finally, when calculating statistics at the level of urbanized areas or nationally, we used population-weighted statistics, to give greater weight to Census Blocks with greater population.
Valuation and extrapolation
Our valuation methodology follows McDonald et al. 27 , and is extensively described in the supplementary methods section of that paper. To briefly summarize, for avoided mortality we use a value of a statistical life approach (VSL), adjusted for remaining life years e.g., 80,81 , using a range in the US of $5.4-13.4M (2015$) 82 . Most heat-related deaths occur in people 55 years or older 80,83 , and so we apply Murphy and Topel’s 81 life cycle shape for the value of a life-year for 11 age classes to calculate an overall average, age-weighted VSL, accounting for the age distribution of mortality from heat-related events, of $5.7M (2015$, range: $3.3-8.2M).
For morbidity, we use a cost-of-illness (COI) approach to assess the economic burden associated with heat-related illnesses (HRI) that were avoided due to tree cover. Economic calculations were done separately for avoided emergency department (ED) visits84, avoided hospitalizations85, and outpatient visits86. We also estimated lost work productivity as the product of total work loss days (from HRI-related hospitalizations, ED visits and outpatient visits) and the average US daily earnings rate (see McDonald et al.27 for details).
For electricity consumption, data on average household residential electricity consumption and average cost per KWh, by electric utility, were taken from the US Energy Information Administration (EIA, 2016) form EIA-861, submitted by electric utilities annually.
For carbon sequestration and avoided GHG emissions due to avoided electricity consumption, we used the new EPA proposed Social Cost of Carbon (SCC) for 2020 of USD 190 expressed at 2020 prices, assuming a 2% discount rate87. This estimate follows that of recent science by Rennert et al.88. The SCC measures the total (global) estimated economic damages, expressed in monetary terms, that result from one additional ton of carbon dioxide in the atmosphere. These damages from increased atmospheric CO2 concentrations are completely separate from, and fully additional to, the local damages from health and electricity use impacts that increases in urban tree canopy avoid via their local cooling effects. Where necessary, all economic values presented in this paper were standardized to USD in 2022, using the US Bureau of Labor Statistics Consumer Price Index89.