Study area
We examined data from 121 ‘communes’ or cities in Chile, which are distributed across 4,200 km from north to south. Latitudes of our study area range from 17°S (Arica) to 56°S of latitude (Cabo de Hornos), and altitudes range from 8 m a.s.l. (Pacific Ocean coast) to 3,962 m a.s.l. (San Pedro de Atacama, Andean mountain range). The study area covers the following five climatic zones: (i) desert (17°30' - 26°00’S), (ii) semiarid (26°00’-32°00’), (iii) mediterranean (32°00’-39°00’), (iv) marine west coast (39°00’ - 44°00’S) and (v) tundra (44°00 - 56°00’S); the only climatic zone excluded from the study was the ice sheet (located in the highest areas of the Andes mountain range) because of the absence of human population. In terms of macroclimates, ca. 41% of the country is temperate, 31% arid and the remaining 28% has a polar climate29.
Chilean population is 19.11 million inhabitants, of which 51% are women and 49% men. Life expectancies are 83 (women) and 78 (men) years old; 68.7% of the population is between 15-64 years old and 11.9% over 65 years old. The 88% of inhabitants live in urban areas and the estimated international migration rate is 12 per thousand inhabitants. The 13% of the population belongs to indigenous or native groups; 80% Mapuche, 7% Aymara and 4% Diaguita38. The population is aging as a result of the decline in fertility and the increased life expectancy32.
Chile has 16 administrative regions29,39, of which the Metropolitana region concentrates the largest population (7.1 million inhabitants), followed by the Valparaiso region (1.8 million inhabitants). In contrast, the Aysén and Magallanes regions, located in the southern extreme of Chile, have the smallest population (<200,000 inhabitants). Inhabitants > 65 years old mainly inhabit the areas with mediterranean climate in the cities of Santiago, Valparaíso and Concepción, and correspond to 6.28% of the total employed inhabitants in the country40. By 2050 it is projected that total population size reaches 21.6 million (i.e., an increase of 15.3% compared to 2020) under assumptions of birth and immigration surpassing mortality and emigration, with inhabitants > 65 years old predicted to exceed 3 million (25% of the population)32.
COVID-19 transmission data and predictive variables
We characterized the COVID-19 transmission in Chile from February 23 to April 16, 2020, based on two variables: (i) mean absolute infection rate (i.e., number of infected inhabitants per week), and (ii) mean relative infection rate (i.e., the former variable divided by population size). Data were obtained from official sources of the Government of Chile 41. We extracted daily climatic data from the databases of 121 meteorological stations in Chile42 corresponding to cities with presence of COVID-19, for the same period; these data were averaged per week to make them comparable with variables quantifying the disease transmission. The climatic variables extracted were the following: average, maximum and minimum atmospheric temperature (°C); relative (%) and absolute (g m-3) humidity, accumulated precipitation (mm), atmospheric pressure (mbar), ultraviolet solar radiation (Mj m-²) and wind speed (km h-1). Additionally, we obtained data for other relevant environmental, demographic and geographic variables as follows: air pollutant data, including particulate matter with aerodynamic diameter ≤10 μm (PM10) and ≤2.5 μm (PM2.5), obtained from a database of 30 air quality stations43; and city area (km-2), population size (ind), population density (ind km -2), latitude (absolute degrees), longitude (absolute degrees) and altitude (m a.s.l.), obtained from CONAF44 and IDE Chile45.
Statistical analyses
We examined all pairwise relationships between predictive variables with Pearson correlation coefficients [pairs.panels function, psych package (Revelle, 2016); R statistical software46, and discarded variables showing high correlations with others (r > 0.5) for further analyses. Air pollution data were also discarded due to the low sample size (i.e., 30 out of 121 cities for which epidemiological data were available). The retained variables were: average temperature (which was significantly correlated with maximum and minimum temperature, latitude, longitude and altitude), relative humidity (related to absolute humidity, accumulated precipitation, latitude, longitude and altitude), atmospheric pressure, wind speed and population size (see Supplementary Table S2 online).
We explored the effects of the predictive variables on absolute and relative COVID-19 infection rates through linear models and a backward model selection procedure, based on the Akaike Information Criterion (AIC)47. Prior to running the models, we examined Cleveland dot and boxplots for each response variable, which revealed outliers for absolute (1 oulier) and relative (3 outliers) COVID-19 infection rates that were confirmed with Cook’s distances (using the residuals of the final model) and removed; predictive variables were standardized to unit variance (z-scores). Additionally, validation of the model for absolute infection rate showed an increase of model residuals with population size (and thus a violation of parametric model assumptions), which was solved by including a power variance function structure in the model (varPower function using population size). The models were fitted using the gls function of the nlme R package48.