3.1 PurpleAir sensor (PA) data correction factors
The PurpleAir sensor shows a moderate correlation with reference PM2.5 concentrations (r = 0.56) under varying humidity and temperature. At lower humidity levels, PurpleAir sensors tend to underestimate PM2.5 and higher temperatures tend to cluster near the red dashed line, indicating better agreement between the sensor measurements in warm conditions. Humidity and temperature affect the spread of data points, though the overall correlation remains consistent. While these factors do not significantly affect the overall correlation, the variability in sensor performance suggests that humidity and temperature contribute to deviations from the reference PM2.5 concentrations. The MAE of 13.69 µg/m³ and a positive bias of 1.19 µg/m³ indicate a slight overestimation.
We compared the performance of four models in predicting PM2.5 concentrations using three key metrics: MAE, RMSE, and R². A lower MAE and RMSE along with a higher R² suggest a more accurate model in predicting PM2.5 concentrations. Selection of optimal model hyperparameters was conducted using parameter grid search and 5-fold cross-validation. The Polynomial Regression model has the best performance (MAE = 10.8 µg/m³, RMSE = 15.27 µg/m³, and R² = 0.35), explaining 35% of the variance in PM2.5 level and underscoring its precision and reduced deviation from observed values. In contrast, the MLR correction model shows slightly lower performance (MAE = 11.31 µg/m³, RMSE = 15.65 µg/m³, and R² = 0.32), explaining 32% of the variance and showing that the model has higher error in its predictions. Random Forest shows low performance (MAE = 12.2 µg/m³, RMSE = 16.1 µg/m³, and R² = 0.30) while XGBoost performs the lowest (MAE = 13.5 µg/m³, RMSE = 17.2 µg/m³, and R² = 0.28). These results indicate that the Polynomial Regression model is more accurate and explains more variability in PM2.5. Overall, 65% − 75% of the variability in PM2.5 is not captured by the models, suggesting that important factors might be missing. The residual variability not captured by the models indicates that there are limitations in sensor’s accuracy and model design. In the city of Antananarivo, the influence of additional factors such as emission sources (vehicle, fire smoke), and other meteorological variables (wind pattern, pressure) may be very important but not included in this analysis. Incorporating those factors into the models could provide a more comprehensive understanding and yield higher predictive accuracy.
3.2 Temporal and spatial variations of PM2.5 concentrations
The annual average of PM2.5 concentrations for each sensor located at Ambatobe, Ampandrianomby, Andraharo, Antsakaviro, Soanierana, Ambohidahy, and Miandrarivo during the year 2023 are: 21.6µg/m3, 24.5µg/m3, 26.2µg/m3, 28.4µg/m3, 28.7µg/m3, 29.3µg/m3, and 31.9µg/m3 respectively. Furthermore, Ambatobe and Ampandrianomby are the least polluted outskirts while Soanierana, Ambohidahy, and notably Miandrarivo are the most polluted. These differences found for each measurement point could be due to the location of the sensors, which are either close to areas of dense circulation or in a slightly more remote and less polluted area. Each of these annual averages far exceeds the WHO’s 5 µg/m3 annual air quality standards. During the year 2023, 83% of the days exceed the WHO’s daily threshold standards (15 µg/m3). During the period from September to November, only 11 days are below this threshold. The maximum of the daily averages is observed on October 11, which has a value of 130.42 µg/m3, while the minimum is on January 15, which is 8.4 µg/m3 (Fig. 3).
In Fig. 3, each bar represents the standard deviation calculated from daily averages. A clear seasonal pattern in the standard deviation can be observed, with larger variances occurring between October and November, as well as May and July, and smaller variances during the remaining months. PM2.5 levels exhibited higher standard deviations when daily averages were high, and lower standard deviations when daily averages were lower.
Figure 4 shows the monthly average concentrations of PM2.5 for each sensor. Overall, the monthly averages exceed the daily WHO threshold (15 µg/m3 ) throughout the month.There are fluctuations in the average values for each month from January to December. Two peaks were observed, the first in May-June (winter) and the second in October-November.
Due to poor road infrastructure and limited public transportation, traffic congestion is common in Antananarivo's city center. Many older diesel vehicles remain in use, contributing to incomplete fuel combustion and higher PM2.5 emissions. Additionally, the growing use of motorcycles, which also produce emissions, increases the pollution problem, especially as heavy traffic persists. Meteorological conditions such as weak wind speeds, a reduced boundary layer height, and increased temperature inversions are more frequent during the winter. These temperature inversions in the lower troposphere enhance atmospheric stability and inhibit the vertical dispersion of pollutants, creating favorable conditions for the accumulation of pollutants and leading to higher concentrations of PM2.5 (Bai et al., 2023; Liu et al., 2019).
Figure 5 shows the monthly air quality classification level for 2023 according to WHO’s guideline, averaged across all sensors, with detailed breakdowns for each sensor shown in Fig. S1. Overall, Antananarivo experienced Moderate air quality throughout the year. January had the best air quality, with 12.9% of days classified as Good and 87.1% as Moderate. However, the air quality worsened during certain months, with Unhealthy for Sensitive Groups and Unhealthy categories accounting for over 30% of days in May and June, and reaching about 70% and 40%, respectively, in October and November (Fig. 4). Air pollution levels varied significantly between locations. Some areas maintained lower pollution levels, while others experienced very Unhealthy conditions during certain months. For example, a location near a tunnel exhibited high pollution early in the year, with 60% of days classified as Unhealthy for Sensitive Groups and 40% as Unhealthy (Fig. S1).
3.3 Diurnal variations of PM2.5 concentrations
Figure 6 shows the overall diurnal variations of PM2.5 concentration based, showcasing the consistent bimodal pattern. We analyzed PM2.5 concentrations every one trimester: December-February, March-May, and June-August and September-November. In all seasons, particle mass concentrations exhibit a bimodal pattern, with notably higher levels in the mornings (around 2:00 am − 7:00 am) and evenings (around 5:00 pm − 9:00 pm). The amplitude and width of PM2.5 are most pronounced during September-November. Air pollution in Antananarivo is at its worst during this period throughout the day, consistently surpassing the World Health Organization's 24-hour recommendation for air quality, which suggests levels below 15µg/m³.
3.4 Impact of fires, rainfall and wind direction on air quality during September and November
Figure 7A represents a zoomed-in temporal distribution of daily averages of PM2.5 concentrations from September to November in 2023. Air quality reaches its worst levels between September and November, primarily due to the rise in forest fires during the dry season, which begins in July and August and peaks in September and October (Andriamanantena et al., 2021).
Figure 7B illustrates fire points detected by VIIRS SNPP and NOAA-20 and Fig. 7C presents daily precipitation data from the national weather station. In September, extensive fire activity was observed outside of Antananarivo, but PM2.5 concentrations remained relatively low. A notable decrease in both fire activity and PM2.5 levels occurred in early October, coinciding with the start of the rainy season in Antananarivo. However, in mid-November, a sharp drop in both fire incidents and PM2.5 concentrations were observed, followed by a significant pollution episode.
On October 11, PM2.5 concentrations ranged from 98 µg/m³ to 163 µg/m³, and on November 21, from 70 µg/m³ to 165 µg/m³. Fire points recorded in October and November were 3,239 and 2,470 for SNPP, and 4,128 and 4,727 for NOAA-20, respectively, with differences likely attributed to satellite overpass timings or cloud cover. The decrease in fire points may also be associated with precipitation events.
Robinson (2021) similarly reported a direct correlation between increased fire activity and elevated PM2.5 during pollution episodes. Rainfall reduces atmospheric PM2.5 concentration through collision-coalescence processes, where raindrops capture pollutants. However, on October 23, despite 38 mm of rainfall, PM2.5 concentrations increased, possibly due to substantial fire activity the previous day, with smoke transported to Antananarivo after the rain had temporarily cleared the air. This highlights the complexity of fire-related pollution events, where localized meteorological factors and smoke transport can significantly influence air quality.
Pollution rose plots (Fig. 8) for September, October, and November highlight the critical role of wind direction in dispersing PM2.5. The highest concentrations were linked to south-easterly winds, though October showed mixed wind directions, resulting in pollution peaks from multiple directions. Notably, PM2.5 contributions were highest from the southeast, northwest, and northeast. This period coincides with widespread fires across Madagascar, particularly in the eastern regions due to slash-and-burn agriculture (“tavy”), which generate PM2.5-laden smoke transported by trade winds (Andriamanantena et al., 2021).
The convergence of pollutants from distant fires and local sources, such as traffic emissions, contributed to elevated PM2.5 levels. These findings underscore the significant influence of wind direction in transporting pollutants and confirm the relationship between fire activity and deteriorating air quality in Antananarivo (Fig. 7A and 7B).
In Antananarivo, the significant increase in air pollution during the hot period of October and November can be attributed to a combination of local and regional factors. Local emissions from traffic, industries, and urbanization activities, including the burning of waste and the seasonal production of bricks, contribute to elevated pollution levels. During this period, brick kilns are highly active, releasing substantial amounts of particulate matter and other pollutants. Long-range transport of smoke from widespread bush and forest fires, which are common during this season, significantly increases air quality issues. These fires, driven by agricultural practices such as slash-and-burn farming (also known as "tavy") and deforestation, result in the emission of large quantities of smoke and fine particulate matter (PM2.5) that can travel across large distances, impacting urban areas. The combination of local pollution sources and regional smoke transport creates a situation of poor air quality, especially as meteorological conditions such as weak wind speeds and temperature inversions and rainfall delay further trap pollutants near the surface.
3.5 Analysis of the impact of meteorological parameters on PM2.5
Figure 9 shows a weak positive correlation between PM2.5 and temperature (+ 0.21). This suggests that temperatures may weakly lead to an increasing PM2.5 concentration. This pattern can be likely attributed to smoke transport during the warm, dry season, and the impact of temperature inversions, which slows PM2.5 dispersion. Temperature inversions limit vertical mixing and trap pollutants closer to the surface. Additionally, higher temperatures can enhance photochemical reactions, producing more precursors of PM2.5 and other secondary pollutants, thus driving an increase in PM2.5 concentrations (Chen et al., 2020; Zhang et al., 2015).
High concentrations of PM2.5 may also influence temperature in the opposite way by scattering solar radiation, and therefore reduce surface temperatures (Zhong et al., 2018). This reduction in near-ground temperature weakens atmospheric mixing, allowing further accumulation of PM2.5. During pollution episodes with temperature inversion layers, the cooling of the near-ground atmosphere and the stability of the inversion enhance PM2.5 accumulation (Zhu et al., 2018).
Negative correlations were found for relative humidity (-0.46) and wind speed (-0.29). With an increasing relative humidity PM2.5 concentrations decrease. When humidity surpasses 70%, suspended particles aggregate and fall out of the atmosphere through dry and wet deposition, leading to a significant reduction in PM2.5 levels (Wang and Ogawa, 2015; Li et al., 2015). Higher wind speeds promote the horizontal and vertical diffusion of PM2.5. Conversely, low wind speeds inhibit dispersion and promote the accumulation of PM2.5 near the surface. These findings are consistent with measurements taken in Antananarivo’s less polluted areas (INSTN, 2021), which also showed an inverse relationship between PM2.5 and both humidity and wind speed.