3.1. Characterization and analysis of trends
The average annual precipitation in the study area varies between 620 mm and 926 mm in the stations of Patos and Sousa, respectively (Fig. 2a). The distribution of the monthly average precipitation shows similar behavior in all rain gauge stations, with maximum values in March and minimum values between August, September and October, reinforcing when wet and dry periods occur, a common characteristic of semi-arid regions of Brazil (Fig. 2b).
Trend analysis is one of the most important measures in the study of time series data. In recent years, several methods have been developed and used in climatological studies to analyze trends in time series, such as linear regression, Mann-Kendall test, Spearman’s rank-order correlation coefficient, Theil–Sen estimator, innovative trend analysis, among others (Gedefaw et al. 2018; Chervenkov and Slavov 2019; Shahid and Rahman 2021). Among these methods, the Mann-Kendall test is the most popular, due to its efficacy in identifying trends related to the variation of the sample and the magnitude of the trend itself (Wang et al. 2020a, b).
In the present study, the Mann-Kendall test showed statistically significant changes (p < 0.01) in the annual comparison for the stations of Água Branca and Teixeira, with increase rates in precipitation of 4.06 mm/year and 4.15 mm/year, respectively. In the other stations, the annual comparison showed no significant trends (Table 2).
In addition to these, in monthly comparisons it was also possible to observe changes in the distribution of precipitation. January (in Água Branca, Coremas and Teixeira), April (in Teixeira), May (in Água Branca) and July (in Água Branca and Princesa Isabel) showed increasing trends, with values between 0.15 mm/year (p < 0.05) and 0.91 mm/year (p < 0.05). The increases were concentrated from January to July (Table 2).
However, for March, the month considered with the highest average precipitation for the studied area (Fig. 2b), there was a significant trend of reduction in precipitation volume at a significant rate of 2.92 mm/year (p < 0.10), only for the municipality of Sousa (Table 2).
Table 2 Trends detected through the Mann-Kendall test, quantified by Sen’s slope, for the study area
Period
|
Água Branca
(mm/year)
|
Aguiar
(mm/year)
|
Coremas
(mm/year)
|
Patos
(mm/year)
|
Princesa
Isabel
(mm/year)
|
Sousa
(mm/year)
|
Teixeira
(mm/year)
|
January
|
0.65**
|
0.58 ns
|
0.91*
|
1.57 ns
|
0.20 ns
|
0.79 ns
|
0.60**
|
February
|
0.44 ns
|
0.54 ns
|
0.21 ns
|
0.73 ns
|
-0.23 ns
|
-1.05 ns
|
0.25 ns
|
March
|
0.25 ns
|
-0.60 ns
|
-1.16 ns
|
-2.79 ns
|
0.07 ns
|
-2.92+
|
0.10 ns
|
April
|
0.01 ns
|
0.30 ns
|
-0.41 ns
|
0.30 ns
|
0.11 ns
|
-1.76 ns
|
0.77+
|
May
|
0.65*
|
0.33 ns
|
0.01 ns
|
-0.35 ns
|
0.11 ns
|
0.08 ns
|
0.18 ns
|
June
|
0.28 ns
|
-0.09 ns
|
-0.03 ns
|
0.06 ns
|
0.01 ns
|
0.14 ns
|
-
|
July
|
0.35*
|
0.00 ns
|
0.02 ns
|
0.00 ns
|
0.15*
|
0.06 ns
|
-
|
August
|
-
|
-
|
-
|
0.00 ns
|
-
|
0.00 ns
|
-
|
September
|
-
|
-
|
-
|
0.00 ns
|
-
|
0.00 ns
|
-
|
October
|
-
|
-
|
-
|
0.00 ns
|
-
|
0.00 ns
|
-
|
November
|
-
|
-
|
-
|
0.00 ns
|
-
|
-0.11 ns
|
-
|
December
|
-
|
0.00 ns
|
0.14 ns
|
0.34 ns
|
-0.02 ns
|
-0.11 ns
|
-
|
DJF
|
1.82**
|
0.82 ns
|
1.67+
|
2.80 ns
|
-0.35 ns
|
0.36 ns
|
1.82**
|
MAM
|
0.82 ns
|
0.04 ns
|
-1.25 ns
|
-5.96 ns
|
0.68 ns
|
-3.88 ns
|
1.25 ns
|
JJA
|
1.00**
|
-0.07 ns
|
0.08 ns
|
0.27 ns
|
0.26 ns
|
0.14 ns
|
-
|
SON
|
0.04 ns
|
-0.05 ns
|
-0.02 ns
|
0.08 ns
|
-0.33*
|
-0.36 ns
|
-
|
Annual
|
4.06**
|
1.29 ns
|
2.11 ns
|
0.29 ns
|
0.08 ns
|
-4.25 ns
|
4.15 **
|
ns: not significant; +p < 0.10; *p < 0.05; **p < 0.01; -: not conclusive.
The remaining months, that is, between August and December, did not show significant trends in any of the stations. It should be emphasized that, for the municipalities of Aguiar and Patos, the trends were not conclusive or not significant in all comparisons (annual, quarterly, and monthly).
In order to apply the traditional Mann-Kendall non-parametric statistical test to a time series, one neglects the correlation between its elements and assumes that the probability distribution should remain the same, that is, it should be a simple random series. When the variance between its elements is null, the test results in non-significant (ns) (Kendall 1975; Mann 1945; Wang et al. 2020a).
For these two weather stations, the coefficients of variation for each series were approximately equal to one, mainly for the months from January to July, indicating that the mean values and the respective standard deviations were close. For the months from August to December, the coefficients of variation were higher. However, these are the months with the lowest precipitation levels, in which the absence of precipitation was repeatedly noted.
Therefore, if in the same series there are many equal elements, the Mann-Kendall test will be unable to detect a significant trend, which is possibly the main reason to explain some non-significant and inconclusive trends in the results of the present study. Other authors consider this as one of the limitations of trend analysis (Wang et al. 2020a; Yue et al. 2002a, b). Therefore, cluster analysis was used to overcome this deficiency, so that it was still possible to investigate the existence of climate change.
Some of the quarterly series showed significant trends, especially of increase in precipitation. In the first quarter, DJF, the increases were significant in Água Branca (1.82 mm/year), Coremas (1.67 mm/year) and Teixeira (1.82 mm/year). In the third quarter, JJA, there was also a significant increase in Água Branca (1.00 mm/year) and, in the last quarter, SON, in Princesa Isabel, there was a significant trend of reduction in precipitation (0.33 mm/year).
The results presented through the precipitation data time series are in accordance with those of other studies already published in the literature, in which significant trends are observed in weather stations of Paraíba (Alves et al. 2017; MEDEIROS et al. 2019; Silva et al. 2020).
3.2. A new approach to cluster analysis
Cluster analysis was used in order to aggregate in the same cluster the years in which the precipitation distribution showed similarity (similar seasonality, with equivalent monthly values). Two or three clusters were created for each of the weather stations, so that it was possible to identify changes in monthly and annual patterns and compare them to the observed trends. In addition, the ‘median year’ of each cluster was used as an indicator to confront recent and past periods in the series, in order to better understand the distribution of precipitation over time. These results are presented in Table 3.
Thus, Table 3 presents for each of the stations the number of clusters, the centroids of each monthly series and the annual average (based on the clusters, called Annual) and historical average (HA). The median year (Med Year) of each cluster indicates the behavior of recent and past periods in relation to precipitation volume. Finally, the percentage number of elements of the series allocated in each cluster (%) is also presented.
Table 3
Descriptive information of each cluster for each of the locations in the study area
Cluster
|
Jan
(mm)
|
Feb
(mm)
|
Mar
(mm)
|
Apr
(mm)
|
May
(mm)
|
Jun
(mm)
|
Jul
(mm)
|
Aug
(mm)
|
Sep
(mm)
|
Oct
(mm)
|
Nov
(mm)
|
Dec
(mm)
|
Annual
(mm)
|
Med Year
|
%
|
Água Branca
|
1
|
68.4
|
88.1
|
145.8
|
100.7
|
56.0
|
40.2
|
28.0
|
9.9
|
5.2
|
2.5
|
14.3
|
9.5
|
568.7
|
1973
|
59.0
|
2
|
93.5
|
134.0
|
157.0
|
172.2
|
81.7
|
76.4
|
71.9
|
14.2
|
4.3
|
22.0
|
2.9
|
96.4
|
926.4
|
1985
|
16.7
|
3
|
74.0
|
117.9
|
229.0
|
193.7
|
177.7
|
55.3
|
40.6
|
28.4
|
16.0
|
7.6
|
26.1
|
16.2
|
982.4
|
1986
|
24.4
|
HA
|
74.0
|
103.0
|
168.0
|
135.3
|
89.9
|
49.9
|
38.4
|
15.1
|
7.7
|
7.0
|
15.2
|
25.6
|
729.1
|
|
|
Aguiar
|
1
|
72.3
|
140.1
|
214.8
|
147.1
|
45.7
|
17.2
|
8.6
|
3.3
|
0.9
|
7.7
|
22.3
|
44.2
|
724.3
|
1962
|
59.2
|
2
|
160.8
|
156.3
|
224.0
|
253.5
|
144.8
|
45.4
|
25.0
|
7.8
|
3.0
|
10.5
|
11.6
|
37.3
|
1079.9
|
1991
|
40.8
|
HA
|
108.4
|
146.7
|
218.6
|
190.5
|
86.1
|
28.7
|
15.3
|
5.1
|
1.8
|
8.9
|
17.9
|
41.4
|
869.4
|
|
|
Coremas
|
1
|
109.6
|
133.9
|
203.6
|
132.8
|
47.7
|
26.3
|
16.4
|
2.2
|
0.9
|
2.2
|
5.9
|
19.7
|
701.1
|
1968
|
54.4
|
2
|
80.1
|
107.0
|
184.0
|
166.8
|
96.5
|
48.8
|
16.3
|
16.1
|
11.0
|
47.0
|
19.0
|
60.7
|
853.3
|
1972
|
23.5
|
3
|
140.5
|
199.4
|
327.4
|
294.5
|
158.1
|
33.2
|
19.3
|
5.9
|
2.5
|
2.8
|
50.5
|
41.3
|
1275.4
|
1977
|
22.1
|
HA
|
109.5
|
142.0
|
226.3
|
176.5
|
83.5
|
33.1
|
17.0
|
6.3
|
3.6
|
12.9
|
18.8
|
34.1
|
863.6
|
|
|
Patos
|
1
|
35.3
|
58.8
|
359.0
|
183.7
|
141.0
|
13.2
|
21.6
|
4.0
|
0.6
|
8.4
|
17.9
|
81.5
|
924.7
|
1992
|
15.4
|
2
|
114.1
|
118.7
|
131.4
|
91.8
|
42.9
|
25.2
|
7.4
|
1.8
|
0.6
|
5.3
|
7.5
|
18.2
|
564.9
|
2000
|
84.6
|
HA
|
101.9
|
109.5
|
166.4
|
105.9
|
58.0
|
23.4
|
9.6
|
2.1
|
0.6
|
5.8
|
9.1
|
28.0
|
620.2
|
|
|
Princesa Isabel
|
1
|
83.5
|
117.3
|
188.8
|
102.9
|
52.9
|
24.0
|
13.2
|
3.0
|
4.5
|
6.3
|
28.6
|
45.9
|
670.8
|
1956
|
51.0
|
2
|
114.1
|
154.8
|
189.5
|
186.5
|
100.5
|
51.4
|
40.8
|
16.4
|
11.4
|
18.9
|
30.6
|
60.0
|
974.9
|
1970
|
49.0
|
HA
|
98.5
|
135.7
|
189.2
|
143.9
|
76.2
|
37.4
|
26.7
|
9.6
|
7.9
|
12.5
|
29.6
|
52.8
|
819.9
|
|
|
Sousa
|
1
|
179.0
|
272.4
|
226.5
|
335.8
|
171.0
|
45.7
|
25.1
|
9.9
|
16.6
|
24.0
|
13.9
|
56.9
|
1376.7
|
1981
|
22.2
|
2
|
98.3
|
161.5
|
228.8
|
140.0
|
59.4
|
25.9
|
19.6
|
3.0
|
1.8
|
16.9
|
10.7
|
31.5
|
797.4
|
1982
|
77.8
|
HA
|
116.3
|
186.1
|
228.3
|
183.5
|
84.2
|
30.3
|
20.8
|
4.5
|
5.0
|
18.5
|
11.4
|
37.1
|
926.1
|
|
|
Teixeira
|
1
|
62.9
|
123.1
|
223.5
|
108.4
|
40.0
|
14.0
|
9.5
|
2.4
|
1.7
|
1.4
|
10.7
|
13.4
|
610.9
|
1961
|
69.1
|
2
|
56.1
|
291.8
|
232.4
|
96.3
|
8.5
|
11.4
|
14.2
|
10.5
|
73.4
|
28.5
|
11.7
|
47.7
|
882.3
|
1970
|
2.4
|
3
|
108.5
|
141.9
|
210.5
|
212.5
|
110.7
|
39.7
|
18.8
|
5.9
|
2.0
|
11.8
|
12.9
|
75.1
|
950.3
|
1987
|
28.6
|
HA
|
75.7
|
132.5
|
220.0
|
137.8
|
59.5
|
21.3
|
12.3
|
3.6
|
3.5
|
5.0
|
11.4
|
31.9
|
714.3
|
|
|
HA: historical average; Med Year: median year.
|
For the Água Branca station, 3 clusters were created. Cluster 1 (median year 1973) is the one with the lowest value of annual precipitation (568.7 mm), while cluster 3 (1986) has the highest annual value (982.4 mm). Cluster 2 (1985) is an intermediate group. When comparing the clusters in relation to the volume of precipitation and the median years, it can be inferred that there is an increase in precipitation volumes in the most recent periods (Fig. 3a). This was also observed by trend analysis (Table 2).
In addition to these considerations, in Fig. 3b and Fig. 3c it is possible to visualize how the distribution occurred in each cluster. For cluster 3 (1986) in MAM, the precipitation values were much higher than those of the other clusters, as well as the historical average.
The month of December in cluster 2 (1985) has values much higher than those of the other clusters. In all years belonging to this cluster, the month of December showed high precipitation levels. Higher values were also found in January, February, June, July and December. However, November had the lowest rates for cluster 2. April is the month with the highest centroid value among the monthly values of the same cluster. Thus, cluster 2, with 16.7% of the elements of the series, showed a different behavior compared to the other two clusters, while in more recent years (cluster 3) it has been raining greater volumes, but with a seasonality similar to the historical average.
Although no significant trends were detected for the station of Aguiar, two clusters were created by the cluster analysis and it is also noticed that in more recent years, represented by cluster 2 (1991), the volume of precipitation is higher than in the initial years (cluster 1, 1962) (Fig. 4a). The same occurs in quarterly comparisons, except in SON, where the values remain low as in the historical series (Fig. 4b). In cluster 2 (1991), there is a slight shift in the precipitation regime in the rainy season, so that April becomes the rainiest month (Fig. 4c).
In the station of Coremas, by the annual evaluation, it can be seen that the precipitation levels increased gradually. Cluster 3 (1977), the most recent, had the highest values of precipitation (Fig. 5a). This is also observed in the quarterly comparisons of DJF and MAM, that is, in the rainy season (Fig. 5b). According to the trend analysis (Table 2), only the significant increase in DJF was verified.
It is important to note that, during the rainy months, cluster 2 (1972) had higher values in JJA and SON, as well as in August, September and October. Moreover, cluster 2, composed of 23.5% of the series elements, is the one which had precipitation levels close to those of cluster 1 (1968) in the rainy season and the highest values in the dry season (Fig. 5c).
The results for the station of Patos are different from those observed so far. Cluster 1 (1992) had higher values; therefore, in the older years of the series the precipitation level was higher than in recent years, represented by cluster 2 (2000) (Fig. 6). Therefore, in recent years it has rained less in the locality of Patos and such reduction was observed mainly in the rainiest quarter of the year (MAM). In contrast to this, by the Mann-Kendall test (Table 2), no significant trends were detected for this station in any month or quarter, nor annually. However, with the cluster analysis approach applied in this study, it is possible to observe how precipitation was distributed over the years/months among the clusters. Thus, it was found that cluster 1 (1992) had much higher values, especially during the rainy season, with values below the average for January, February and June (Fig. 6c).
In Princesa Isabel, as well as in Água Branca, Aguiar and Coremas, the application of cluster analysis shows the increase in precipitation in recent years, especially in the rainy season (Fig. 7). It can be noted that there was an increase in precipitation values from cluster 1 (1956) to cluster 2 (1970), and cluster 2 consisted of 49% of the years of the series.
All quarters for cluster 1 (1956) had lower values than in cluster 2 (1970), with the largest differences observed in MAM and JJA. Fig. 7c highlights the difference between the monthly precipitation values for the rainy season, except for the peak month of precipitation (March). In the driest period the values are closer.
The Mann-Kendall test proved to be sensitive to increases in precipitation only for the month of July. While for July there was a significant trend of increase (0.15 mm/year), in SON there was a decrease of -0.33 mm/year in precipitation volumes (Table 2).
It is possible to note that, for the station of Sousa, precipitation values have decreased in recent years: cluster 1 (1981), composed of 22.2% of the years of the series, had annual precipitation of around 1376.7 mm. For cluster 2 (1982) it was 797.4 mm. However, the difference between centroids was almost zero (1981 and 1982). As pointed out by Alashan (2018), the trend inclinations of each cluster could be detected by comparing their centroid points.
Compared to the Mann-Kendall test, it is possible to note the sensitivity of the trend analysis for the reduction of precipitation in March (-2.92 mm/year), while in the cluster analysis the values were very close in this month for the two clusters (Fig. 8c) and it was not possible to observe this change pointed out by the Mann-Kendall test. This is because of how clusters and their respective centroids are defined in Ward’s method. Clusters are formed by minimizing the squares of the component deviations from the average value of each cluster, that is, their centroid (Majerova and Nevima 2017).
All quarters in Sousa showed higher precipitation values for the cluster of the oldest years (cluster 1). This joint analysis clarifies how the changes in rainfall regime occurred, enabling the use of this information to improve the management of plantations, for instance, and a more efficient use of water resources.
It was detected, both by cluster analysis and Mann-Kendall test, an increasing behavior for precipitation in the municipality of Teixeira, especially in the annual comparison (Fig. 9 and Table 2). In the DJF and SON quarters, cluster 2 (1970) stood out, showing the highest averages among the clusters. In MAM and JJA, cluster 3 (1987) was the one which stood out. Increases in precipitation in recent years are more evident in MAM and JJA.
The semi-arid region of northeastern Brazil, which encompasses the study area of the present study, is characterized by the irregular distribution of precipitation, that is, it has high temporal and spatial variability (Lima et al. 2019; Costa et al. 2020a). Annual precipitation is less than 500 mm in a great portion of these semi-arid areas (Oliveira el al. 2017). Cluster analysis, in conjunction with trend analysis, allowed clarifying these changes in precipitation patterns over the years for the studied locations.
Historically, the Northeast has always been affected by prolonged periods of drought or, diametrically and at specific points, by floods (Costa et al. 2020b). The causes of drought in the northeastern semi-arid region of Brazil in 2010-2016 were discussed in the study conducted by Marengo et al. (2018), who observed that after 2012 the droughts became more intense in the northeastern semi-arid region and this may be associated with the singular increase of temperature in the Tropical North Atlantic Ocean, generating an atypical position north of the Intertropical Convergence Zone (itcz), causing less rainfall in northeastern Brazil. Dry years, in general, coincide with the occurrence of the El Niño phenomenon. This scenario aggravates the situation of reservoirs, in addition to agriculture and livestock farming in the region.
On the other hand, the rainiest years for the Northeast region of Brazil were 1964, 1967, 1974, 1985, 1986, 1988, 1989, 1994, 2004 and 2009. In general, this was also observed in the present study, considering the localities which showed significant trends of increase in precipitation, in accordance with cluster analysis, indicating that it has rained more in recent years. One of the possible causes was the transport of moisture from the tropical Atlantic and the Amazon basin to the Brazilian Northeast (Marengo et al. 2018).
However, both droughts and floods cause problems for the population. Therefore, public policies aimed at mitigation and coexistence with extreme weather events are necessary, as they cause important economic losses in this region (Oliveira el al. 2017).
Another point to be highlighted refers to the capacity of widely used methods to recognize trends in climate change studies. In the case of the Mann-Kendall test, one of the advantages of using it is that no assumptions are required about the distribution of data and it is also little sensitive to outliers or limited absence of values in time series. In addition, it becomes more effective with the increase in the slope coefficient of the line given by the Sen’s method, that is, with the increase in the amplitude of the observed trend. However, it is inversely proportional to the increase in sample variance, which is very common in precipitation data, and it becomes even less robust when the elements of the series have autocorrelation (Wang et al. 2020a; Yue et al. 2002a, b).
Cluster analysis was used in the present study to fill this gap of the traditional Mann-Kendall test, since in the hierarchical clustering method the focus is on variability.