Terrain classifications
The terrain classification scheme proposed by Iwahashi and Pike (2007) by using the three surface morphometric features of topographic slope, surface texture, and local convexity automatically identifies 16 terrain categories from DEM data (brief scheme shown in Fig. 4). Mountains and basins can be identified by slope, terraces (convex-upward) and alluvial fans (convex-downward) can be identified by convexity, and alluvial fans with smooth surface and mountain slopes with rough surface can be distinguished by texture (Iwahashi 2001).
Using VS30 values from California, Yong et al. (2012) proposed a method for estimation of VS30 based on the I07 terrain classification map. Subsequently, Zhang et al. (2022) found significant differences between the threshold values used for terrain classification in Xinjiang and Hebei provinces of China, and highlighted that there would be a smoothing effect and inadequate recognition of local geomorphology when consistently using the global thresholds provided by Iwahashi and Pike (2007). Therefore, the terrain classification threshold values of the three regions were calculated using regional DEM data (results shown in Fig. 5), and the terrain categories of the three regions were reclassified accordingly. The Z22 and I07 terrain-classification maps of the three regions are shown in Fig. 6.
Using 30arc-second DEM data (http://www.webgis.com/srtm30.html), and in accordance with the terrain classification scheme of Iwahashi and Pike (2007), the taxonomic thresholds of the three regions were calculated, as shown in Fig. 5(a). The threshold values for Xinjiang and Hebei provinces provided by Zhang et al. (2022) and for world provided by Iwahashi and Pike (2007) are compared in Fig. 5(b).
Comparison of Fig. 5(a) and 5(b) reveals that the threshold values of topographic slope, surface texture, and local convexity in the CM Region are approximately the same as those reported for the Hebei province in Zhang et al. (2022) after the addition of the Beijing and Tianjin regions. The topographic slope of the CM Region is the lowest among the four local regions (i.e., the CM Region, CD Region, GG Region, and Xinjiang province) owing to the large area of plain landform in the CM Region. The texture values of both the CD Region and the GG Region are close to 75%, indicating similarity in surface roughness. However, the slope and convexity values are higher in the CD Region than in the GG Region, reflecting the surface characteristic of more mountains in the CD Region and more hills in the GG Region. Therefore, the CD Region has more upward-convex landforms and steeper topographic slopes than the GG Region. Overall, the geomorphological features of steepest topographic slope, roughest surface texture, and most upward convexity all occur in the CD Region, reflecting the most complex changes of terrain.
The 16 terrain categories of each region were partitioned according to the classification scheme of Iwahashi and Pike (2007) (Fig. 4) and the calculated thresholds shown in Fig. 5(a), and the results (Z22) are shown in Fig. 6. The results extracted from I07 are also shown in Fig. 6 for comparison.
For the CM Region, the classification of terrain categories can be more fully identified using the regional taxonomic thresholds. For example, in the North China Plain, the categories according to I07 can be partitioned only as Class16, while the categories according to Z22 can be partitioned as Class11, Class13, Class15, and Class16, which can better distinguish the uneven and rough surfaces of areas with gentle low slopes that correspond to alluvial plains, alluvial–diluvial plains, and diluvial plains (Fig. 1). The grain size of the alluvial plains is coarser than that of the diluvial plains, which results in them having different characteristics of VS30 (Wills et al. 2006). Therefore, it is necessary to further partition the plain areas.
For the CD Region, I07 unified Sichuan and most of western Yunnan into Class1, whereas the elevation of western Sichuan is actually higher than that of western Yunnan. Matsuoka et al. (2005) found that the VS30 value of the same geomorphic unit is affected by elevation. Therefore, further division and more detailed analysis of these areas are necessary. For the GG Region, owing to the complex terrain and varied geomorphic landforms, the mountains, hills, and plains are interlaced and the terrain categories obtained from Z22 and I07 are relatively fragmented. Therefore, intuitive comparison of the two results is not possible and detailed analysis in combination with VS30 data should be conducted later.
Development of VS30 prediction models
Iterations of random selection and cross-validation methods were used by Yong (2012) and Yong et al. (2016) to establish the Y12 and Y16 VS30 prediction models, respectively, while Ahdi et al. (2017) and Contreras et al. (2018) considered \(\mu \ln {V_{{\text{S}}30}}\) and\(\sigma \ln {V_{{\text{S}}30}}\) from all boreholes in each terrain category as representative. To determine the differences in the VS30 characteristics between the I07 and Z22 terrain classification results, this study adopted the \(\mu \ln {V_{{\text{S}}30}}\) and \(\sigma \ln {V_{{\text{S}}30}}\) from all boreholes in each terrain category as representative as used by Ahdi et al. (2017). The number of boreholes and the results of the predicted VS30 values for each terrain category from I07 and Z22 are listed in Table 1, and the predicted VS30 values corresponding to the NEHRP site classification boundary are shown in Fig. 7.
Table 1
Predicted values and standard deviations of VS30 in logarithmic coordinates derived from I07 and Z22.
| CD Region(I07) | CD Region(Z22) | CM Region(I07) | CM Region(Z22) | GG Region(I07) | GG Region(Z22) |
\(\mu \ln {V_{{\text{S}}30}}\) | \(\sigma \ln {V_{{\text{S}}30}}\) | Num | \(\mu \ln {V_{{\text{S}}30}}\) | \(\sigma \ln {V_{{\text{S}}30}}\) | Num | \(\mu \ln {V_{{\text{S}}30}}\) | \(\sigma \ln {V_{{\text{S}}30}}\) | Num | \(\mu \ln {V_{{\text{S}}30}}\) | \(\sigma \ln {V_{{\text{S}}30}}\) | Num | \(\mu \ln {V_{{\text{S}}30}}\) | \(\sigma \ln {V_{{\text{S}}30}}\) | Num | \(\mu \ln {V_{{\text{S}}30}}\) | \(\sigma \ln {V_{{\text{S}}30}}\) | Num |
Class1 | 348.25 | 0.25 | 183 | 314.49 | 0.18 | 39 | 507.65 | 0.38 | 3 | 459.44 | 0.39 | 15 | 478.16 | 0.61 | 5 | 534.48 | 0.59 | 8 |
Class2 | 321.42 | 0.00 | 1 | 392.81 | 0.24 | 51 | NaN | NaN | 0 | NaN | NaN | 0 | 364.95 | 0.00 | 1 | 324.30 | 0.07 | 2 |
Class3 | 331.12 | 0.22 | 46 | 359.17 | 0.19 | 6 | 454.49 | 0.36 | 30 | 410.37 | 0.41 | 10 | 340.27 | 0.27 | 34 | 309.98 | 0.40 | 7 |
Class4 | 350.12 | 0.30 | 22 | 346.30 | 0.27 | 12 | 338.95 | 0.27 | 11 | 349.70 | 0.34 | 2 | 310.61 | 0.00 | 1 | 335.73 | 0.26 | 17 |
Class5 | 330.77 | 0.34 | 56 | 371.42 | 0.28 | 30 | 319.97 | 0.27 | 7 | 394.94 | 0.35 | 51 | 399.01 | 0.37 | 18 | 397.30 | 0.45 | 11 |
Class6 | NaN | NaN | 0 | 379.06 | 0.29 | 11 | NaN | NaN | 0 | 276.92 | 0.00 | 1 | NaN | NaN | 0 | 270.70 | 0.00 | 1 |
Class7 | 286.61 | 0.28 | 105 | 336.42 | 0.30 | 7 | 412.04 | 0.28 | 63 | 362.75 | 0.27 | 72 | 366.13 | 0.31 | 107 | 388.45 | 0.23 | 23 |
Class8 | 296.13 | 0.21 | 30 | 318.20 | 0.26 | 26 | 297.61 | 0.20 | 36 | 311.51 | 0.22 | 19 | 273.73 | 0.27 | 6 | 307.98 | 0.27 | 16 |
Class9 | 316.33 | 0.20 | 18 | 330.57 | 0.22 | 27 | 341.30 | 0.31 | 4 | 270.16 | 0.21 | 281 | 369.33 | 0.29 | 9 | 442.79 | 0.23 | 18 |
Class10 | 340.37 | 0.09 | 7 | 342.22 | 0.37 | 9 | 291.20 | 0.14 | 18 | 226.97 | 0.21 | 17 | NaN | NaN | 0 | 300.02 | 0.26 | 11 |
Class11 | 308.00 | 0.26 | 68 | 312.27 | 0.32 | 8 | 383.14 | 0.23 | 58 | 284.71 | 0.32 | 144 | 340.54 | 0.33 | 117 | 346.46 | 0.27 | 24 |
Class12 | 311.37 | 0.17 | 33 | 328.22 | 0.25 | 36 | 283.36 | 0.21 | 43 | 220.66 | 0.29 | 58 | 256.99 | 0.50 | 5 | 334.64 | 0.29 | 23 |
Class13 | 319.32 | 0.04 | 4 | 344.20 | 0.38 | 21 | 318.83 | 0.28 | 4 | 231.05 | 0.19 | 210 | 298.79 | 0.13 | 19 | 365.09 | 0.28 | 34 |
Class14 | 353.57 | 0.07 | 2 | 358.26 | 0.24 | 21 | 265.14 | 0.17 | 180 | 193.92 | 0.12 | 110 | 208.88 | 0.24 | 2 | 287.16 | 0.32 | 60 |
Class15 | 305.70 | 0.18 | 61 | 296.63 | 0.24 | 44 | 330.98 | 0.23 | 74 | 268.35 | 0.25 | 47 | 342.29 | 0.38 | 122 | 396.01 | 0.32 | 52 |
Class16 | 334.03 | 0.19 | 71 | 305.58 | 0.23 | 359 | 208.54 | 0.20 | 706 | 186.80 | 0.17 | 200 | 226.53 | 0.31 | 37 | 310.10 | 0.38 | 176 |
Note: “NAN” means there is no data under this terrain category. |
It can be determined from Table 1 that from the I07 results for the CM Region, no boreholes are listed under Class2 and Class6 and only 3 boreholes are listed under Class2, while the 706 boreholes listed under Class16 account for approximately 57% of the overall data. Additionally, only 4 boreholes are listed under Class9 and Class13. The Z22 results indicate that no boreholes are listed under Class2 (similar to the I07 results) and that only 1 and 2 boreholes are listed under Class6 and Class14, respectively, while over 200 boreholes are listed under Class9, Class13, and Class16.
In the CD Region, the I07 results show that the boreholes under Class6 is empty, and that fewer than 3 boreholes are listed under Class2 and Class14. The terrain category with the largest number of boreholes (183) is Class1, and that with the largest standard deviation (0.34) is Class5 with 56 boreholes. The Z22 results show that each of the terrain categories has more than 5 boreholes, and that Class16 has the highest number (359). These boreholes are mainly located in the plain area of the Sichuan Basin, with similar components comprising sands and pebbles. The terrain category with the largest standard deviation (0.38) is Class13 with a total of 21 boreholes.
In the GG Region, the I07 results indicate that no boreholes are listed under Class6 and Class10, only 1 borehole is listed under Class2 and Class4, and just 2 boreholes are listed under Class14. The exceptionally small numbers of boreholes under these terrain categories reflect both the uneven distribution of boreholes and the smoothing effect in I07 with the application of global DEM data resulting in the absence of some terrain categories in local regions. The terrain category with the largest number of boreholes (122) is Class11. The Z22 results indicate that boreholes are listed under each of the terrain categories, but only 2 boreholes are listed under Class2. The terrain category with the largest number of boreholes (176) is Class16.
It can be seen from Fig. 7b that in the CD Region, the predicted VS30 values derived from I07 under all categories are located in Site D of the NEHRP classification. The VS30 fluctuates between 280 and 360 m/s, with only a small range of variation. The predicted VS30 values derived from Z22 under the Class2, Class5, and Class6 categories are located in Site C of the NEHRP classification, and the remaining categories are in Site D, which is considered relatively reasonable in terms of site identification. With the exception of Class1 and Class16, the predicted VS30 values of each terrain category derived from Z22 are higher than or close to the predicted values derived from I07. In the CM Region, only the predicted VS30 value under Class5 from Z22 is significantly higher than that from I07, but the trends of the predicted VS30 values with the variation of terrain categories derived from the two terrain classification results are almost parallel. The predicted VS30 values under all terrain categories are located in Site D or Site C of the NEHRP site classification, and the predicted VS30 value under Class16 derived from Z22 is only 186.80 m/s, approaching that of Site E of the NEHRP site classification. In the GG Region, the predicted VS30 values under all terrain categories derived from Z22 are higher than those from I07, but the differences between the highest predicted values and the lowest predicted values of the two are close, i.e., 263.3 m/s from Z22 and 251.63 m/s from I07.
To compare the validity of the predictions from Z22 and I07, the linear relationship between the predicted VS30 values and the measured VS30 values was analyzed, and the correlation coefficient is shown in Fig. 8. It can be seen that in the CD Region, the correlation coefficient between the predicted VS30 values and the measured VS30 values derived from Z22 is 0.32, while the equivalent correlation coefficient derived from I07 is 0.24, i.e., significantly lower. A similar result is evident for the GG Region; however, in the CM Region, the correlation coefficient derived from I07 (0.73) is higher than that derived from Z22 (0.65). A possible reason is that the boreholes with lower VS30 values are clustered in Class16 with a smaller standard deviation, resulting in a higher overall correlation.
Cross validation of classification considering geology dependency
It can be determined from Table 1 that the I07 and Z22 terrain classification results might lead to concentration of boreholes under a certain terrain category. For example, the Z22 result lists 359 boreholes under Class16 in the CD Region and 176 boreholes under Class16 in the GG Region. Similarly, the I07 result lists 706 boreholes under Class16 in the CM Region. The concentration of boreholes under a certain terrain category also corresponds to concentration of boreholes under a geological unit. For example, in the CD Region, 226 of the 359 boreholes under Class16 in the Z22 result belong to the Holocene unit, comprising mainly modern river alluvial, diluvial, mud, sand, and gravel material. These 226 boreholes connect to the I07 results classified as Class8, Class10, Class12, Class13, Class15, and Class16. Note that it is impossible that these 226 boreholes fall into the same terrain category of I07, only the terrain category with boreholes belonging to the Holocene unit that accounted for > 50% of the total number of boreholes under this terrain category in the I07 result were considered. It is considered that if the number of boreholes belonging to the Holocene unit under a certain category does not exceed 50% of the total number of boreholes under this category, a relationship between the terrain category and the geological unit cannot be inferred; therefore, such terrain categories were not considered. Given the similarity and variability of boreholes under the same geological units (Wills et al. 2006, Thelen et al. 2006), it is necessary to confirm whether it is reasonable to divide these boreholes into the same category or multiple categories, and to verify whether the VS30 data between the categories are statistically different. For this purpose, the F-test was adopted in which absence of statistical difference between the two terrain categories under the same geological unit was consider the null hypothesis, and evidence of statistical difference between the two terrain categories under the same geological unit was considered the alternative hypothesis. For the terrain categories mentioned above that belong to the Holocene unit in the I07 result, the cross-validation results of the F-test are shown in Table 2. It can be determined that Class8–Class13, Class12–Class13, Class13–Class15, and Class13–Class16 show statistically significant differences, which can only indicate that Class13 is different from other the categories; however, there is no evidence to suggest that the remaining categories differ from each other. Therefore, it is unreasonable to divide the boreholes under the Holocene unit into different terrain categories in the CD Region, whereas it is more reasonable to divide them into the same category in the Z22 result.
Table 2
Cross-validation results of categories belonging to the Holocene unit in I07 in the CD Region
Group-pairs | p-value | Null hypothesis |
Class8-Class10 | 0.166 | Accept |
Class8-Class12 | 0.792 | Accept |
Class8-Class13 | 0.032 | Rejected |
Class8-Class15 | 0.434 | Accept |
Class8-Class16 | 0.322 | Accept |
Class10-Class12 | 0.128 | Accept |
Class10-Class13 | 0.162 | Accept |
Class10-Class15 | 0.300 | Accept |
Class10-Class16 | 0.066 | Accept |
Class12-Class13 | 0.027 | Rejected |
Class12-Class16 | 0.449 | Accept |
Class12-Class15 | 0.267 | Accept |
Class13-Class15 | 0.048 | Rejected |
Class13-Class16 | 0.018 | Rejected |
Class15-Class16 | 0.047 | Rejected |
In the CM Region, 518 of the 706 boreholes under Class16 in theZ22 result are Holocene alluvial units, which connect to I07 classification results classified as Class10, Class12, Class13, and Class14. The remaining terrain categories that even include a few Holocene alluvial boreholes but with numbers of < 50% of the total under this category were not considered. The differences of VS30 between the different terrain categories of the same geological unit were verified by the F-test, and the results are shown in Table 3. Almost all categories show statistical differences and therefore the Z22 classification is more reasonable.
Table 3
Cross-validation results of categories belonging to Holocene alluvial unit in I07 in the CM Region
Group-pairs | p-value | Null hypothesis |
Class10-Class12 | 0.031 | Rejected |
Class10-Class13 | 0.033 | Rejected |
Class10-Class14 | 0.001 | Rejected |
Class10-Class16 | 0.424 | Accept |
Class12-Class13 | 0.001 | Rejected |
Class12-Class14 | 0.001 | Rejected |
Class12-Class16 | 0.001 | Rejected |
Class13-Class14 | 0.001 | Rejected |
Class13-Class16 | 0.017 | Rejected |
Class14-Class16 | 0.001 | Rejected |
For the GG Region, given that the boreholes were not concentrated in any specific geological unit in particularly large numbers in either classification result, comparative analysis was not performed. However, it can be seen from Fig. 8 that the correlation between the predicted VS30 values and the measured VS30 values derived from Z22 was better than that from I07, confirming that the Z22 terrain classification is more reasonable.
Based on the above analysis, regional VS30 models were established for China using the predicted VS30 values derived from Z22.