Abstract
In this study, we use the strong motion records and seismic intensity data from 11 moderate-to-strong earthquakes in the mainland of China since 2008 to develop new conversion equations between seismic intensity and peak ground motion parameters. Based on the analysis of the distribution of the dataset, the reversible conversion relationships between modified Mercalli intensity (MMI) and peak ground acceleration (PGA), peak ground velocity (PGV), and pseudo-spectral acceleration (PSA) at natural vibration periods of 0.3 s, 1.0 s, 2.0 s, and 3.0 s are obtained by using the orthogonal regression. The influence of moment magnitude, hypocentral distance, and hypocentral depth on the residuals of conversion equations is also explored. To account for and eliminate the trends in the residuals, we introduce a magnitude-distance-depth correction term and obtain the improved relationships. Furthermore, we compare the results of this study with previously published works and analyze the regional dependence of conversion equations. To quantify the regional variations, a regional correction factor for China, suitable for adjustment of global relationships, has also been estimated.
1. Introduction
As a qualitative index to describe the ground vibration and degree of damage caused by an earthquake, the seismic intensity provides a simple macroscopic scale to characterize the magnitude of an earthquake’s impact on human communities, so it is widely used in earthquake disaster assessment, loss estimation, structural response analyses, historical earthquake disaster recurrence, and other fields all over the world [1]. Traditionally, seismic intensity is typically obtained by experts who investigate the building damage and human response to an earthquake in an area where that earthquake strikes. Ground motion parameters, such as PGA, PGV, and PSA, which are used as quantitative indexes to describe the degree of ground vibration, are usually obtained from the records of seismic observation instruments deployed in earthquake areas. How to transform the ground motion parameters into seismic intensity and establish the conversion equation between them is of great significance for the rapid assessment of earthquake disaster and loss (such as the generation of ShakeMap [2]), the comparative analysis of different types of intensity scales, and other engineering applications.
In some countries or regions with abundant seismic records (e.g., the United States, Japan, Italy, Turkey, New Zealand, and Taiwan), researchers have explored the correlation between seismic intensity and ground motion parameters and established the corresponding ground motion to intensity conversion equations (GMICEs) [3–12]. At present, there are numerous seismic intensity scales, such as modified Mercalli intensity (MMI), Mercalli–Cancani–Sieberg (MCS) intensity, JMA intensity in Japan, and so on in which MMI is the most widely used. With respect to these seismic intensity scales, most researchers calibrate their conversion equations using PGA, PGV, and PSA as physical indicators. In early works, PGA was employed as the ground motion parameter that would provide accurate seismic intensity estimates because it was assumed that the product of acceleration and mass would accurately characterize the inertial forces acting on the structure [13–15]. However, because PGV is directly related to the kinetic energy, some studies have found that the correlation between PGV and seismic intensity is stronger than it is for PGA [16, 17]. Wald et al. proposed that low seismic intensities were strongly correlated to both PGA and PGV, while high seismic intensities were only strongly correlated with PGV [3]. Bilal and Askan concluded that the seismic intensity was more closely related to PGV for ductile structures and PGA for brittle structures [8]. For the response spectrum parameters, which are directly related to the seismic response and structural damage, many researchers found that they were also closely related to seismic intensity. As Spence et al. demonstrated, the correlation between seismic intensity and PSA (5% damping ratio) over a period of 0.1 s to 0.3 s was stronger than it is for PGA [18]. Boatwright et al. determined that seismic intensity correlated best with the velocity response spectra (5% damping ratio) at a period of 1.5 s [19]. In addition, Caprio et al. derived new global relationships, assembling a database from active crustal regions in California, Italy, Greece, and global data from the Central and Eastern United States catalog. The authors obtained the PGA and PGV to intensity conversion equations and provided regional correction factors to the global relationships for Italy, Greece, and California, as well as a regional similarity study for other regions of the world [10]. Moratalla et al. considered a large amount of data derived from New Zealand earthquakes and developed the updated GMICEs between MMI and PGA/PGV [11]. The work also proposed a regional correction factor for New Zealand, suitable for the global relationships of Caprio et al. [10].
The research on the correlation between seismic intensity and ground motion parameters in China began in the 1960s. The intensity scale used was MMI, and the PGA value corresponding to the intensity of VII-IX was given in the seismic building design code at that time. Due to the lack of local strong motion records in China, Liu et al. used a small amount of seismic data from the United States and some foreign research results [20], took PGA and PGV as the measures of seismic intensity, and gave the conversion equations between MMI and PGA/PGV [21], which were applied in Chinese seismic intensity scale in 1980. Du et al. derived a new empirical relationship between MMI and PGA/PGV for Western China, using the ground motion records of earthquakes that occurred before 2014 [22]. The work used a weighted least-squares regression method, resulting in an irreversible conversion from PGA/PGV to MMI, and the relationship between MMI and PSA was not considered in the literature. It is worth noting that the conversion equations from PGA/PGV to MMI are not scrutinized and revised by using recent local seismic data; in addition, as of now, the robust correlation relationships between MMI and PSA, which are also very useful because the spectral quantities are strong indicators of the structural response which directly influence the observed intensity values, have not been studied in detail for Chinese earthquake dataset. So, there is an urgent need to re-examine the GMICEs in China thoroughly and develop the updated relationships using the latest local seismic data and including PSA in addition to PGA and PGV. Furthermore, two realistic factors also inspire us to develop the new GMICEs. (1) Since 2008, the operation of China Digital Strong Motion Network has greatly increased the quantity and quality of the strong motion observation stations, and in recent years, the mainland of China has experienced a number of moderate-to-strong destructive earthquakes, resulting in large volumes of strong motion records and post-earthquake seismic intensity data. (2) With the rapid development of national economy, the structure of buildings, seismic resistance, and the types of earthquake damage have changed significantly, and the distribution of the peak ground motion corresponding to the seismic intensity may also change accordingly. Thus, in this study, we intend to use the recent seismic dataset of China and develop the new conversion equations between seismic intensity and PGA, PGV, and PSA.
2. Data Source and Analysis
2.1. Data Source
In this study, the seismic data consist of the strong motion records from 11 moderate-to-strong earthquakes (5.9 ≤ Mw ≤ 7.9) that occurred in the mainland of China from 2008 to 2019. We downloaded the data from the China Strong Motion Network Center (see Data Availability), which used a new digital strong motion network after 2008, and obtained a large volume of strong motion data from destructive earthquakes. The locations of the 11 earthquakes as well as the spatial distribution of the strong motion stations that recorded these events are displayed in Figure 1, the detailed information of these earthquake events is given in Table 1, and the magnitude versus hypocentral distance distribution of the dataset is shown in Figure 2. We employ the method of Yu et al. [23] to perform baseline correction and filtering and obtain PGA, PGV, and PSA (5% damping ratio) at natural vibration periods of 0.3 s, 1.0 s, 2.0 s, and 3.0 s of the two horizontal components for each record. The maximum value of the two horizontal components is used in the subsequent regression analysis.


The macroseismic intensity data (in terms of MMI scale) were compiled from the isoseismal maps and damage reports of the past earthquakes (see Data Availability). The isoseismal maps were obtained by detailed post-earthquake survey and site evaluations after the earthquakes at the meizoseismal areas by reconnaissance teams. All the maps and bulletins used in this paper were prepared by the China Earthquake Administration, involving the MMI range from V to X. For the past earthquakes with an observed isoseismal map or a detailed earthquake damage report from which MMI values were easy to obtain, the peak ground motion corresponding to MMI can be paired straightforwardly. We assigned one MMI value for each recorded accelerogram considering the location of accelerograph stations on the isoseismal maps or considering the observed intensity value near the stations (≤2 km), and a total of 242 peak ground motion-intensity pairs were obtained.
2.2. Data Analysis
Using the seismic dataset described in the previous section, we have calculated the max, min, and mean values, as well as the standard deviations of PGA, PGV, and PSA (hereafter denoted as PGM, i.e., peak ground motion) in logarithmic scale for different intensity units ranging from V to X (Table 2). It can be seen from Table 2 that the discreteness of seismic parameters in each intensity unit is very large, but the mean value has a good positive correlation with the intensity value. We also draw the frequency distribution of the datapoints in each intensity unit for logPGM (Figures 3 and 4, for PGA and PGV; Figures 5–8, for PSA). From these figures, we can see that the sample data in intensity units of V to VII have approximate normality, which is less obvious in the intensity units of VIII to IX due to its smaller amount of data. It also should be noted that the mode and mean values of PGA (Figure 3) in each intensity unit in our dataset are evidently higher than the corresponding values of Liu et al. [21]. This significant finding further illustrates the necessity to re-examine the GMICEs used in China and make necessary improvements.



(a)

(b)

(c)

(d)

(e)

(a)

(b)

(c)

(d)

(e)

(a)

(b)

(c)

(d)

(e)

(a)

(b)

(c)

(d)

(e)
In addition, the dataset in this study is also compared with the datasets of America and Turkey [2–5, 8] (Figure 9, taking PGA and PGV as examples). It can be seen from Figure 9 that the distribution of Chinese data is inconsistent with that of the other two regions. As a whole, the PGA of Chinese data in each of the intensity units is smaller than that of America and larger than that of Turkey, and the PGV of Chinese data is smaller than that of America and Turkey. This regional difference of the data distribution indicates that the GMICEs for a certain area must be obtained using the local seismic dataset, and the introduction of nonlocal seismic data may lead to inaccurate results. Therefore, only the seismic dataset of China (Table 1) is used in this study to obtain the most suitable GMICEs for local use.

(a)

(b)
3. Regression Analysis
3.1. Outlier Test of Data
To ensure the accuracy of the regression results, it is necessary to screen and delete outliers from our dataset prior to the statistical analyses. Because the conversion equations between MMI and PGM are usually characterized by a linear function with respect to logPGM, it is necessary to search for outliers in the dataset of logarithmic scale for each intensity unit.
Because the sample data of logPGM in most of the intensity units approximately obey the normal distribution, according to the recommendations in Chinese National Standard GB/T 4883-2008 (Statistical Interpretation of Data: Detection and Treatment of Outliers in the Normal Sample) [24], we conduct the skewness-kurtosis test on sample data in intensity units ranging from V to VII because of the high number of samples in these intensity units. Conversely, there are fewer sample data in the intensity units of VIII and IX, and the Dixon test is used to identify the outliers. In addition, the outlier test is not performed for the intensity unit of X because there is only one sample data in this intensity unit.
According to the test results, we find only one straggler of logPGA value (0.38), which came from the 51BTD station located at the intensity V region in the 2014 Mw6.5 Ludian earthquake. This station, consisting of a MR2002/SLJ-100 seismometer, is located in Butuo County, Liangshan Prefecture, Sichuan Province. Its Vs30 is 368.12 m/s, and the observation site is the soil layer. Butuo County is built on a valley with special topography, which may lead to an abnormal PGA value that should be deleted from our dataset. It is also worth noting that the sample data of MMI X are not used in the subsequent analysis, which is because we use the mean value of the sample data in each intensity unit to develop the regression but there is only one sample data in MMI X. To avoid the fitting deviation caused by the single data and ensure the overall stability of the regression results, we delete the data of MMI X from our dataset. As such, our conversion equations are only applicable within the intensity range of V ≤ MMI ≤ IX.
3.2. Regression Results
In this present study, the orthogonal regression, which is also known as total least square regression or Deming regression [25], has been used to fit our MMI-logPGM data pairs and develop the GMICEs for China. The orthogonal regression calculates the residuals as the minimum perpendicular distance from a point to the regression line. It considers the error on both the dependent and independent variables and has an important advantage for making the equations reversible. This is very useful for GMICEs because we can use the reverse equations to analyze the historical earthquakes which have only seismic intensity information but no ground motion measure data. In addition, according to the statistical results described in the previous section that the low intensity units have a large number of samples while the high intensity units have a small number of samples, we use the mean value of the sample data in each intensity unit to develop the regression to assure the equal weight for all intensity units.
The GMICEs to estimate MMI from PGM are provided in the following equation:in which PGM corresponds to PGA, PGV, PSA (0.3 s), PSA (1.0 s), PSA (2.0 s), and PSA (0.3 s) and c1 and c2 are the fitting coefficients, where c1 is the slope obtained from the linear regression and c2 is the intercept.
Given the reversibility of the equation, the equation for calculating PGM from MMI is
Table 3 provides a summary of the coefficients in equations (1) and (2), as well as the corresponding standard deviations of the residuals obtained by applying these equations to the complete dataset, for both logPGM and MMI. Regression lines for PGA and PGV are shown in Figure 10 (regression lines for PSA are shown in Figure 11).

(a)

(b)

(a)

(b)

(c)

(d)
In Figure 10(a), we take the MMI-PGA relationship derived by Liu et al. [21] as a reference. It is clearly observed that there are significant differences between the two relationships. Under the same value of PGA, the MMI calculated by the equation of Liu et al. [21] is greater than that obtained by the formula developed in this study. The possible reason is because nowadays the buildings in China are more earthquake resistant than those in 1980. Similar trends can be found in the comparison of MMI-PGV relationships shown in Figure 10(b), which is less obvious than that in Figure 10(a).
4. Residual Analysis for Magnitude, Distance, and Depth Dependence
We perform a residual analysis of the dataset to identify possible trends related to earthquake magnitude, hypocentral distance, and hypocentral depth. The objective is to quantify the dependence on these factors and calculate residual corrections. Based on the regression relationships in equation (1), we calculate the residuals of MMI values (i.e., the difference between the observed MMI and the predicted MMI) and plot the trends of residuals with moment magnitude, hypocentral distance, and hypocentral depth (Figures 12–14, for PGA and PGV; Figures 15–17, for PSA).

(a)

(b)

(a)

(b)

(a)

(b)

(a)

(b)

(c)

(d)

(a)

(b)

(c)

(d)

(a)

(b)

(c)

(d)
To eliminate the dependence of residuals on magnitude, distance, and depth, it is necessary to introduce new variables into the prediction model (equation (1)). Here an approach similar to Worden et al. [6] is adopted. Firstly, we introduce the magnitude correction term:where M is the moment magnitude and c3 and ct1 are the parameters of the line defining the trend in residuals with magnitude, which are obtained from linear least-squares fit. We then add this correction term to equation (1) and yield an intermediate relationship:
Secondly, using the intermediate relationship (equation (4)), we calculate the new residuals of MMI values and introduce the distance correction term:where R is the hypocentral distance and c4 and ct2 are the parameters of the line defining the trend in residuals with distance. Similarly, the distance correction term is added to equation (4) to get an updated intermediate relationship:
Lastly, using this new relationship (equation (6)), we calculate the depth correction term:where D is the hypocentral depth and c5 and ct3 are the parameters of the line defining the trend in residuals with depth. Then, we can obtain the improved prediction relationship:where c6 = ct1 + ct2 + ct3.
Table 4 lists the fitted coefficients and the new standard deviations of residuals after applying the correction terms. When we examine the residuals obtained using equation (8) to predict MMI (instead of equation (1)), we note that the trends have been removed. Moreover, as can be seen from a comparison of the sigma values in Table 4 with those in Table 3, the inclusion of magnitude-distance-depth correction terms reduces the standard deviations of the residuals for all ground motion types.
5. Comparison of the Proposed GMICEs with Previous Studies
The GMICEs developed in this study (equation (1)) are compared with similar relationships developed for different regions worldwide [2, 4, 5, 7, 8, 10–12, 17, 20–22, 26–28] to assess the possible variations from one region to another. The corresponding equations derived by various researchers for comparison are given in Table 5. Figures 18 and 19 present these comparisons in terms of logPGA, logPGV, and logPSA, respectively.

(a)

(b)
As can be seen from Figure 18(a), there are significant discrepancies in the various studies for MMI-PGA relationships. Liu et al. [21], Trifunac and Brady [20], and Faenza and Michelini [2] overestimate intensity values at all intensity levels, whereas Wald et al. [3] and Worden et al. [6] underestimate the intensity values approximately for all intensity levels. Additionally, Atkinson and Kaka [5], Ma et al. [27], Caprio et al. [10], Du et al. [22], Ahmadzadeh et al. [12], and Mortalla et al. [11] present different slopes leading to underestimating the values at low levels and overestimating at higher levels.
By comparing the MMI-PGV relationships in Figure 18(b), it is observed that Liu et al. [21], Atkinson and Kaka [5], Du et al. [22], and Ahmadzadeh et al. [12] are nearly consistent with our proposed relation. Faenza and Michelini [2] overestimate intensity values at all intensity levels, whereas Wald et al. [3], Worden et al. [6], and Mortalla et al. [11] underestimate the intensity values for all intensity levels. Additionally, Ma et al. [27] and Caprio et al. [10] overestimate the intensity values at low levels but underestimate at higher levels, whereas Trifunac and Brady’s study [20] has the opposite trend, leading to underestimating the values at low levels and overestimating at higher levels.
Similar discrepancies can be found from the comparison of MMI-PSA relationships shown in Figure 19. It should be also noted that, when comparing the MMI-PSA curves under different natural vibration periods (0.3 s, 1.0 s, 2.0 s, and 3.0 s) for a given intensity unit, the PSA value is larger in the shorter period and smaller in the longer period, which is consistent with the results of Worden et al. [6] and Panjamani et al. [26]. This interesting observation may be related to the types of buildings and structures in the study areas.

(a)

(b)

(c)

(d)
Through the above analysis, we can see that the GMICEs obtained by different scholars exhibit a big discrepancy. Because these relations are obtained from the strong motion records and seismic intensity dataset in different regions, the differences between them are believed to originate from the variable ground motion characteristics(frequency components, duration, and so on) as well as different building and damage styles in these regions. As a result, it is important to note that the GMICEs carry regional characteristics and should only be applied in regions with similar seismic design and construction practices.
6. Regional Scaling Factor for Global Relationships
To quantify the differences to other regions in the globe by having the global relationships as reference, as well as providing an alternative approach to the GMICEs proposed here, a regional correction factor for China to the global relationships proposed by Caprio et al. [10] is applied. Through a residual analysis of the global relationships applied to the China database, the mean and variance are calculated and defined as the regional correction factor. Caprio et al. [10] gave the regional correction factors for Italy, Greece, and California, and our regional correction factor for China can now be added to the global relationships, as shown in the following equation:
These equations indicate that the global relationships underestimate the China dataset by about 0.23 ± 0.96 intensity units for PGA and 0.03 ± 0.81 for PGV. It should be noted that since there is no global conversion relationship for PSA at present, we cannot get the regional correction factor of PSA for China in our study.
7. Conclusion
In this study, we use the strong motion records and seismic intensity data from 11 moderate-to-strong earthquakes in the mainland of China since 2008 and develop new conversion equations between MMI and PGA, PGV, PSA (0.3 s), PSA (1.0 s), PSA (2.0 s), and PSA (3.0 s). Unlike earlier studies in China (e.g., Liu et al. [21] and Ma et al. [27]), we elect to employ orthogonal regression and obtain the reversible relationships, which provide a stable conversion from ground motion to MMI or MMI to ground motion. It is believed that these new GMICEs proposed herein can be used in the future whenever necessary in China, particularly for preparing intensity maps, earthquake disaster and loss assessment, and other engineering applications.
We analyze the effects of the moment magnitude, hypocentral distance, and hypocentral depth on the residuals of conversion equations. By introducing a magnitude-distance-depth correction term, we obtain the improved relationships and remove the trends in residuals. It should be noted that Vs30, a factor characterizing the site conditions at the locations of interest, is also a potential independent variable in the conversion equations. However, as of now, there are numerous strong motion stations in China without detailed information on local site conditions. Therefore, we do not include a site correction term in our improved conversion relationships.
The proposed conversion equations are also compared with similar relationships from previous studies. These comparisons confirm that such relationships should be regionally dependent. Since the limited dataset used in this study mainly comes from the central and western regions of China, the conclusions in this paper are only applicable to these regions. An alternative approach is also proposed with a regional correction factor for China, suitable for the global relationships proposed by Caprio et al. [10]. This makes the regional characteristics of China comparable to other regions in the globe having the global relationships as reference.
It is also worth noting that the GMICEs in the present study should not be used beyond the PGM and MMI ranges indicated in Tables 1 and 2 (MMI range between V to IX, Mw range between 5.9 and 7.9, and hypocentral distance range covering 11.2–640.5 km), and PGM data used to develop the equations correspond to the maximum of the two horizontal components. In the future, similar studies need to be conducted with more seismic records and intensity data to further optimize and improve the GMICEs for China.
Data Availability
The strong motion records used in this study were provided by the China Strong Motion Network Center. The macroseismic intensity dataset, including the intensity maps and post-earthquake disaster investigation reports, was acquired from the provincial seismological bureaus operating under the China Earthquake Administration.
Conflicts of Interest
The authors declare that they have no conflicts of interest.
Acknowledgments
This study was supported by Basic Scientific Research Fund of Science and Technology Innovation Base of Lanzhou, Institute of Earthquake Forecasting, China Earthquake Administration (grant no. 2019IESLZ06). The authors would like to thank the China Strong Motion Network Center for providing strong motion records and the provincial seismological bureaus operating under the China Earthquake Administration for providing the isoseismal maps and damage reports of the past earthquakes.