Abstract

In this study, an improved second-order difference plot is proposed to analyze the variability of heart rate variability. Although the variation of the physiological status of the cardiovascular system can be shown graphically by the second-order difference plot, the descriptive ability of existing indicators for this plot is insufficient. As a result, the physiological information contained in the second-order difference plot cannot be extracted adequately. Addressing the problem, the temporal variation measure analysis is presented to describe distribution patterns of scatter points in the second-order difference plot quantitatively and extract the acceleration information for a variation of heart rate variability. Experiment results demonstrate the effectiveness of the temporal variation measure analysis. As a quantitative indicator, the temporal variation entropy is properly designed and successfully applied in the recognition and classification of the physiological statuses of the heart.

1. Introduction

An electrocardiogram (ECG) records the abundant physiological information of the heart, which has been widely applied in the diagnosis for a variety of cardiovascular diseases. The research of the analytical method of ECG is always a hot spot in the field of biomedical signal processing [15]. The nonlinear analyses are usually employed to analyze ECG since it reflects a nonlinear characteristic of the biomedical signal. In numerous nonlinear analytical algorithms, the Poincaré plot analysis and the second-order difference plot analysis (SODP) are classical and often employed in the research of heart rate variability (HRV) [616].

The theoretical foundations of the Poincaré plot and SODP is state space reconstruction [17], which has been widely applied in various fields of research [18, 19]. Based on the theory, the motion state of a nonlinear system can be well reconstructed and revealed in a reconstructed state space. The system performance can further be adequately analyzed via reconstructed trajectories. Poincaré plot is a two-dimensional space constructed by a time series. The plot has the function to graphically reveal the physiological status of the cardiovascular system. Based on Poincaré plot, HRV can be effectively analyzed and various cardiovascular diseases display distinct distribution patterns [2022], for instance, the heart rate of a healthy human appears “comet-shape,” the patient with serious heart failure is “fan-shaped,” and the heart rate produced by the artificial pacemaker is “round.” For describing these distribution patterns of scatter points quantitatively, many analysis algorithms were presented, such as the minor and major axis of the fitted ellipse (SD1, SD2) [23], the fitted ellipse area of scatter points (S) [24], Ehler index (EI) [25], Guzik index (GI) [26], Porta index (PI) [27], and the local distribution entropy [6]. Numerical quantitative indicators have been proposed during the past decade, making Poincaré plot analysis widely applied in biomedical signal analysis.

As an extension of the Poincaré plot, Cohen et al. [28] presented the second-order difference plot and central tendency measure (CTM) based on the chaos theory in 1996, which were firstly used to quantify the variability of the physiological status of the heart. As SODP is constructed by the second-order difference of successive RR intervals, the variation of RR intervals can be reflected in a two-dimensional space graphically with various distribution patterns of the reconstructed data points representing different physiological statuses of the cardiovascular system. Although a great deal of information on the variability of HRV is contained in SODP, a bit of information is hard to be adequately extracted by existing quantitative indicators yet. These indicators only depict the overall distribution of points in SODP, whereas the variation tendency for successive RR intervals contained in a single point is ignored natively.

The motivation of the study comes from the existing limitations of SODP. We mainly focus on the relationship between the position of the reconstructed scatter point in SODP and the increment of variation of RR intervals. The information of increment reflects the variation tendency of heart rate within a short time and helps us analyze the physiological status of the heart more comprehensively. On the basis of our previous research studies [6, 2933], we design an improved second-order difference plot in this study and a more effective feature of the physiological status of the heart can be extracted via the novel algorithm. The main contributions of this paper are as follows.(1)Temporal variation measure algorithm (TVM) is presented in this paper. Via TVM, both the information of the variation and the variation tendency for successive RR intervals can be fused and graphically reflected in a three-dimensional second-order difference plot.(2)For TVM, the temporal variation entropy is proposed, which can assess the distribution of points in the three-dimensional second-order difference plot more objectively.

The arrangement of this paper is organized as follows. SODP are briefly introduced in Section 2. Section 3 provides the theory and steps of TVM in detail. In Section 4, experiments are given to demonstrate the performance of our method. Finally, the major outcomes are given in Section 5.

2. Second-Order Difference Plot Analysis

2.1. Second-Order Difference Plot

As a significant extension of the Poincaré plot analysis, SODP focuses primarily on describing the variability of successive RR intervals. Here, SODP is computed by plotting Y(n) against X(n) which are given by equations (1) and (2), respectively.where x(i) is the ith time interval of successive RR interval series.

Based on the theory of state reconstruction, the coordinates (X(i) and Y(i)) of all points can be calculated and the distribution pattern of all scatter points constructed by the time series is visually shown in SODP.

Figure 1 illustrates the SODP of a normal human. In the plot, the coordinates of point Pi are composed of three successive RR intervals. The horizontal and vertical coordinates of this point are and , respectively. In the plot, the points are located in different quadrants representing the diverse physiological statuses of heart rate. For the point Pi in the first quadrant (I), the coordinate values of X(i) and Y(i) are greater than zero, which means that x(i + 2) > x(i + 1) > x(i). Thus, these points in the first quadrant indicate that successive RR intervals increase monotonically and the heart rate decrease steadily. Oppositely, the points in the third quadrant (III) imply increasing heart rate. If the changes of three successive RR intervals are inconsistent, the corresponding points will be located in the second quadrant (II) or the fourth quadrant (IV). Therefore, SODP can describe the variability of heart rate intuitively.

2.2. Central Tendency Measure

As a matched indicator of SODP, the central tendency measure (CTM) can quantify the degree of variability of the heart rate. In SODP, CTM is a ratio of the number of points in radius r to the number of all points. The indicator is calculated as follows:

Figure 2 shows that CTM counts the number of all points in the origin-centered circles with radius r. As the quantitative feature of distribution density for scatter points, the parameter r decides the value of CTM directly, which influences the classification accuracy significantly. Nevertheless, how to reasonably choose an optimal radius r is still problematic.

SODP describes the variability of successive RR intervals graphically. The scatter points in different quadrants of SODP contain abundant physiological information. Even though some points are located in a certain quadrant, there are still much information of the cardiovascular system waiting to be analyzed further.

In Figure 3, four points are located in the same area according to the algorithm of CTM. Hence, the four points will be counted indiscriminately and used to calculate CTM. However, the four points represent the different variation tendencies of the RR interval series, which cannot be distinguished by CTM effectively. For example, as the large coordinate difference of the point P1, the point is located next to the y-axis and the position of P1 means different variation for three successive RR intervals is an incremental tendency.

Conversely, the point P2 near the x-axis reflects different variations of successive time intervals as a diminishing tendency. Therefore, CTM cannot comprehensively describe the variation of the RR intervals. Addressing the problem, an improved second-order difference plot analysis is presented in the next section, which will improve the performance of SODP further.

3. Temporal Variation Measure Algorithm

For the limitations of SODP, in this section, the temporal variation measure algorithm is introduced in detail, which is an improvement for standard SODP. The algorithm is composed of three-dimensional second-order difference plot and temporal variation entropy. In Section 3.1, the RR interval series is reconstructed in a three-dimensional second-order difference plot. In the space, the distribution patterns of reconstructed scatter points reflect more physiological information of cardiovascular system. Then, for analyzing these patterns quantitatively, a temporal variation entropy is designed in Section 3.2.

3.1. Three Dimensional Second-Order Difference Plot

Rich physiological information of the heart is shown by the distribution pattern of scatter points. In SODP, the position and Euclidean distance of points reflect the variation tendency of the RR intervals essentially, which can be explained by the concept of acceleration. However, the acceleration of the variation for the RR intervals cannot be reflected by the indicators of CTM effectively. For this limitation, the temporal variation measure algorithm is presented. Based on the novel analysis method, the change and the acceleration of the change for the RR intervals will be calculated adequately.

In Figure 4, there are three points P1, P2, and P3 in SODP. As mentioned above, the distance between the points P1 and the origin is the same as the distance between P2 and the origin, whereas the two points reflect the different conditions of the heart. On the other hand, the different distances also represent the different physiological statuses of the heart. For instance, the distance between P3 and the origin indicates that the changes of three successive RR intervals are smaller than the other two points’. Hence, the two factors of position and distance of points should be properly fused in the novel algorithm.

Various heart rate accelerations can be described by the different positions of scatter points in SODP. Those scatter points near the vertical axis of SODP reflect that the accelerations of heart rates are increasing. On the other hand, if the points are located at the abscissa axis of SODP nearby, this situation shows that the accelerations of variation for the heart rate are reducing. Hence, the relative positions between the points and coordinate axes can describe the acceleration of variation for heart rate graphically. Here, the relative position of the point in SODP can be well calculated by the parameter Dco.where Dco(i) is the difference of the vertical and horizontal coordinates of point Pi. If the parameter Dco is greater than zero and the point is closer to the y-axis, which means that the variation tendency of RR intervals is rising in a short time. Otherwise, the point is near the x-axis and the variation tendency decreases. In fact, the parameter Dco provides the acceleration information of variation for RR intervals and well depicts the laws of the variation of the heart conditions.

On the other hand, the variation of RR intervals can be described by the Euclidean distance between the point and the origin of SODP, and the RR intervals is positively associated with the distance. The distance can be calculated by the parameter L.where LE(i) is the Euclidean distance between point Pi and the origin. xi and yi are the coordinate values of Pi, respectively. is the mean of all LEs. Here, formula (6) is the sigmoid function, which can scale the distance LE. Via the nonlinear function, the Euclidean distance LE is mapped to the interval (0, 1).

For fusing the two factors further and reserving the characteristics of SODP, we design a novel coordinate for standard SODP, which is calculated as

In the formulas, two parameters Dco and L are fused in , which can describe the variation and the acceleration of variation for RR intervals quantitatively. Hence, the coordinates of points are composed of three coordinates x, y, and . In other words, the RR interval time series will be reconstructed in a three-dimensional second-order difference plot. In the novel space, the change of the RR interval series will be shown more comprehensively.

Figure 5 shows a three-dimensional second-order difference plot. The x-y plane is standard SODP in which the projections of scatter points contain all distribution patterns of standard SODP. Meanwhile, the zTVM coordinate of the scatter point provides more detailed variation tendency information of the time series, which makes up for the deficiency of SODP. Therefore, these scatter points in the high dimensional SODP can describe the variation of the RR interval time series adequately.

3.2. Temporal Variation Entropy

The scatter points are fully distributed in the three-dimensional space. Compared with standard SODP, there are more distribution patterns shown in the high dimensional space. For extracting the distribution information of the three-dimensional SODP quantitatively, a quantitative indicator is designed. Inspired by the concept of the local distribution entropy [6], the temporal variation entropy is presented in this study.

In Figure 6, for the three-dimensional SODP, the x-y plane is standard SODP in nature. The axis coordinate zTVM is perpendicular to the x-y plane. In the figure, there is a closed cubic space. The height of the space jointly depends on the maximum value and the minimum value of the zTVM coordinates of all points. The cross-section of this space is a rectangle. The maximum value and minimum value of the x-coordinate of all scatter points are chosen and the difference of two extreme values is calculated as the length of the cross-section. Similarly, the extreme values of the y-coordinate can be selected from all scatter points and we will obtain the width of cross-section.

Here, the space is divided into several subspaces appropriately. In the different subspace, these scatter points will reserve the distribution information of SODP. Meanwhile, various change tendencies are reflected by these subspaces with different heights. Therefore, we can obtain the distribution information of scatter points by counting the number of the points in every subspace. Here, the concept of entropy is employed to measure information. The temporal variation entropy is defined as follows:where N is the number of the subspace, ni is the number of the scatter points in the ith subspace, is the coordinate of the point in the jth subspace, M is the number of all points, and is the average number of points. To emphasize the quantity of scatter points and the difference of positions for these points, two weighting factors ni and are added in the formula (8). Thus, the variation of the RR interval series and the distribution pattern of scatter points can be fully described by ETV.

Here, the steps of the temporal variation measure algorithm are listed as follows:(1)Calculate the positions of points in SODP by equation (5)(2)Compute the distance between the point and the origin of SODP by equations (6) and (7)(3)Fuse the position parameter Dco and the distance parameter L, and calculate the coordinate by equation (8)(4)Select the proper number of subspaces N and calculate the temporal variation entropy ETV by equation (9)

In virtue of TVM, the variation of RR interval time series is adequately revealed in the novel space. Meanwhile, the characteristics of the time series can be quantified by the indicator ETV accurately. To assess the performance of TVM objectively, the algorithm will be applied in the classification of cardiovascular diseases in the next section. Then, the effectiveness of the method will be fully demonstrated by experimental results.

4. Application of the Temporal Variation Measure Algorithm

To verify the performance of TVM, in this section, it is applied to extract the features from the normal sinus rhythm and four cardiovascular diseases. In this experiment, five datasets are chosen from PhysioNet/PhysioBank, which are Normal Sinus Rhythm RR Interval Database (nsr2db), CU Ventricular Tachyarrhythmia Database (cudb) [34], MIT-BIH Arrhythmia Database (mitdb) [35], Post-lctal Heart Rate Oscillations in Partial Epilepsy (szdb) [36], and Smart Health for Assessing the Risk of Events via ECG Database (shareedb) [34], respectively.

In order to demonstrate the performance of TVM, here three indicators, CTM, component CTM (CCTM), and mean distance of the points within the circular radius (D) [37], are tested experimentally. Here, the quantitative indicator D is the mean distance of these points in a circular radius r. The other indicator CCTM is the development of CTM. Based on CCTM, the number of the points in the four quadrants of SODP are counted, respectively, and the CTM of each quadrant is calculated.where n is the number of all points, is the number of points in the kth quadrant of SODP.

All three indicators are derived from radius r. Thus, how to choose an optimum r is a crucial procedure, which can influence the results of the experiment directly. Figure 7 shows the values of CTM(r) and D(r) for five datasets, nsr2db, cudb, mitdb, szdb, and shareedb. In Figure 7(a), experiment results of CTM(r) indicate that there is a distinct difference for these datasets when the radius r is greater than 3. Therefore, in the experiment, parameter r is set to 3 for the indicator CTM(r). Similarly, radius r is set to 6 for the indicator D(r) in Figure 7(b).

The values of CCTM(r) in four quadrants are displayed in Figure 8. Figure 8(a) shows the values of CCTM1(r) in the first quadrant, which indicate a substantial difference when the . Figures 8(b)∼8(d) are CCTM2(r), CCTM3(r), and CCTM4(r), respectively. From the three figures, we can find that once the value r is beyond 3, a prominent difference would appear for CCTMk(r). Thus, for CCTM, the parameter r is set to 3 in the experiment.

Here, the quantitative characteristics of the five datasets are calculated by the indicators, CTM, CCTM, and D. Figures 9(a)∼9(f) show the boxplots of experiment results for CTM, D, and CCTMk (k = 1, 2, 3, 4), respectively.

For the indicator CTM, the numerical ranges of results for nsr2db, shareedb, and szdb are shown in Figure 9(a). Meanwhile, in Table 1, the means and standard deviations of nsr2db, shareedb, and szdb are 0.93 ± 0.01, 0.98 ± 0.01, and 0.94 ± 0.02, respectively. From the boxplots and numerical results, we can find that as the numerical ranges of experiment results are seriously overlapped, CTM is no longer effective as an indicator to classify or recognize the mentioned cardiovascular diseases. In Figure 9(b) and Table 1, for indicator D, considerable overlap exists between the three datasets, nsr2db, cudb, and mitdb, which means that the quantitative indicator is not sufficiently suitable to distinguish the three physiological statuses of the cardiovascular system effectively. The other figures are experiment results of CCTMk (k = 1, 2, 3, 4). Similarly, except for cudb, CCTMs for the other four datasets have no significant difference and cannot be feasibly taken as a feature to distinguish between the datasets.

The indicators are usually employed to quantify the variation of the physiological status of the heart. Apparently, for the five datasets, the mentioned indicators CTM, CCTM, and D cannot well meet the expectations of this study.

For TVM, experiment results of four quadrants are shown in Figure 10 and Table 2. The boxplots of ETV show that the results of the four quadrants are largely similar. For the four datasets nsr2db, cudb, mitdb, and szdb, the numerical ranges almost have no overlap. For example, the means and standard deviations of ETV in the quadrant I are 118.57 ± 35.39, 4.10 ± 1.20, 6.78 ± 2.41, and 16.11 ± 5.58, respectively. The statistical results indicate that, as an indicator, ETV in quadrant I has the ability to distinguish the four physiological statuses of the heart and ETV as a kind of feature can be employed in the classification and recognition of the mentioned cardiovascular diseases. In the other quadrants, there are still a great difference of ETV between the four datasets. Thus, the physiological features of the aforementioned datasets can be extracted more effectively by ETV and experimental results indicate that the indicator has the consistency of performance in the four quadrants.

On the other hand, there is a certain overlap of the numerical range of ETV between the box (interquartile range) of nsr2db and whisker (lower quartile) of chareedb. However, for the standard deviations of experiment results, there are significant differences between nsr2db and chareedb. Figure 10 and Table 2 show that the stand deviation of chareedb is greater than nsr2db’s, which means that the distribution density of the experimental results of chareedb is partly lower than nsr2db’s. Hence, the features of two datasets nsr2db and chareedb can be described effectively by ETV. Via the indicator, two datasets can be still recognized.

To adequately illustrate the performance of TVM further, here ETV and the other indicators are used for the classification of normal sinus rhythm and the other four cardiovascular diseases. Through the comparison of the classification accuracy of ETV and the other indicators, the performance of TVM will be demonstrated intuitively. As a basic classification method, in this section, the k-means algorithm is adopted to accomplish the classification task. Meanwhile, RI is used to calculate the classification accuracy, which is defined as follows:where CD and TD are the quantities of correct and total decisions, respectively.

Table 3 shows the classification results between the five datasets. Except for the classification tasks of two datasets nsr2db and chareedb, the classification accuracies of ETV in four quadrants are significantly superior to the other indicators’, which indicates that the physiological status of cardiovascular can be described by TVM more reasonably and precisely. For the classification of nsr2db and chareedb, the classification accuracy of ETV is lower than CTM and D, which means that CTM and D can better extract the feature of the dataset chareedb. However, as the significant difference of standard deviations for the two datasets, the classification accuracy of TVM should be improved further, if a density-based clustering algorithm is adopted in an experiment.

In this section, the performance of the temporal variation measure algorithm is preliminarily verified. By analyzing experiment results, TVM is suitable for properly describing the variation of RR interval time series and extracting the physiological status of the cardiovascular system precisely.

5. Conclusion

As a development of the Poincaré plot, the second-order difference plot is widely employed to depict the variation of a physiological signal. In SODP, the different distribution pattern of scatter points represents various physiological status of the heart. In the study, many quantitative indicators are introduced briefly, such as CTM, CCTM, and D. However, the distribution pattern of points in SODP are sketchily described by these indicators and the abundant physiological information of the dynamic change of the cardiovascular system cannot be adequately extracted. Therefore, the temporal variation measure algorithm is presented to remedy the limitations of the mentioned indicators partly.

In this paper, besides the study of the distribution pattern of scatter points, the focus for the research is twofold. Firstly, it is the change and span of time increment of successive RR intervals. Secondly, how can we extract the variation information of the physiological status of the heart contained in a single point. Through the research, the difference of horizontal and vertical coordinates of a point in SODP reflects the change of time increment effectively. The span of time increment can be properly depicted by the Euclidean distance between the point and the origin. Then, the temporal variation measure algorithm is designed and the steps of the algorithm is presented. Via TVM, the mentioned variation information is reconstructed in a three-dimensional second-order difference plot. In the novel space, the variation information contained in the overall and local distribution pattern of all scatter points are well retained.

Experiment results preliminarily indicate that TVM is effective for extracting more physiological features from these scatter points in SODP. For the task of classification, the sinus rhythm can be effectively identified from the other physiological statuses via TVM and the accuracy of our method is higher than the others. Therefore, TVM can be as a valuable analytical method to be applied in the physiological signal recognition and classification.

Data Availability

The data used to support the findings of this study are available at https://www.physionet.org/.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This work was supported by National Natural Science Foundation (NNSF) of China (Grant no. 61867005), and Natural Science Foundation of Ningxia Province (Grant no. 2020AAC03068). The authors gratefully acknowledge the insightful comments and suggestions from the reviewers and editor, which have helped to improve the presentation.