Statistical Inference for Piecewise Affine System Identification

Jianwang, Hong; Ramirez-Mendoza, Ricardo A.; Yan, Xiang

doi:https://doi.org/10.1155/2021/4618030

Mathematical Problems in Engineering

On this page

Abstract Introduction Conclusion Data Availability Disclosure Conflicts of Interest Acknowledgments References Copyright Related Articles

Special Issue

Emerging Trends in Integrated Engineering, Computing, and Mathematical Methods for Unmanned Vehicles

View this Special Issue

Research Article | Open Access

Volume 2021 | Article ID 4618030 | https://doi.org/10.1155/2021/4618030

Statistical Inference for Piecewise Affine System Identification

Hong Jianwang,¹Ricardo A. Ramirez-Mendoza,¹and Xiang Yan²

Academic Editor: Zain Anwar Ali

Received16 Aug 2021

Accepted08 Sept 2021

Published05 Oct 2021

Abstract

This short note studies the problem of piecewise affine system identification, being a special nonlinear system based on our previous contribution on it. Two different identification strategies are proposed to achieve our mission, such as centralized identification and distributed identification. More specifically, for centralized identification, the total observed input-output data are used to estimate all unknown parameter vectors simultaneously without any consideration on the classification process. But for distributed identification, after the whole observed input-output data are classified into their own right subregions, then part input-output data, belonging to the same subregion, are applied to estimate the unknown parameter vector. Whatever the centralized identification and distributed identification, the final decision is to determine the unknown parameter vector in one linear form, so the recursive least squares algorithm and its modified form with the dead zone are studied to deal with the statistical noise and bounded noise, respectively. Finally, one simulation example is used to compare the identification accuracy for our considered two identification strategies.

1. Introduction

Over the past few decades, the rapid evolution of computing, communication, and sensor technologies has brought about the proliferation of new dynamic systems, mostly technological and often highly complex. When to control these new dynamics systems, we must need some prior knowledge about them, i.e., their corresponding mathematical models or equations are firstly constructed for the latter controller design. The concept of the system and model is the basis of the engineering disciplines, where one is generally interested in a quantitative assessment of the behavior of a dynamical system. Therefore, it is necessary to obtain a mathematical description of it. Starting from the model, it is necessary to develop formal tools of analysis and synthesis. Basically, there are two ways of constructing mathematical models, i.e., mathematical modeling and system identification. More specifically, mathematical modeling is an analytic approach, and prior knowledge and physical insight about the considered system are used to describe the dynamic behavior of a system. System identification is an experimental approach, for which some experiments are performed on the considered system in order to learn a mathematical model from the collected input-output data by means of parameter estimation.

The goal of system identification is to derive one appropriate mathematical model or equation for the considered unknown system; then, this constructed mathematical model is a reference basis for the next controller design. During these years, lots of research studies exist on linear system identification, but the linear system is one ideal case in practice, as we all know that all phenomena are nonlinear in nature. The research direction corresponds to the linear system identification; then, the obtained results for linear system identification are tried to extend to more general case-nonlinear system identification. The most difficulty for nonlinear system identification is about which nonlinear form is used to describe the considered nonlinear phenomenon. After extending the nonlinear phenomenon as one detailed expression, then all the obtained results about linear system identification are modified to be suited for nonlinear system identification. One more widely used way to extend the nonlinear phenomenon is that orthonormal function expansion or its generalized form. As the neural network can approximate the nonlinear system with low approximation error, this nice property of the neural network attracts lots of researchers in studying nonlinear system identification based on orthonormal function expansion and neural network. The detailed description based on nonlinear system identification is omitted due to space limitations. Our mission in above explanation is to show research on nonlinear system identification is worth, and it is more suited for practical phenomenon in nature.

To apply the obtained or existed results on linear system identification to nonlinear system identification well, the nonlinear phenomenon is formulated as almost one special linear form, i.e., this special linear form has not only linear form but also some properties, coming from nonlinearity. This special linear form is our considered piecewise affine system here, which shows one nonlinear form in the whole region, but one linear form in each subregion. It is a trade-off between the linear system and nonlinear system. Furthermore, if we can determine which region it belongs to, then it reduces to linear system identification. It means all results on linear system identification are applied directly, such as identification algorithm, optimal input design, and model structure validation.

Due to the trade-off for this piecewise affine system, it bridges the closed relation between linear system identification and nonlinear system identification, so the identification for this piecewise affine system is widely studied in these recent years. For example, one convex optimization strategy-mixed integer programming is introduced to identify the piecewise affine system in [1]. Nonlinear auto-regressive with exogenous input model [2] is an input-output description, often used in piecewise affine system identification. The current output is obtained by means of a nonlinear functional expansion of lagged inputs, which outputs elementary terms [3]. The piecewise affine system reduces the number of model terms and combinations of elementary terms, with respect to other parametric class, such as Volterra series [4]. This peculiarity is very important when the model structure is unknown since it allows to reduce the number of possible candidate structures [5], and the idea of piecewise affine means switching linear too [6]. The computational complexity on piecewise affine system identification [7] is analyzed from the point of complexity theory. Although the piecewise affine system could be in the principle done in the linear case, it is not feasible in the nonlinear one since the number of possible model terms increases rapidly with the number [8]. One recursive multiple least squares algorithm is proposed to identify piecewise affine regression [9], where multicategory discrimination is deemed for data clustering in case of stochastic noise, i.e., zero mean white Gaussian and mutually uncorrelated. Moreover, for unknown but bounded noise, existing in the piecewise affine system, a bounded error approach can achieve the identification goal in [10], where one guaranteed interval is yielded to include the parameter estimation, and the center of this guaranteed interval is chosen as the final estimation. Furthermore, other aspects are studied in detail for the piecewise affine system, such as its input-output representation [11], recursive identification algorithm [12], and statistical clustering technique [13]. The detailed review of piecewise affine system identification can be seen [14], where the similarity between our considered piecewise affine system and hybrid system is also analyzed. The classical observability and controllability of the linear system are extended for the piecewise affine system and hybrid system in [15], where observability and controllability analysis are two important factors for regulating the future state converge to its stable point. In the year, research on piecewise affine system identification are still done, for example, Federico Bianchi Valentina Breschi [16] develops a general framework to alternate between parameter estimation and sample mode assignment. In [17], given a set of input-output data, the identification of a switched nonlinear system for the underlying system involves the simultaneous identification of the mode sequence. Our previously published paper [18] proposes one zonotope parameter identification algorithm to identify the unknown parameter vector in each separated region in presence of unknown but bounded external noise. The identification idea can be combined with optimal control theory; then, the data-driven control is yielded [19]. The difference between system identification and data driven control is that the system identification is to identify the unknown plant from observed data, but the data-driven control is to apply the observed data to design the considered controller. As lots of information are included in observed data, the control task concerns on extracting these useful information from data [20]. As the number of published works on piecewise affine system identification is vast, we cannot list all of them here, but only main works in recent years. Le Quang et al. [21] investigate the identification of continuous piecewise affine systems in state space form with jointly unknown partition and subsystem matrices. A novel online fault detection and identification strategy was established for a class of continuous piecewise affine systems, namely, bimodal and trimodal piecewise affine systems in [22], where the recursive nature of the proposed scheme and the consideration of parametric uncertainties in both partitions and in subsystems parameters were proposed there.

Based on abovementioned works and our previous contribution on piecewise affine system identification, this technical note continues to do it deep. As the nice property of the piecewise affine system is that the linear form holds during each subregion, which combines the whole considered region. It means the whole considered region is divided into many subregions, and in these subregions, the form is the special linear form. Based on this explanation on the piecewise affine system, the first step for piecewise affine system identification is to classify the observed input-output data into their own subregion, as the observed data are included in different subregion. After classifying all observed input-output data, then least squares algorithm can be applied to identify the unknown parameter vector in each subregion. More precisely, assume the whole considered region is divided into different subregions, i.e., the number of unknown parameter vectors is . Our mission for piecewise affine system identification is to estimate these parameter vectors by using the observed input-output data. The identification process of applying all observed input-output data to estimate all unknown parameter vectors corresponds to previous centralized identification, which does not care any subregion without any classification. For this centralized identification, all unknown parameter vectors are obtained simultaneously. On the contrary, the other identification process of combining classification and parameter estimation is the distributed identification. For this distributed identification, firstly all observed input-output data are classified into their own subregion; then, secondly, the classified data are dealt with to estimate the unknown parameter vector in each subregion. The reason about why we study these two different identification strategy is that through comparing them, we find if the number of subregions is low, i.e., , the previous centralized identification can give a nice identification results. But when , we need the distributed identification to complete our given identification accuracy, and it means the centralized identification is useless for the case of .

The main contributions of this short note are listed as follows. (1) Two different identification strategies are proposed to identify the special piecewise affine system respectively, i.e., centralized identification and distributed identification. (2) Comparisons are given for these two identification strategies. (3) One novel least squares algorithm with the dead zone is proposed in the identification process. Generally, these two identification strategies and least squares algorithm with the dead zone can be widely applied in other research fields, such as signal processing, spectral estimation, and adaptive control.

This short note is organized as follows. In Section 2, the basic piecewise affine system is presented, and its identification problem is also given. In Section 3, the previous centralized identification is proposed to estimate all unknown parameter vectors simultaneously, through introducing one discrete variable and least squares algorithm. Furthermore, the distributed identification is given to classify the observed data and estimate the unknown parameter vectors step by step in Section 4. To complete the drawback of our previous work, one lease squares algorithm with the dead zone is proposed to consider the unknown but bounded noise. Section 5 gives one example to compare these above identification strategies and illustrate our proposed theory. Finally, conclusions and comments about next work are presented in Section 6.

2. Problem Statement

2.1. Piecewise Affine System

Consider the following linear form, existing in linear system identification many times aswhere, in equation (1), and are the corresponding input and output signal at time instant .

Parameters and are the unknown parameters, which are needed to identify in the latter study. Two model orders and are given or prior known. is one external noise or disturb, and it may be one statistical noise or bounded noise. To apply the existing identification algorithm in estimating these two kinds of unknown parameters and , linear form (1) is always rewritten as one single linear regressor form, i.e.,where regressor vector and parameter vector are defined as follows:

In regressor vector , and are also named as time delay.

When to extend classical linear regressor form (2) into our considered piecewise affine system, we observe equation (2) is always time invariant within the whole region. Then, we divide the whole region into different subregions, so in each subregion, parameter vector is not the same with each other. This property makes us reformulate the original linear regressor form (2) as the other piecewise affine form, i.e.,where in equation (4), is the number of subregions, i.e., .

Equation (4) means firstly we need to know which subregion the regressor vector belongs to; then, secondly, one approximated parameter vector is used to correspond it. Comparing the original linear regressor form (2) and its extended piecewise affine system (4), the computational complexity about the identification process is increased, as only one constant parameter vector exists in equation (2), but different parameter vectors are needed to be identified in equation (4).

2.2. Main Mission

Those subregions constitute the whole considered region . From a practical point of view, these subregions are always assumed to be convex and connected, i.e., their closely relations are shown in Figure 1, where and . As all regions have one common equilibrium point, i.e., origin point, and this origin point means all elements in that regressor vector correspond to zero, i.e., , it is no useful in analyzing, so we always delete this special original point to let .

From above description about the piecewise affine system, our mission is to estimate these different unknown parameter vectors by using only the input-output data , where is the number of observed input-output data. Then, this short note proposes two identification strategies-centralized identification and distributed identification to combine the least squares algorithm.

3. Centralized Identification

Observing that piecewise affine system (4), we must know which subregion the regressor variable belongs to; then, its corresponding unknown parameter vector is used. The above process means one data clustering process in machine learning. Here, one centralized identification is introduced without the above data clustering process, i.e., all observed input-output data are used to estimate those unknown parameter vectors simultaneously.

Introduce one discrete variable , satisfyingusing this defined discrete variable ; then, equation (4) is rewritten as follows:

Reformulating equation (6) as follows:where the new and are defined as follows:

Based on our defined discrete variable , then piecewise affine system (4) can be also reformulated as one linear regressor form (7). The unknown parameter vector , existing in equation (7), can be obtained easily through the classical least squares algorithm, i.e., the parameter estimation is given as thatwhere denotes the number of observed input-output data. Furthermore, to increase the computational speed, its recursive least squares algorithm is thatwhere and are the parameter estimations at time instant and , respectively.

To start the above recursive least squares algorithm, the initial parameter value can be chosen aswhere is a column vector with element 1.

From equation (8), we see that the computational complexity of this centralized identification is vast, as the dimensions of the regressor and parameter vector is ( is the dimension of each parameter vector ), so if and are all small, then this centralized identification holds.

4. Distributed Identification

Distributed identification is different from centralized identification, as the observed input-output data are used to estimate each unknown parameter vector , not the total unknown parameter vector . But one obvious problem appears, it is about which parameter vector is determined by the observed input-output data, and we must be sure that the observed input-output data, used to estimate that unknown parameter vector , belong to that corresponding subregion , so firstly one additional process is added here.

Let

When given one observed data , we need to classify it in its right subregion, through computing its projection on each subregion or set, i.e.,where denotes one norm.

Identify its minimal component and treat this index of this observed data as our guess from the number of the class to which belongs, i.e.,

Then, we regard the following fact:

By the way the idea case is one index, satisfying , it means

After we divide all observed input-output data into their right subregions, then we describe the considered piecewise affine system as follows:where number satisfies

Based on above clustering or classifying process, then some observed input-output data, belonging to its corresponding subregion, are applied to estimate only one corresponding unknown parameter vector. For notational convergence only, we use the observed input-output data to estimate the unknown parameter vector . It is rewritten as follows:

Least squares algorithm (9) and its recursive form (10) are applied to identify the unknown parameter vector .

From the existing results in linear system identification, the identification result is nice in case of noise . It means when external noise is a white noise, then the parameter estimation is unbiased and consistent. But this nice result does not hold for bounded noise. Our previously work [18] proposes zonotope parameter identification in case of bounded noise. As knowledge of zonotope is little for some readers, so here we modify the recursive least squares algorithm (10) into one new least squares algorithm with the dead zone.

As that external noise is bounded, without loss of generality, assume that , where is one upper bound. This new least squares algorithm with the dead zone is listed as follows:where coefficient and

The above iterative algorithm is terminated by . After subtracting the true parameter vector on both sides of equation (20), we havewhere and are parameter errors at time instant and . Due to the definition of , it holds thatwhere in equation (23), is one model error, i.e.,

From an applied point of view, for any , we have

Based on our above mathematical derivation, we see is a decreasing sequence, and it also satisfies that

It means the parameter estimation , obtained by the new least squares algorithm with the dead zone, converges to its true parameter value with the iterative step increases. The parameter error is one decreasing sequence, not related with the upper bound .

5. Simulation Example

In this simulation part, one example is used to prove the efficiency of our proposed theories in this short note. The piecewise affine system is defined as follows:

It meansor

Firstly, the centralized identification strategy is studied to estimate the parameter vector simultaneously through the observed input-output data. Sinusoidal signal is used to excite the above piecewise affine system, and Figure 2 shows our chosen sinusoidal signal. We measure the corresponding output data by some physical devices or sensors; then, the observed output signal is plotted in Figure 3. Combining our observed input-output data in Figures 2 and 3, three unknown parameters are identified by the recursive least squares algorithm in case of a white noise, and the corresponding identification result for the centralized identification strategy is shown in Figure 4, where the true parameters and their estimations correspond to each other. It tells us when the number of subregions is 3, the identification result for the centralized identification strategy is tolerable.

Secondly, an additional subregion is added in equation (24), i.e.,

The observed input-output data are also seen in Figures 2 and 3, where the number of subregion is 4. To compare our mentioned two identification strategies for estimating these four unknown parameters, we simulate them in the framework of bounded noise, i.e., . For this case of bounded noise in the piecewise affine system with four subregions, the least squares algorithm for centralized identification and least squares algorithm with the dead zone for distributed identification are all considered to estimate those four unknown parameters. We do not plot the parameter estimations; just like in Figure 4, here we substitute the parameter estimations into the original piecewise affine system to get the identified model, and the original system is named as the true model. Two identification results are plotted in Figure 4, where the black dotted line and red line are for centralization identification and distributed identification, respectively. Their output responses correspond to the true output and identified output. Figure 5 and 6 show the comparisons between the true output and identified output for centralized identification and distributed identification, respectively.

More specifically in Figure 5, the black curve is the true output and red curve is the identified output from centralized identification. Then, we see the deviation error between these two curves is not accepted due to the large deviation error. But for the distributed identification in Figure 6, the blue curve denotes the identified output. Then, these two curves are very closely to each other, it means their deviation error is little, and the parameter estimations converge to their corresponding true values. Furthermore, from Figures 5 and 6, if the number of subregions does not exceed to 3, then centralized identification is efficient. But on the contrary, it is better to apply that distributed identification to complete our identification task.

6. Conclusion

As the piecewise affine system is one trade-off between the linear system and nonlinear system, further analysis on the piecewise affine system is considered here. From the point of knowledge about classification process or not, centralized identification and distributed identification are studied in detail. To deal with statistical noise and bounded noise, the recursive least squares algorithm and its modified form with the dead zone are applied to estimate the unknown parameter vector. As this short note is our continuous contribution on piecewise affine system identification, the detailed computational complexity of our considered two identification strategies is our next ongoing work.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Disclosure

This paper was not presented at any IFAC meeting.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This work was partially supported by Jiangxi Provincial National Science Foundation (no. 20202BAL202012).

References

J. Roll, A. Bemporad, and L. Ljung, “Identification of piecewise affine systems via mixed-integer programming,” Automatica, vol. 40, no. 1, pp. 37–50, 2004.
View at: Publisher Site | Google Scholar
G. Ferrari-Trecate, M. Muselli, D. Liberati, and M. Morari, “A clustering technique for the identification of piecewise affine systems,” Automatica, vol. 39, no. 2, pp. 205–217, 2003.
View at: Publisher Site | Google Scholar
A. Lj Jouloski and S. Weiland, “A Bayesian approach to identification of hybrid systems,” IEEE Transactions on Automatic Control, vol. 50, no. 10, pp. 1520–1532, 2013.
View at: Google Scholar
H. Ohlsson and L. Ljung, “Identification of switched linear regression models using sum-of-norms regularization,” Automatica, vol. 49, no. 4, pp. 1045–1050, 2013.
View at: Publisher Site | Google Scholar
F. Lauer, G. Bloch, and R. Vidal, “A continuous optimization framework for hybrid system identification,” Automatica, vol. 47, no. 3, pp. 608–613, 2011.
View at: Publisher Site | Google Scholar
F. Lauer, “On the complexity of switching linear regression,” Automatica, vol. 74, no. 12, pp. 80–83, 2016.
View at: Publisher Site | Google Scholar
F. Lauer, “On the complexity of piecewise affine system identification,” Automatica, vol. 62, no. 12, pp. 148–153, 2015.
View at: Publisher Site | Google Scholar
L. Bako, “Identification of switched linear systems via sparse optimization,” Automatica, vol. 47, no. 4, pp. 668–677, 2011.
View at: Publisher Site | Google Scholar
V. Breschi, D. Piga, and A. Bemporad, “Piecewise affine regression via recursive multiple least squares and multicategory discrimination,” Automatica, vol. 73, no. 11, pp. 155–162, 2016.
View at: Publisher Site | Google Scholar
A. Bempora, A. Garulli, S. Paoletti, and A. Vicino, “A bounded-error approach to piecewise affine system identification,” IEEE Transactions on Automatic Control, vol. 50, no. 10, pp. 1567–1580, 2005.
View at: Publisher Site | Google Scholar
S. Paoletti, J. Roll, A. Garulli, and A. Vicino, “On the input-output representation of piecewise affine state space models,” IEEE Transactions on Automatic Control, vol. 55, no. 1, pp. 60–73, 2010.
View at: Publisher Site | Google Scholar
L. Bako, K. Boukharouba, E. Duviella, and S. Lecoeuche, “A recursive identification algorithm for switched linear/affine models,” Nonlinear Analysis: Hybrid Systems, vol. 5, no. 2, pp. 242–253, 2011.
View at: Publisher Site | Google Scholar
H. Nakada and K. T. Takaba, “Identification of piecewise affine systems based on statistical clustering technique,” Automatica, vol. 41, no. 5, pp. 905–913, 2005.
View at: Publisher Site | Google Scholar
S. Katayama and A. L. Jouloski, “Identification of hybrid systems a tutorial,” European Journal of Control, vol. 13, no. 2, pp. 242–260, 2008.
View at: Google Scholar
A. Bemporad and G. Ferrari Trecate, “Observability and controllability of piecewise affine and hybrid systems,” IEEE Transactions on Automatic Control, vol. 45, no. 10, pp. 1864–1876, 2010.
View at: Google Scholar
F. B. Valentina Breschi, “Model structure selection for switched NARX system identification: a randomized approach,” Automatica, vol. 125, no. 2, Article ID 109415, 2021.
View at: Google Scholar
F. Bianchi and M. Prandini, “A randomized two stage iterative method for switched nonlinear system identification,” Nonlinear Analysis-Hybrid Systems, vol. 35, no. 4, Article ID 100818, 2020.
View at: Publisher Site | Google Scholar
J. Wang, “Zonotope parameter identificaiton for piecewise affine systems,” Journal of Systems Engineering and Electronics, vol. 31, no. 5, pp. 1077–1084, 2020.
View at: Google Scholar
J. Berberich, J. Kohler, M. A. Muller, and F. Allgower, “Data-driven model predictive control with stability and robustness guarantees,” IEEE Transactions on Automatic Control, vol. 66, no. 4, pp. 1702–1717, 2021.
View at: Publisher Site | Google Scholar
J. Berberich, M. A. Muller, and A. Frank, “A computationally efficient robust model predictive control framework for uncertain nonlinear system,” IEEE Transactions on Automatic Control, vol. 66, no. 2, pp. 794–801, 2021.
View at: Publisher Site | Google Scholar
T. Le Quang, T. Van den Boom, and S. Bold, “Online identification of continuous bimodal and trimodal piecewise affine systems,” in Proceedings of the 2016 European Control Conference, Aalborg, Denmark, July 2016.
View at: Publisher Site | Google Scholar
N. Moustakis, B. Zhou, and T. Le Quang, “Fault detection and identification for a class of continuous piecewise affine systems with unknown subsystems and partitions,” International Journal of Adaptive Control and Signal Processing, vol. 32, no. 7, pp. 980–993, 2021.
View at: Google Scholar

Copyright

Copyright © 2021 Hong Jianwang et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies