Abstract
A distributed model predictive control method is used to transform the formation and maintenance problem of fixed-wing UAV formation during flight into an online rolling optimization problem to solve in this paper. Firstly, the state estimation model of the neighborhood UAV is established according to the relative information of the UAV. Secondly, the error state model in the three-dimensional inertial coordinate frame of the UAV is established without considering the time delay, sensor error, and external interference. Thirdly, a cost function is designed by introducing the error state of the UAV in the neighborhood. Finally, four UAVs are used to verify that under the action of the controller, the UAVs can quickly form and maintain the desired formation while tracking the reference line.
1. Introduction
In recent years, with the improvement of UAVs performance and control technology, UAVs have been widely used in both civilian [1–3] and military fields [4, 5]. The coordinated control of UAV formations to perform reconnaissance and surveillance, search and rescue, and the realization of communication relays has become a research hotspot and a difficult problem.
Although there have been many successful studies in this field, in leader-follower formation flying, most of the formation problems are solved on the premise of the existence of active communication links [6–9]. The main problem of the above studies is that the task will fail when the communication link fails. Therefore, in [10–12], it is considered to reduce unnecessary communication or realize formation control in the case of communication link failure. However, in previous studies, vision-based formation control focused on the control of mobile robots [13–16]. In [17], they employ a line-of-sight principle to assign the desired speed and develop an extended state observer based on nonperiodic sampling to provide accurate disturbance estimation with guaranteed convergence. Then, they proposed a robust antidisturbance dynamics control protocol and verified the effectiveness of the algorithm by simulation. But With the development of UAV technology, vision-based formation flight research has become an active research topic. In [18], the formation control law is proposed by using the UAV line-of-sight angle information in the neighborhood, and the formation control of the aircraft is realized by controlling the yaw angle and speed. In [19], Wilson et al. uses a quaternion-based unscented Kalman filter to fuse UAV sensors with relative visual observers to improve the accuracy of relative state estimation. In [20], an improved transfer network is used to reduce the misclassification of images and false alarms of noise during the deep learning process which is able to control the formation of two UAVs in the case of communication refusal. In [21] an appointed-time funnel control consisting of a modified funnel variable is proposed to realize the control of quadrotors suffering from environmental disturbances and parametric uncertainties.
Since in the actual application process, it is not only necessary for a single UAV to enter the formation, but also to realize the coordinated control of multiple UAVs. Therefore, an important issue in practical design is the cooperative control of multiple UAVs. In [22], in order to solve the problem of formation control of multiple UAVs, Dehghani et al. take fixed-wing UAVs as the research object, and give a leader-follower formation kinematics formula based on seeker measurement, and adopted robust controllers and disturbance compensators are used to reduce errors in speed measurements and the effects of bounded disturbances. The effectiveness of the controller when the three UAVs fly in a straight line is verified by simulation. In [23], a double-layer formation control is proposed to solve the problem of UAV formation with relative information. Reference [22] uses sliding mode control to realize the control of UAV formation based on relative information. Reference [24] builds on paper [22] to achieve UAV formation control when one UAV cannot directly measure the relative information of the virtual leader. Also to achieve state estimation, [25] embeds a sampler based on switching thresholds to implement a state observer in the case of intermittent measurements. The paper also proposed a tracking differentiator (TD) -based prescribed performance control(TDPPC) in order to enforce system profiles evolve within the redefined performance boundaries with reduced chattering. Finally, a sigmoid function-based TD is utilized to overcome the complexity explosion.
However, when the number of UAVs increases in the above centralized control method, the amount of calculation increases and the real-time performance is poor. The distributed control algorithm has been widely concerned by researchers because it can effectively reduce the overall calculation amount and improve the autonomy of a single UAV [26, 27]. Among them, distributed model predictive control (DMPC) is more suitable for UAV formation control because of its high structural fault tolerance and strong robustness [28]. For example, in [29], a rendezvous algorithm is proposed for heterogeneous unmanned clusters based on DMPC under limited communication. Reference [30] is focused on problem of time extension and packet loss of the networked control system of UAV with DMPC and event-triggered. Moreover, DMPC is used to avoid collision [31, 32]. In [33], state keeping, obstacles avoidance and precision formation control are performed with the use of DMPC based on introducing binary variables. However, a fundamental distinction of the above methods, this paper adopts different nonlinear error models and control methods to realize formation control which can reflect the relationship between the input and the error more intuitively.
The main contribution of this paper compared with existing research in the literature devoted to three dimension UAV formation control can be expressed in two points of view:(1)Compared with existing studies [17, 21, 22], those in two-dimensional or simple three-dimensional planes, this paper will realize the control of formation in three-dimensional complex trajectories. Unlike [22], which can only achieve formation control of UAVs with direct access to relative information, and [24], which can achieve formation control of four unmanned vehicles with one UAV cannot directly measure the relative information of the virtual leader, the algorithm in this paper can not only achieve formation control with only neighboring UAV information, but also achieve formation control in complex paths of multiple UAVs.(2)A new cost function is designed by introducing the UAV error state model in the formation neighborhood, and the stability of the formation system is proved by using the Lyapunov function. And different from [21, 24] which requires the use of large value inputs to achieve the stability of the estimated state, this paper adopts the method of DMPC, which considers the UAV state as well as the input constraints to avoid generating large loads on the UAV and can effectively ensure the robustness of the system.
The rest of the paper is organized as follows. Section 2 establishes a stand-alone motion model of UAV, and the neighborhood UAV state estimation model. In Section 3, we derive a three-dimensional error state model through the stand-alone motion model and design the DMPC controller and give the algorithm process. In Section 4, we present the analysis process of the sufficient conditions for the convergence of the single-machine error model and the stability of the formation system. Section 5 conducts simulation and numerical analysis of three examples through MATLAB. Finally, in Section 6, we draw conclusions and give suggestions for future work.
2. Problem Modeling
In order to introduce the UAV formation movement and modeling process, we introduce three coordinate frames , , and as the inertial coordinate frame, the follower velocity frame of UAV , and the line of sight frame. The origin of the follower velocity frame and the line of sight frame is the mass point of UAV. The relative relationship between the coordinate frames is shown in Figure 1.

2.1. Single UAV Motion Model
Assuming that there are fixed-wing UAVs in the formation, the dynamics between UAVs are decoupled, the influence of the roll of the UAVs is not considered, and the influence of the external wind effect is ignored. By considering as the follower velocity vector with respect to the frame , we havewhere is the rotation matrix of with respect to , is the speed of the th follower UAV, and is the yaw angles of follower UAV, and is the pitch angle of follower UAV. is the position vector of the UAV in the inertial coordinate frame.
By denoting as the velocity differential in inertial frame, as the angular velocity vector in the follower velocity frame , as the acceleration vector in the follower velocity frame, we have the following Coriolis equation:
By denoting as the states of followers, and by using equations (1) and (2), the motion equation of follower can be express as follows:
By considering as the input of the system, we have the expression of the UAV state space as follows:where is the corresponding nonlinear function.
2.2. Neighborhood UAV State Estimation Model
By denoting as the relative distance between UAV and , and are the relative pitch and yaw angles; we have the relative kinematics of the leader-follower, as shown in Figure 2.

By assuming as the vector of position of UAV with respect to line of sight frame , we have the position vector of UAV in the inertial framewhere . By introducing as the relative angular velocity in the frame , we havewhere . Moreover, the velocity of UAV can be described as follows:
Then, the velocity of UAV in frame can be described as follows:where is the Euclidean norm of vector. By taking the time derivative of equation (6), we can getwhere . Moreover, the angular velocity of can be given as follows:
By assuming which means that the UAV does not roll to simplify the model, we have
By considering equations (9) and (11), we can have the acceleration of UAV in the frame as follows:where . Then, the state estimator of the UAV can be described as follows:where . In the following DMPC algorithm design, the state of UAV in equation (10) will be used as the reference state of the UAV that cannot directly measure the relative state of the leader.
3. Distributed Model Predictive Controller Design
In view of the problem that there is no accurate modeling of the three-dimensional space UAV formation error model in the existing literature, this section first establishes the error model based on the state estimation model in the previous section. Then, a cost function based on the error model is given.
Without loss of generality, we give two assumptions of the algorithm as follows:
Assumption 1. Only the leader can obtain the reference line. Follower UAVs can identify other UAV numbers and obtain relative information of UAVs in the neighborhood without delay according to the information flow transmission topology.
Assumption 2. The algorithm does not consider the influence of external wind and sensor noise on the UAV.
3.1. State Error Model
Different from [22] and [24], in order to better realize formation control, this paper firstly establishes the UAV error state model in three-dimensional. By assuming as the reference state of UAV , the error between UAV and its reference state in the frame can be described as follows:
By taking the time derivative of above equation, one can getwhere , . By using equations (3) and (11), the error state can be expressed as follows:
To simplify the calculation, define a virtual input:
In actual operation, the input can be calculated according to equation (17). By linearizing equation (16) at , we havewhere
3.2. Formation Control Design
The DMPC-based control design can be described as, at time , the UAV solves the optimal control problem with the predicted number of steps starting from the initial error . In the UAV formation, we design different cost function for leader and followers, which can be expressed as follows:
3.2.1. Cost Function of Leader
3.2.2. Cost Function of Followers
where represents the quadratic form of the vector. represents the set of followers. represents the set of UAVs in the neighborhood of the UAV . indicates the error state which the UAV starts from time and applies the control sequence in turn to obtain within the number of predicted steps. is the set of state constraints of UAV which can be expressed as follows: . is the input constraint set of the system, which can be described as follows:where indicates the maximum value of control input in three directions and indicates the maximum value of control input change in three directions. The weight matrix satisfies .
3.3. Algorithm Flow
We use leader-follower strategy to realize formation control and there is a fixed information flow topology in the formation. In order to distinguish the leader from other UAV in the formation, we use UAV1 to represent the leader. The basic principle of the algorithm is shown in Figure 3.

The process of DMPC can be described as follows: Step 1: Bind UAV number and priority. Bind the reference to UAV1. Step 2: At , the UAV in formation are in a constant cruise state. Determine the UAV in the neighborhood and measure the relative state quantities between the UAV. For the follower UAV: take the current state as the initial state of the UAV, and take the measured state quantity of the UAV in the neighborhood as the reference state quantity. Calculate the current time error as the initial state of rolling time domain planning, which can be described as follows . Step 3: Calculate the optimal input sequence which can be described as according to equations (20) and (21). The first item of the optimal input sequence is used as the input of the UAV at . Step 4: At , based on the new error state of the formation, repeat to Step 2
4. System Stability Analysis
In the process of solving the UAV formation problem, the stability of the formation plays an important role, which not only reflects the performance of the controller, but also is an important guarantee for the UAV formation system to achieve the expected formation. The main idea of stability analysis is to find a suitable Lyapunov function and prove its degressivity.
According to the above ideas, in the process of cluster stability proof, the following two problems need to be considered: one is the stability of the error state of the single-machine system, and the coefficient relationship is given to ensure the stability of the single-machine system; the second is the stability of the formation system. In the following part, we first give and prove sufficient conditions for single-machine error stability. Then the cost function is used as the Lyapunov function of the system to prove the stability of the formation system.
Theorem 1. There exists , such that the virtual input satisfiesSystem equation (18) is asymptotically stable, where , ,
Proof. From equation (18), we haveWe can rewrite the above equation in the following form:where . By assuming as the expected value of , we can describe the error as follows:By taking the time derivative of the above equation, one can getChoose the Lyapunov function asBy considering equations (26) and (27), we haveSince , we have , where . By assuming as the expected value of , we haveThe error for can be expressed asSubstituting equations (31) and (32) into (30), we haveBy substituting equations (26) and (31) into (32) and taking the time derivative of it, we haveLet the Lyapunov function with beBy taking the time derivative of equation (35) and substituting equation (33) into it, we havewhere , By substituting equations (26), (32), and (34) into the above equation, one can get the following equation:By rewriting the above equation, we can get the equation of the input:By substituting equations (38) into (34), we haveBy substituting equations (26) and (31) into (28), one can getLet the eigenvalues of system equation (40) be , corresponding to the eigenvalues of the 1st to 6th rows of the eigenmatrix. We havewhere represents the determinant of . Since , , considering equation (41), one can get that the real part of the conjugate eigenvalue of system equation (40) has the same sign. And by considering Gershgorin’s disc theorem, we haveBy considering equation (42), when , the eigenvalues of system equation (40) are all in the negative half-plane and are asymptotically stable. By considering and , and substituting equations (27) and (32) into (38), we haveHence, the system is stable under the action of the input equation (43). Theorem 1 gives the sufficient conditions for the stability of the single-UAV error system. Then the cost function is taken as the Lyapunov function to prove the stability of the UAV formation system. According to the one-to-one correspondence between the state space, the error state space, the virtual input and the actual input, the solution of the actual input sequence can be transformed into the solution of the virtual input sequence . By rewriting equations (20) and (21) with the virtual input in equation (17), we have a new expression for the cost function as follows:At , the optimal solution sequence of equation (45) is . The optimal state sequence corresponding to the solution is . The error state sequence corresponding to the solution is . Obviously, all components of the optimal state sequence satisfy the constraints.
At , we construct the solution of equation (45) which can be expressed as follows:where , . By considering equation (45), we can get the state sequence and error state sequence of the system expressed as follows:By considering the first of are the components of , the first of are the components of , all of them satisfy the constraints. By substituting into (10), one can getBy considering equation (40) and the input which can be expressed as , we haveIt can be rewriting as . By considering all the above equations, we can get that meet terminal constraints and is a set of feasible solutions to equation (45). Similarly, we can get a set of feasible solutions to equation (44). At , the cost functions of leader and followers can be expressed as follows:At ,By considering and , we can rewriting the above equations as follows:By subtracting the cost function at time from both the sides at the same time, we can getIt can be rewritten aswhere the equal sign holds if and only if . According to equation (54), the UAV DMPC system designed in this paper is asymptotically stable.
5. Simulation
In this section, four UAVs are used for simulation. The UAV formation is shown in Figure 4, and the arrows indicate the flow of information.

Table 1 shows the initial state of the UAV in the inertial frame and the expected relative position of each UAV and UAV1. Set the maximum value of the input in the three directions of the UAV to 12 m/s and the minimum value to -12 m/s; the maximum value of the speed in the three directions is 20 m/s; the maximum value of the acceleration change is 10 m/s2. The number of simulations is 200 times, and the sampling time is 0.1s. In order to verify the effectiveness of the algorithm, the simulation will be carried out in three cases: straight flight, fixed turning radius flight and variable turning radius.
Case 1. Straight Flight
Figure 5 shows the result of the formulation of the UAV formation when they are flying in a straight line. Figure 5(a) shows the trajectories of the UAV formation. It can be found that the four UAVs can form and maintain a desired formation shape and track the reference line. Figure 5(b) shows the relative distance between UAVs. In this case the distance converges after about 4.3 s. Figure 5(c) shows the velocity of each UAV in three directions. It can be found that the velocity in each direction does not exceed the maximum value and changes smoothly. The control trajectories of UAV are shown in Figure 5(d). In Figure 5(d), the values of the first moment are not equal. This is because the value is the input calculated after the first sample period. In order to test the effect of the algorithm when the formation performs a turning flight, we also conduct experiments of Case 2.

(a)

(b)

(c)

(d)
Case 2. Fixed Turning Radius Flight
Figure 6 shows the result of the formulation of the UAV formation while they are flying with a fixed turning radius. Four UAVs can form and maintain desired formation when they are flying with a fixed radius (Figure 6(a)). Compared to Case 1, the convergence rate of distance shown in Figure 6(b) is slower. It can be found in Figure 6(c) that the velocity can converge to a fixed value determined from the radius of rotation and does not exceed a maximum value. Compared to Figure 5(d), the input curve converges to a fluctuating curve. This is because the direction of the drone needs to be adjusted in time to track the trajectory. In order to further verify the effectiveness of the algorithm, we conduct simulation experiments in Case 3.

(a)

(b)

(c)

(d)
Case 3. Variable Turning Radius Flight
Figure 7 shows the simulation results of the UAV formation under the condition of varying turning radius. The formation of four UAVs can ensure the tracking of the reference line when the turning radius is not large in the early stage. When the turning radius becomes larger, the formation has an error in the tracking of the reference line because the speed has reached the maximum value (Figure 7(c)). This problem will be solved in the next step by improving the performance of the UAV or the control algorithm. However, it can be found in Figure 7(b) that although the distance between UAVs is slower than when flying in a straight line, the four UAVs can form and maintain the ideal formation error. It can be seen from Figure 7(d) that the four UAVs can achieve convergence under the limit of the maximum input value. In order to further verify the control effect of the algorithm on multisortie UAV clusters, we set up Case 4.

(a)

(b)

(c)

(d)
Case 4. Multi-UAVs. (Table2)
Unlike the previous cases with four UAVs, Case 4 simulates the tracking and formation control of eight UAVs in complex trajectories with changing turn radius, and the simulation results are shown in Figure 8. From Figure 8(a), we can see that the algorithm has a better control effect for large-scale clusters. At the same time, the interaircraft distance of UAVs can converge to the specified value relatively quickly (Figure 8(b)). From Figures 8(c) and 8(d), we can see that the algorithm can guarantee the control of the formation while ensuring that the UAV states and inputs satisfy the performance constraints.

(a)

(b)

(c)

(d)
6. Conclusions
The DMPC-based scheme for formation control of multiple UAVs is studied in this paper. The UAV formation error state equation in three-dimensional case is proposed. By introducing the error of UAV formation, two cost functions are established according to the difference between the leader and the follower. The sufficient conditions for the stability of the UAV formation system are given and proved and the algorithm is simulated in three cases. The numerical results show that the UAV formation of four UAVs can form and maintain the desired formation under condition of straight flight and fixed turning radius. However, in the case of variable turning radius flight, when the turning radius is large, there is an error between the UAV and the reference line due to the limitation of the maximum speed of the UAV. Future work will focus on improving the performance of the DMPC algorithm in order to realize the tracking the reference line with large turning radius and resist external disturbances such as wind speed and sensor errors.
Data Availability
The data used to support the findings of this study are included within the article.
Conflicts of Interest
The authors declare that they have no conflicts of interest.
Acknowledgments
This research was funded by the Shandong Natural Science Foundation of China (ZR2020QF057).