Abstract

Acrobot is a robotic system that is actuated to achieve a certain degree through a controller and/or motion motors. In order to control acrobot in a given position, the designers have to come up with appropriate algorithms. In fact, input signals into the system are affected by noise signals. Therefore, it is necessary to remove noisy signals. In this paper, a method to control the balance under the influence of noise signals affecting the system (LQG) is proposed. Through acrobot's equations of motion, the system is linearized to facilitate the use of the above regulator for positions around the equilibrium point. The role of a Kalman filter of an LQG regulator is also depicted. Besides, a PID controller was used to survey this system as a comprehensive assessment of the effectiveness of control methods for acrobot. As acrobot stability surveys are rare in recent years according to the data referenced below, this paper is useful for acrobot control purposes and can serve as an important database for the application of acrobots in practice. Simulations are performed on Matlab.

1. Introduction

Nowadays, controllers such as PID, fuzzy, and sliding mode are applied to control all kinds of robots. This is a strong step toward helping robots achieve the requirements set out. For the acrobot system, the algorithms to control the balance are always focused on the selection of controllers. Modern robots are controlled with classical algorithms and genetic algorithms [1] and intelligent algorithms [2] where noise signals are not taken into account. In [1, 2], it is possible that the noise signals considered are negligible. Algorithms can be widely applied in the future such as self-driving subways, self-driving fighter planes, and so on. In this paper, a method to control the balance under the influence of noise signals affecting the system (LQG) is proposed. In the section presented below, the role of Kalman filter to form an LQG regulator is depicted. This method will optimally control a system in the presence of unwanted signals.

Other systems have similar motion characteristics like acrobot, including pendulum, inverted pendulum [3], and reaction wheel inverted pendulum [4], which are nonlinear systems such as single-input multi-output (SIMO). In SIMO systems, the designers have a lot of difficulty in determining the exact position that has been placed for the pendulum according to the requirements. The flexible structured models using the algorithms in [57] are interesting experiences. Robustness in the optimal control method based on multivariable control design [8] is a new topic. Some extensions of loop transfer recovery [9] for flexible structures are regarded as a promising application. Loop recovery and robust state estimate feedback designs [10] for robotic systems are always welcome. Loop transfer recovery with nonminimum phase zeros [11] for electric motors is also interesting work. The LQG/LTE procedure for multivariable feedback design [12] is necessary to experience the application of any particular model. Multivariable feedback design concepts for a classical/modern synthesis [13] are useful as a reference for setting up newer algorithms. Feedback property of multivariable systems [14] is an interesting subject for its application to a system with a flexible structure. It is necessary to develop algorithms [1519] so that they are more in line with today's trend. Based on [2023], the author applied algorithms [2023] for robot models. The author used modern algorithms such as neural control, adaptive fuzzy control, and sliding fuzzy control to control the model [2427], and efficiency levels of the above control methods are presented in detail. This paper focuses on the process of investigating the effectiveness of using two control methods as mentioned in the problem. The purpose of this is to cater to practical requirements and considerations in the selection of control methods. Acrobot is a new automated system, which has many utilities in life. The aforementioned studies have not dealt with acrobots in detail, so it is always an interesting experience to investigate the properties of acrobot systems. The acrobot system is described by a link between link 1 and link 2 as shown below (Figure 1).

2. Dynamic Equation of Model System

It is necessary to establish a mathematical equation that describes the motion of any model. Through these equations of motion, controllers can be installed inside the system. They will be connected to the system via wireless devices or wired devices. Usually, control systems have one or more controllers connected to the systems by wired devices. With the successful establishment of the mathematical equations, the author calculated the values of K in LQG regulator. For example, acrobot or an inverted pendulum is described by mathematical equations. Acrobot has a similar structure to Pendubot [3], but the difference between the two lies in their control joint positions. The two-dimensional coordinate system: Ox and Oy, is depicted in Figure 1 [28].

Parameters of the acrobot system are listed in Table 1. Variables are q1, q2, and , where q1 and q2 are output signals and is input signal. The selection of variables was based on the structure of the system as shown in Figure 1. Variables have unknown values. In the process of designing controllers for these systems, readers can better understand the nature of the relationships between input signals and output signals through voltage signals and the torque of an electric motor. That is, these values do not have a specific value during the analysis, but their influence on the system is important.

In Figure 1, the x-axis of the Cartesian coordinate system was chosen to be the reference level of zero potential energy. Letting be the absolute position of the COM of the ith link gives [24]

The kinetic energy is , and the potential energy is V(q):where

Considering that friction is very small, the author used Lagrangian of acrobot; then, the dynamic equation of the mechanical system iswhere and . Equation (5) is equivalent towhere

State-space equations are created as

LetDynamic equation (6) can be described aswhereP(i, j) and Z(i, j) are matrices having the ith row and jth column of P and Z. Therefore, P and Z are determined as

The establishment of acrobot’s equation of state is based on the values of Table 2.

The state variable equations of the system are described as follows:

With the above parameters and matrices A and B of the state-space model, G1(s), G2(s), G3(s), and G4(s) are calculated:

Transfer function of acrobot:

3. LQG Regulator

LQG Regulator (linear quadratic Gaussian) is used for optimal control of a system while noise signals “affect” the system. The author considered the following diagram (Figure 2).

The goal of adjusting is to stabilize the output at zero; under the influence of input white noise signals () and noise signals due to (), the control signal is “u,” and the equation of state is

LQG Regulator consists of using a state feedback unit and a state estimator (Kalman filter) (Figure 3).

The quality indicator (J) is used to find the state feedback matrix .

The control signal is is inferred aswhere “L” is the Kalman gain and “L” is calculated by the following commands:where Qn, Rn, and Ts represent the variance of the noise signal., and kest is the model of the Kalman filter.

Kalman filter calculates to minimize the variance:

The equation of state for a regulator (LQG) iswhere is created by the following commands:where rlqg is the state-space model of a set of LQG, the output is u, the input is , and the state is .

Finally, the command is requested:

The author established a closed system with the value of “sys” as the object model, and the value of “K” is the state feedback matrix, “L” is the state estimation matrix, “sensors” represent a subset of outputs, the value of “y” returns the estimator, “known” represents the inputs, “ud” affects the estimator, and “controls” are inputs of “sys” that is used for control.

4. Simulation Results and Discussion

Simulation results are shown in Figures 435.

Diagram of the system using the LQG regulator is shown in Figures 3639.

Diagram of the system using the PID controller is shown in Figures 4043.

In Figures 3639, the value of “d” is a color noise signal with a spectral density of less than 10 rad/s, and the value of “n” is a white noise signal . The quality indicator is ‘J’: . The model of the object is .

The impulse response for the closed system (highlighted in red in Figure 6) is better than that for the open system (highlighted in green in Figure 7). The value of amplitude of the oscillation of the closed system in this case is zero, and the closed system reaches a steady state. For this type of regulator, the closed system responds well. The value of amplitude of the oscillation of the open system in this case is large, and the open system does not reach a steady state.

The step response for the closed system (highlighted in blue in Figure 4) is better than that for the open system (highlighted in red in Figure 5). The value of amplitude of the oscillation of the closed system in this case is −0.6, and the closed system reaches a steady state. For this type of regulator, the closed system responds well. Meanwhile, the open system cannot respond well. In general, for a regulator of this type, the system responds well to the presence of noisy signals.

The impulse response for the closed system (highlighted in red in Figure 10) is better than that for the open system (highlighted in blue in Figure 11). The value of amplitude of the oscillation of the closed system in this case is zero, and the closed system reaches a steady state. For this type of regulator, the closed system responds well. The value of amplitude of the oscillation of the open system in this case is large, and the open system does not reach a steady state.

The step response for the closed system (highlighted in green in Figure 8) is better than that for the open system (highlighted in green in Figure 9). The value of amplitude of the oscillation of the closed system in this case is −30, and the closed system reaches a steady state. For this type of regulator, the closed system responds well. Meanwhile, the open system cannot respond well. In general, for a regulator of this type, the system responds well to the presence of noisy signals.

The impulse response for the closed system (highlighted in blue in Figure 14) is better than that for the open system (highlighted in green in Figure 15). The value of amplitude of the oscillation of the closed system in this case is zero, and the closed system reaches a steady state. For this type of regulator, the closed system responds well. The value of amplitude of the oscillation of the open system in this case is large, and the open system does not reach a steady state.

The step response for the closed system (highlighted in red in Figure 12) is better than that for the open system (highlighted in red in Figure 13). The value of amplitude of the oscillation of the closed system in this case is zero, and the closed system reaches a steady state. For this type of regulator, the closed system responds well. Meanwhile, the open system cannot respond well. In general, for a regulator of this type, the system responds well to the presence of noisy signals.

The impulse response for the closed system (highlighted in blue in Figure 18) is better than that for the open system (highlighted in green in Figure 19). The value of amplitude of the oscillation of the closed system in this case is zero, and the closed system reaches a steady state. For this type of regulator, the closed system responds well. The value of amplitude of the oscillation of the open system in this case is large, and the open system does not reach a steady state.

The step response for the closed system (highlighted in green in Figure 16) is better than that for the open system (highlighted in red in Figure 17). The value of amplitude of the oscillation of the closed system in this case is −200, and the closed system reaches a steady state. For this type of regulator, the closed system responds well. Meanwhile, the open system cannot respond well. In general, for a regulator of this type, the system responds well to the presence of noisy signals.

The impulse response for the closed system (highlighted in green in Figure 20) is worse than that for the open system (highlighted in blue in Figure 21). The value of amplitude of the oscillation of the closed system in this case is large, and the closed system does not reach a steady state. For this type of regulator, the closed system does not respond well. The value of amplitude of the oscillation of the open system in this case is large, and the open system does not reach a steady state.

The step response for the closed system (highlighted in red in Figure 22) is better than that for the open system (highlighted in green in Figure 23). The value of amplitude of the oscillation of the closed system in this case is large, and the closed system does not reach a steady state. For this type of regulator, the closed system does not respond well. Meanwhile, the open system cannot respond well. In general, for a regulator of this type, the system does not respond well.

The impulse response for the closed system (highlighted in green in Figure 28) is better than that for the open system (highlighted in blue in Figure 29). The value of amplitude of the oscillation of the closed system in this case is zero, and the closed system reaches a steady state. For this type of regulator, the closed system responds well. The value of amplitude of the oscillation of the open system in this case is large, and the open system does not reach a steady state.

The step response for the closed system (highlighted in red in Figure 30) is better than that for the open system (highlighted in green in Figure 31). The value of amplitude of the oscillation of the closed system in this case is 1.0, and the closed system reaches a steady state. For this type of regulator, the closed system responds well. Meanwhile, the open system cannot respond well. In general, for a regulator of this type, the system responds well.

The impulse response for the closed system (highlighted in red in Figure 24) is better than that for the open system (highlighted in green in Figure 25). The value of amplitude of the oscillation of the closed system in this case is zero, and the closed system reaches a steady state. For this type of regulator, the closed system responds well. The value of amplitude of the oscillation of the open system in this case is large, and the open system does not reach a steady state.

The step response for the closed system (highlighted in green in Figure 26) is better than that for the open system (highlighted in red in Figure 27). The value of amplitude of the oscillation of the closed system in this case is 0.992, and the closed system reaches a steady state. For this type of regulator, the closed system responds well. Meanwhile, the open system cannot respond well. In general, for a regulator of this type, the system responds well.

The impulse response for the closed system (highlighted in green in Figure 32) is better than that for the open system (highlighted in blue in Figure 33). The value of amplitude of the oscillation of the closed system in this case is large, and the closed system does not reach a steady state. For this type of regulator, the closed system does not respond well. The value of amplitude of the oscillation of the open system in this case is large, and the open system does not reach a steady state.

The step response for the closed system (highlighted in blue in Figure 34) is better than that for the open system (highlighted in red in Figure 35). The value of amplitude of the oscillation of the closed system in this case is large, and the closed system does not reach a steady state. For this type of regulator, the closed system does not respond well. Meanwhile, the open system cannot respond well. In general, for a regulator of this type, the system does not respond well.

The efficiency of the above control methods is in the following descending order:(A) LQG regulator(B) PID controller

5. Conclusions

LQG regulator in the case of interference signals affecting this system has been proposed in this study. Simulation shows positive results. This allows the closed system to reach steady state in a short time. At the same time, it also helps the open system to reach steady state in half the time compared to the closed system at the beginning of the operating cycle of the system. Although the steady state is achieved, simulation results of this type of regulator are only true in the case where the link angle between link 1 and link 2 is small compared to the equilibrium point. To widen the link angle between link 1 and link 2, further studies can be conducted in the future.

Data Availability

The data used to support the findings of this study are included within the article.

Conflicts of Interest

The author declares that there are no conflicts of interest.