Abstract
The power-voltage (P-V) characteristic curve of the centralized thermoelectric generation (TEG) system under nonuniform temperature distribution (NTD) exhibits multiple extreme point characteristics, and the traditional maximum power point tracking (MPPT) algorithm is prone to fall into the local maximum power point (LMPP) and takes a long time to track. This paper designs a BP-IPSO algorithm based on back propagation neural network (BPNN) and improved particle swarm optimization (IPSO) for MPPT. The algorithm firstly utilizes the good nonlinear function fitting ability of BPNN to obtain the fitting curve of the relationship between system control input and power output to establish the TEG array power prediction model. Then, the dynamic learning factor and weight coefficient are introduced into the traditional particle swarm optimization (PSO) algorithm to search the output power prediction model and realize MPPT control. MATLAB/Simulink experiment results show that BP-IPSO algorithm can effectively avoid falling into LMPP, quickly and accurately track the global maximum power point (GMPP), and effectively suppress the oscillation of voltage and power during the tracking process. Especially in the start-up test experiment, compared with perturb and observe (P&O), PSO, and grey wolf optimizer (GWO), the energy generated by BP-IPSO increased by 12.84%, 3.18%, and 4.75%, respectively, which improved the system power generation efficiency.
1. Introduction
Energy is one of the most critical factors affecting national development and is the driving force for social development. As people’s demand for energy increases, the energy situation becomes more critical, and it is imminent to develop and utilize renewable energy extensively [1]. Renewable energy development can help reduce greenhouse gas emissions and dependence on fossil fuels, and the global contribution of renewable energy to the electricity supply is expected to increase to 31% by 2035 [2]. Compared with traditional power generation, thermoelectric generation is a power generation technology that uses the thermoelectric potential of thermoelectric materials to generate electricity. It can use various forms of thermal energy to generate electricity and has the advantages of long working life, compact structure, and high reliability, which has broad application prospects [3]. At present, it is widely used in the power supply of wearable personal electronic products [4], waste heat recovery in automobiles [5], photovoltaic-thermoelectric hybrid power generation [6, 7], and large-scale industrial waste heat recovery [8, 9]. Using thermoelectric technology to recover industrial waste heat is one of the methods to improve energy utilization.
In the process of large-scale industrial waste heat recovery and utilization, due to the low output voltage and power of a single thermoelectric generator module, there is a mismatch between energy supply and demand. Therefore, multiple modules are usually connected in series and parallel to form a thermoelectric generation (TEG) system to increase the output power, and an effective maximum power point tracking (MPPT) algorithm is designed to improve the energy conversion efficiency. Figure 1 illustrates the three main architectures of TEG systems, which are centralized, string-type, and modularized [10]. The centralized TEG system connects TEG modules in series and parallel to form an array and transmits the power to the load after centralized transformation through an MPPT controller, as shown in Figure 1(a). The string-type TEG system places each string along the isotherm of the heat source, and the system is equipped with an MPPT controller for each string, as shown in Figure 1(b). In the modularized TEG system, each TEG module independently tracks its own maximum power point, but the cost of system operation and maintenance is the highest and the control is complicated, as shown in Figure 1(c).

(a)

(b)

(c)
When the TEG system works in MPPT mode, the string-type and modularized architectures need to coordinate and control multiple MPPT controllers, which increases the control complexity and implementation cost. In contrast, the centralized architecture only needs to control one MPPT controller, which greatly reduces the system cost and the inherent power consumption of energy in the power conversion. However, due to the nonuniform distribution of thermal energy in space, the temperature of each TEG module varies with its location, resulting in differences in output characteristics [11]. The environmental mismatch problem will not only reduce the output power of the centralized TEG system but also cause its output static characteristic curve to exhibit several local maximum power points (LMPPs) and a unique global maximum power point (GMPP) [12].
Traditional MPPT algorithms, such as perturb and observe (P&O) [13] and incremental conductance (INC) [14], have been widely used in the field of TEG systems. Reference [15] improved the traditional P&O method by introducing the support vector regression (SVR) algorithm to improve the tracking accuracy. Reference [16] proposed an MPPT technique based on linear extrapolation, which uses two random operating points to calculate MPP, which shortens the tracking time. However, the above MPPT algorithms are prone to fall into LMPP under nonuniform temperature distribution (NTD) and cannot accurately track the GMPP, which greatly reduces the output power of the system. In contrast, heuristic algorithms are flexible, model-free, and can avoid local optimization [17], which can be used as an efficient way for the centralized TEG system to realize MPPT under NTD conditions. Reference [18] utilizes the advantages of particle swarm optimization (PSO) algorithm with few parameters and fast convergence speed and applies it to MPPT control of TEG system under dynamic temperature and achieves a relatively ideal control effect. Reference [19] used the grey wolf optimizer (GWO) for MPPT control in the presence of several LMPPs under NTD conditions and traced to GMPP. Although the technique based on stochastic search can successfully track GMPP under NTD, the tracking speed is still low due to the randomness of the search, which requires a large number of populations to iteratively find the best, leading to dynamic power loss.
Artificial Neural Network realizes the mathematical mapping of complex nonlinear relationships by simulating some mechanisms of the brain, which is widely used in power prediction [20], pattern recognition [21], etc. Back propagation neural network (BPNN) is a widely used method. Using BPNN to predict the output power of the TEG array under NTD does not need to consider the detailed mathematical model of the TEG array, but starts from the real and representative collected data, and only through its own training to establish the power prediction model.
Based on the above discussion, a BP-IPSO algorithm based on BPNN and improved particle swarm optimization (IPSO) is designed in this paper for MPPT of centralized TEG system under NTD. The method trains the BPNN by collecting the real-time operation data of the TEG system, completes the fitting of the nonlinear relationship between the control input and power output of the TEG system, and establishes the power prediction model of the TEG array. After that, the search of the fitted I/O curve by IPSO accurately distinguishes LMPP and GMPP to realize MPPT control. Finally, the effectiveness of the method is verified by simulation tests.
2. Centralized TEG System Modeling
2.1. Mathematical Modeling of TEG
Figure 2 shows the schematic diagram of TEG module and its equivalent circuit. The TEG module can be modeled as a series connection of a voltage source and an internal resistance , and the parameters change with the temperature difference between the two ends of the module. The principle is that the P-type and N-type semiconductor materials are connected with conductive copper sheets, and when there is a temperature difference between the two sides, a temperature difference electric potential will be generated at both ends of the material. can be described as where is the Seebeck coefficient and and are the temperature on the hot side and cold side, respectively, and is the temperature difference between the hot side and cold side.

When the current flows through a uniform conductor with a temperature gradient, reversible energy absorption and release occurs, known as the Thomson effect. The Thomson coefficient can be expressed as where is the average temperature.
In fact, the Thomson effect is non-zero and makes the Seebeck coefficient dynamic. Therefore Eq. (3) is used for accurate modeling and incorporates the effect of dynamic average temperature as [22]. where is the initial rate of change at the reference temperature and is the variation rate of the Seebeck coefficient.
When there is a load resistance, the output power of the TEG module can be calculated by where and are the internal resistance and load resistance of the TEG module, respectively. When , the TEG operates at MPP and the load obtains the maximum power.
2.2. Modeling of Centralized TEG System under NTD
As shown in Figure 3, the centralized TEG system consists of four parts: TEG array, boost circuit, MPPT controller, and load. The system inputs different duty cycles to the boost circuit through the MPPT controller to achieve the matching of the equivalent load resistance and the internal resistance value of the TEG array, so that the system operates at GMPP and improves the energy conversion efficiency under NTD conditions.

The TEG array constructed in this paper has a total of 40 TEP1-09656-0.5 model TEG modules, arranged in 8 strings, and each string is connected by 5 TEG modules in series. The TEG array structure is shown in Figure 3. A bypass diode is added to each TEG module in the TEG array to prevent the TEG module from working in reverse voltage; in order to prevent the reverse current flowing into the array, the parallel branch is connected in series with blocking diodes. Table 1 provides the main parameters of the TEG module and the boost converter.
The output power of the th string in the TEG array and the total output power can be calculated by Eq. (5) and Eq. (6), respectively, as where and are the number of TEG modules connected in series and parallel, respectively, is the TEG array output port voltage, and and are the open-circuit voltage and internal resistance of the th TEG module.
The Eqs. (5)–(6) indicate that there is a parabolic relationship between the output power of each string in the TEG array and the array output port voltage , and the P-V characteristic curve of the TEG array under uniform temperature distribution is shown in Figure 4(a). Under NTD conditions, each blocking diode is triggered at different voltage levels, causing the P-V characteristic curve of the TEG array to be superimposed by multiple different parabolas, showing complex multipeak characteristics, as shown in Figure 4(b).

(a)

(b)
3. BP-IPSO Search Algorithm
3.1. BP Neural Network Construction
Neural networks have good nonlinear function fitting ability and can directly summarize the prediction model based on actual and representative sample data [23]. Applying neural network modeling only needs to know the input and output data of the system and does not need to deeply understand the complex internal structure of the system, which avoids tedious mathematical formula modeling, making system modeling convenient and effective.
Back propagation neural network (BPNN) is a neural network with three or more layers: input layer, hidden layer, and output layer. The structure of BPNN realizes a full connection between the front and rear layers, and there is no connection between neurons in each layer [24], as shown in Figure 5.

According to Section 1, the P-V characteristic of the TEG array under NTD is a nonlinear curve with multiple extreme points. Therefore, BPNN is used to complete the modeling of the TEG array. (1)Input Layer Design. The number of nodes in the input layer is usually determined by the influencing factors of the problem being solved. For the maximum power point tracking problem, the MPPT controller dynamically adjusts the output power of the TEG array by outputting different duty cycles and feeds it back to the MPPT controller so that the TEG system can operate near the maximum power point. Therefore, the input layer is designed as one node, and the duty cycle of the boost converter is used as the input variable, which is represented by .(2)Hidden Layer Design. Considering that the boost converter’s input duty cycle and the TEG array’s output power is a single input-single output curve, it is decided to adopt the single hidden layer. The determination of the number of neurons in the hidden layer is the critical issue in the design of BPNN. Too few neurons in the hidden layer may lead to lower fitting accuracy, while too many neurons will increase the network training time and cause overfitting.
For the selection of the number of neurons in the hidden layer, first refer to the relevant literatures and use the empirical formula to calculate the approximate boundaries. Then, under the temperature environment set by the start-up test experiment, the accuracy of the power prediction model is tested in turn when different numbers of hidden layer nodes are selected. MAE and MAPE are used as evaluation indicators for quantitative analysis. According to the number of sample data and the analysis of multiple experiments, the number of neurons in the hidden layer is finally determined to be 5. Since the sigmoid function can reflect the saturation characteristics of neurons and the predicted value of the output power cannot be negative, the sigmoid function is selected as the transfer function of the hidden layer. The sigmoid function and the output value of the hidden layer can be expressed as follows: where is the output value of the th hidden layer neuron, is the connection weight of the input layer to the th hidden layer neuron, is the threshold value of the th hidden layer neuron, and is the number of hidden layer nodes. (3)Output Layer Design. Since the purpose of the network is to predict the output power of the TEG array under different duty cycle inputs, the output layer is designed as one node and denoted by . The output layer transfer function adopts a linear function. The output value of the output layer can be expressed aswhere is the connection weight between the th hidden layer neuron and the output layer, is the threshold value of the output layer, and is the number of hidden layer nodes.
After the above design, the specific structure of BPNN is determined. A three-layer neural network is adopted, with one neuron in the input layer, five neurons in the hidden layer, and one neuron in the output layer. The input data is the duty cycle of the boost circuit, and the output is the predicted output power of the TEG array under the current duty cycle.
3.2. IPSO Search
The PSO algorithm is an effective method for global optimization of multiple extreme point functions, where the population intelligence generated by cooperation and competition among the particles in the population guides the optimization search. At the ()-th iteration, the update process of each particle’s flight velocity and flight position can be expressed by Eq. (10) and Eq. (11), respectively, as [25], where is the inertia weight, and are the self-learning factor and social learning factor, respectively; and are random numbers on [0, 1], is the particle order, is the number of iterations, is the individual optimal value of the th particle at the th iteration, is the global optimal value at the th iteration.
For the maximum optimization problem in function optimization, if the value of the fitness function is larger, the fitness value is better, so the update process for the individual optimal value and the global optimal value can be calculated, respectively, as
The selection of parameters , , and has a significant influence on the iterative process of the algorithm. The inertia weight determines the inheritance of particle flight velocity. If is set too small, it is easy to fall into the local optimum; if set too large, it is difficult to guarantee the convergence speed in the later stage. The learning factors and determine the speed at which the particle moves to the individual optimal position and the global optimal position, respectively. To improve the adaptability of the PSO algorithm, an improved PSO algorithm is designed. The method introduces dynamic learning factors and weight coefficients in the iterative process to improve the speed and accuracy of algorithm convergence.
In the initial stage of the algorithm, a larger inertia weight, a self-learning factor, and a smaller social learning factor are set to improve the global search ability, so that the algorithm is not easy to fall into local optimization. In the later stage, it updates with the number of iterations and sets smaller inertia weight, self-learning factor, and larger social learning factor, which improves the local search ability of the algorithm and makes the system converge faster. The process of dynamic adjustment of parameters is expressed in Eq. (14)–(16). where is the maximum number of iterations and , , , , , and are the maximum and minimum values of , , and in the iteration process, respectively.
4. MPPT Design of BP-IPSO Algorithm
4.1. Control Framework
The MPPT control structure of the centralized TEG system based on BP-IPSO under NTD is shown in Figure 6. The combination of the power prediction model fitted by BPNN and IPSO search is applied to the MPPT control of the centralized TEG system. The essence is the fitting and searching of the nonlinear relationship between the system control input and power output. The basic idea of this composite MPPT control algorithm is as follows.

Firstly, the duty cycle of the initial training samples is output in turn, and the output voltage and current data of the TEG array are collected for neural network pretraining. Secondly, the IPSO performs three iterative searches for the I/O curve fitted by the BPNN and sequentially collects the optimal duty cycle obtained by the three searches and supplements it to the training samples so as to improve the accuracy of the BPNN in fitting the I/O curve near the maximum power point. Finally, the new training samples are used to fit the I/O curve again, and the IPSO search is guided by fitting the I/O curve to realize MPPT. When the external environment changes, the algorithm restarts.
4.2. Power Prediction Model Establishment
The structure of the BPNN power prediction model designed in this paper for MPPT of centralized TEG system is shown in Figure 6. The establishment process of the power prediction model is as follows: (1)Within the duty cycle variation range of DC/DC converter [0.95], ten groups of data are uniformly selected as the duty cycle of the initial training samples, and the TEG array output voltage and current data are collected in turn. The output power of the TEG array is normalized and used for neural network pretraining.(2)Determine the prediction error function of the BP neural network, that is, the error between the expected output and the actual output. The target error of training is set to 0.001. The prediction error function can be expressed aswhere is the number of samples in the training set, is the actual value of output power of the th sample, and is the predicted value of output power of the th sample. (3)Start training the neural network, and the maximum number of training times for the network is set to 100. By evaluating the error function and updating the connection weights, the output value of the network is continuously approached to the desired output until it reaches the predetermined error accuracy.
4.3. GMPP Search
(1)Initial Parameters Setting. Based on the MPPT control algorithm of BP-IPSO, each particle position in IPSO is defined as the duty cycle of boost converter, and the fitness value is the output power value of the TEG array predicted by BPNN under the given duty cycle and achieves MPPT by multiple iterative searches.
The number of particles is set to 4. Since the variation range of each particle is [0 0.95], the initial position is 0.1, 0.3, 0.6, 0.95, the maximum speed is 0.3, and the maximum number of iterations is 10. The learning factor values are , , , and , and the weight coefficient values are and . (2)Termination Strategy. Since repeated iterations will bring long-term power fluctuations, in order to stabilize the power to the maximum power point as soon as possible and reduce the power oscillation when the system approaches a steady state, the termination strategy adopted in this paper is as follows. When the maximum distance between the particle positions is less than 0.1 or the maximum number of iterations is reached, the system considers that the GMPP has been found at this time, stops the iteration, and completes the convergence.(3)Algorithm Restart Conditions. When the temperature of the cold and hot source changes, the output power of the TEG array also changes, and thus the BP-IPSO algorithm needs to be restarted to make the system work stably at the new maximum power point. The power variation can be expressed aswhere is the output power value of the TEG array at the current sampling time and is the output power value of the TEG array at the last sampling time. When , restart the algorithm.
The trained neural network is used in the MPPT control of the TEG array to predict the power output value of the TEG array under different duty cycle inputs, combined with IPSO search, and finally realize the tracking of the maximum power point. The overall execution procedure of the MPPT control algorithm based on BP-IPSO is demonstrated in Procedure 1.
|
5. Simulation Analysis
In order to verify the MPPT performance of BP-IPSO, this section compares it with P&O, PSO, and GWO in three cases, namely, start-up test, step variation of temperature, and accuracy analysis. Since the parameter values will affect the performance of the algorithm, in order to ensure a fair comparison, the common parameters for different heuristic algorithms are set to the same.
The population numbers of PSO and GWO are both set to 4, and the initial positions are 0.1, 0.3, 0.6, and 0.95, respectively; the maximum number of iterations is 10, and the control period is 0.02 s; the convergence condition is that the maximum distance between population individuals is less than 0.1 or the maximum number of iterations is reached. In addition, the P&O parameters are set: fixed step size is 0.005, and control period is 0.02 s; PSO parameters are set: , , and .
5.1. Start-up Test
This experiment tests the MPPT performance of BP-IPSO when the centralized TEG system starts at zero point under NTD conditions. The strings 1-8 in the TEG array are divided into four groups, and the cold side temperatures of the four groups are set to 50°C, 50°C, 50°C, and 30°C; the hot side temperatures are set to 290°C, 145°C, 96°C, and 75°C, respectively. Figure 7 shows the MPPT performance of the four methods under the start-up test. Table 2 presents a quantitative comparison of the tracking results of the four methods.

(a)

(b)

(c)

(d)
From Figure 7(c)–7(d), it can be seen that the MPPT with the P&O algorithm cannot distinguish between GMPP and LMPP, resulting in the lowest output power and the least energy obtained, although the power oscillation during the tracking process is the smallest. Except for P&O, the other three algorithms have global search ability and can get more energy. According to Table 2, compared with PSO and GWO, since BP-IPSO uses the I/O curve fitted by the BPNN to replace the actual circuit model for search, the data sampling time in the search process of the algorithm is shortened and completes the tracing process in 0.26 s, which significantly improves the convergence rate. In addition, the energy output obtained by BP-IPSO is the highest among the four methods, which are 112.84%, 103.18%, and 104.75% of P&O, PSO, and GWO, respectively. Therefore, BP-IPSO can converge to the highest quality GMPP with faster speed and less power oscillation and with good start-up performance.
5.2. Step Variation of Temperature
In the actual operation of the TEG system, the temperature of the cold and hot sources often changes, so the adopted MPPT algorithm must be able to track accurately when the temperature changes abruptly. In order to test the effectiveness of the MPPT algorithm under step temperature change, this experiment divided string1 to 8 into four groups according to the TEG array structure in Figure 3, each group consists of two strings and is under the same temperature environment. After that, four groups of continuous step temperature changes are set for the TEG array in this experiment, and the temperature change curves of the cold side and hot side are shown in Figure 8(a) and Figure 8(b), respectively.

(a)

(b)
The MPPT performance comparison of the four methods under the above conditions is shown in Figure 9. Table 3 shows the quantitative comparison of the tracking results of the four methods under step temperature variation. It can be seen that P&O has the smallest power oscillation but easily falls into the low quality LMPP. PSO and GWO are prone to generate large power oscillations during the tracking process and need a longer convergence time to approach GMPP. In particular, when the second temperature step change occurs, GWO cannot accurately identify GMPP and LMPP and falls into LMPP together with P&O, as shown in Figure 9(c). In contrast, BP-IPSO can quickly approach high-quality GMPP in step temperature variation. This indicates that the global search ability of the BP-IPSO is higher than that of the metaheuristic algorithm, and the GMPP can also be accurately tracked when the temperature changes abruptly.

(a)

(b)

(c)

(d)
5.3. Accuracy Analysis
To further analyze the GMPP tracking accuracy of the four methods under different temperature inputs, the system was tested with a series of different temperature ratios imposed by [0,100%]. The temperature ratio change interval is set to 5%, and the rated temperatures of the cold side were set to 50°C, 50°C, 40°C, and 50°C, respectively; the temperatures of the hot side were set to 295°C, 190°C, 98°C, and 115°C, respectively. Under the experimental conditions of different temperature ratios, the power tracking results of each method are shown in Figure 10.

RMSE, MAE, and MAPE are used as error evaluation indexes to quantitatively analyze the tracking accuracy, and the calculation formula is as follows. The calculation results of the error evaluation indexes of each method are shown in Table 4. where is the number of samples, is the global maximum power, and is the output power of the TEG system after the MPPT algorithm is tracked.
From Figure 10, it can be obtained that as the temperature ratio increases, the power tracked by each algorithm generally increases accordingly. PSO falls into LMPP when the temperature ratios are 75%, 85%, and 95%, and GWO falls into LMPP when the temperature ratios are 90% and 95%. As depicted in Table 4, BP-IPSO achieves the minimum value in three error evaluation indexes, which proves that BP-IPSO has a certain improvement in tracking accuracy and higher convergence stability than the comparison algorithm. According to Table 4 and Figure 10, it can be seen that the performance of BP-IPSO proposed in this paper is better than the other three comparison algorithms, which can ensure greater power output and higher convergence stability under different temperature ratios.
6. Conclusion
The P-V characteristic curve of the centralized TEG system under NTD condition exhibits multiple extreme point characteristics. In order to utilize the energy more efficiently, this paper designs a BP-IPSO algorithm based on BPNN and IPSO search and applies it to MPPT control. The main advantages are summarized as follows: (1)Using the good nonlinear function fitting ability of BPNN, we can establish a neural network power prediction model by collecting multiple sets of TEG system control input and power output data, which can directly predict the TEG array power output value based on the system control input, avoiding the tedious analysis of the system internal mechanism(2)The dynamic learning factors and weight coefficients are introduced into the traditional PSO algorithm, which has a stronger performance in global search and local exploitation. The accuracy analysis results show that BP-IPSO algorithm can effectively avoid falling into LMPP(3)BP-IPSO algorithm uses the neural network power prediction model to replace the actual circuit for search. During the tracking process, there is no need to change the voltage or current value in the actual circuit many times so as to effectively reduce the voltage, current, and power oscillations and shorten the time of data sampling in the tracking process of the algorithm. The simulation results show that the tracking can be completed in 0.26 s, which significantly improves the energy efficiency and convergence speed
Data Availability
The data that support the findings of this study are available from the corresponding author upon reasonable request.
Conflicts of Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Acknowledgments
This work was supported by the Fundamental Research Program of Shanxi Province (202203021211175), the Shanxi Provincial Key Research and Development Project (201903D121015), and the Research Project Supported by Shanxi Scholarship Council of China (2020-039).