Abstract
Artificial neural network has been widely used in air conditioning systems as an effective method for predicting parameters, and the accuracy of ANN model relies on training data and network structure. In order to increase the quality of chilled water loops model, this paper develops an optimal data processing algorithm combining Kalman filtering with particle swarm optimization to compensate for uncertain factors and disturbances of collected data from the case building and establishes the nonlinear variation trend database. Based on Elman and BP neural networks, this paper proposes the improved network structures to avoid the local optimum predicted value of chilled water loops and increase data training speed. Simulation results show that this algorithm improves the data accuracy of current percentage (CP) of chillers and chilled water temperatures 12% and 9%. Compared with Elman and BP models, mean absolute errors of CP improved models are improved 24.1% and 10.3%, and mean squared errors of water temperature improved models are improved 5.2% and 4.8%. For the purpose of energy conservation control in air conditioning systems, this work has an application value and can be used for predicting other parameters of buildings.
1. Introduction
With the development of artificial pattern recognition and intelligence technology, the information construction of air conditioning systems has been improved. Numerous sensors have been installed in energy management and control system (EMCS), and data classifications are becoming more detailed. When measured data are transmitted to servers using communication technology, it will produce great value. How to use these huge data to resolve the management and optimal configuration problems for the limited energy resources is always the hot topic, which depends on the support of the relevant data processing and an analysis tool. Recently, data analysis technology has been applied to the heating, ventilation, and air conditioning (HVAC) systems. For instance, Chang et al. [1] developed an effective data acquisition and air conditioning control system using Labview; it could realize the real-time data acquisition as well as the data transmission, processing, and display. In addition, this system could save a lot of manual operations and material resources. Wang et al. [2] analyzed the information between the supply air temperature and the other variables acquired from an air conditioning system and presented a data preprocessing and association scheme. Piette and Hannah [3] designed a performance-monitoring tool using data storage and Internet; it could get more high-quality time-series data for performance analysis. The results showed that this tool facilitated the detection and the diagnosis of energy and other performance problems in buildings. Hou et al. [4] developed a strategy based on the data analysis method to detect and diagnose sensor faults in HVAC systems. Real test results from HVAC systems showed that this strategy could identify temperature sensor faults of the supply chilled water and the return chilled water. Doubtlessly, data quality is the precondition for increasing the analytical efficiency and the database category. Furthermore, it needs an advanced data preprocessing scheme.
Research about the modeling of air conditioning systems can be traced back to the 1930s, and the development was accelerated after the theory of intelligent air conditioning was presented in 1990s [5]. Generally, the evolution of air conditioning system model can be divided into three stages according to different principles, as shown in Figure 1. Physical model is the initial one that describes the heat transfer process or physical property of components using energy conservation and dynamic balance theories [6]. For instance, Zhang et al. [7, 8] discussed the physical structure and the operating characteristic of a chiller using partial differential equation; this model reasonably predicted the chiller transient performance. Whalley and Abdul-Ameer [9] indicated that the air conditioned room model was similar to the circuit resistance and capacitance in electrical system. They introduced the room temperature and the ventilation equipment model and analyzed the response. It is undeniable that physical model can effectively express the intrinsic characteristic of components in air conditioning systems, but many external nonlinear disturbances and phenomena cannot be accurately described. Therefore, many engineers have adopted the optimized or simplified scheme to get the satisfactory performance. He et al. [10] developed a simplified thermal model for predicting the cooling load and the thermal environment in buildings. Compared with other building simulation tools, test results showed that root mean square values of operative temperature and cooling load were below 1% and 12%. Thosar et al. [11] designed a nonlinear control system for various air conditioning plants to reduce the building energy consumption, and this system was compared with a PI control scheme. The results showed that the reduced model was chosen to ensure the mathematical tractability, and optimal performances in terms of comfort and energy were obtained using this method. However, empirical model often amends many performance coefficients of description equation based on engineers’ experience. For instance, Yiu and Wang [12] developed a multiple autoregressive moving average exogenous model for air conditioning system and analyzed the difference in performances when the system model adopted SISO or MIMO structures. The results showed that the predicted values were significantly closer to the measured values. Xu et al. [13] formulated a real-time dynamic load model for the terminal equipment. In their study, the experimental results showed that the empirical model had the well forecast effect, and more effective energy saving control strategy could be obtained using this model. Nowadays, air conditioning systems have entered into “Data Time”, and data model has emerged. Data mining and intelligent algorithms are able to provide a new development trend for the modeling of air conditioning systems [14, 15].

Air conditioning systems contain many subsystems, including water system, ventilation system, air supply system, and control system. Among them, the chilled water loops (CWL) are most complicated, and it is difficult to accurately describe this arbitrary component. In multitudinous modeling tools, artificial neural network (ANN) has been widely used as the representative of intelligence algorithm and plays an important role in the data processing. Engineers can get well predicted data according to the network analysis, without knowing the detailed structure about the component [16]. Therefore, ANN and CWL can be combined together. In this field, Frey et al. [17, 18] designed the ANN model to describe the chiller performance equation; results showed that the predicted temperature was very close to the test temperature. Hou et al. [19] combined ANN with rough set theory to predict the load of an air conditioned room; results showed that the actual error was within 4%. Using ANN with 6-6-9-1 structure, Manohar et al. [20] presented a steady state double effect absorption chiller model to optimize the coefficient of performance (COP). The predicted COP had an error of ±1.2%. Zhao et al. [21] built a screw chiller mode using the polynomial ANN, which provided a method to improve the part-load energy efficiency. In their study, the predicted performance by the chillers model was better than existing ones in the literature. Facticity and accuracy are the targets in the air conditioning system model and also decide the performance of the control system. According to these investigations, ANN has the superior performance for data modeling. Unfortunately, if there is any serious nonlinear influence hiding in the collected data, the output value may fall into the local optimal solution. Meanwhile, the process of the collected data will be influenced by multitudinous uncertain and nonlinear factors. Thus, it is difficult to predict the future state.
For taking advantage of measured data and building more precise prediction model, this paper develops an optimization data processing scheme combining Kalman filtering with particle swarm optimization (KPOS) to improve the accuracy of measured data and analyzes the data variation trends for establishing the nonlinear variation trend database (NVTD). Meanwhile, this paper adds evaluation (Eva) layers which are used to update the weight calculation and constrain the boundary conditions based on NVTD in the Elman and BP networks. Finally, this paper predicts the current percentage (CP) of two chillers and chilled water temperatures in a case building using the improved model as a prediction example. The results show that the authenticity of the improved model is enhanced effectively, and locally optimal solution can be avoided using this method.
2. System Descriptions
The case building is located in Shenzhen, China. This building was reconstructed into ice storage air conditioning system in 2009, where office area adopts fan coil system, and shopping area (opening hours are from 9:00 am to 22:00 pm) adopts all air systems. EMCS includes 34 temperature monitoring points and 2 CP monitoring points as shown in Figure 2. About 2 years’ data are collected to expound the data processing scheme and improved models. Some details are presented as follows.(i)Architecture (5 floors, 30000 m2, shopping and office)(ii)HVAC system (ice storage air conditioning system)(iii)Chiller (two screw chillers (total power: 258 kW), cooling capacity (air conditioning): 1436 kW, cooling capacity (ice storage condition): 859 kW)(iv)Data acquisition(1)Sample interval: 0.5 h(2)Total data records: 1345975(3)Data for modeling: CP, temperatures of chilled water: supply and return water temperatures (TCHWS and TCHWR)

3. Methodology
3.1. Data Processing Scheme
The quantitative analysis about limited energy management and optimal configuration in CWL depends on the accuracy collected values (CVs) from EMCS and the mathematical model. Due to the interference and the noise which are the primary uncertainties in measurements when recording the data, missing and abnormal values (MAVs) are unavailable. If MAVs have a serious effect on the prediction model, error between the predicted value and the true value will be increased. Therefore, the observed value cannot be regarded as the real value. For obtaining ideal real values (IRVs), this paper adopts Kalman filtering [22] and particle swarm optimization [23] (KPSO) algorithms to compensate for MAVs.
Kalman filtering algorithm calculates the optimal estimated value from the collected data; state equation and observation equation of collected data can be described using the following:
The deviation of collected data can be calculated using
The deviation gain can be calculated using
Since the design value is used as IRVs, the deviation gain should be always within the range of . The estimated value can be calculated using
The deviation of IRV will be updated usingwhere is sampling instant. The important variables in Kalman filtering are and , which are always obtained based on the experience of designers and values are confirmed. However, the actual systems are dynamic; data processing needs to update and . Then the PSO algorithm is adopted for searching the optimal values of and . In this algorithm, every particle has a fitness value; it can be calculated using
In the iteration, the position of particle is updated using where = [1~30]; is the best position of particle; is the best position of swarm. Figure 3 shows the flowchart of KPOS; it starts with the initial sample data and optimizes MAVs using the Lagrange interpolation until the output value meets the requirements. The observation data ( is updated using and as well. Tables 1 and 2 show the summary of water temperature and chillers energy consumption data. The CVs are inconsecutive because of interference and uncertain factor. For instance, when the data of 1# chiller water temperature in 7 days is analyzed, is 0.093 and is 0.165 through KPOS algorithm. Figure 4 shows the comparison between CVs and IRVs. Obviously, MAVs still exist in preprocessing data and the average errors for supply and return chilled water temperatures are 0.67°C and 0.43°C, respectively. Apparently, this method improves the authenticity of the measured data and avoids MAVs effectively. The observed value will be replaced by IRVs when the process of data modeling is executed.


(a)

(b)

(c)

(d)
3.2. Nonlinear Variation Trend Database
Data collected from EMCS represents the superficial features of air conditioning systems, so it is necessary to excavate the variation hiding data. Meanwhile, load areas are always influenced by uncertain interference factors, leading to the inevitable nonlinear phenomena in the input and output places of CWL. In this work, for analyzing the operation of chillers and chilled water temperatures, the supply chilled water temperatures are regarded as input and return chilled water temperatures are regarded as output. CP of chillers is used to illustrate the operation characteristic. To make full use of collected data, the variation trend is divided into two directions, as shown in Figure 5. Similar to the automatic control theory, the ordinate represents the sample data in one day (“hysteresis of water temperature”) while the abscissa represents the sample data in a corresponding cycle (“chaos of water temperature”), and these trends are nonlinear. The new database (NVTD) is used for establishing evaluation layer in the process of modeling.

3.2.1. Abscissa: Chaos of Water Temperature
The sample data at the same time of different days is obvious due to the influence of equipment working condition, random employee turnover, and variability of outdoor temperature. So the trend value of this phenomenon can be defined as the chaos characteristic of water temperature [24]. Generally, if the chilled water system has higher nonlinearity, the chaos characteristic may become more obvious. Chaos value can be calculated usingwhere is IRV of water temperature in the sampling point (°C); is IRV of water temperature in the adjacent cycle sampling point (°C).
3.2.2. Ordinate: Hysteresis of Water Temperature
Not only does the current input value determine the output value of chilled water system, but also the variation tendency (VT) will produce a certain influence. For instance, VT of the outlet water temperature of 1# chiller in one day can be divided into three intervals: rising (), declining (), and constant (), as shown in Table 3. According to the analysis and the statistics of the operating data, the sampling time is divided into seven intervals (including probability (Pro) and average slope (_Slope) when these conditions happen). Therefore, the system has different output values while the input has the same set value in the water temperature rising and declining processes; this phenomenon is recognized as hysteresis. Thus, the relationship between input and output is not simply mapping. The hysteresis value can be calculated using where is collection value in temperature rising processes (°C); is collection value in temperature falling processes (°C).
3.2.3. Random Chillers CP
Chiller provides the required cooling capacity for terminal devices. Since the cooling and heating loads vary randomly, the chiller operates randomly as well. If designers use the partial load ratio to reflect the operation characters of chiller, various sensors should be installed in the indoor and outdoor environment, and the process is quite complicated. The most simple and economical method is using CP to replace the partial load ratio, and the chiller is controlled sequentially. The chiller power can be calculated using (13). Figure 6 shows the current distribution and percent of sampling point in five days; the chiller operates usually under partial load and CP value is random.

Both the fixed frequency and the variable frequency chillers have different CP in the adjacent days. For describing the random value, average value is used as the evaluation criterion; it can be written as the following equation:
3.2.4. Case Study
Figure 7 shows the percentages of each average index of CP and chilled water temperatures using the collected data in both July and August. The chaos amplitudes of the supply and the return chilled water increase, indicating that the load fluctuation of CWL gets serious with the rising outside temperature in August. However, the hysteresis amplitudes of the supply and the return chilled water fluctuation vary within a narrow range, which shows the water temperature variation is unconspicuous when the same set value are in both the rising and falling processes. The radar chart analytical method is a comprehensive assessment and multivariate analysis technology. This method especially suits the object of multiple attribute description and can make an overall evaluation. Using the radar chart, the status of the evaluation objects can be seen intuitively. Figures 7(c) and 7(d) show the quota distribution of nonlinear phenomena in the adjacent days. According to the analysis, chaos characteristics and randomness are the largely influenced nonlinear index, and the development tendency of these characteristics can be predicted using NVTD.

(a)

(b)

(c)

(d)
4. Data Model
Artificial neural network processes the information by imitating the biological neural network. It is a mathematical model based on the physiological research results about the brain, and often simulates some mechanisms to achieve the prediction function for the performance and parameters in air conditioning systems. The major training method is guided learning, which is able to adjust parameters according to the training sample and make the output value of network close to the known samples. There is not any specified network structure or learning algorithm suitable for all issues, and it is necessary to improve the network structure for specific issues. In this paper, the evaluation (Eva) layer based on NVTD is added to the Elman and BP networks.
The function of the evaluation layer is to indicate the nonlinear trends during the modeling process and to judge the output of the model. The output value of the hidden layer should meet the corresponding nonlinear trends value. Further, evaluation layer reduces the deviation between the predicted value and the actual value and enhances the calculation pertinence in the big fluctuation region.
4.1. Chiller Operation Model
Elman neural network structure includes input layer, hidden layer, connection layer, and output layer [25, 26]. The transfer functions of the output layer and the hidden layer are linear and nonlinear functions, respectively. The transmission function of the connection layer is linear and acts as the input of hidden layer together with the network input, so this network can achieve the dynamic memory. In this paper, the structure of Elman neural network is combined with the evaluating layer of the nonlinear trend. The form of improved Elman neural network (N-Elman) is 5-2-2-5-2 (5 input neurons, 2 connection neurons, 2 hidden neurons, 5 Eva neurons, and 2 output neurons) as shown in Figure 8.

4.1.1. Data Selection Rule of N-Elman
During the training cycle, every 6-day data is used to update the CP model as the training pattern, while every 5-day data is selected as the input vector. Data in the 6th day is selected as the target vector, while data in the 7th day is selected as the test data. Figure 9 shows the data training rule, and the chiller operation model will stop the calculation until meeting the requirements.

4.1.2. Network Formula of N-Elman
In the hidden layer, the output in network can be calculated usingwhere
In the evaluation layer, it is applied to approximate the value of nonlinear phenomena indicator function. The output target can be calculated using
The output of evaluation layer can be calculated using
The error function of evaluation layer can be calculated using
The minimize objective function of evaluation layer can be calculated using (20), and weights can be updated using (21)~(22).
In the output layer, can be calculated using
The objective function can be calculated using
4.2. Chilled Water Temperature Model
BP neural network is a multilayer feed-forward network, including input layer, hidden layer, and output layer. Among them, the number of hidden layers can be adjusted according to the requirements of designers. BP neural network is widely used for predicting parameters in air conditioning systems [27]. It has been proven that BP neural network can realize any complex nonlinear mapping and is particularly suitable for solving problems with complex inner mechanisms. However, some localized problems are difficult to be solved in the counting process of BP neural network. Therefore, the BP network structure in most studies is often improved according to the actual situation. For enhancing the accuracy of predicted value and avoiding the localized problems, the improved BP model adds the Eva layer combining with NVTD. The form of improved BP neural network (N-BP) is 6-10-6-3 (6 input neurons, 10 hidden neurons, 6 Eva neurons, and 3 output neurons), as shown in Figure 10.

(a)

(b)
4.2.1. Data Selection Rules of N-BP
If data in a sample has the bad relevance, it is hard for the network to predict the performance. Therefore, it is very rigorous to select the parameters. In this case, the correlation between the supply water temperature and the return water temperature is close. In order to carry out the network calculation, N-BP adopts 5856 data patterns of chilled water temperature. 80% of them are used for training and 10% are used for verifying, while the other 10% are used for testing.
4.2.2. Network Formula of N-BP
Chaos and hysteresis of chilled water temperatures from NVTD are selected as constraint conditions for the network output. The input layer includes TCHWS and TCHWR; will be in the range from 70% to 85% and will be in the range from 10% to 15% in evaluation layer according to the data from NVTD.
TCHWS and TCHWR in hidden layer can be calculated usingwhere is nerve cell threshold value in the hidden layer (). In the evaluation layer, the output target of evaluation layer can be calculated using
The evaluation layer output can be calculated using
The error function of evaluation layer can be calculated using
The minimize objective function of evaluation layer can be calculated using (30), and weights can be updated using (31)~(32).
In the output layer, and can be calculated usingwhere and are nerve cell threshold value in output layer. MSE (Mean Squared Error) is used as evaluation index; it can be calculated using
5. Results
5.1. Evaluation of CP Model
Figure 11 shows the predicted results of CP for two chillers using the N-Elman network and the Elman network. In Figure 11, the predicted CP of two chillers (2# chiller is spare unit) adopting the N-Elman network model is more accurate, and its average errors are less than Elman network model. Table 4 shows the detailed comparisons, where chillers power is derived from (9). Although the iteration of the improved model is increased, the increment is acceptable.

(a)

(b)
For describing the long-term predictive effect, Figure 12 shows the predictive results of chillers power in 7 days. When the ice storage is melted, the values will be zero. There are 13 anamorphic predicted points using Elman network model in Figure 13(c) and 4 anamorphic predicted points in Figure 13(d) due to the uncertain data between two collection points with less training data near the zero time. However, CP model using the N-Elman overcomes this problem, NVTD provides the variation trend of data, and it is especially suitable for the interval of less data span. In addition, mean absolute errors (MAE) of two chillers using the N-Elman (0.135 and 0.113) are lower than those of using Elman model (0.376 and 0.216). Therefore, the performance of the improved model is more accurate and more stable than Elman model.

(a)

(b)

(c)

(d)

(a)

(b)

(c)

(d)
5.2. Evaluation of Chilled Water Temperature Model
Figure 13 describes the predicted results for TCHWS and TCHWR of CWL. The predicted performance of N-BP model is more excellent than BP model, with output values more close to the measured values. Table 5 shows the detailed comparisons. The iterations of the modified network are increased, but it reduces the mean square error (MSE) and its value is lower than BP network.
To analyze the long-term performance of improved model, predicted values in 7 days are listed as shown in Figure 14. During the training period, the predicted data of N-BP network model are more concentrated than BP network; the reason is that NVTD provides the variation trend of data. Meanwhile, MSEs of TCHWS and TCHWR using N-BP model (5.03% and 3.75%) are less than BP model (10.19% and 8.54%). So the chilled water temperature with improved model is more accurate and effective than BP model.

(a)

(b)

(c)

(d)
5.3. Evaluation of NVTD and Nonlinear Indexes
The major functions of NVTD are providing the learning knowledge of ANN and constraint. For some special operating conditions such as ice storage, there are many sharp variation values (before and after making and melting ice). In this period, the performance of the traditional ANN exists as “jumping value.” According to the results of CP model using the N-Elam network, this phenomenon is effectively avoided. In the training process, either Elman or BP network, often falls into the local optimum because of various data, so the output value should be controlled. According to the comparisons, NVTD has solved this problem using (17)~(21) and (26)~(31).
Figure 15(a) shows the predicted random index () of two chillers. Compared with the Elman model, evaluated by the N-Elman models (74.7% and 64.2%) are more close to the real value from NVTD. Moreover, Figure 15(b) shows the nonlinear indexes and of water temperature using BP and N-BP networks, the predicted index of chaos using BP is lower than the actual value, and hysteresis predicted index is higher than the actual value from NVTD.

(a)

(b)
6. Conclusions
With the increase of sensors in the practical projects, data with high precision for air conditioning system model is required. However, the external and internal nonlinear interferences have great influences, reducing the measurement precision and the data quality. Meanwhile, through excavating data variation trend hiding in collection data from EMCS, optimized parameter prediction data model can be designed. The main goal of this study is to guarantee the accuracy of CWL modeling data using KPOS. This algorithm is derived from recursion using the dynamic state space and is very suitable for multidimensional random estimation. This algorithm also effectively improves the stability and accuracy of the results.
In this study, the data expressions of NVTD are present to analyze the historical data of a case building. Although the predicted values of Elman and BP model are acceptable, improved models (N-E and N-BP) further enhance the predicted performance. Results demonstrate that these improved models acquire more actual operation data compared with Elman and BP model, and they are able to avoid the local optimum value due to the evaluation layer. Therefore, the data processing scheme and model can be regarded as powerful tools for data mining and for increasing the database types. These improved models are also beneficial to the energy distribution and control.
The further study in this field will focus on other parameters (including pressure, flow, and indoor air quality) and designing the controller to reduce the nonlinear influence and energy consumption.
Symbols
: | State equation |
: | Observation equation |
: | State transition matrix |
: | Measurement matrix |
: | Input disturbance (value is ) |
: | Observing disturbance (value is ) |
: | Deviation function |
: | Deviation gain |
: | Temperature (°C) |
: | Fitness function |
, : | Learning factors |
, : | Random numbers 0~1 |
: | The position of particle |
: | Chaos index |
: | Sampling period (h) |
: | Hysteresis index |
: | Energy consumption of chiller (kW) |
: | Chiller power (kW) |
: | Operation time (h) |
: | Random index |
: | Average CP (%) |
: | Hidden layer |
: | Output target |
: | Output of evaluation layer |
: | Error function of evaluation layer |
: | Output matrix of weight |
: | Input matrix of weight |
: | Primary data (%, °C) |
: | Activation function (Purelin function) |
: | Objective function of output layer. |
: | Perturbation matrix |
: | Particle number () |
: | Inertia factor (≥0) |
: | Weight factor |
: | Constraint factor |
: | Activation function |
: | Learning rate |
: | Connection weight matrix. |
: | Searching dimensions |
: | Transport matrix |
: | Connection weight matrix between associative unit and hidden unit |
: | Connection weight matrix between input unit and hidden unit |
: | Connection weight matrix between hidden unit and output unit. |
IRV: | Ideal real value |
: | Supply water |
: | Return water |
: | Neuron connection weight from the input layer to the hidden layer |
: | Chaos and hysteresis value. |
Conflicts of Interest
The authors declare that they have no conflicts of interest.