Abstract

This study proposes a novel framework that couples the general likelihood uncertainty estimation (GLUE) method with a deterministic forecasting approach to conduct a new uncertainty analysis approach for assessing the energy production of operational wind turbines installed in the Jhongtun wind farm at Penghu (an island in the middle of Taiwan Strait). The 10-year measured data of wind speeds and energy output collected on these wind turbines is divided into two 5-year data sets for the present analysis framework of execution and validation to demonstrate the predictability of the GLUE method. The present study considers 15 scenario testing cases with various time periods, i.e., twelve months, the strong-wind (October-March) regime, the weak-wind (April-September) regime, and one year, for the framework to investigate the applicability of the GLUE method on long-term wind energy forecasting. In the execution framework, the 5-year measured data is used by the GLUE method to access the uncertainties involved in the deterministic approach (i.e., the shape and scale parameters of the Weibull wind speed distribution (WWSD), the performance curve, and the capacity factor) with two confidence intervals of 50% and 90%. The framework is then validated by the measured capacity factors in the last 5-year data and compared with the results of the uncertainty analysis approach by the Monte Carlo (MC) approach to discover the applicability of the new uncertainty analysis approach. From the simulated results, it is found that the proposed uncertainty analysis approach provides predictions of confidence intervals that match the measured data better than the MC-based uncertainty analysis approach. Specifically, the proposed approach can match the measured capacity factors in all the simulated scenarios. Conversely, the MC-based approach is found to create narrow confidence intervals that cannot completely capture the measured capacity factors, particularly for the strong-wind, weak-wind, and one-year scenarios. Therefore, this novel uncertainty analysis approach is proven to be useful in predicting the uncertainties of wind energy production.

1. Introduction

Wind energy has been recognized as a promising alternative to nonrenewable energy resources. Furthermore, wind energy is proven to introduce a noteworthy reduction in greenhouse gases (i.e., CO2) and water consumption, which reveals the effectiveness of wind energy in preserving environmental sustainability [1]. So far, the installation of wind turbines and wind farms worldwide has reached a total capacity of 906 GW in 2023 [2]. Such a large amount of energy is harvested from wind turbines installed in the atmospheric boundary layer where the flow is highly turbulent, intermittent, energetic, and unstable, which leads to significant uncertainties in wind energy production. In this aspect, the integration of wind energy into electricity power systems has a great impact on power system operations, stability, and planning. The most direct way to tackle this problem is to improve the accuracy and reliability of wind energy forecasting. As a result, many studies associated with the investigation of wind energy forecasting techniques have been made in many countries during the past two decades [39]. These approaches have been built for various forecasting horizons according to their objectives, i.e., the very short-term forecasting horizon with a time scale from seconds to minutes for wind turbine control, the short-term forecasting horizon with a time scale from hours to days for economic dispatch, the medium-term forecasting horizon with a time scale from days to weeks for maintenance scheduling, and long-term forecasting horizon with a time scale from weeks to months or years for wind power planning, as summarized by Zhang et al. [10].

Summarizing the aforementioned approaches of various wind energy forecasting technologies in the past two decades, as shown in Figure 1, the authors have classified wind energy forecasting approaches into deterministic forecasting and uncertainty analysis based on whether uncertainties are quantified or not. The deterministic forecasting approaches are further divided into two categories: physics-driven forecasting and data-driven forecasting. In the physics-driven forecasting approaches, the physical causes of wind speeds, i.e., pressure and altitude, are considered. These approaches can be then subdivided into wind physics and turbine physics approaches. For the wind physics approaches, the wind speed data can be represented by a probability density function established based on the measured wind speed data. Among various distributions, the Rayleigh or Weibull distributions are often adopted to describe the wind speed distribution. Basically, the Rayleigh distribution is a special case of the Weibull distribution with a constant shape parameter of 2 and a constant location parameter of 0. As the Rayleigh distribution only consists of one parameter (the scale parameter), it is relatively more convenient to use. Nevertheless, the Weibull distribution is more widely used as it has more flexibility from the inclusion of the shape parameter to match the wind speed data. On the other hand, since the fitness of the adopted wind speed probability density function can significantly influence the accuracy of the predicted wind energy, there are various ways to find the optimal parameter set. For example, AI-Quraan et al. [11] developed an artificial intelligence technique with the whale optimization algorithm (WOA) to search for the optimal parameter set and found that the WOA outperforms the maximum likelihood and moment methods. Besides the probability density functions, the wind speed data can be simulated by numerical weather prediction (NWP) models such as the Weather Research and Forecasting (WRF) model. The corresponding wind energy output can be computed in several ways including using the performance curves of wind turbines or from functions that relate the measured wind speed and generated energy output [3, 12, 13]. In terms of the turbine physics approaches, the wind speed is directly given and the corresponding wind energy output under this wind speed is computed based on the empirical formula from wind tunnel experiments, which is conceptually similar to the use of the standard performance curve of a wind turbine. In contrast to the physics-driven forecasting approaches, the data-driven forecasting approaches, including the stochastic and artificial intelligence (AI) approaches, completely utilize the measured data of wind speeds and energy output to forecast wind energy output through a predictive function [14]. For the stochastic approaches, the predictive function is expressed by probability theory and stochastic processes, e.g., the Kalman filter (KF) by Zuluaga et al. [15] and Bayesian multiple kernel regression by Wang et al. [16]. Nevertheless, the stochastic approaches are found inappropriate for nonlinear data and could lead to significant delays [16]. The AI approaches have recognized robustness and effectiveness in predicting the wind energy output [14], e.g., the artificial neural network (ANN) by Tu et al. [5, 6] and the extreme learning machine (ELM) by Liu et al. [17]. However, the accuracy of wind energy forecasting based on AI approaches significantly relies on the training set. Concerning all the aforementioned deterministic approaches, because of the inevitable uncertainties in the atmosphere that bring natural variations of wind speeds, the predicted wind energy may not match the actual one in some circumstances, e.g., in seasons where the direction or magnitude of winds change frequently [4]. Furthermore, the generated wind energy is found to have nonlinear and no-stable properties which subsequently decrease the predictivity of these approaches.

To resolve the aforementioned difficulties of the deterministic approaches, uncertainty analysis approaches are subsequently proposed based on the aforementioned deterministic forecasting approaches to quantify the uncertainties in the prediction of wind energy [4]. In the uncertainty analysis approaches, a distribution of wind energy is provided instead of a fixed value in the deterministic forecasting approaches. Currently, there are three main categories of uncertainty analysis, i.e., the risk index, scenario, and probabilistic forecasting approaches [10]. For a risk index approach, a risk index is computed to reflect the expected wind energy forecasting error that is directly related to the predictability of wind energy forecasting. Different indexes such as the Meteo-Risk Index (MRI) by Pinson and Kariniotakis [18] and MaxMin/MaxMinMax by Holmgren et al. [19] are proposed for quantifying such a risk. From the viewpoints of the scenario approaches, they consider the spatiotemporal correlation of the uncertainties at different look-ahead time points and geographical positions such that the scenario approaches are suitable for dynamic decision-making problems [20, 21]. For more details on the risk index and scenario approaches, the readers are referred to Zhang et al. [10].

For the probabilistic forecasting approaches in the category of uncertainty analysis, the wind energy output is a probability density function that provides additional information on the quantity of the involved uncertainties. Currently, there are nonparametric and parametric approaches for building the probability density function [10]. Concerning the nonparametric approaches, no assumption is required for describing the distribution shape of any uncertainty such that the nonparametric approaches are distribution-free. Various approaches are built to estimate the wind energy distribution in various ways, such as the quantile regression method (e.g., the Local Quantile Regression (LQR) model by Bremnes [22, 23]), the kernel density estimation (KDE) method (e.g., the KDE method based on copula method by Bessa et al. [24]), the ensemble forecasting method that uses massive WRF ensembles (e.g., the approach that uses the Bayesian Model Averaging (BMA) to average the predicted results of three AI approaches by Wang et al. [25] and the approach by Xie et al. [26] that uses a nonparametric time series forecast model (Infinite Markov Switching Autoregressive, IMSAR) for wind power generation and applies the Bayesian inference with a sampling procedure to quantify the uncertainty based on measured data), and the AI method (e.g., the nonparametric approach by Wan et al. [27]). Although no parameters of the distribution shape are required to be determined, as noted by Zhang et al. [10], the nonparametric approaches are more computationally expensive than the parametric approaches that will be introduced in the next paragraph. Hence, based on the reviews of the deterministic approaches in the second paragraph and the nonparametric approaches in this paragraph, the target of this research is to build a parametric approach based on a physics-driven forecasting approach comprising both the wind and turbine physics approaches.

For the parametric approaches in the category of probabilistic forecasting, the distribution shapes of parameters with uncertainties are analytically defined [10]. Based on the aforementioned objective of the present study (convert a deterministic forecasting approach (physics-driven forecasting approach) to a probabilistic forecasting approach (parametric approach)), as displayed in Figure 2, the authors herein classified various parametric approaches into the MC approaches, GLUE approaches, Bayesian approaches, stochastic approaches, and AI approaches. In the MC approaches, the distributions of uncertain factors are defined, e.g., applying the Weibull distribution for wind speeds, and massive samplings are created to form the resultant wind energy output distribution. For example, Kwon [28] considers the uncertainties in the WWSD (i.e., the shape parameter), ground roughness and performance curve, etc., and uses the MC method to create a huge number of simulated values to evaluate the uncertainties of the wind energy. Mokryani and Siano [29] establish an integrated approach that comprises the MC method and market-based optimal power flow to evaluate the impact of wind turbine integration into the distribution networks within a market environment. Zhao et al. [30] examine the performance of the probabilistic uncertainty analysis based on the MC method and chaos least squares support vector machine algorithm (Chaos-LS-SVM) for predicting confidence levels of wind energy output. The Bayesian approaches utilize the Bayesian inference method to estimate the parameters of defined distributions of the uncertain factors and then apply the MC method to form the desired distribution, e.g., Bracale and Falco [31] establish an advanced Bayesian approach for short-term wind energy forecasting that includes a mixture of two WWSDs and a Bayesian interface method to find the parameters of the two WWSDs. Then, the MC method is used to find the desired wind energy distribution. Jung et al. [32] propose a Bayesian approach to predict the parameters of the distributions of wind speeds under the condition that there is limited measured wind speed data at the study site. Thus, the uncertainties in the air density, performance curve, and wind speeds at the reference site are both considered. For the stochastic approaches, there are different researches, e.g., the Self-Exciting Threshold Autoregressive (SETAR) model by Gallego et al. [33] that is based on the Regime-Switching Autoregression (RS-AR) nonlinear time-series model, the Conditional Parametric Autoregression eXtraneous (CPARX) model by Nielsen [34], and the neural network by Sideratos and Hatziargyriou [35], the Autoregression-Generalized Autoregressive with Conditional Heteroscedasticity (AR-GARCH) model [36]. In the AI approaches, the AI is used again to predict the parameters for defining the distribution, e.g., the approach that uses a feedforward neural network to predict wind energy output by Khosravi et al. [37].

As to the GLUE approaches within the category of the parametric approaches (as drawn in Figure 2), the uncertainty analysis for wind energy forecasting is conducted by incorporating the GLUE (generalized likelihood uncertainty estimation) method. Basically, in the GLUE method, a user-defined likelihood function is used to quantify the difference between the simulated and measured data [38], such that the simulated results with better accuracy can be emphasized. Thus, it is conceptually a mixture of the MC, AI, and stochastic approaches. The GLUE method has been used in many fields because of its simplicity for implementation and its demonstrated capability for uncertainty analysis, e.g., in the hydrological models [3941], flood inundation models [4245], and rainfall-runoff models [46, 47], to access the uncertainties of studied problems. Nevertheless, currently, there is no research using the GLUE method to conduct the uncertainty analysis of wind energy generation.

To fill this gap, this study proposes a novel uncertainty analysis approach that couples the GLUE method with a deterministic forecasting approach to assessing the energy production of operational wind turbines at Penghu. Collected 10-year wind speed and energy output data of these wind turbines are utilized for this analysis. For demonstration, the data is divided into two 5-year data sets for the framework of execution and validation. The framework comprises 15 scenario testing cases with various time periods as twelve months, one strong-wind regime, one weak-wind regime, and one year to investigate the applicability of the GLUE method on long-term wind energy forecasting. For each testing case, the first 5-year data is utilized to analyze the uncertainties of the Weibull wind speed distribution parameters, the wind energy production, and the capacity factor by the GLUE method with two confidence intervals of 50% and 90%. Next, the framework is evaluated with the measured capacity factors based on the last 5-year data. To discover the advantages of the GLUE method, the MC method is selected for comparison because of its simplicity and applicability to various approaches. The results of the GLUE-based uncertainty analysis approach are compared with the results of the MC-based uncertainty analysis approach, and attention is devoted to the accuracy of the confidence intervals of wind energy output.

2. Deterministic Approach

2.1. Weibull Wind Speed Distribution (WWSD)

In general, wind energy density per unit area can be estimated using a probability density distribution fitting to a set of wind speed data. Considering that the natural distribution of measured wind speeds often matches a Weibull shape [3], the Weibull density function comprising two parameters is used to represent wind energy density: where is the wind energy density function, is the wind speed (m/s), is the scale parameter highly related to the mean of the wind speed distribution, and is the shape parameter that is usually in a range between 1.5 and 3. The corresponding cumulative distribution function can be obtained by integrating Equation (1):

There are various ways to find the scale and shape parameters, i.e., the numerical approaches [48] and the artificial intelligence approaches [11, 49] with higher accuracy but heavier computational demands than the numerical approaches. For simplicity and acceptable accuracy in predicting the parameters (Chang et al. [7]), the maximum likelihood method as in the category of the numerical approach is used to compute the two parameters through iterations using the following equations [4]: in which subscript refers to the th time period and is the number of nonzero measured wind speed data. The average wind speed at a local site is given by where denotes the Gamma function as .

2.2. Wind Energy Production

Wind energy production is the process that converts the energy of wind into mechanical energy and stores the mechanical energy as power energy. Nevertheless, since there are several types of losses in the system, the whole mechanical energy cannot be completely converted into power energy. Currently, for estimating the energy output from a wind turbine at a specific wind speed, there are two commonly used approaches. In the first approach, the energy output is computed by utilizing the standard performance curve from the manufacturers which describes the energy output at various wind speeds under a standard environment. The other approach estimates the energy output by the regression methods or artificial neural networks from the measured data set of wind speeds and energy output. Nevertheless, although the second approach can lead to higher accuracy than the first approach, it requires massive data for the computations. As a result, the present study adopts the first approach hereinafter to calculate the energy output of a wind turbine at a specific wind speed. The corresponding formula for determining the energy output by a standard performance curve is given as where , , and refer to the cut-in wind speed, rated wind speed (m/s), and cut-out wind speed (m/s), respectively. is the rated power which is the maximum energy output of the wind turbine. , , , , and are coefficients that are provided by the corresponding manufacturer of the used wind turbine. From Equation (5), when the wind speed is less than the cut-in wind speed, the fans of the wind turbine are not rotated such that the subsequent energy output is zero. When the wind speed is greater than the cut-in wind speed, the fans of the wind turbine are rotated and the energy output is subsequently produced. Constant energy output (i.e., the maximum energy output) is produced by the wind turbine under the condition that the wind speed is between the rated and cut-off wind speeds. When the wind speed is greater than the cut-off wind speed, the wind turbine is turned down to avoid potential permanent damage under such a violent wind; hence, there is no energy output. The standard performance curve is displayed in Figure 3 for illustration.

Combining the WWSD aforementioned in Section 2.1 and the standard performance curve in this subsection, the energy output () in a time period can be expressed as which leads to the form of

Equation (7) is solved by applying numerical integration. In practice, the energy output is often divided by the maximum energy output to derive a dimensionless factor as the capacity factor to give a more objective inspection of the produced energy output [3, 50]. Hence, the produced energy output is represented by the capacity factor hereinafter. The capacity factor is defined as (Chang et al. [7])

The formula for computing the capacity factor can be deduced by combining Equations (7) and (8), leading to

Equation (9) cannot be directly integrated. Instead, the present study adopts the recursive adaptive Simpson quadrature to numerically solve Equation (9) to find the capacity factor .

3. Uncertainty Analysis Approaches

3.1. Generalized Likelihood Uncertainty Estimation (GLUE) Method

The GLUE method is currently one of the main methods used for uncertainty analysis in hydrology and water quality simulations [38]. It utilizes the likelihood function and considers the parameter space and probability distribution patterns to appropriately describe the involved physical mechanisms. Under general circumstances, as long as a model has several parameter combinations and measured simulation results, the analysis process can be initiated. The first step in performing the uncertainty analysis by the GLUE method is to decide a representative likelihood function that measures the difference between the simulated and measured results. Then, the decided likelihood function is used to perform a sensitivity analysis of all the involved parameters. A likelihood value scatterplot of each involved parameter is conducted to observe where the optimal performance occurs. Subsequently, the parameter range for the parameter is determined. Then, a sufficiently large number of parameter combinations are created based on these derived parameter ranges. The simulated values and the corresponding likelihood values of these parameter combinations are computed and recorded. For a parameter combination, the likelihood value is updated when a new likelihood value is computed due to the use of new data: where is the updated likelihood value, refers to the likelihood value based on the original data, is the likelihood value based on the new data, and and are the weights for the original and new data, respectively. After the likelihood value of each parameter combination is computed and/or updated, the simulated results are then sorted according to the simulated value of each parameter combination. After that, the likelihood values are normalized to obtain the desired distribution. The confidence intervals are subsequently derived based on the distribution.

3.2. Monte Carlo (MC) Estimation Method

Monte Carlo simulation is a method for solving problems via massive random samplings to produce a huge number of simulations that can describe the desired population [51]. Nowadays, through the use of computers, a lot of samplings and corresponding simulations can proceed to obtain an estimated answer in a relatively short time. The MC method is currently being applied in many fields such as physics, chemistry, and even social sciences. The basic principle of the MC method is to define all possible events via the considered probability density functions and to cumulate the probability density functions into cumulative probability functions from 0 to 1. Through massive simulated data, a link can be established between random sampling and actual problem simulation. Then, the most likely result of an actual problem is conducted. According to the central limit theorem, the sample average after massive simulation is the unbiased estimation of the population’s average. Also, the confidence interval can be obtained by statistically analyzing the outcomes of these simulations.

4. Study Site and Data Background

4.1. Analysis of Wind Speed and Power Data of Wind Turbines at the Study Site

This study uses measured data of wind speeds and power output at four Enercon E44-600KW wind turbines in the Jhongtun wind power station at Penghu which is an offshore island in the middle of the Taiwan Strait, as Figure 4(A) depicts. The Jhongtun wind power station has been operating since 2001. It is the second median-scale wind power plant in Taiwan and the first wind power plant located on the offshore island of Taiwan. The station is set in the Baisha village of Penghu, as Figure 4(B) displays. The four Enercon E44-600KW wind turbines are erected by the Taiwan Power Company on the north side of the village, as depicted in Figure 4(C). The four wind turbines are marked as 1,2, 3, and 4 from east to west in Figure 4(D). An Enercon E44-600KW wind turbine has a capacity of 600 kW with 3 m/s, 13 m/s, and 25 m/s for the cut-in, rated, and cut-off wind speeds, respectively. The parameters for the standard performance curve in Equation (5) are given as , , , , and . Time series of wind speeds and power output of the four wind turbines have been recorded in a 10-minute time resolution from 2002 to 2011. Unfortunately, after 2012, the measured data is no longer available to the public. Nevertheless, this situation will not influence the quality of the present paper. On the other hand, despite the measured data of the four wind turbines can all be adopted for the rest of uncertainty analysis, the records of the last three wind turbines are not used since the wind speeds around these three wind turbines are influenced by the first wind turbine (wind turbine #1) in the wind velocity field of northeast monsoon during the strong-wind regime (from October to March). Furthermore, the 10 min measured data of wind turbine #1 is shortened to 1-hour data by averaging the six 10 min data within an hour to increase the computational efficiency. In terms of the pattern of the wind velocity field, it can be classified as the strong-wind and weak-wind (from April to September) regimes. In the strong-wind regime, the wind speeds can be up to 10 m/s to 16 m/s, whereas the wind speeds are between 5 m/s and 10 m/s in the weak-wind regime. Therefore, to discover the capability of the developed uncertainty analysis in various timespans, the present study has 15 scenario testing cases, i.e., twelve months, the strong-wind regime, the weak-wind regime, and one year. The obvious difference in the weak-wind and strong-wind regimes also leads to the difference in capacity factors [5], which can be seen in Table 1. Table 1 lists the monthly capacity factors determined by the 1-hour time series of each scenario during the 10 years. Table 1 also displays the estimated monthly capacity factors by the deterministic approach aforementioned in Section 2 where the measured wind speed data for each year is used to find the fitted Weibull distribution in each scenario. From Table 1, the aforementioned two prominent wind regimes can be inspected within a year in Taiwan. In the strong-wind regime, the monthly capacity factors are in the range of 0.509 to 0.793. On the other hand, the monthly capacity factors in the weak-wind regime are only between 0.188 and 0.373. Besides, it is seen that a larger discrepancy between the measured and estimated capacity factors is found in the weak-wind regime (up to 28.2%) than in the strong-wind regime (up to 8.2%). Such a phenomenon can also be seen in monthly scenarios. The average error for the scenario with weak wind (months from April to September) is in a range between 8.8% and 25.13%, and the average error for the scenario with strong wind (months from October to March) is between 3.3% and 18.7%. Thus, the above phenomenon indicates the necessity to use uncertainty analysis approaches to predict the capacity factor of wind turbine #1.

Finally, as aforementioned, there is a fitted Weibull distribution for each year in each scenario. Consequently, there are 150 fitted Weibull distributions in the present study. Overall, these fitted Weibull distributions can all reasonably represent the distributions of the measured wind speeds provided that the coefficients of determination () are all acceptably close to 1. Owing to the space limitations, the fitted results for all 15 scenario testing cases in 2004 with an averaged of 0.81 are displayed in Figure 5 for demonstration. From this figure, regarding monthly scenarios, there might exist two peaks in a measured wind speed distribution where the wind speed turns from strong to weak or weak to strong (i.e., February and October). This can also be seen in the one-year scenario where a peak for stronger winds and another peak for weaker winds are observed.

4.2. Uncertainty Analysis of the Wind Energy Production

In the present study, the prediction of the energy output of wind turbines through the adopted deterministic approach involves the determination of wind speeds by the WWSD and the wind-to-energy conversion through the performance curve of the wind turbines. As a result, the uncertainties of the wind energy output, which is represented by the capacity factor in the present study, are related to the WWSD and performance curve. Specifically, for the WWSD, the uncertainties come from the shape parameter and scale parameter , as will be discussed in Subsection 4.2.1. Nevertheless, the uncertainties in selecting the Weibull distribution instead of other distributions to describe wind speeds are not considered in the present study since the Weibull distribution is proven to have the best fitness than the other distributions. The uncertainties of the WWSD and the performance curve are next discussed in Subsections 4.2.1 and 4.2.2, respectively. The distinctive procedures for implementing the MC and GLUE methods are introduced in Subsection 4.2.3 and 4.2.4, respectively.

4.2.1. Uncertainties of the WWSD

As the amount of wind speed data in the distribution selection process is often very large, a perfect description for the measured wind speed data by a probability density distribution is unable to be obtained. Chang et al. [3] collect wind speed data from 25 weather stations between 1996 and 1999 and find that the Weibull distribution coincides with the measured wind speed data with a high degree of precision. Furthermore, concerning the wind speed distribution in Taiwan, the Weibull distribution is again to be the most representative. Hence, this study selects the Weibull distribution to represent wind speed data, and the uncertainties in using the Weibull distribution instead of other distributions are not considered as aforementioned. Nevertheless, the uncertainties of the WWSD exist due to many factors. First, the anemometer is positioned behind the wind turbine’s blades such that the wind turbine’s slipstream or other wind turbines nearby may cause a draft disturbance. Also, human activities, machine malfunction, or other issues may cause discrepancies between the measured and actual wind speeds. In the present study, the uncertainties of the WWSD are assumed to be mainly attributed to the uncertainties of the WWSD parameters which are obtained via measurement of the local long-term wind speeds, i.e., the shape parameter and scale parameter . In such a way, the aforementioned discrepancy between the measured and actual wind speeds is reasonably included in the analysis.

4.2.2. Uncertainties of the Performance Curve

For a given wind speed, the energy output is obtained by substituting the wind speed into the performance curve provided by the wind turbine manufacturer or from the regression analysis of the measured data. However, these two methodologies are known to underestimate the monthly capacity factor in the strong-wind regime and overestimate the monthly capacity factor in the weak-wind regime [4]. Equivalently speaking, there is not yet a perfectly accurate method to determine the energy output. Consequently, the process of determining the capacity factor has some degree of uncertainty. Also, this process is subject to the limits imposed by the amplitude of the data, the size of the error in the model itself, and the errors due to human factors and machine malfunction. These involved uncertainties are all considered by accessing the uncertainties of the performance curve.

4.2.3. Procedures for the Uncertainty Analysis by the MC Method

In the adopted MC method, to consider the uncertainties within the WWSD (as the shape parameter and scale parameter ) and the performance curve, for each scenario, a massive number of simulations are conducted via random sampling from 0 to 1 based on the cumulative density distributions of these parameters. The sequential procedures for creating a simulation for a specific scenario are introduced next.

In the first step, the uncertainties of the shape parameter and scale parameter in the WWSD are considered. Specifically, the uncertainties of the scale parameter are herein considered by analyzing the uncertainties of the average wind speed . The shape parameter and average wind speed are both assumed to be in normal distribution, and the statistics (i.e., averages and standard deviations) of the two parameters in the scenario are computed by the 10-year measured data according to the period of the scenario. Based on the computed two normal distributions, a shape parameter and an average wind speed are randomly sampled. Then, the sampled shape parameter and average wind speed are utilized to compute the scale parameter through Equation (4). A WWSD and its cumulative density function are thus decided based on the shape parameter and scale parameter .

In the second step, a wind speed is randomly sampled based on . Then, the capacity factor of wind turbine #1 is computed by using the performance curve based on the sampled wind speed by Equation (9). To consider the uncertainties of the performance curve, the present study assumes that the energy output under a given wind speed is a normal distribution. To find this normal distribution, the measured wind speeds and energy output in the Jhongtun wind plant are drawn in Figure 6. As shown in this figure, under a given wind speed, the distribution of the energy output is almost symmetrical despite some data being far away from the performance curve due to manual or mechanical problems. Hence, the present study assumes that the energy output is in the normal distribution under a given wind speed. The uncertainties of the performance curve are considered by assuming that the energy output on the performance curve is the average value of the normal distribution. Also, the standard deviation of the distribution is given as the variance of the measured energy output. For the variance of the measured energy output, it is assumed to be a negative-slope straight line between the cut-in and rated wind speeds and a horizontal line between the rated and cut-off wind speeds [28]. By analyzing the measured data in the Jhongtun wind plant, the coefficient of variation is computed and a regression curve is found (Figure 7(a)). Correspondingly, the standard deviation of the energy distribution () is given as where is the computed energy output under a given wind speed from the standard performance curve. Thus, for a sampled wind speed , the corresponding energy output is computed by the performance curve and then added by a deviation that is randomly sampled from a normal distribution with zero mean and a standard deviation of . Taking the used Enercon E44-600KW wind turbine as an example, the generated energy output using the normal distribution together with Equation (11) is displayed in Figure 7(b). By repeating step 2 many times, the capacity factor can be computed. Finally, for the considered scenario, steps 1 and 2 are repeated 100,000 times to produce 100,000 capacity factors that can be used to derive the confidence interval of the capacity factor. The results for all 15 scenarios are displayed and analyzed in Section 5. The flowchart of the M-C method is displayed in Figure 8 for illustration.

4.2.4. Procedures for the Uncertainty Analysis by the GLUE Method

To implement the GLUE method for accessing the uncertainties of the energy output, the present study modifies the formula presented by Nash and Sutcliffe [52] as the likelihood function since each simulated capacity factor is a single value [53]: where is the likelihood value, is the simulated value, and is the measured value. As Equation (12) depicts, this likelihood function is 1 minus the corresponding error value. As a result, the likelihood value has a similar effect as the efficiency coefficient which represents the quality of the simulation result. After determining the likelihood function, the GLUE method then processes a parameter sensitivity analysis of the three considered parameters (the shape parameter , scale parameter , and performance curve) by using the likelihood values of all possible parameter combinations to discover the parameter range of each parameter that can lead to a simulated result with better accuracy. Although each scenario can have its determined parameter ranges, for the sake of simplicity, the parameter ranges are decided to have the best accuracy in all 15 scenarios. Usually, to discern the variation of the likelihood value in the simulation result, a larger parameter range will be used. Furthermore, to lower the complexity of using the GLUE method, the parameter range is defined by two ratios that describe the lower and upper bounds, respectively. In this way, concerning the shape parameter , the parameter sensitivity analysis finds the optimal parameter range as . Similarly, for the scale parameter , the optimal parameter range is defined as . The uncertainties of the performance curve are considered by multiplying a computed capacity factor with an adjusted coefficient . The parameter range of the performance curve is actually the parameter range of this coefficient as .

After finding the parameter ranges of the three uncertain parameters, for each scenario, the GLUE method separates the measured data into the training set (2002 to 2006) and the validation set (2007 to 2011) to demonstrate the predictability of the GLUE-based approach. In the training set, for each year, the GLUE method randomly samples values of parameter and values of parameters from their corresponding parameter ranges with the assumption that and are both in uniform distribution. Subsequently, there are distinctive WWSDs and corresponding capacity factors computed by Equation (9). Next, to include the uncertainties of the performance curve, values of the adjusted coefficients are randomly sampled with the same assumption that the adjusted coefficient is in uniform distribution. The previously computed capacity factors are multiplied by these adjusted coefficients, resulting in parameter combinations and their corresponding capacity factors. After that, the likelihood values of these parameter combinations are computed. In the next step, for each parameter combination, an average likelihood value over the 5 years in the training stage is calculated by the following equation that is modified from Equation (10): where subscript refers to the th parameter combination. The weights in Equation (13) (, , , , and ) are all given as 1/5 based on the assumption that each year is an independent event. The flowchart of the GLUE method in the training set is displayed in Figure 9 for illustration. In the validation set, as displayed in Figure 10, the GLUE method also creates parameter combinations and computes the corresponding capacity factors by the same means as the training set. The average likelihood values from the training stage are directly used as the likelihood values for these parameter combinations in the validation stage. These likelihood values are then utilized as the weights of these parameter combinations. In such a way, the probability and cumulative density functions of the capacity factor are established and the confidence interval of the capacity factor for the last 5-year validation set can be derived. The results of all the scenarios are discussed in Section 5.

5. Results and Discussion

In the present study, the MC and GLUE methods are used to establish the confidence intervals of the capacity factor of wind turbine #1 at the Jhongtun wind power station for the 15 scenario testing cases. For demonstration, for each scenario, the 50% and 90% confidence intervals of the two uncertainty analysis approaches are compared with the measured capacity factors in 2007-2011 to examine the applicability of the two approaches. Theoretically, when the established distribution of capacity factor is reasonable, there should be about two or three measured capacity factors falling within the 50% confidence interval of the distribution and about four or five measured capacity factors falling within the 90% confidence interval. The larger the number of the measured capacity factors falling within the confidence interval is, the more the accuracy of the uncertainty analysis approach is. The results of the MC-based and GLUE-based uncertainty analysis approaches are displayed in subsections 5.1 and 5.2, respectively. The comparison between the two uncertainty analysis approaches is stated in subsection 5.3.

5.1. Results of the MC-Based Uncertainty Analysis Approach

The 50% and 90% confidence intervals of the capacity factor for all 15 scenarios of the MC-based approach are drawn in Figure 11. Overall, the MC-based approach can give acceptably accurate predictions in all 15 scenario testing cases. Specifically, in terms of the monthly scenarios, when concerning the 90% confidence interval, the MC-based approach can provide quite good predictions in January to May, July, and October. Particularly, the MC-based approach gives a satisfactory prediction in October. Nevertheless, in terms of the 50% confidence interval, the MC-based approach gives relatively poor predictions in almost all the months except April, August, October, and November. Specifically, in March and September, all the measured capacity factors are out of the 50% confidence interval. Comprehensively speaking, the MC-based approach provides relatively better results in April and October in which the weak/strong winds are changed into strong/weak winds. In other months, the accuracy of the MC-based approach on the confidence intervals, particularly for the 50% confidence interval, is relatively low. In terms of the strong-wind, weak-wind, and one-year scenarios, it is found that almost every measured capacity factor is out of the ranges of the 50% and 90% intervals, especially for the weak-wind and one-year scenarios. Also, the MC method tends to overestimate the capacity factors in all three scenarios, which is also seen in monthly scenarios with strong winds. Concerning the differences among the three scenarios, the 50% and 90% intervals in the weak-wind and one-year scenarios are both relatively narrower than the strong-wind scenario.

The above result indicates that the MC-based uncertainty analysis approach underestimates the uncertainty in the three scenarios compared to the other monthly scenarios, especially in the weak-wind and one-year scenarios. Also, the MC-based approach tends to overestimate the capacity factors in all scenarios. The above phenomenon may be attributed to the use of the Weibull wind speed distribution and standard performance curve for wind-to-power conversion. In the present study, the two-parameter Weibull distribution is adopted such that the wind speed distributions in scenarios with more than one significant wind speed peak cannot be completely considered. Also, the Weibull wind speed distribution may significantly overestimate the capacity factors in the weak-wind periods and slightly underestimate the capacity factors in the strong-wind periods, as reported in [4]. Besides, there may still be unresolved uncertainty in the wind-to-power conversion by the standard performance curve. Nevertheless, the MC-based uncertainty analysis approach can still provide an acceptably accurate estimation of the wind power energy uncertainties.

5.2. Results of the GLUE-Based Uncertainty Analysis Approach
5.2.1. Parameter Sensitivity Analysis

For the parameter sensitivity analysis of the GLUE method, the values of the shape parameter , scale parameter , and adjusted coefficient of the performance curve range from -50% to +50% ( and ), -50% to +50% ( and ), and -30% to +30% ( and ), respectively. Only the data in the training set (2002 to 2006) is used for this analysis, and there are 10,000 simulations in each scenario. It is noted that the computed capacity factor is limited to be less than or equal to 1 to avoid unrealistic results. The closer the corresponding likelihood value (by Equation (12)) is to 1, the better the simulation result is. Based on the computed likelihood values in all of the 15 scenarios, the optimal parameter ranges for the three parameters can be found. Owing to the space limitations, the results of the strong-wind and weak-wind scenarios are displayed in Figures 12 and 13, respectively, for showing the observed trends. The red dots in these figures are the averaged likelihood values used to depict the average trend of the likelihood value.

Generally, concerning the distributions of likelihood values, for each parameter, the values of likelihood functions are concentrated to 1 in the strong-wind scenario, which is also seen in the monthly scenarios with strong wind (January to March and October to December). On the other hand, the weak-wind scenario and the weak-wind monthly scenarios (April to September) all have wider distributions of the likelihood values than in the strong-wind scenario and monthly with strong wind. As for the one-year scenario, its pattern of likelihood values lies between the strong-wind and weak-wind scenarios. Based on these results, to simultaneously get high likelihood values in all the scenario testing cases, the parameter ranges for , , and the performance curve are determined to be -20% to +10%, -30% to +10%, and -10% to 10%, respectively. Equivalently speaking, and for the parameter range of , and for the parameter range of , and and for the parameter range of the adjusted coefficient. On the other hand, the shape parameter is found to be the most sensitive, while the uncertainties of the performance curve are almost neglectable. In terms of the uncertainties in different scenarios, the uncertainties in the weak-wind regime are relatively larger than in the strong-wind regime.

5.2.2. Uncertainty Analysis

The GLUE method conducts the uncertainty analysis by using the determined parameter ranges of the three considered parameters, as discussed in subsection 5.2.1. The analyzed outcomes of the GLUE method are displayed in Figure 14.

From the viewpoint of the monthly scenarios, concerning the 90% confidence interval, the GLUE method leads to accurate results in almost all months except April. As for the 50% confidence, the GLUE method only fails in October where only one measured capacity factor is within the 50% confidence interval. Thus, when simultaneously concerning the performance on the 50% and 90% confidence intervals, the accuracy of the GLUE method is quite good except for April and October. For the strong-wind, weak-wind, and one-year scenarios, the GLUE method also successfully maintains its high accuracy on both the two confidence intervals in which sufficient numbers of the measured capacity factors fall within the intervals. The reason why the results in April and October are less accurate is because the two months fall in either spring or autumn, a time when both strong/weak winds meet and bring greater fluctuations in wind speeds. Nevertheless, if the time scale is extended to 6 months for the strong-wind, weak-wind, or even one-year scenarios, the differences in the annual capacity factors are small since the homogeneity of the data is increasing.

5.3. Performance Comparison between the MC-Based and GLUE-Based Approaches

For comparing the two approaches, for each scenario, the numbers of the measured capacity factors, as listed in Table 1, that fall within the 50% and 90% confidence intervals are counted and listed in Table 2. The lower and upper bounds of the capacity factor (denoted as and , respectively) for the two approaches are also listed in Table 2 for illustration. Furthermore, a visual comparison is also conducted by inspecting Figures 11 and 14. Based on Table 2 and the two figures, both two approaches are found to provide acceptably accurate estimations of the wind energy output uncertainties. Nevertheless, the GLUE-based approach provides results that match the measured data better than the MC-based approach. When considering the monthly scenarios, the GLUE-based approach gives pretty accurate results in almost all months except for April and October in which the numbers of the measured capacity factors in the 50% or 90% confidence intervals are relatively smaller. Compared to the GLUE-based approach, the MC-based approach only gives the same accurate results in April and October. In other months, the 50% and 90% confidence intervals, particularly for the 50% confidence intervals, are found to be less consistent with the measured. Specifically, there is no measured capacity factor falling within the 50% confidence intervals in March and September.

Regarding the strong-wind scenario, the GLUE-based approach has 3 and 4 data points within the 50% and 90% confidence intervals, respectively, which demonstrates the capability of the GLUE-based approach to grasp the degree of variations in the capacity factor. For the MC-based approach, there are only 1 and 2 measured capacity factors falling within the 50% and 90% confidence intervals, respectively. Hence, compared to the GLUE-based approach, the variations of capacity factors are relatively underestimated by the MC-based approach. Also, inspection of Figures 11 and 14 and Table 2 indicates that the confidence intervals by the MC-based approach are relatively narrower than the GLUE-based approach. This result means that the MC-based approach relatively underestimates the degree of uncertainty. For the weak-wind scenario, almost all measured capacity factors fell within the 50% and 90% confidence intervals of the GLUE-based approach. The simulation results of the MC-based approach are found to be distorted given that no measured capacity factor is within the 50% and 90% confidence intervals. For the one-year scenario, the GLUE-based approach has 2 and 4 measured capacity factors that fell within the 50% and 90% confidence intervals, respectively, which is good. The simulation results of the MC-based approach are found to be distorted again provided that no measured capacity factor is found to fall within the two confidence intervals.

In summary, whether using the quantitative method to calculate the number of measured capacity factors falling within the confidence intervals or conducting a direct comparison through graphs, despite the MC-based and GLUE-based approaches are both considered to be accurate enough, the simulation results of the GLUE-based approach are relatively superior to the MC-based approach. The former produces relatively better simulation results in each simulated scenario. Also, in some scenarios in which the MC-based approach estimates distorted results, the GLUE-based approach is still able to grasp the measured capacity factor. In light of this, the simulation results of the GLUE-based approach meet the actual uncertainties of the capacity factor better than the MC-based approach in the selected wind power station. This result is mainly attributed to the fact that the GLUE method has verification procedures whereas the MC method does not. In the GLUE method, the likelihood value that measures the difference between the simulated and measured data is used as the weight such that better-simulated results can be efficiently selected and emphasized. For the MC method, it may lead to results that are less consistent with the measured data, which is due to the use of the WWSD and unresolved uncertainty in the wind-to-power conversion. The adopted WWSD cannot completely represent the wind speed distributions in scenarios with more than one wind speed peak (e.g., one-year scenario). Also, the WWSD may significantly overestimate the capacity factors in weak-wind periods and slightly underestimate the capacity factors in strong-wind periods, as found by [4]. Finally, there may be unresolved uncertainties in the wind-to-power conversion by the standard performance curve. On the other hand, when the steps of simulations are lengthened, the MC method is not only time-consuming but also inefficient. In contrast, in the GLUE method, other events can directly use the previous results and effectively obtain an appropriate amount of data. Thus, the GLUE method is more effective to be implemented than the MC method.

6. Conclusions

This study combines the GLUE method with a deterministic forecasting approach to propose a novel uncertainty analysis approach for wind energy forecasting. The 10-year measured wind speed and energy output data of operational wind turbines in the Jhongtun wind farm at Penghu located in the middle of the Taiwan Strait is collected and divided into two 5-year data sets for the framework of execution and validation to demonstrate the GLUE method. To thoroughly discover the performance of the proposed uncertainty analysis approach in forecasting long-term wind energy output, there are 15 scenarios testing cases with various time periods (twelve months, the strong-wind regime, the weak-wind regime, and one year) in the framework. In the execution framework, the uncertainties in the wind speed distribution, performance curve, and capacity factors are accessed by the GLUE method. Next, the framework is validated by comparing the 50% and 90% confidence intervals of the wind energy output with the measured capacity factors in the last 5-year data. Besides, the proposed framework is compared with the results of the uncertainty analysis approach by the MC method. Based on the results, some conclusions are given as follows.

First, it is found that both the two uncertainty analysis approaches can give acceptably accurate predictions of 50% and 90% confidence intervals. Nevertheless, both 50% and 90% confidence intervals of the proposed uncertainty analysis approach are found to be more consistent with the measured capacity factors than the MC-based uncertainty analysis approach in all the considered scenarios. Specifically, the MC-based approach even has some undesired distortion in the weak-wind and one-year scenarios whereas the proposed approach still behaves well. On the other hand, concerning the performance of the proposed approach in various scenarios, the results reveal that the accuracy in April and October is relatively less than those in the other months because of relatively greater wind speed fluctuations in these two months. Overall, the proposed uncertainty analysis approach is demonstrated to provide relatively better simulation results than the MC-based approach, indicating that the GLUE method is relatively more efficient in conducting the uncertainty analysis of wind energy data and their distribution.

Data Availability

The result of the study is available from the corresponding author under request.

Conflicts of Interest

The authors declare that there is no conflict of interest regarding the publication of this paper.

Acknowledgments

The authors are grateful for the financial support of this work partially provided by the National Science and Technology Council, Taiwan, under Grant No. 112-2625-M-002-023.