Abstract
CuO is a narrow band gap semiconductor with distinct features that render it indispensable in many industrial and technological applications such as environmental friendly catalysts for organic pollutant removal, sensors, photovoltaic, solar cells, batteries, and storage media among others. Engineering of its energy gap becomes imperative and necessary for tailoring its light absorption capacity to a desired level required for a particular application. Elemental doping mechanisms with accompanied lattice distortion symmetry breaking effectively enhance the optical property of this semiconductor and serve as a major route through which material design is achieved. This work develops an extreme learning machine intelligent predictive (ELM-IP) model and stepwise regression (SWR) based model for estimating energy gap of a doped CuO semiconductor. The developed ELM-IP-Sin model which employs sine activation function performs better than the ELM-IP-Sig model (that utilizes sigmoid activation function) and SWR model with a percentage improvement of 14.15% and 50.05%, respectively, using root mean square error (RMSE) metric, while the developed ELM-IP-Sig model outperforms the SWR-based model. The developed models further investigate the dependence of CuO energy gap on iron and cobalt impurity incorporation, and the obtained results agree well with measured values. The outstanding performance of the developed models is highly meritorious in tailoring the light response capacity of CuO semiconductor for photocatalytic and optoelectronics applications at a reduced cost while the experimental stress is circumvented.
1. Introduction
Copper II oxide (CuO) is a transition metal oxide monoclinic p-type semiconductor with fascinating properties such as nontoxicity, low cost, good stability, narrow energy gap, and natural abundance of its elemental constituents [1, 2]. The attractive features exhibited by CuO semiconductor motivate its potentialities in technological and industrial applications such as sensors, solar cell, photocatalysis, supercapacitors, lithium-ion batteries, spintronics, adsorbents, and electrode of photocells among others [3, 4]. Application of this semiconductor for solar energy conversion as well as catalysts attracts significant attention due to its high selectivity and activities in reduction and oxidation reaction [3]. Photodegradation and photocatalytic activities of CuO semiconductor can be enhanced by band gap engineering through incorporation of suitable foreign materials known as dopants (such as iron and cobalt) with appropriate concentrations [5]. The incorporated dopants frequently stretch and distort the monoclinic crystal structure of CuO semiconductor and ultimately shrink or widen the energy gap of the semiconductor. Experimental synthesis of crystalline and high-quality CuO semiconductor with desired energy gap optical property is challenging which necessitates the need for computational technique that can aid material design and enhances tailoring of energy gap to a value desired for specific applications. This work develops the extreme learning machine intelligent predictive (ELM-IP) model and stepwise regression (SWR) based model for determining the energy gap of CuO semiconductor after dopants incorporation using structural lattice parameters as model descriptors.
The p-type nature of CuO semiconductor comes into being due to the presence of acceptor levels attached to vacancies in copper atoms, where charge compensation is achieved through incorporation of impurities or formation of Cu3+ with equivalent concentration [6]. Improvement in its optical properties due to dopants inclusion as well as varying experimental conditions has been reported in the literature [7–9]. Doping introduces crystal defects into CuO semiconducting crystal structure which serves as points of photon scattering. The optical transition of pure CuO semiconductor occurs directly from valence to conduction band while 3d state doping leads to the creation of surface states within the semiconductor band gap. Oxygen vacancies due to doping are responsible for the modified surface states and redshift behavior that signifies energy gap reduction [10, 11]. CuO semiconductor photo-induced behavior can be easily modified by oxygen vacancies which serve as photo-excited trap sites for charge recombination suppression [12]. Foreign material incorporation as well as varying experimental conditions potentially changes the bond length (Cu-O) and bond angles (Cu-O-Cu) of CuO semiconductor which in turn alters the lattice parameters [13]. The number of oxygen vacancies present in pure and un-doped CuO lattice is altered when Cu2+ in the CuO semiconductor is replaced by dopant atoms. This oxygen vacancy variation influences the number of Cu-O bond as well as Cu-O-Cu bond angle which subsequently affects the O-Cu-O bond angle of the material. The substitutional sites continue to accommodate the dopant atoms until thermodynamic limit is attained after which interstitial sites are occupied [13]. When dopant atoms are introduced into the CuO semiconductor, free electrons are released depending on the nature of the dopants. Electron exchange interaction occurs due to the generated free electrons which leads to electron exchange potential that shrink the semiconductor energy gap [13]. Similarly, effective oxidation states differences due to substituting Cu2+ ions with ions of higher oxidation states which leads to electron donation which induces Cu s-orbital hybridization that causes energy gap alteration. Extreme learning machine is proposed in this work to approximate nonlinear relationship between energy gap and monoclinic crystal lattice parameters of CuO semiconductor purposely to develop a robust model through which the energy gap of CuO semiconductor can be easily controlled to enhance light harvesting capacity of the semiconductor. The stepwise regression- (SWR-) based model is also developed in this work due to ease of implementation of its empirical relations for attaining quick and precise material design procedures that facilitate technological and industrial application of tailored energy gap semiconductors.
Extreme learning machine (ELM) is an extended form of artificial neural network with distinct characteristic features that translate to robust function approximation strength which promotes its wider applicability in solving many real life sciences and engineering problems [14–17]. ELM is characterized with three different layers which include input, hidden, and output layers. The algorithm trains a single hidden neural network with randomized input biases and weights while the output weights are analytically assessed [18]. It ensures higher efficiency with excellent learning accuracy [19, 20]. This approach of learning adopted by ELM algorithm gains learning speed and attains reliable generalization capacity as compared to a conventional single hidden neural network. Other merits of extreme learning machine include circumvention of overfitting and ease of convergence to global solution. These unique qualities are utilized in this work in modeling the energy gap of CuO semiconductor using monoclinic lattice distortion as model descriptors.
The stepwise regression algorithm is a regression model that systematically combines descriptors in accordance to their significance for effective prediction of the desired target [21, 22]. Its strength for best subset selection conserves computational time while model precision as well as accuracy is further strengthened or maintained [23]. The descriptors that have the highest impact and contribution to the residual sum of squares are given priority while executing addition (forward process) and deletion (backward process) procedures of model development [24–26]. This algorithm is employed in this work to determine the influence of incorporated dopants and distortions along -axis, -axis, and -axis of CuO crystal structure on the energy gap. The ease of implementation of stepwise regression generated empirical equations further contributes to its choice in the addressed CuO energy gap engineering problem.
The remaining part of the manuscript is arranged as follows: Section 2 presents the mathematical formulation of extreme learning machine intelligent predictive algorithm as well as the operational principles of stepwise regression. The computational methodology and strategies of the developed ELM and stepwise regression-based models are presented in Section 3 of the manuscript. The description, sources, and data acquisition strategies are also presented in Section 3. Section 4 presents and discusses the outcomes of the developed ELM and stepwise regression-based models while manuscript conclusions are presented in Section 5.
2. Mathematical Background of the Employed Intelligent and Stepwise Regression-Based Algorithms
Mathematical formulation of extreme learning machine is discussed in this section. The section further presents the operational description of stepwise regression algorithm.
2.1. Description of Extreme Learning Machines
Supposing neural network of a single hidden layer is to be developed for approximating nonlinear function connecting lattice parameters of number of CuO semiconductors (doped with different materials) with their corresponding band gap energy. The dataset from which patterns are to be approximated and acquired is represented by , where input lattice parameters of doped CuO semiconductor and band gap energy are, respectively, represented by and . Equation (1) expresses network function with number of hidden layer nodes. where input weights and the activation function are, respectively, represented as and . The bias and output weight of unit of hidden layer are represented as and , respectively. stands for dot product between and . The algorithm is aimed at minimizing error associated with the predicted band gap of CuO semiconductor as presented in Equation (2) in such a way that condition contained in Equation (3) regarding , , and holds.
Equation (4) presents the objective of the ELM algorithm in which and , respectively, represent hidden layer node output and the estimated band gap using the algorithm while stands for output weight.
The matrix form of Equation (4) is presented in Equation (5) while expression for output weight matrix and estimated band gap of doped CuO semiconductor are presented in Equations (6) and (7), respectively.
Parameters , , and should be obtained in order to train neural network of single hidden layer through Equation (8) in accordance to the operational principle of ELM.
Minimization of loss function expressed in Equation (9) becomes necessary with the constraint in order to effectively attain the goal of ELM approximation principle.
The algorithm uniquely determines the output matrix of the hidden layer after and are randomly obtained. Therefore, the network is conveniently transformed to a linear system while is determined through where represents the More-Penrose inverse of matrix
2.2. Stepwise Regression Algorithm
Stepwise regression is a type of multiple linear regression that uses forward-adding and backward-deleting variables to find the best-fitting combination of independent variables for dependent variable prediction [27]. When dealing with a large number of independent factors, determining the appropriate combination of these variables to predict the dependent variable is critical. Stepwise regression can efficiently locate the significant variables from a large amount of raw data based on their special statistical values, allowing the best-fitting model to be built [22]. The stepping technique starts with a model definition and then adds a variable to the preceding model in a stepped forward manner. Although the process is most commonly employed in regression analysis, the basic approach can be used in a variety of model selection situations. Forward selection is all about adding one variable at a time to the subset that has already been selected [24]. For each new variable added or eliminated from the previous model, the forward selection yields the greatest reduction in the residual sum of squares. It can go forward, backward, or both at the same time. Forward selection continues until all variables are incorporated in the model, with no termination constraint [28]. Backward stepwise variable selection begins by selecting subset models from the whole model and deleting each variable whose removal causes the residual sum of squares to grow minimally at each step and continues until only one variable is present in the subset model. The termination rule for stepwise variable selection uses both forward and backward elimination criteria. When all variables in the model meet the criteria to remain and no variables outside the model met the requirement to enter, the variable selection method is brought to stop.
2.3. Mathematical Description of the Employed Performance Measuring Parameters
The feature generalization of the developed models is evaluated using root mean square error (RMSE), correlation coefficient (CC), and mean absolute error (MAE). Equations (11), (12), and (13), respectively, depict mathematical formulation of the parameters. where is the experimental band gap energy, is the predicted band gap energy, is the mean of experimental band gap energy, is the mean of predicted band gap energy, is the error while .
3. Computational Description and Data Acquisition
The description of computational details for the developed ELM-IP-Sin, ELM-IP-Sig, and SWR models is presented in this section. Sources as well as description of dataset employed for modeling and simulation are also presented.
3.1. Description of Data Acquisition and Data Source
Modeling of energy gap of doped CuO semiconductors was carried out using fifty-seven samples of the semiconductor as extracted from the literature [8, 10, 11, 29–37]. The lattice parameters of the semiconducting samples after dopant incorporation serve as the model inputs.
Statistical assessment of the employed dataset from correlation coefficient perspective shows weak degree of linear relationship between the lattice parameters (along -axis, -axis, and -axis) and the energy gap. Correlation coefficients of 8.96% as shown in Figure 1(a), 7.08% as depicted in Figure 1(b), and 10.6% as presented in Figure 1(c) were, respectively, obtained between energy gap and -axis lattice parameter, energy gap and -axis lattice parameter, and energy gap and -axis lattice parameter. This is a clear indication that the linear model becomes inadequate for handling this kind of pattern acquisition. Hence, the need for nonlinear modeling tool becomes essential. Doping of CuO semiconductor results into absorption range blue shift exhibition which influences the semiconductor energy gap due to defect formation in the crystal lattice where photons are scattered [29]. This physically justified relationship between CuO lattice parameters and the energy gap is well captured through the developed model in this work.

(a)

(b)

(c)
3.2. Computational Strategies for the Developed Extreme Learning Machine Intelligent Model
The computational development of ELM model through which band gaps of CuO semiconductor are estimated took place within MATLAB computing environment. The distorted lattice parameters of the semiconductor which serve as the model inputs and their corresponding energy gaps were initially randomized purposely to ascertain that even distribution of data points within stages of model development is ensured. Subsequently, the dataset was partitioned into training and testing sets in the ratio of 8 : 2. With this partitioning, forty-six of data samples from the semiconductor are employed for model training and weight acquisition while eleven of samples were allotted for model assessment. Computational procedures of the developed ELM-based model are summarized as follows: (Step A)Hidden node () specification and activation function () selection: select hidden nodes from the search space with upper and lower bounds of 100 and 1, respectively. Selection of a function which serves as activation function also falls into this stage of model development. The pool of activation functions includes tribas function, sine function, radbas function, and sigmoid function, among others.(Step B)Random generation of and : generate the bias and input weights randomly by implementing initial seed approach within MATLAB computing environment.(Step C)Matrix computation of the output of hidden layer node : compute the output of the hidden layer node through implementation of Equation (5) with the inclusion of the training set of data.(Step D)Determination of output weight : calculate the output weight through Equation (10).(Step E)Model prediction strength evaluation: determine the prediction strength of the developed ELM-based model by incorporating the testing set of CuO semiconducting samples into randomly determined biases from Step B, randomized input weights from Step B, and computed output weights from Step D. Root mean square error serves as the parameter for the evaluation.(Step F)Best model search through various activation function and hidden nodes: repeat Step A to Step E for different activation functions while the number of hidden nodes spans from 1 to 100. Select the model with the lowest root mean square error for any best two activation function. Save the parameters and details of the best models for future implementation. Figure 2 presents the computational flow chat of the developed ELM-based models.

3.3. Computational Method of the Developed Stepwise Regression-Based Model
Development of stepwise regression algorithm with its implementation on CuO semiconductor energy gap estimation was conducted within MATLAB computing environment. The available dataset extracted from fifty seven doped samples of CuO semiconductor was partitioned into training and testing in the ratio of 4 : 1 while the remaining part of modeling and simulation procedures are summarized as itemized below. (Step 1)Null model initialization: this step initiates and develops the regression model that is devoid of input descriptors and contains only intercept.(Step 2)Descriptor inclusion: suppose there are numbers of descriptors for which , linear regression equation is developed for using the separated training dataset and null regression model developed in Step 1. This procedure is repeated for other values of until number of regression functions is attained.(Step 3)Model performance characterization through implementation of defined criteria: the generalization performance of each of the developed regression model, as evaluated through testing set of data with root mean square error performance metric, is computer using some defined criteria such as adjusted , akaike information, sum of squared error, and information criteria.(Step 4)Substitution of subsequent descriptor into regression model: this step constructs and modifies the regression function obtained in Step 3 through addition of another distortion along other axis. The modified regression function is further validated using testing set of data.(Step 5)Development of interaction and quadratic terms: there is a construction of interaction terms and pure quadratic of the best regression function with performance evaluation.(Step 6)Model performance evaluation using root mean square error: the developed regression equations are validated through root mean square error implementation.
4. Results and Discussion
The empirical relations obtained from extreme learning machine and stepwise regression-based models are presented and discussed in this section. The performance comparison of the developed relations is presented. Energy gap tuning effect of different dopants on CuO semiconductor is also presented in this section.
4.1. Intelligent and Stepwise Regression-Based Empirical Functions Connecting Lattice Parameters with the Energy Gap
The empirical results of the developed stepwise regression (SWR), extreme learning machine predictive model with sine activation function (ELM-IP-Sin), and extreme learning machine predictive model with sigmoid activation function (ELM-IP-Sig) are, respectively, presented in Equations (11), (12), and (13). The developed SWR model linearly combines the distortion along -axis, -axis, and -axis due to the incorporated dopants while ELM-based models incorporated nonlinearity existing between lattice parameters and the energy gaps.
It is expected that the developed SWR-based model performs lower than the ELM-based models since it has been established previously from Figure 1 that the lattice parameters are not linearly correlated with the energy gaps of the distorted CuO semiconductor. Although the developed SWR model enjoys ease of implementation as compared with the ELM-based model. However, weights that enhance easy implementation of the developed ELM model have been included in Tables 1 and 2. The optimum number of hidden nodes for the developed ELM-IP-Sin and ELM-IP-Sig models is thirty and thirty-one, respectively, as presented in Tables 1 and 2. The performance superiority of the developed intelligent predictive ELM-based models can be attributed to the uniqueness of ELM algorithm to approximate any nonlinear relationship as a result of its strong mathematical background as well as implementation of Moore Penrose inverse matrix. With the provided model weights and biases, the developed ELM models enjoy better performance and ease of implementation as compared with the SWR-based model.
4.2. Performance Evaluation of the Developed Intelligent and Stepwise Regression-Based Models
The performance of the developed extreme learning machine intelligent predictive (ELM-IP) models and stepwise regression (SWR) model is assessed through computation of root mean square error (RMSE), correlation coefficient (CC), and mean absolute error (MAE) during training and testing stages of model development. The results of the model performance are presented in Table 3. Figures 3(a)–3(c), respectively, compare the performance of developed ELM-IP-Sin, ELM-IP-Sig, and SWR models on the bases of CC, MAE, and RMSE metrics during training stage of model development while compari4sons using testing dataset are, respectively, presented in Figures 3(d)–3(f). The developed ELM-IP-Sig model performs better than the ELM-IP-Sin model during training phase with performance improvement of 1.37%, 4.06%,4 and 3.13% on the basis of CC, MAE, and RMSE, respectively, as presented in Figures 3(a)–3(c) while it outperforms the SWR model during training and testing stage. During the training phase, the developed ELM-IP-Sig outperforms the SWR model with performance superiority of 82.52%, 58.85%, and 45.41% using CC, MAE, and RMSE metrics while performance enhancements of 29.26%, 51.12%, and 41.82% were, respectively, obtained during the testing stage of model development using the same performance metrics as presented in Figures 3(d)–3(f).

(a)

(b)

(c)

(d)

(e)

(f)
The developed ELM-IP-Sin model outperforms the ELM-IP-Sig model during testing stage with performance improvement of 3.73%, 9.26%, and 14.15% using CC, MAE, and RMSE performance metrics, respectively, while it outperforms the SWR model during the training phase with performance enhancement of 82.28%, 57.11%, and 43.64% using CC, MAE, and RMSE metrics, respectively. While testing the developed ELM-IP-Sin model, the model outperforms the SWR model with performance improvement of 31.89%, 55.65%, and 50.05% using CC, MAE, and RMSE, respectively. The developed ELM-IP-Sin model has demonstrated a high level of performance superiority over the SWR and ELM-IP-Sig models during the testing stage of model development.
Table 1 also presents the results of performance superiority of each of the developed model at each stage of model development. Despite the fact that the developed ELM-IP-Sin model demonstrates superior performance over the SWR and ELM-IP-Sig models, its testing phase shows better performance as compared with the training phase with performance improvement of 4.65%, 18.38%, and 4.48% on the basis of CC, MAE, and RMSE metrics, respectively. The superiority of testing stage over training stage demonstrated by the ELM-IP-Sin model further justifies it robustness, precision, and future potentials to effectively determine the energy gap of doped CuO semiconductor with high level of accuracy.
4.3. Investigating the Effect of Iron Particle Inclusion on Energy Gap of CuO Semiconductor Using Developed Models
The effect of iron particle inclusion in the lattice structure of CuO semiconductor using the three developed models is presented in Figure 4 with a comparison to the measured energy gap [4].

The difference between ionic radius of the incorporated iron and the host copper constitutes the observed variation in the unit cell volume which translates to energy gap at different concentrations of iron particles. Increase in iron concentration enhances particle agglomeration formation due to improvement in particle small mass, surface tensional force, and surface area/volume ratio. The results of the developed ELM-IP-Sin and ELM-IP-Sig models capture and match the experimental trends excellently well while the estimates of SWR show deviation which can be attributed to inadequacy of the SWR algorithm to address nonlinear relationship exhibited by the lattice parameters descriptors and the energy gaps.
4.4. Effect of Cobalt Foreign Material Inclusion in CuO Semiconductor Matrix on Energy Gap Using Developed Models
Bang gap engineering effect of cobalt on CuO semiconductor is investigated by the developed models and presented in Figure 5. The comparison of the model estimates with the measured energy gaps [37] is also presented in the figure. Increase in the concentration of cobalt particles lowers the energy gap of the semiconductor. The observed suppression of energy gap can be attributed to the influence of crystal defects (substitutional and interstitial defects) created by the dopants which enhance photon scattering effect. The doping cobalt ions generate acceptor level (which is shallow) above valence band since atoms of cobalt act as a p-type acceptor which results into energy gap of doped CuO lower than pure undoped CuO semiconductor.

The existence of d-p strong coupling between 3d and 2p of oxygen which raises the level of oxygen 2p is also attributed to the observed variation in energy gaps of doped CuO semiconductor [37]. The estimates of ELM-IP-Sin show consistent closeness with the measured values more than that of ELM-IP-Sig model. The developed SWR model performs the least performance with deviation of its estimates from the measured values. The outstanding performance of the developed ELM-IP-Sin model justifies the potential of the developed model in adjusting and tuning energy gap of CuO semiconductor to a level desired for a particular application.
5. Conclusion
The energy gap of cupric oxide (CuO) semiconductor with incorporated dopants is modeled using the extreme learning machine intelligent predictive model (ELM-IP) with different activation function (sine and sigmoid) and stepwise regression (SWR) algorithm. The distortions in monoclinic structure of the parent CuO consequent upon the incorporated dopants, which resulted into crystal defect formation as well as photon scattering, serve as model descriptors. The developed ELM-IP-Sin model outperforms the ELM-IP-Sig and SWR models when assessed using correlation coefficient, mean absolute error, and root mean square error metrics. The developed intelligent-based and stepwise regression models investigate the influence of iron and cobalt particle incorporation in CuO semiconductor matrix on energy gap of the semiconductor and the obtained energy gap agree excellently well with the measured values. The outstanding performance of the developed ELM-IP-Sin model justifies the potential of the model in adjusting and tuning energy gap of CuO semiconductor to a level desired for a particular technological and industrial application.
Data Availability
The data employed in building the presented models are contained in the cited sources in Section 3.1 of the manuscript.
Conflicts of Interest
The author declares that they have no conflicts of interest.
Acknowledgments
The support of Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia, is acknowledged.