Abstract

In order to predict the freshness of grass carp, a novel data preprocessing method was proposed for electronic nose (E-nose) signals. The signal sequences from six sensors were selected and subsequently normalized. The direct signal sequence merging (DSSM) and reversed signal sequence merging (RSSM) modes were used for signal sequence merging. Subsequently, the genetic algorithm (GA) was used to evaluate the contribution of diverse sensors, and the merged data sequence was compressed using wavelet transform (WT). Using approximation coefficient and detail coefficient based on different scales and different signal sequence merging modes, principal component analysis (PCA) discriminated successfully storage time of chilled fish fillet. The PCA plots clearly demonstrated that all extracted feature data fully retain the signal characters. The partial least squares (PLS) and artificial neural network (ANN) models were used to establish prediction models for the freshness of grass carp during storage. The DSSM-ANN- and DSSM-PLS- models were chosen as the TVB-N content prediction models, while the DSSM-ANN- and RSSM-PLS- models were selected as the K value prediction models. The values of these models are higher than 0.9, and they have a good coefficient of determination. The results of this study suggest that it using E-nose signals to predict TVB-N content and K value is an effective method for assessing the freshness of grass carp during storage.

1. Introduction

A number of degradative reactions begin in the fish flesh after the death of a fish. These reactions are caused by chemical, biochemical, and microbial metabolic activity. Various components decompose and new compounds form during fish spoilage, resulting in changes to the perceived quality (odor, flavor, and texture) of the fish meat [1].

Freshness is very important when assessing the quality of fish and fishery products [2]. It can be described by various indicators based on biochemical changes after slaughter, including the K value [3], total volatile basic nitrogen (TVB-N) [4, 5], and thiobarbituric acid contents [6]. The K value, which measures the extent of the breakdown of adenosine triphosphate (ATP) in fish flesh [7], can be obtained by measuring inosine (HxR), hypoxanthine (Hx), ATP, adenosine diphosphate (ADP), adenosine monophosphate (AMP), and inosinic acid (IMP) with high-performance liquid chromatography (HPLC). A range of methods have been used to measure TVB-N since the measurement method was first described in Conway and Byrne [8]. Generally, the meat sample or an extract of meat is alkalized; the volatile bases are collected and titrated with acid, and then the total bases are calculated in accordance with the amount of acid. Moreover, chromatographic techniques such as GC-MS, SPME-GC-MS, and HPLC have produced convincing results and proved to be suitable for the determination of TVB-N [911]. However, these conventional methods are cumbersome, expensive, time-consuming, and destructive. Therefore, developing a convenient, cost-effective, rapid, sensitive, and reliable method for freshness determination is necessary.

In recent years, many studies have focused on developing rapid and nondestructive methods to evaluate the freshness of food. Among them, the electric nose (E-nose) is a system composed of a series of chemical sensors with specific response and appropriate pattern recognition algorithms, which can rapidly detect and recognize single or mixed complex gases while also providing comprehensive odor information for the tested sample [12]. Li et al. [4] developed simple mathematical models relating E-nose signals to variations in total viable count (TVC) and TVB-N for packaged pork during refrigerated storage. An input-modified convolution neural network combined with E-nose and hyperspectral imaging was utilized to evaluate TVB-N of mutton [5]. Furthermore, the prediction of K value with E-nose has also been reported in recent literature. Li et al. [3] predicted the freshness of horse mackerel by utilizing the E-nose, electronic tongue (E-tongue), and colorimeter combined with a data fusion strategy and different pattern recognition algorithms. The square correlation coefficient of the test set for the prediction model of K value was 0.936. Previous studies have successfully showcased the efficacy of E-nose in predicting the freshness of food. However, a significant portion of these studies focuses on selecting appropriate pattern recognition algorithms, especially deep learning algorithms, in recent years [13], to improve the accuracy of E-nose prediction for food freshness. For the E-nose, in addition to the pattern recognition algorithm, data processing plays an important role in improving the prediction accuracy.

The mean value or maximum value is typically used as a feature signal in the analysis of E-nose data [14, 15]. However, one sensor can only provide one kind of data, which may not necessarily correspond to characteristic responses. Consequently, acquiring more data necessitates the use of additional sensors, but increasing the number of sensors leads to more complex and expensive operations. Therefore, improving the data processing method is necessary to obtain majority of the response data from the E-nose and obtain higher prediction accuracy with fewer sensors. Li et al. [16] used discrete wavelet transform (WT) to extract important features from dynamic sensor responses and then evaluated egg storage time and yolk index with satisfactory predictions. This inspired us to think about a new idea for predicting the freshness of grass carp (Ctenopharyngodon idella) using an E-nose. The WT method can remove high frequency noise and compress data. Hence, the features can be selected for determining multicomponent samples by WT, and different wavelets give rise to different results. As a minute tool for time-frequency dynamic responses, WT has been applied in various fields [1719]. Here, WT is tried as a new data processing method to extract feature signal from E-nose.

This paper presents a feature extraction method for the purpose of evaluating freshness of grass carp during 4°C storage using an E-nose effectively. Specifically, the measured signal sequences of the E-nose are merged and the features are extracted to obtain more useful information from the merged signal sequence as a feature signal by using the WT method. Then, combined with chemometric methods, both the TVB-N content and the K value can be predicted quantitatively.

2. Materials and Methods

2.1. Raw Material

Twenty grass carps were obtained from a local supermarket near the laboratory in Nanchang, China, and each fish weighed about 2–2.5 kg. These grass carps were killed by a sharp blow to the head, and then fish muscles were dissected carefully from the dorsal and ventral regions of the lateral line and weighed between 50 g and 55 g. Four fillets can be obtained from each grass carp. Immediately, these fillets were stored inside a plastic container at 4°C prior to determination.

Eight fillets were randomly taken out from the container on days 0, 2, 4, 6, 8, 10, 12, 14, and 16, respectively. Each fillet was homogenized for 1 min and then divided into three parts. One part of the samples was distributed for E-nose sample preparation, another part was used for K value measurement, and the remaining part was utilized for TVB-N content measurement.

2.2. TVB-N Content Measurements

The TVB-N content in the fillet was measured according to GB/T 5009.44 (2003). Each fillet sample was ground individually, and then 10 g of grounded sample was taken into a conical flask and impregnated with 100 mL distilled water for 30 min; the conical flask was often shaken before filtration to ensure uniform dispersion of the sample within the solution. Subsequently, 5 mL of filtrate was taken and blended with 5 mL of 10 g/L magnesia (MgO), distilled by a Kjeldahl distillation unit for 5 min. The distillate was absorbed with 10 mL of 20 g/L boric acid solution and titrated with 0.01 M of hydrochloric acid solution. The TVB-N content was calculated as shown in equation (1) by GB/T 5009.44 [20]:

In equation (1), X is the TVB-N content (mg/100 g), is the titration volume for the tested sample (mL), is the titration volume for the blank (mL), C is the concentration of HCl (mol/L), and m is the weight of ground fillet (g).

2.3. Determination of K Value with HPLC

The method for measuring the ATP-related compounds with HPLC was modified and validated based on the reference described by Barat et al. [21] and published in a Chinese journal [22].

Two grams of minced muscle were placed into a centrifuge tube and oscillated under vortex movement with 10 mL of 10% perchloric acid for 2 min and then centrifuged at 10,000 g under cold conditions (4°C) for 10 min. The supernatant was taken out, and the sediment was treated with 5.0 mL of 10% perchloric acid repeatedly. Repeat the operation once and put all supernatants together. The pH of the supernatant was adjusted to 6.5 with 1 mol/L sodium hydroxide solution. It was made up to/diluted to 50 mL with deionized water and filtered through a 0.45 mm membrane filter for further analysis.

HPLC analysis was performed by the Agilent 1260 high-performance liquid chromatography equipped with an UV-vis absorbance detector (Agilent Technologies, USA). ATP and its related compounds were separated in a Waters C18 column (4.6 mm i.d. ×150 mm, 5 μm). The mobile phase was triethylamine phosphate solution (3.5 mL of phosphoric acid solution and 7.2 mL of triethylamine solution was made up to 1,000 mL with deionized water, and pH was adjusted to 6.5 with triethylamine): methanol = 95 : 5. A sample (20 μL) was injected with a flow rate at 1 mL/min, and peaks were detected at 254 nm.

The amounts of ATP and its related compounds were determined and calculated based on the extra-calibration curves of standard ATP, ADP, AMP, IMP, HxR, and Hx. K value was defined by the following equation [7]:

2.4. Analysis of Aroma by Electronic Nose

A sensor array system (Gemini, Alpha M.O.S., France) with 6 metal oxide sensors was used to acquire the signal of aroma from minced muscle. The sensors were sensor T70/2 (Sensing species: toluene, xylene, carbon monoxide), sensor PA/2 (Sensing species: ethanol, ammonia, amine compounds), sensor P30/1 (Sensing species: hydrocarbons, ammonia, ethanol), sensor P40/2 (Sensing species: chlorine), sensor LY2/GH (Sensing species: ammonia, amine compounds), and sensor LY2/gCTL (Sensing species: hydrogen sulfide).

2 g minced fillet was placed in a 20 mL vial (Alltech, USA). The vials were sealed with a 20 mm silicone/PTFE magnetic crimp-top cap which was obtained from CNW Technologies GmbH (Dusseldorf, Germany). The vials were equilibrated for 30 min at 30°C. In the previous experiment, the incubation condition has been optimized. The response of the E-nose increased with the incubation temperature and time. The incubation temperature was selected to be 30°C because it is a normal temperature. The response increased quickly with time at 30°C and became slow after 30 min. So, this incubation condition was chosen. Then 1,000 μL of headspace gas was injected into the sensor chamber, and it was measured for 200 s, with a 0.5 s sampling interval. Dried air (purity 99.999%) was employed as the carrier gas, and the flow rate was 600 mL/min. After sample analysis, the system was purged for 300 s.

2.5. E-Nose Signal Pretreatment

The 101 data points (0–50 s) extracted from each sensor of E-nose are selected, and they are normalized between 0 and 1, respectively, by the following equation [23]:where R: the normalized response; : the response value at a sensor of E-nose; : the minimum response value at a sensor of E-nose; and : the maximum response value at a sensor of E-nose.

The merging signal sequence is compressed by WT before analyzing data with statistical techniques. The different Daubechies (dbN) wavelets and decomposition levels are used, and the optimum dbN mother wavelet and decomposition level are determined for the wavelet transform of E-nose signal.

2.6. Data Analysis

The genetic algorithm (GA) was performed to evaluate the influence of the diverse sensors on prediction of the K value. GA is a custom program written by Leardi, executed on Matlab [24]. The evaluation per run of the GA was set to 500, with 300 runs, while the remaining parameters were kept at their default values.

Compressed data were analyzed by partial least squares (PLS), artificial neural network (ANN), and principal component analysis (PCA). PLS was performed by means of the software SIMCA 14.1 (Umetrics, Umea, Switzerland); ANN and PCA were performed on Matlab 7.0 (MathWorks, Natick, MA, USA). PCA was applied to identify the different storage times of chilled grass carp fillet; ANN and PLS were employed to build predictive models for K value and TVB-N content with full cross-validation (leave-one-out cross-validation) approach. As for the performance of the established models, the evaluation indicator systems were mainly related to the square correlation coefficient () and root mean square error of cross-validation (RMSECV), respectively.

3. Results and Discussion

3.1. TVB-N Content and K Value Changes of Grass Carp during Storage

Figure 1 presents the changes of TVB-N content and K value of grass carp fillet during storage at 4°C. The TVB-N content of grass carp fillet increases as storage time increases as shown in Figure 1(a). The initial TVB-N content is 15.4 mg/100 g, and the final TVB-N content is 38.2 mg/100 g when the flesh decayed in the end. The TVB-N content on day 6 is 20.9 mg/100 g, which indicates that the fillet is beginning to decay. In addition, the error bars reveal that the relative standard deviation (RSD) of measured TVB-N content of each storage group is less than 5%.

The concept of the K value as a freshness indicator was introduced by Saito et al. [7]. The larger the K value is, the lower the freshness of the fish is. Generally, the K value of live fish is between 0 and 10, while K value of fresh fish is between 15 and 35. The fish flesh is not regarded as fresh when the K value is greater than 50 [6]. Figure 1 shows an almost linear increase in the K value of grass carp flesh during storage at 4°C. The initial K value is 7.72, and the K value is 91.81 when the flesh completely decayed in the end. However, the increase of the K value is significant from the sixth day to the eighth day of storage, and the K value exceeds the fresh threshold (50) after the sixth storage day. The RSD of measured K value of each storage group is less than 10%.

3.2. E-Nose Signal

The signals extracted from E-nose are shown in Figure 2(a). Each curve represents a different sensor response. They are denoted as T70/2, PA/2, P30/1, P40/2, LY2/Gh, and LY2/gCTL, respectively, according to their sensors. Different sensors have varying sensitivities to gas, resulting in differing responses from each electronic nose sensor, with some exhibiting higher responses than others. Figure 2(a) shows that signals from sensors (P30/1, P40/2, LY2/Gh, and LY2/gCTL) change little compared with the signal from other sensors. Hence, the enlarged signals are presented in Figures 2(b)2(d).

Figure 2 shows that the responses of the six sensors vary a lot with time in the beginning of curve and stabilize gradually until the end of the measurement. Each curve contains much information. In most studies, each E-nose sensor only adopts one data point (mean value or max value of signal), and much useful information is wasted. In this paper, full utilization of information included in the signals of each E-nose sensor is attempted. A larger signal variability is observed at the initial period of the curve, indicating that this range may account for the majority of the original information. Therefore, a selection of 101 data points from the first 50 seconds signal of each sensor was selected as tentative character responses. However, it is crucial to extract feature signal sequences from these selected data points.

3.3. Evaluation of Diverse Sensors with GA

The GA was used to evaluate the contribution of diverse sensors to prediction. GA can simulate natural selection and evolution and find the optimal solution globally by searching through the entire solution space.

In dataset of GA, there is a one-to-one correlation between one direct signal sequence merging and one measured K value. After 300 runs, GA provided a total of 88 feature data points. The frequency of selection is shown in Figure 3. It can be observed that data points 304–308 and 345 have a high frequency of selection, indicating that data points acquired by P40/2 are more relevant to predicting the K value. The remaining feature data points, whose selected frequency is nearly equal, are distributed on diverse sensors. So, the data points from all six sensors must be used.

3.4. Merging E-Nose Signal Sequence

In Li et al. [16], WT was used to extract characteristic information from the sensor response of E-nose for evaluating egg storage time and yolk index. They decompose the original signal from each sensor, respectively, and obtain a series of coefficients sets. Then, new corresponding signal was reconstructed from the coefficient sets for qualitative and quantitative analysis.

Similarly, WT was adopted in this paper. However, Figure 2 shows that signal from different sensors differs significantly. Thus, each selected data sequence from different sensor has to be normalized according to equation (3) unlike the WT. These selected data sequences are denoted as T70/2, PA/2, P30/1, P40/2, LY2/Gh, and LY2/gCTL, respectively, according to their sensors. Two signal sequence merging modes are compared: the first mode is that normalized data sequences are merged from end to head as T70/2-PA/2-P30/1-P40/2-LY2/Gh-LY2/gCTL, shown in the top subgraph of Figure 4. This mode is referred to as direct signal sequence merging (DSSM). The beginning of each data sequence has a larger value compared to the end, so a sharp increase appears in the joining of two data sequences. The second mode is to reverse the normalized data sequences from the 3 sensors (PA/2, P40/2, LY2/gCTL), followed by combining them with the other three data arrays, with the linking sequence same as the DSSM mode. This mode is called the reversed signal sequence merging (RSSM) mode. The top subgraph of Figure 5 shows that the junctions of two data sequences of RSSM mode are smoother compared with the DSSM mode. These two signal sequence merging modes are adopted in the following data analysis in order to evaluate the effect of two merging modes on the prediction of K value and TVB-N content.

3.5. Extracting Feature Signal with WT

WT was developed for the analysis of a merged signal sequence with Matlab 7.0. The data compression process is used to reduce data size. Apart from data compression, WT is also expected to minimize noise and other unwanted contents present in the signal [25].

The merged signal sequence could be decomposed into two sets of coefficients with the best mother wavelet and scale. The coefficients are obtained by convolving the merged signal sequence with the low-pass filter for approximation and with the high-pass filter for detail. The coefficients contain a series of set and a series of set where A represents the approximation coefficients, D represents the detail coefficient, and j represents the level of decomposition. The set and set retain the low-frequency and high-frequency content of the signal, respectively. The different scales of the approximation coefficients and detail coefficient are shown in Figure 4 (DSSM mode) and Figure 5 (RSSM mode). There are 606 data points in , and the number of data points reduces by half when the decomposition level increases by 1. Finally, the data size is reduced by WT, with only 19 data points in the and sequence.

In this study, the optimization of WT parameters is performed using different signal sequence merging modes, approximation coefficients, and detail coefficients of different scales. Meanwhile, several mother wavelets are tried and evaluated. The first-order wavelet transform of the Daubechies’ family (db1) is selected as mother wavelet by comparing and computing different wavelet bases.

3.6. PCA

PCA has been widely applied in the fields of pattern recognition and multivariate calibration. The newly generative variables are utilized to represent the original ones after processing by PCA, which effectively simplifies calculation. Samples can be grouped according to variance in principal component scores.

Figure 6 displays the two-dimensional PCA results of DSSM and RSSM modes for sets and sets based on WT. The arrangement of Figure 6 corresponds to Figures 4 and 5: PCA of is based on merged signal sequence, and PCA of each or is based on the or dataset obtained with WT, respectively.

The PCA plots of D-, D-, D-, D-, D-, R-, R-, R-, R-, and R- exhibit similarities. The sample groups are clearly separated by PC1 within the first 8 days, decreasing along PC1. The samples from day 0 gather closely, while those from day 2 are more dispersed. The sample groups from day 6 to day 16 increase along PC2. Most sample groups are distinguished clearly except for a partial overlap between day 6 and day 10. The first two principal components (PC1 and PC2) explain 79.44% to 83.94% of the data variance. PCA plots of D-, D-, D-, R-, R-, R-, R-, R-, and R- also group most samples effectively, albeit with different arrangement. However, the D- and R- models showed no distinct grouping. The predictive effect of the K value and TVB-N content of grass carp flesh is compared by using the sets and sets based on WT with the ANN and PLS.

3.7. Prediction of K Value and TVB-N Content

The original data and all feature data from WT using approximation coefficients and detail coefficients of different scales are compared while doing analysis using PLS and ANN. PLS is positioned as a statistical technique for prediction renowned for their effectiveness in estimating structural equation models with latent variables [26]. ANN is a powerful, efficient, and nonlinear method with pattern recognition abilities, which makes it perfectly suited for the extraction of feature information from large data, especially due to complex biological, environmental, and instrumental variations [27]. Compared with other techniques, such as covariance and regression, ANN and PLS have better predictive ability and are frequently used in the prediction of texture, freshness, and safety of food [2729]. For example, Basile et al. [27] employed nondestructive NIR spectroscopy and machine learning techniques to predict the texture parameters and total soluble solids content of grape. The results indicated that the multivariate models, created by constructing ANN and applying PLS regressions, displayed improved predictive capabilities following the removal of uninformative spectral ranges.

Several architectures of the network were investigated to predict the K value and TVB-N content. ANNs used in this paper are backpropagation (BP) and radial basis (RB) neural networks. The RB training algorithm is faster than BP for practical problems. The only optimized parameter is SPREAD. Despite various modifications to SPREDA, the predictive performance for both K value and TVB-N content remains subpar. It is proven that the RB model is not suitable in this study.

In this paper, the BP-ANN adopts the steepest gradient descent backpropagation training algorithm which updates the network weights and biases along the direction of the negative of the gradient. “Squashing” functions such as sigmoid transfer functions, that compress an infinite input range into a finite output range, are usually used in the hidden layers. If the input values are large, the gradient can have a very small magnitude and therefore cause small changes in the weights and biases, even though the weights and biases are far from their optimal values.

The PLS uses X (measured K value or TVB-N content) to construct a model of Y (predicted K value or TVB-N content), where the objective is to predict the latter from the former for new samples in the prediction set.

The two models are further evaluated with full cross-validation using a “leave-one-out” technique. RMSECV and were adopted to evaluate the prediction ability of K value and TVB-N content. The small RMSECV means the prediction model is better. As for , the closer it is to 1, the better the model is. A model performs well when the value of is in the range of 0.82–0.90; the model performs inaccurately while the value of is lower than 0.82; and the value of higher than 0.90 shows excellent performance of the model.

3.7.1. TVB-N Content

The prediction results of PLS and ANN models based on signal sequence merging (DSSM and RSSM modes) and WT for TVB-N content are shown in Table 1. It is interesting that the prediction results with sets are better in ANN models, in which the best model is the DSSM- model ( is 0.9683, RMSECV is 1.6582 mg/100 g); on the contrary, the prediction results with sets are better in PLS models, in which the best model is the DSSM- model ( is 0.9685, RMSECV is 1.6521 mg/100 g). The best ANN and PLS models are based on the DSSM mode. Most values of are higher than 0.90, except for and models built with ANN. PLS, in addition to the RSSM- model, demonstrated strong performance in predicting TVB-N content, exceeding an of 0.9. Furthermore, no significant differences were observed between DSSM and RSSM models in predicting TVB-N content, except for a few models.

According to Table 1, the ANN model was built using the sets, while the PLS model was built using the sets. Figures 7(a)7(d) visualize linear relationships between the predicted and measured TVB-N content with DSSM-ANN-, DSSM-ANN-, DSSM-PLS-, and DSSM-PLS- models, respectively. The four models have a good coefficient of determination. Although R2 of the DSSM-ANN-A3 model is the highest in the ANN model, the R2 of DSSM-ANN-A5 model is still higher than 0.90 (R2 is 0.9474, RMSECV is 2.1423 mg/100 g). Thus, it can be inferred that the DSSM-ANN-A5 model retains the original signal features, with a dataset size of 19 data points significantly less than the 76 data points in the DSSM-ANN-A3 model. The gradient descent training algorithm in the ANN model was too slow for practical problems. It cost 8 h to complete a performance when using merged signal sequence with 606 data points, and the run time decreased with fewer data numbers. It only cost 31 and 5 min to complete a performance when using DSSM-ANN- and DSSM-ANN- models, respectively. Although the running time of PLS model based on SIMCA is almost not affected by the number of data points, compared with the DSSM- model, the DSSM- model can better distinguish the storage time of chilled fish fillets in PCA plots, and the of DSSM-PLS- model still exceeds 0.9 ( is 0.9606, RMSECV is 1.8437). Finally, DSSM-ANN- and DSSM-PLS- models are selected as prediction models for measuring TVB-N content.

3.7.2. K Value

The prediction results of PLS and ANN models based on signal sequence merging (DSSM and RSSM modes) and WT for K value are shown in Table 2. PLS models consistently demonstrate highly beneficial prediction results, with all exceeding 0.9 and no influence observed on the due to varying coefficient sets or signal sequence merging modes. Furthermore, the best model is the RSSM- model ( is 0.9898, RMSECV is 4.7357). The K value prediction models with Aj sets, as established by ANN, perform well except the A5 model, which exhibits a low R2 possibly due to the limited number of data points. On the other hand, excellent performance is exhibited by the K value prediction model with sets in ANN models, except for . In this case, the best model is the DSSM- model ( is 0.9731, RMSECV is 7.6938). Compared with set, the K value prediction result with sets in ANN model has better performance. Different from the TVB-N content prediction model, the performance of the K value prediction model is not significantly different between the DSSM and RSSM modes.

Figures 7(e)7(h) visualize linear relationships between the predicted and measured K value with DSSM-ANN-, DSSM-ANN-, RSSM-PLS-, and RSSM-PLS- models, respectively. Figures 7(d) and 7(e) indicate that the predicted K value is much greater than the measured K value on day 6, and greater deviations occur between predicted and measured K values on day 8. The reason is that the K value increases swiftly from day 6 (mean K value: 21.38) to day 8 (mean K value: 69.49) as shown in Figure 1(b). Parts of the samples may not be analyzed with HPLC and E-nose meantime. Measurement errors caused by analyzing time will be great. The DSSM-ANN- model, with fewer data points and great than 0.9 ( is 0.9503, RMSECV is 10.4359), is selected as the K value prediction model. On the other hand, compared to the RSSM-PLS- model, the RSSM-PLS- model more effectively differentiates the storage time of chilled fish fillet in PCA plots, while still maintaining an greater than 0.9 ( = 0.9897, RMSECV = 4.7495). Finally, DSSM-ANN- and RSSM-PLS- models are selected as prediction models for measuring K value.

In a similar work conducted by Huang et al. [14], the TVB-N content of pork meat was measured by integrating near infrared spectroscopy (NIRS), computer vision (CV), and E-nose techniques. PCA was employed to achieve data fusion, and the prediction model for TVB-N content was built using BP-ANN. The results revealed the outstanding performance of the data fusion model of NIRS, CV, and E-nose (RMSECV = 2.73 mg/100 g,  = 0.9527). However, this method of data fusion is complex, expensive, time-consuming, and not suitable for rapid nondestructive testing requirements. Moreover, it is worth noting that the maximum response value response of each sensor was extracted as the characteristic variable, resulting in data waste. Consequently, the prediction performance of the E-nose model based on BP-ANN was found to be poor (RMSECV = 5.97 mg/100 g,  = 0.6495). Conversely, in this study, only an E-nose was used to measure chilled fish fillet, and an innovative electronic nose signal data preprocessing method was employed, which ultimately established excellent prediction models for the freshness of grass carp during storage.

4. Conclusions

In order to predict the freshness of chilled grass carp flesh, a signal pretreatment method was developed based on two types of signal sequence merging modes, with wavelet transform applied. The PCA analytical results reveal that the different scales and two kinds of signal sequence merging modes of approximation coefficients and detail coefficients can be used to distinguish the grass carp fillets of different storage time. In addition, successful utilization of PLS and ANN was achieved to build prediction model for measuring TVB-N content and K value of grass carp fillet. The DSSM-ANN- and DSSM-PLS- models are finally chosen as the TVB-N content prediction models, while the DSSM-ANN- and RSSM-PLS- models are finally selected as the K value prediction models. Even though the number of data points is reduced to 19 ( and ) from 606 (), they still show excellent performance of the model. There was no significant difference between the DSSM and RSSM models in the results of the established prediction model. This study proves that WT can condense and extract feature E-nose signal effectively. Not only can this E-nose signal preprocessing method be used to predict TVB-N content and K value, but it also has the potential capacity for other rapid determination with E-nose.

Data Availability

The data supporting the findings of this study are available from the corresponding authors upon reasonable request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Authors’ Contributions

Guoqiang Zhao was responsible for data curation, formal analysis, software, visualization, original draft preparation, and review and editing. Yuanyuan Chen was responsible for investigation, software, resources, and visualization. Mei Xie and Yihong Tan were responsible for investigation, resources, and visualization. Li Zhao and Yong Jiang were responsible for conceptualization, supervision, project administration, review and editing, and funding acquisition.

Acknowledgments

This study was supported financially by the Earmarked Fund for Jiangxi Agriculture Research System (Grant No. JXARS-[2023]-03) and Open Project of Aquatic Product Processing and Engineering Research Center of Safety Control of Jiangxi (No. KFJJ2301).