[Retracted] Marine Underwater Landscape Image Design Based on Bright Color Compensation and GAN Model Generation

Yuan, Jingwen; Zhang, Longlong; Kim, ChulSoo

doi:https://doi.org/10.1155/2023/1976819

Journal of Sensors

On this page

Abstract Introduction Related Work Conclusion Data Availability Conflicts of Interest Acknowledgments References Copyright Related Articles

Research Article Retraction

!

This article has been Retracted. To view the article details, please click the ‘Retraction’ tab above.

Special Issue

Artificial Intelligence and Deep Learning for Sustainable Farming and Smart Environmental Monitoring

View this Special Issue

Research Article | Open Access

Volume 2023 | Article ID 1976819 | https://doi.org/10.1155/2023/1976819

[Retracted] Marine Underwater Landscape Image Design Based on Bright Color Compensation and GAN Model Generation

Jingwen Yuan,¹Longlong Zhang,¹and ChulSoo Kim²

Academic Editor: Yuan Li

Received24 Aug 2022

Accepted24 Sept 2022

Published03 Apr 2023

Abstract

Traditional denoising algorithms cannot effectively deal with these images with different blurriness and color deviation. Especially for underwater operations, the images are not clear, which makes it difficult for operators to act as agents. To solve this problem, this paper proposes a bright color compensation and fusion method. Underwater image enhancement algorithm uses generated countermeasure network (GAN). First, the original image is color compensated using the bright channel to obtain a color-compensated image; then, adaptive contrast stretching is performed on the color-compensated image to obtain a clear image with high contrast. It can be seen from the experiment that the PSNR of the marine landscape map can reach 21.9329, and the SSIM can reach 0.7329, which can provide useful help for the field of underwater image enhancement.

1. Introduction

Underwater images are widely used in marine environment, industry, and military fields, but due to the effects. The degraded quality of underwater images often affects the accuracy of the application when used for underwater image recognition and understanding [1]. With the decrease of available resources on land, the development and utilization of marine resources has become an urgent issue for human beings. In recent years, underwater robots have been widely used in marine resource exploration and other areas. Due to the complex underwater environment, the images captured by underwater robots are often severely degraded, mainly in the form of blur, low contrast, color deviation, etc. [2, 3]. Underwater enhancement methods include bright channel color compensation and fusion and multiscale Retinex. These methods use objective measures to enhance the quality of underwater images without considering the physical process of underwater imaging and are not suitable for underwater imaging with different physical characteristics and may lead to more obvious noise in the enhancement process [4]. However, because the attenuation coefficients of different wavelengths of light underwater are different, the DCP algorithm cannot achieve the expected results when applied directly to underwater image recovery. Underwater enhancement methods include bright channel color compensation and fusion and multiscale Retinex. These methods use objective measures to enhance the quality of underwater images without considering the physical process of underwater imaging, which is not suitable for underwater imaging with different physical characteristics and may lead to more pronounced noise in the enhancement process [5, 6]. The underwater image recovery method relies on the underwater imaging model and considers the influence of the water body optical parameters, camera parameters, water body point expansion function, and scene distance on the underwater imaging and finally obtains a clear underwater image by the inverse solution [7].

The nonphysical model-based methods achieve image enhancement by adjusting the image pixel values to enhance visual perception. In recent years, the pyramid attention mechanism has been successfully applied in the field of semantic segmentation [8, 9], where the combination of pyramid structure and attention mechanism can capture richer high-level features and better learn image details, thus effectively improving the performance of the model [10–13].

This paper proposes a bright color compensation and fusion. Underwater image enhancement algorithm is using generative adversarial networks (GAN). Use the bright channel to perform color compensation on the original image to obtain a color-compensated image; perform adaptive contrast stretching on the color-compensated image to obtain a clear image with high contrast.

Underwater images are widely used in the marine environment, industry, and military fields, but they generally suffer from poor clarity, low contrast, and severe noise due to the effects of underwater light absorption, scattering, and suspended particles [14]. Literature [15] used image blurring and light absorption to estimate the transmittance map and improved the estimation of background light to optimize the recovery effect. However, most of these underwater recovery methods only consider the estimation of transmittance map and background light, ignoring the influence of a large amount of noise and edges in the underwater image on the recovery results [16, 17], so it may aggravate the noise of the image and even destroy the original edge information of the image [18, 19].

For example, in [20–23], a wavelength-dependent compensation algorithm was combined with an image defogging algorithm to recover underwater images, which can effectively eliminate the blue-green hue of underwater images and the effect of artificial light sources. In [24], a full variational Retinex (TV-R) model based on the TV rule term was proposed. In [25], a new fractional-order variational framework method is proposed by changing the number of rule terms from integer order to fractional order while introducing nonlocal operators.

3. Algorithms in This Paper

3.1. Variational Recovery Model Based on Laplace Operator

The underwater optical imaging model under natural illumination conditions can be expressed as where is the acquired underwater image; is the transmittance map of each channel; is the undegraded underwater image; and is the ambient background light. Based on the -based variational model for foggy sky image recovery, Equation (1) is deformed: . Based on the fact that light propagation in water has different attenuation ratios of red, blue, and green colors (red is the largest; blue and green are the second largest), which is different from the imaging process of foggy sky images, we add a constraint term to adjust the color distortion of underwater images and thus establish the variational energy equation for underwater image recovery, namely,

3.2. Estimation of Background Light and Transmittance Map

The accuracy of the background light estimation can directly affect the results of underwater image recovery. The adaptive red channel a priori (ARC) algorithm considers the relationship between the attenuation coefficients of the three underwater channels, describes the high-intensity pixels of the underwater image in terms of saturation, and estimates the background light based on the green, blue, and inverted red channels, where the saturation and red channel estimation are defined as

To ensure the accuracy of the background light estimation, the red channel map of the original image was calculated according to Equation (4) . All the pixel values in the 3-layer channel were sorted from the largest to smallest, and the coordinates of the points corresponding to the top 0.1% of the pixel values were calculated separately, and then, the corresponding pixel values were found in the 3 channels according to the coordinates of the points, and the average value of the calculated pixels in the region was used as the background light values of the different channels:

Since the information loss occurs when solving the underwater image imaging model, the amount of information loss is inversely proportional to the transmittance , and the attenuation coefficient of the red channel is the largest, so the loss of information can be reduced by estimating the red channel transmittance map a priori. Based on the red channel a priori, the transmittance map of the red channel is obtained as

The attenuation coefficient versus wavelength for the transmittance maps of different channels of underwater images can be expressed as

Based on the ratio of the predicted transmittance map of the red channel and the attenuation coefficient, the transmittance maps of the green and blue channels were reestimated to obtain

Three underwater images are selected, and their respective red channel transmittance maps are derived by Equation (7), as shown in Figure 1, which shows the three degraded underwater images and the estimated red channel transmittance maps and the estimated background light values by the red channel a priori method.

3.3. ADMM Algorithm for Variational Recovery Model

The proposed variational model is solved numerically using the ADMM algorithm according to the iterative model of the Laplace term operator to improve its convergence speed: (i)The auxiliary variables are introduced and replaced by , respectively, and assigned according to the ADMM solution method; the energy equation (2) is transformed intowhere are nonnegative penalty parameters; are Lagrangian multipliers. (ii)Solve for (1)Fix to solve . The Euler-Lagrange equation for is(2)Fixing to solve for , the Euler-Lagrange equation for is(3)Fixing and solving for , the Euler-Lagrange equation for is(4)Fixing and solving for . Using the generalized soft threshold formula for the variable , the computer expression of the analytic solution is obtained as(5)Fix to solve for , and use the generalized soft threshold formula to solve for the variable . The computer expression of the analytic solution is obtained as(iii)Update the Lagrangian multiplier

4. FPAGAN Enhancement Model

The algorithm takes paired underwater images as training data and constructs a multinomial loss function to enhance the output of the network model and finally obtains underwater images with improved contrast, vivid colors, and clarity by training the network model.

4.1. Model Structure

FPAGAN (generative adversarial network with feature pyramid attention) enhances degraded underwater images by using encoder and decoder to extract image features and combining them with feature pyramid attention module. The structure of FPAGAN is divided into two parts: the generative network and the discriminative network. The network structure and data processing flow of FPAGAN are shown in Figure 2.

4.2. Construction of the Loss Function

In this paper, the FPAGAN model is optimized by adversarial training of the generative network and the discriminative network, and the optimization process of the network corresponds to the optimization of the loss function. The loss function of the adversarial network is where denotes the discriminant network; denotes the generative network; and denotes the mathematical expectation.

During model training, the discriminant network is updated once, and then, the generator network is updated once to prevent the model gradient from disappearing. The model reaches dynamic equilibrium when the discriminant network cannot distinguish between the generated image and the reference image, and then, a clear underwater image is obtained. In order to reduce the human interference in the output image, three aspects are considered in this paper, namely, global similarity, image content, and color perception loss.

Global similarity loss: existing methods show that adding loss to the objective function allows the generative network to learn the global similarity between images. Since loss is not easy to introduce ambiguity, a loss term is added to the objective function: where denotes the degraded underwater image; denotes the reference image corresponding to the degraded image.

Color perception loss: this loss term facilitates to generate enhanced images with similar color to the reference image, which is defined as follows: where the subscripts denote the red, green, and blue color channels of the image, respectively.

FPAGAN uses a multinomial loss function for network model learning, which facilitates the accelerated convergence of network parameters and improves the robustness of the model. The objective function is obtained by linearly superimposing the adversarial loss loss function, content-aware loss , and color-aware loss to obtain where represents the scaling factor, which is set to 0.6, 0.3, and 0.1, respectively, according to the experience.

5. Experimental Setup

The experiment is implemented under Windows OS, based on Tensorflow and Keras open source framework for deep learning. The computer configuration is as follows: NVIDIA GeForce RTX2070 Max-Q (8 GB) GPU, Inter Core i7-10750H CPU at 2.60 GHz, and 16 GB of RAM.

In this paper, the size of all training samples was reduced to and normalized to the interval [-1, 1]; the batch_size was 8; five different sizes of convolutional kernels were used in the network model, , , , , and , respectively; in addition, three different nonlinear activation functions were used to improve the generalization ability of the model. In addition, three different nonlinear activation functions are used to improve the generalization ability of the model.

5.1. Analysis and Discussion

The test set is divided into test set A and test set B. Test set A contains 46 underwater images with reference, and test set B contains 23 underwater images without reference. The validation set is generally used in the training process, and the enhancement effect can be verified on the validation set after each epoch, in order to find the problems of the model or parameters and verify the generalization ability of the model in time, so that we can make countermeasures in time. The test set is generally used after the training to evaluate the performance of the final trained model.

In order to verify the effectiveness of the algorithms in this paper, we compare them with existing underwater image enhancement methods, including nonphysical model-based methods (GC), physical model-based methods (UDCP, LDCP), and data-driven methods (UWCNN, FUnIE-GAN), through extensive experiments. In this paper, the effectiveness of the introduced FPA modules is firstly analyzed, and then, the experimental results of the above algorithms are compared qualitatively and quantitatively on test set A and test set B, respectively.

In order to demonstrate the effectiveness of the FPA module added to the generative network, a comparison between this model and the GAN model without the FPA module is performed. The two models are consistent in terms of the training set and the values of their hyperparameters. The results of the subjective experiments on the test set are shown in Figure 3.

As shown in Figure 3, the GAN model without the FPA module can effectively correct the underwater images with blue-green tones, but at the same time, it introduces yellowish color, and the overall image is dark, which has poor enhancement effect. In this paper, the algorithm dense features and contextual information of the image, which effectively improves the visual perception of the image, corrects the color bias, and enhances the contrast.

To further validate the effectiveness of the FPA module, the performance of the two models on test set A and test set B is quantitatively analyzed. First, two full-reference evaluation metrics, namely, PSNR and SSIM, were considered. Among them, the larger the PSNR and SSIM values are, the closer the brightness, contrast, overall structure, and information contained in the image to be evaluated are to the reference image. Secondly, three nonreference image quality evaluation metrics—UIQM, IE, and NIQE—were considered. The results of quantitative analysis are shown in Tables 1 and 2.

The experimental results in Table 1 show that after adding the FPA module, the distortion between the enhanced underwater image and the reference image is smaller, and the brightness, contrast, and structural information of the image are closer to that of the reference image.

The experimental results in Table 2 show that the enhanced underwater image performs better in the nonreference image quality evaluation index after adding the FPA module.

In order to verify the effectiveness of this algorithm, several experiments were conducted on test set A to compare this algorithm with existing underwater image enhancement methods (including GC, UDCP, LDCP, UWCNN, and FUnIE-GAN) qualitatively and quantitatively. The results of the qualitative comparison analysis of different methods on test set A are shown in Figure 4. It can be seen that the nonphysical model-based GC method does not enhance the image sufficiently, resulting in lighter color and fogging of the image. In contrast, the physical model-based methods (UDCP, LDCP) introduce unexpected colors, and the images show red or blue color bias. In turn, a lightweight convolutional neural network model is designed to enhance the underwater images, which effectively removes fogging and improves clarity but introduces a yellowish color bias.

To further verify the performance of the algorithm in this paper, two full-reference image quality evaluation metrics, PSNR and SSIM, were used for quantitative comparison and analysis with the above methods. The results of quantitative analysis of different methods are shown in Table 3. From Table 3, we can see that the algorithm outperforms the other comparison algorithms in both PSNR and SSIM evaluation indexes, which indicates that the algorithm can effectively recover the information in the image, improve the contrast, and make the obtained image more realistic and natural.

In order to evaluate the performance of this algorithm more accurately, the qualitative and quantitative comparisons between this algorithm and the above underwater image enhancement methods were conducted on test set B. The qualitative comparison analysis of different methods on test set B is shown in Figure 5. As shown in Figure 5(b), although the GC method corrects the color deviation, it has not completely eliminated the blue-green hue of the underwater image and brings the fogging effect. From Figures 5(c) and 5(d), it can be seen that the physical model-based method for underwater images shows exposure and introduces red and blue color bias. As shown in Figure 5(e), the method in the literature [12] leaves room for improvement in terms of image brightness and haze removal. As shown in Figures 5(f) and 5(g), both the method in [17] and the algorithm in this paper perform better in terms of blue-green hue correction, sharpness enhancement, and visual perception of the image. However, the algorithm in this paper performs better in color correction, defogging, and visual perception, and the obtained images are clearer and more natural. Therefore, the algorithm in this paper has better generalization performance.

(a)

(b)

(c)

(d)

(e)

(f)

(g)

In order to objectively verify the performance of the algorithms in this paper, four nonreference image quality evaluation metrics, UIQM, IE, NIQE, and gradient correlation-based image quality assessment (OG-IQA), are selected to evaluate and compare the underwater images enhanced by the above methods. IE is an important indicator of the richness of information in an image, and the larger the value, the more informative the image is. OG-IQA uses the correlation between adjacent gradients to describe the degree of distortion of an image, and the smaller the value, the better the image quality. The average scores of different methods on each evaluation index are shown in Table 4.

Table 4 shows that, except for the NIQE evaluation index, the values of the other three evaluation indexes of this algorithm are better than those of the comparison algorithm. For example, the UIQM evaluation index of this algorithm is higher than that of the comparison algorithm, with a value of 2.9457, which indicates that this algorithm can effectively improve the contrast and sharpness of the image and correct the color deviation. The values of IE and OG-IQA evaluation indexes are 4.6925 and -0.8002, respectively, which indicate that the enhanced images contain more information, less distortion, and higher visual quality.

6. Conclusion

This paper focuses on the low contrast and chromatic aberration of ocean underwater images and studies the specific methods of ocean underwater image enhancement. In this paper, neural network is used for enhancement. A bright color compensation and blending method is proposed. Underwater image enhancement algorithm uses GAN. Both qualitative analysis and quantitative analysis have achieved good results. Because the image color compensation based on the bright channel takes the average value of the bright channel as the estimation value of red, the marine landscape map not only has low contrast and color distortion but also has various degraded images due to changes in water turbidity and depth of field.

Data Availability

The experimental data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declared that they have no conflicts of interest regarding this work.

Acknowledgments

This work was supported by a grant from the Brain Korea 21 Program for Leading Universities and Students (BK21 FOUR) MADEC Marine Designeering Education Research Group.

References

D. Wu, Y. Lei, M. He, C. Zhang, and L. Ji, “Deep reinforcement learning-based path control and optimization for unmanned ships,” Wireless Communications and Mobile Computing, vol. 2022, Article ID 7135043, 8 pages, 2022.
View at: Publisher Site | Google Scholar
G. Cai, Y. Fang, J. Wen, S. Mumtaz, Y. Song, and V. Frascolla, “Multi-carrier $M$-ary DCSK system with code index modulation: an efficient solution for chaotic communications,” IEEE Journal of Selected Topics in Signal Processing, vol. 13, no. 6, pp. 1375–1386, 2019.
View at: Publisher Site | Google Scholar
K. Chandra, A. S. Marcano, S. Mumtaz, R. V. Prasad, and H. L. Christiansen, “Unveiling capacity gains in ultradense networks: using mm-wave NOMA,” IEEE Vehicular Technology Magazine, vol. 13, no. 2, pp. 75–83, 2018.
View at: Publisher Site | Google Scholar
X. Xie, X. Pan, W. Zhang, and J. An, “A context hierarchical integrated network for medical image segmentation,” Computers and Electrical Engineering, vol. 101, article 108029, 2022.
View at: Publisher Site | Google Scholar
S. Palanisamy, B. Thangaraju, O. I. Khalaf, Y. Alotaibi, S. Alghamdi, and F. Alassery, “A novel approach of design and analysis of a hexagonal fractal antenna array (HFAA) for next-generation wireless communication,” Energies, vol. 14, no. 19, p. 6204, 2021.
View at: Publisher Site | Google Scholar
S. N. Alsubari, S. N. Deshmukh, A. A. Alqarni et al., “Data analytics for the identification of fake reviews using supervised learning,” CMC-Computers, Materials & Continua, vol. 70, no. 2, pp. 3189–3204, 2022.
View at: Publisher Site | Google Scholar
X. Xie, X. Pan, F. Shao, W. Zhang, and J. An, “Mci-net: multi-scale context integrated network for liver ct image segmentation,” Computers and Electrical Engineering, vol. 101, article 108085, 2022.
View at: Publisher Site | Google Scholar
S. A. Bansode, V. R. More, S. P. Zambare, and M. Fahd, “Effect of constant temperature (20 0C, 25 0C, 30 0C, 35 0C, 40 0C) on the development of the Calliphorid fly of forensic importance, Chrysomya megacephala (Fabricus, 1794),” Journal of Entomology and Zoology Studies, vol. 4, no. 3, pp. 193–197, 2016.
View at: Google Scholar
F. A. Al-Mekhlafi, R. A. Alajmi, Z. Almusawi et al., “A study of insect succession of forensic importance: dipteran flies (diptera) in two different habitats of small rodents in Riyadh City, Saudi Arabia,” Journal of King Saud University-Science, vol. 32, no. 7, pp. 3111–3118, 2020.
View at: Publisher Site | Google Scholar
A. Algalil, M. A. Fahd, and S. P. Zambare, “New species of flesh fly (Diptera: Sarcophagidae) Sarcophaga (Liosarcophaga) geetai in India,” Journal of Entomology and Zoology Studies, vol. 4, no. 3, pp. 314–318, 2016.
View at: Google Scholar
A. M. Al-Azab, A. A. Zaituon, K. M. Al-Ghamdi, and F. M. A. Al-Galil, “Surveillance of dengue fever vector Aedes aegypti in different areas in Jeddah city Saudi Arabia,” Advances in Animal and Veterinary Sciences, vol. 10, no. 2, pp. 348–353, 2021.
View at: Google Scholar
A. R. Alqahtani, A. Badry, S. A. M. Amer, F. M. A. Al Galil, M. A. Ahmed, and Z. S. Amr, “Intraspecific molecular variation among Androctonus crassicauda (Olivier, 1807) populations collected from different regions in Saudi Arabia,” Journal of King Saud University-Science, vol. 34, no. 4, article 101998, 2022.
View at: Google Scholar
R. Ali, M. H. Siddiqi, and S. Lee, “Rough set-based approaches for discretization: a compact review,” Artificial Intelligence Review, vol. 44, no. 2, pp. 235–263, 2015.
View at: Publisher Site | Google Scholar
P. E. Abas and L. C. De Silva, “Review of underwater image restoration algorithms,” IET Image Processing, vol. 13, no. 10, pp. 1587–1596, 2019.
View at: Publisher Site | Google Scholar
X. Xie, W. Zhang, H. Wang et al., “Dynamic adaptive residual network for liver CT image segmentation,” Computers and Electrical Engineering, vol. 91, article 107024, 2021.
View at: Publisher Site | Google Scholar
J. Lu, N. Li, S. Zhang, Z. Yu, H. Zheng, and B. Zheng, “Multi-scale adversarial network for underwater image restoration,” Optics & Laser Technology, vol. 110, pp. 105–113, 2019.
View at: Publisher Site | Google Scholar
M. J. Islam, Y. Xia, and J. Sattar, “Fast underwater image enhancement for improved visual perception,” IEEE Robotics and Automation Letters, vol. 5, no. 2, pp. 3227–3234, 2020.
View at: Publisher Site | Google Scholar
S. Lin, K. Chi, T. Wei, and Z. Tao, “Underwater image sharpening based on structure restoration and texture enhancement,” Applied Optics, vol. 60, no. 15, pp. 4443–4454, 2021.
View at: Publisher Site | Google Scholar
M. Wenjuan and X. Feng, “Retracted article: underwater image segmentation based on computer vision and research on recognition algorithm,” Arabian Journal of Geosciences, vol. 14, no. 18, pp. 1–11, 2021.
View at: Publisher Site | Google Scholar
R. Chen, Z. Cai, and W. Cao, “MFFN: an underwater sensing scene image enhancement method based on multiscale feature fusion network,” IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1–12, 2021.
View at: Google Scholar
Y. Li and R. Chen, “SE–RWNN: an synergistic evolution and randomly wired neural network-based model for adaptive underwater image enhancement,” IET Image Processing, vol. 14, no. 16, pp. 4349–4358, 2020.
View at: Publisher Site | Google Scholar
M. Mathur and N. Goel, “Enhancement algorithm for high visibility of underwater images,” IET Image Processing, vol. 16, no. 4, pp. 1067–1082, 2022.
View at: Publisher Site | Google Scholar
S. Raveendran, M. D. Patil, and G. K. Birajdar, “Underwater image enhancement: a comprehensive review, recent trends, challenges and applications,” Artificial Intelligence Review, vol. 54, no. 7, pp. 5413–5467, 2021.
View at: Publisher Site | Google Scholar
L. Chen, Z. Jiang, L. Tong et al., “Perceptual underwater image enhancement with deep learning and physical priors,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 31, no. 8, pp. 3078–3092, 2021.
View at: Google Scholar
T. Li, S. Rong, W. Zhao et al., “Underwater image enhancement using adaptive color restoration and dehazing,” Optics Express, vol. 30, no. 4, pp. 6216–6235, 2022.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2023 Jingwen Yuan et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Journal of Sensors

Artificial Intelligence and Deep Learning for Sustainable Farming and Smart Environmental Monitoring

[Retracted] Marine Underwater Landscape Image Design Based on Bright Color Compensation and GAN Model Generation

Abstract

1. Introduction

2. Related Work

3. Algorithms in This Paper

3.1. Variational Recovery Model Based on Laplace Operator

3.2. Estimation of Background Light and Transmittance Map

3.3. ADMM Algorithm for Variational Recovery Model

4. FPAGAN Enhancement Model

4.1. Model Structure

4.2. Construction of the Loss Function

5. Experimental Setup

5.1. Analysis and Discussion

6. Conclusion

Data Availability

Conflicts of Interest

Acknowledgments

References

Copyright