<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Information Control Systems &amp; Technologies, September</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Image Compression Research Based On Convolutional Autoencoder</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Marchenko</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Elena Balalayeva</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Olena Piatykop</string-name>
          <email>piatykop_o_ye@pstu.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Volodymyr Kukhar</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Pryazovskyi State Technical University</institution>
          ,
          <addr-line>29 Gogolya St, Dnipro, 49000</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Technical University Metinvest Polytechnic, LLC</institution>
          ,
          <addr-line>80 Pivdenne Hwy, Zaporizhzhia, 69106</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <volume>2</volume>
      <fpage>4</fpage>
      <lpage>26</lpage>
      <abstract>
        <p>The article presents the results of the study on image compression algorithms based on neural networks. The study analyses classical compression methods, such as JPEG, PNG, GIF, TIFF and identifies the advantages of neural network methods, in particular the use of an autoencoder, a variational autoencoder, and generative adversarial networks. A comparative analysis of classical compression algorithms, such as JPEG, with new approaches based on neural networks is carried out using the example of an autoencoder. A mathematical model describing the principle of operation for an autoencoder is presented, illustrating how a neural network encodes and restores images using latent space. To achieve the best reconstruction quality, a hybrid loss function comprising three components was employed: perceptual loss based on VGG16, SSIM loss, and MSE loss. A modular software system was developed using the Python programming language to conduct the experiments. The software includes a graphical interface, a compression module for encoding and decoding images using an autoencoder model, and a quality assessment module for calculating the main quality. The study found that traditional image compression methods demonstrate high efficiency, but are more prone to generating artifacts, especially at high compression levels, compared to neural network methods. The research results indicate that the autoencoder model can encode and decode images with minimal loss of quality, on par with JPEG, but is inferior to classical algorithms in speed and compression ratio.</p>
      </abstract>
      <kwd-group>
        <kwd>autoencoder</kwd>
        <kwd>image compression</kwd>
        <kwd>neural networks</kwd>
        <kwd>JPEG</kwd>
        <kwd>compression algorithms 1</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>compared to traditional methods. While these studies demonstrate the strong potential of neural
networks in compressing images, their practical implementation is often hindered by high
computational requirements, making them inaccessible for many commercial applications.</p>
      <p>The relevance of the research is determined by the need for a deeper understanding of modern
image compression approaches that address the limitations of traditional methods. Neural
networkbased compression represents an innovative approach with the potential to enhance both
compression efficiency and quality. Research in this field is essential for the continued development
of technologies, especially as the volume of digital data continues to grow.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Literature review</title>
      <p>The goal of this study is to assess the effectiveness of image compression algorithms based on neural
networks and compare them with traditional compression methods.</p>
      <p>
        Analysis of Recent Research and Publications. Classical image compression methods (JPEG, PNG,
GIF, and TIFF) have their own strengths and weaknesses. For example, JPEG works well for
photographs but can introduce noticeable artefacts when compressed too much, while PNG
preserves high image quality but results in larger file sizes. The choice of compression method
depends on specific requirements for the image, such as image quality, content size and type. Neural
networks can offer a new approach to image compression [
        <xref ref-type="bibr" rid="ref3 ref4 ref5">3-5</xref>
        ], with their main advantage being
their ability to learn from large datasets, identify patterns and extract the most important
information from an image. They can also be tailored for specific image types, such as medical scans
[
        <xref ref-type="bibr" rid="ref6 ref7">6, 7</xref>
        ] or satellite imagery [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], making them a flexible and versatile tool.
      </p>
      <p>
        One popular approach is an autoencoder, a type of neural network used for encoding and
decoding data [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. It is commonly applied in dimensionality reduction and noise removal and consist
of two main components: 1) encoder, which compresses the input data by identifying and keeping
only the most essential features while discarding noise and irrelevant information; 2) decoder, which
restores the original image from the compressed data as accurately. The main advantages of
autoencoders include the ease of implementation and configuration, as well as adaptability to
different data types. However, their compression quality may not be as high as more advanced
approaches, while their latent space is often linear and has limitations in handling complex data.
      </p>
      <p>
        A variational autoencoder builds on the traditional autoencoder by introducing a probabilistic
approach to feature representation. Instead of using a fixed feature vector in the latent space,
variational autoencoders model the data as a probability distribution [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], allowing for more flexible
and generalized data representations. This flexibility in the latent space enables them to encode more
complex features, generate new information based on the data not previously seen by the network
and effectively handle complex, uneven data distributions. However, variational autoencoders are
computationally more demanding than standard autoencoders and require more complex training.
      </p>
      <p>Generative adversarial networks (GANs) are a class of artificial intelligence algorithms used in
unsupervised learning. They consist of two competing neural networks in a zero-sum game [11]: one
network generates candidate images (generator), while the other (discriminator) evaluates them. The
generator network typically learns to build matches from the latent space to a specific data
distribution, while the discriminator distinguishes between real data and the candidates produced by
are particularly
effective at operating with complex patterns and have a potential in generating new data. However,
they require substantial computational resources, long training times, and can sometimes introduce
artefacts into the generated images.</p>
      <p>Considering the strengths and limitations of the above methods, autoencoders were chosen for
further experimentation due to their simplicity and operational features.</p>
      <p>
        Let us consider the existing image compression software. A study in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] examines a new system
that combines convolutional neural network (CNN) explanations with semantic image compression
in a single,
end-towhile compressing input images for efficient storage or transmission. The method offers an
innovative approach combining neural network transparency with high compression efficiency,
making it especially useful when the resources are limited in terms of data storage and transmission.
However, one drawback of t
for images or classes not included in the training set, leading to inconsistencies in explanations or
reduced compression efficiency. Additionally, the choice of parameters, such as block size, may affect
the trade-off between image quality and compression level.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], researchers propose a new approach for image compression using deep learning neural
networks, optimized from start to finish with consideration of the trade-off between data
transmission rate and distortion. The authors apply non-linear transformations inspired by biological
neuron models, significantly improving the quality of compressed images compared to standard
methods like JPEG and JPEG 2000. The method demonstrates strong improvements in image
compression, especially at low bitrates, making it a promising option for future real-world
applications. However, optimising all the parameters of this model requires considerable time and
computational power, and the use of GDNs and other non-linear transformations makes
implementation more complex compared to traditional algorithms, such as JPEG.
      </p>
      <p>Another study [12] explores a different approach for lossy image compression using GANs, with
the goal of preserving high visual quality of the restored images at low bitrates. The primary idea is
to combine generative models with compression techniques, allowing the preservation of textures
and fine details even when their size is significantly reduced. To improve the quality of the restored
perceptual losses helps to achieve a high level of similarity between the restored and original images.
This method combines advanced neural network and data compression methods to ensure
highquality restored images. However, it has some drawbacks: images with tiny details or text may lose
quality, especially at extremely low bitrates; and implementing GANs for compression requires
significant computational resources during the training, which limits their widespread use on devices
with restricted processing power.</p>
      <p>Thus, the main advantages of neural networks include the preservation of high texture and detail
quality at low bitrates, as well as the ability to work with high-quality images. These methods,
however, require substantial computational resources and may struggle with preserving fine details
and tiny text. In summary, neural networks enable to achieve effective image compression,
improving the balance between file size and visual quality, however, further research is required
before they can be widely adopted.</p>
      <p>The scientific novelty of this study is the establishment of the dependence of compression
efficiency on the architecture of the convolutional autoencoder. This allows us to provide specific
recommendations for further optimization of the model architecture and increasing its efficiency.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Research methodology</title>
      <p>
        The autoencoder model is a complex system of functions that encodes and decodes the input image
using neural networks [
        <xref ref-type="bibr" rid="ref9">9, 13</xref>
        ]. The model aims to find a latent representation of the data that would
minimize information loss during the image restoration process [13, 14].
      </p>
      <p>An autoencoder can be represented as a pair of functions:</p>
      <p>encoder  ( ) =  , which matches the input image  ∈ ℝ × × with the latent space  ∈ ℝ ,
where  ≪  ×  ×  , with  being the height,  representing the width, and  showing the
number of channels;
– decoder  ( ) =  ̂, which restores the image  ̂, from the latent representation  .</p>
      <p>The objective is to determine the sets of parameters (weight and offsets) for the encoder   and
decoder   that minimize the difference between the input image  and the restored image  ̂.</p>
      <p>The encoder performs a series of convolutional operations:
 =  ( ) = 
(
( ,  1) +  1),
(1)
where 
denotes convolution,  1 is the filter weigh matrix,  1 is the offset, and 
activation function.</p>
      <p>After a sequence of convolutions, a latent vector is obtained:</p>
      <p>∈ ℝ ,
where  ≪  ×</p>
      <p>×  .</p>
      <p>The decoder restores the image from the latent space using transposed convolutional layers:
 ̂ =  ( ) = 
(
( ,  2
) +  2),
represents the transposed convolution, and 
in the feature maps;  ( 
images, respectively.</p>
      <p>where  represents the function used to compute features with VGG16;  is the number of values
) and  (</p>
      <p>) are the  -th feature values for the true and predicted
This part of the loss function allows the network to retain high-level semantic details of the images.</p>
      <p>SSIM loss (Structural Similarity Index Loss) assesses the similarity between two images by
considering structural characteristics such as brightness, contrast, and texture [15].</p>
      <p>The formula for calculating SSIM is:

( ,  ) =
(2    +  1)(2</p>
      <p>+  2)
( 2 +  2 +  1)(  2 +   2 +  2)
difference between  and  ̂.</p>
      <p>
        Characteristics of the latent space:
information;
irrelevant details;
highly complex data.
function that confines pixel values to the range [
        <xref ref-type="bibr" rid="ref1">0, 1</xref>
        ].
      </p>
      <p>Model training algorithm:</p>
      <p>The input image  is passed through the encoder and decoder, resulting in  ̂;
The difference between  and  ̂ is used to assess the quality of the restoration;
The gradients are computed and used to update the network weights;
A stochastic gradient descent algorithm or one of its variations is employed to minimize the
the latent space contains fewer parameters than the input data, enabling the compression of
the latent representation contains only the essential features for restoration, discarding
the latent space assumes a linear structure, which may limit the model s ability to handle
The model described outlines the functioning of the autoencoder (1) and illustrates how the neural
network encodes and restores images through the latent space. During this process, some
information is lost, leading to a reduction in the size of the output image. However, if the model is
properly trained, this loss of information does not significantly affect the visual perception of the
image, which allows the process to be regarded as a form of lossy compression. This approach serves
as the foundation for the subsequent research presented in this study.</p>
      <p>To train the model to correctly identify image features and achieve optimal reconstruction
quality, a hybrid loss function is employed. This function combines three components: perceptual
loss based on VGG16, SSIM loss, and MSE loss.</p>
      <p>Perceptual loss leverages intermediate features from the VGG16 neural network, pre-trained on
the ImageNet dataset. It measures the similarity between the true images  
and the predicted
images</p>
      <p>at the feature map level, rather than at the pixel level.</p>
      <p>The formulation is as follows:
 
(3)
(4)
(5)
where  and  represent the image areas being compared;   and   are the mean pixel intensity
values for blocks  and  ;   2and   2 are the intensity dispersions for blocks  and  ;  
covariance between  and  ;  1 and  2 are small constants preventing division by zero, typically
defined as  1 = ( 1 )2 and  2 = ( 2 )2, where  is the dynamic range of pixels (255 for 8-bit
images),  1 and  2 are larger constants typically between 0.01 and 0.03.</p>
      <p>Since SSIM is a similarity measure (values closer to 1 indicate a higher degree of similarity), the
loss function is defined as:</p>
      <p>∈ [0.1] represents the value of similarity between the images.</p>
      <p>This function improves the structural similarity between the input and restored images.
MSE loss (Mean Squared Error Loss) measures the mean-square deviation between the pixels of

1

∑( 
 =1
, −  
, and  
, )2,
(6)
(7)
(8)
the true and predicted images:
of the  -th pixel in the corresponding images.
restored images.
where  is the number of pixels in the image;  
, represent the intensity values
This part of the loss function minimizes the numerical difference between the original and
Hybrid loss function is a linear combination of the three above losses:
 ℎ
 
where  ,  ,  are the importance factor coefficients,  +  +  = 1.</p>
      <p>This loss function enables balancing between numerical precision, structural similarity, and
highlevel features.
4. Results and discussions
•
•
•
diagrams.
autoencoder model;
and SSIM.</p>
      <p>The experiment consisted of comparing the performance of the developed autoencoder model with
classical algorithms. In particular, the compression efficiency, speed of operation, and quality of
reconstructed images, assessed using PSNR and SSIM, were investigated.</p>
      <p>To conduct the experiment, a modular software system (Figure 1) was developed in Python,
comprising of the following components:</p>
      <p>graphical user interface, which facilitates user interaction, enabling the selection of
compression models and working directories, as well as the visualization of the results and
compression module, which handles the encoding and decoding of images using the
quality assessment module, which computes the key evaluation metrics, specifically PSNR</p>
      <p>The development of the software relied on the following third-party libraries:
tensorflow (version 2.17.0), keras, numpy, pillow, matplotlib, scikit-image, ttkbootstrap,
opencvpython, and scipy.</p>
      <p>The convolutional autoencoder model was trained using the following key hyperparameters:
• batch_size=64 - the number of images that are simultaneously fed to the model during one
training step. This parameter value allowed for efficient use of computing resources and
accelerated the training process;
• epochs=40 - the number of epochs during which the model was trained. To prevent
overtraining and optimize the process, the EarlyStopping technique was used with the patience=3
parameter. This allows training to be automatically terminated if the validation loss value does
not improve over three consecutive epochs;
• optimizer='adam' - Adam (Adaptive Moment Estimation) optimizer to minimize the loss
function with standard parameters (learning_rate=0.001, beta_1=0.9, beta_2=0.999 and
epsilon=1e-07). It adaptively adjusts the learning rate for each model parameter separately, which
allows for faster convergence and stability of the learning process.</p>
      <p>The images used in this paper were taken from CelebA-HQ (CelebFaces Attributes Dataset
HighQuality), an extended and improved version of the popular CelebA dataset. This dataset is used to
train models such as generative adversarial networks (GANs) and autoencoders. CelebA-HQ is
available for non-commercial use.</p>
      <p>To perform the experiments, 4 encoder configurations were developed: incorporating 3 layers,
5 layers, 7 layers, and 10 layers of convolution.</p>
      <p>Let us consider the performance of each configuration and compare them with the JPEG method.</p>
      <p>The results obtained using the 3-layer encoder are presented in Figure 2.</p>
      <p>The model achieved an SSIM score of 0.97, which corresponds to a JPEG quality setting of 95 out of
100 (where 100 represents the highest possible image quality). An SSIM value close to 1 indicates a
high degree of structural similarity between the compressed image and its original. The SSIM
distribution is presented in Figure 3.</p>
      <p>The PSNR for JPEG compression is 41.11 dB, while the autoencoder achieves 27.09 dB. The higher
PSNR value for JPEG suggests that, from a mathematical standpoint, JPEG-compressed image retains
greater fidelity to the original compared to the autoencoder-compressed image. However, while
PSNR is an important quality parameter, it does not always correlate with perceptual visual quality.
The distribution of PSNR is illustrated in Figure 3 (b).</p>
      <p>The JPEG compression ratio (4.58) is nearly 4 times higher than that achieved by the model (1.11).
Therefore, the autoencoder currently exhibits limited compression efficiency. This suggests that the
model has learned to effectively encode and decode images with no significant information loss,
providing a positive impact on the quality of the output images, while reducing the degree of
compression. The compression ratio distribution is shown in Figure 3 (c).</p>
      <p>JPEG compression significantly outperforms the autoencoder in terms of processing speed,
requiring 92.07 seconds compared to 1519.46 seconds. This result is expected, as neural network
algorithms generally demand more computational resources. A visual comparison of the output
images is shown in Figure 4 and Figure 5.
Visually, both images are similar to the original, with no noticeable artefacts. These results indicate
that while the autoencoder achieves high-quality image restoration as indicated by SSIM, it still lags
behind JPEG in terms of compression efficiency and processing speed.</p>
      <p>Next, we evaluate the performance of the 5-layer autoencoder and compare it to JPEG (Figure 6).</p>
      <p>This model achieves an SSIM of 0.93, which approximates to a JPEG quality setting of 75/100. This
represents a slight decrease in image quality compared to the 3-layer model (Figure 7).</p>
      <p>The 5-layer model also performs worse than JPEG in terms of PSNR (25.78 vs. 36.37); however, the
gap between the two is narrower (Figure 7).</p>
      <p>A notable characteristic of the JPEG method is its ability to adjust the trade-off between the
quality and compression. This means that as the SSIM decreases, the compression ratio increases, as
evident from the experiment results.</p>
      <p>The 5-layer autoencoder demonstrates a slightly better compression ratio compared to the 3-layer
model (1.11 vs. 1.18); however, the difference is minimal when compared to JPEG. The compression
ratio distribution is shown in Figure 7.</p>
      <p>The visual comparison is presented in (a) (b) (c)
Figure 8 and (a) (b)
Figure 9.
The images still closely resemble their original, though a slight difference in colour becomes
apparent.</p>
      <p>In comparison to the 3-layer model, there is a modest reduction in both SSIM and PSNR values.
However, a small increase in the compression ratio is observed. This indicates that the compression
process performed by the autoencoder is lossy, with the compression ratio dependent on the quality
of the original.</p>
      <p>We now turn to the performance of the 7-layer autoencoder model (Figure 10).</p>
      <p>An SSIM of 0.85 corresponds to a JPEG setting of 20/100.</p>
      <p>A noticeable decline in SSIM indicates potential overfitting in the model. With an excessive
from them. Despite this, the restored images maintain clear structure, especially when compared to
dent. However, the restored images show a near-complete
absence of green colour (Figure 11 (a, b)), and some finer details begin to blur (Figure 11 (c)).
(a)
(b)
(c)
with more than 5 convolutional layers is not advisable.</p>
      <p>Next, we examine a case of significant overfitting with the 10-layer model (Figure 12). The SSIM
of 0.80 approximates to the JPEG setting of 13/100.</p>
      <p>Due to significant overfitting, the model generates artefacts, as illustrated in Figure 13. Overall, the
restored images frequently display facial features or even entire faces in areas where they should not
be present.</p>
      <p>Comparative characteristics of 3 layers, 5 layers, 7 layers, 10 layers models and JPEG method with
settings of 95/100, 75/100, 20/100, 13/100 is shown in Table 1.
3, 5, 7, 10 layers of convolution). The graphs also show similar results of the JPEG method with the
corresponding settings (quality values 95/100, 75/100, 20/100, 13/100).
(c) (d)
Figure 14: Graphs of the dependence of changes in image recognition results for encoder models
with different numbers of layers and the JPEG method with different quality settings: PSNR (a), SSIM
(b), Compression ratio (c),Time per image (d).</p>
      <p>The findings of this research indicate that, at present, neural networks are not yet able to outperform
traditional compression methods, suggesting that this area requires further exploration. The example
of the basic autoencoder illustrates that while neural networks can reconstruct images with adequate
quality, they still fall short of the compression efficiency and computational performance offered by
classical methods. Furthermore, the study demonstrated that overly deep models tend to
underperform in this context. Consequently, for compressing homogeneous images of size 128×128
pixels, shallow neural networks with 2 3 layers are more suitable.</p>
      <p>The primary challenges at this stage include suboptimal data compression and a significant
demand for computational resources. Addressing the first issue will require further model
optimisation and the development of a loss function that can train the model to perform the
tradeoff between compression and quality. The second issue points to the need for the adoption of
advanced model compression techniques, such as quantisation and neuron and weight pruning.</p>
    </sec>
    <sec id="sec-4">
      <title>5. Conclusions</title>
      <p>In summary, neural networks can reconstruct images with adequate quality, they still fall short of
the compression efficiency and computational performance offered by classical methods.</p>
      <p>This study developed an autoencoder model for image compression. The experiments
demonstrated that the proposed model successfully preserves key structural features of the images,
while achieving significantly lower compression ratios when compared to JPEG.</p>
      <p>The experiments determined the maximum acceptable number of convolutional layers for the
model. For images of size 128×128 pixels, the maximum acceptable number of convolutional layers
was found to be 5, with 2 3 layers being the most effective configuration.</p>
      <p>The analysis of the experimental data highlighted two main challenges for the neural
networkbased approach: a low compression ratio and high resource requirements. Optimising the model
architecture could enhance compression efficiency. Further improvements can be made through
model optimisation and compression. The results indicate that, without addressing the
computational cost, neural network-based compression methods cannot replace traditional
approaches.</p>
      <p>To optimize the architecture, it is planned to explore alternative architectures, such as variational
autoencoders (VAEs), to create a more efficient and stable latent space. It will be advisable to study
hybrid models that combine different types of neural networks, and use dilated convolutions to
increase the receptive field without additional computational costs.</p>
      <p>To reduce the size of the model and accelerate the encoding and decoding process, a promising
direction will be the study of quantization, pruning, and knowledge distillation methods.</p>
      <p>Further research in this direction will allow creating more efficient and compact solutions for
image compression based on neural networks.</p>
    </sec>
    <sec id="sec-5">
      <title>Declaration on Generative AI</title>
      <p>The authors have not employed any Generative AI tools.
[11] I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville,
Yo. Bengio, Generative Adversarial Nets, Advances in Neural Information Processing Systems
3(11) (2014). doi: 2672 2680 10.1145/3422622.
[12] F. Mentzer, G. D. Toderici, M. Tschannen, E. Agustsson, High-fidelity generative image
compression, Advances in Neural Information Processing Systems 33 (2020). doi:
10.48550/arXiv.2006.09965.
[13] A. K. Naveen, S. Thunga, A. Murki, M. A. Kalale, S. Anil, Autoencoded Image Compression for
Secure and Fast Transmission, in: Proceedings of 2024 IEEE International Conference on
Computer Vision and Machine Intelligence (CVMI), 2024. pp. 1-6. doi:
10.48550/arXiv.2407.03990.
[14] D. Bank, N. Koenigstein, R. Giryes, Autoencoders, in: Rokach, L., Maimon, O., Shmueli, E. (Eds),
Machine Learning for Data Science Handbook, Springer, Cham, 2023, pp. 353 374. doi:
10.1007/978-3-031-24628-9_16.
[15] V. Mudeng, M. Kim, S. Choe, Prospects of Structural Similarity Index for Medical Image
Analysis, Applied Sciences 12(8) (2022) 3754. doi: 10.3390/app12083754.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>X.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Shihao</surname>
          </string-name>
          ,
          <article-title>Neural Image Compression and Explanation</article-title>
          , IEEE Access 8
          <article-title>(</article-title>
          <year>2020</year>
          )
          <fpage>214605</fpage>
          -
          <lpage>214615</lpage>
          . doi:
          <volume>10</volume>
          .1109/ACCESS.
          <year>2020</year>
          .
          <volume>3041416</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>S.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Abu-El-Haija</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Johnston</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ballé</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Shrivastava</surname>
          </string-name>
          , G. Toderici,
          <article-title>End-to-end Optimized Image Compression, ICIP (</article-title>
          <year>2020</year>
          ). https://arxiv.org/abs/
          <year>2007</year>
          .11797.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>H. T.</given-names>
            <surname>Sadeeq</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. H.</given-names>
            <surname>Hameed</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Abdi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Nashwan</surname>
          </string-name>
          ,
          <string-name>
            <surname>Image Compression Using Neural Networks</surname>
            ,
            <given-names>A</given-names>
          </string-name>
          <string-name>
            <surname>Review</surname>
          </string-name>
          .
          <source>International Journal of Online and Biomedical Engineering</source>
          <volume>17</volume>
          (
          <year>2021</year>
          )
          <fpage>135</fpage>
          -
          <lpage>153</lpage>
          . doi:
          <volume>10</volume>
          .3991/ijoe.v17i14.
          <fpage>26059</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M.</given-names>
            <surname>Bhattacharjee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ghosh</surname>
          </string-name>
          ,
          <article-title>An Efficient Image Compression Algorithm Using Neural Networks</article-title>
          , in: T.
          <string-name>
            <surname>Choudhury</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Koley</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Nath</surname>
            , JS. Um,
            <given-names>A.K.</given-names>
          </string-name>
          Patidar (Eds.),
          <article-title>Geo-Environmental Hazards using AI-enabled Geospatial Techniques and Earth Observation Systems</article-title>
          .
          <source>Advances in Geographic Information Science</source>
          , Springer, Cham,
          <year>2024</year>
          . https://doi.org/10.1007/978-3-
          <fpage>031</fpage>
          - 53763-9_
          <fpage>15</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>D.</given-names>
            <surname>Mishra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. K.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <surname>Rajat Kumar Singh</surname>
          </string-name>
          ,
          <article-title>Deep Architectures for Image Compression: A Critical Review</article-title>
          ,
          <source>Signal Processing 191</source>
          (
          <year>2022</year>
          )
          <article-title>108346</article-title>
          . https://doi.org/10.1016/j.sigpro.
          <year>2021</year>
          .
          <volume>108346</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>X.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Guo</surname>
          </string-name>
          , T. Han,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ju</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Xu</surname>
          </string-name>
          , H. Liu,
          <source>Medical Image Compression Based on Variational Autoencoder</source>
          , Mathematical Problems in Engineering (
          <year>2022</year>
          )
          <article-title>7088137</article-title>
          . https://doi.org/10.1155/
          <year>2022</year>
          /7088137
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>K. A.</given-names>
            <surname>Fettah</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Menassel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gattal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Gattal</surname>
          </string-name>
          ,
          <article-title>Convolutional Autoencoder-Based medical image compression using a novel annotated medical X-ray imaging dataset</article-title>
          ,
          <source>Biomedical Signal Processing and Control</source>
          <volume>94</volume>
          (
          <year>2024</year>
          )
          <article-title>106238</article-title>
          . https://doi.org/10.1016/j.bspc.
          <year>2024</year>
          .
          <volume>106238</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>G.</given-names>
            <surname>Guerrisi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Schiavon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Del Frate</surname>
          </string-name>
          ,
          <article-title>On-Board Image Compression using Convolutional Autoencoder: Performance Analysis and Application Scenarios</article-title>
          ,
          <source>in: Proceedings of 2023 IEEE International Geoscience and Remote Sensing Symposium 2023 IEEE International Geoscience and Remote Sensing Symposium</source>
          ,
          <year>2023</year>
          , pp.
          <fpage>1783</fpage>
          -
          <lpage>1786</lpage>
          . doi:
          <volume>10</volume>
          .1109/IGARSS52108.
          <year>2023</year>
          .
          <volume>10281562</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>F.</given-names>
            <surname>Berahmand</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. S.</given-names>
            <surname>Daneshfar</surname>
          </string-name>
          , Yu.Li Salehi,
          <article-title>Xu Yue, Autoencoders and their applications in machine learning: a survey</article-title>
          ,
          <source>Artificial Intelligence Review</source>
          <volume>57</volume>
          (
          <issue>2</issue>
          ) (
          <year>2024</year>
          ). https://doi.org/10.1007/s10462-023-10662-6.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>D. P.</given-names>
            <surname>Kingma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Welling</surname>
          </string-name>
          ,
          <article-title>An introduction to variational autoencoders Foundations and Trends</article-title>
          ® in
          <source>Machine Learning</source>
          <volume>12</volume>
          (
          <issue>4</issue>
          ) (
          <year>2019</year>
          )
          <fpage>307</fpage>
          -
          <lpage>392</lpage>
          . doi:
          <volume>10</volume>
          .1561/2200000056.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>