<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Khabarovsk, Russia</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Clustering of Polar Vortex States Using Convolutional Autoencoders</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Mikhail A. Krinitskiy</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yulia A. Zyulyaeva</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sergey K. Gulev</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Shirshov Institute of Oceanology, Russian Academy of Sciences</institution>
          ,
          <addr-line>Moscow</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <volume>1</volume>
      <fpage>6</fpage>
      <lpage>19</lpage>
      <abstract>
        <p>A profound understanding of the stratospheric wintertime dynamics and its climate changes are important for improving seasonal forecast skill. The primary goal of the research of the wintertime Arctic stratospheric polar vortex (PV hereafter) is defining its states and their clustering. Manual classification is a highly time-consuming task suffering of researcher subjectivity. We apply deep learning methods that let us cluster the PV states based on their spatial structure. We designed the particular kind of neural networks called variational convolutional autoencoder with the sparsity constraint (SpCVAE). We applied the hierarchical agglomerative clustering algorithm to the states pf PV described by their embedded representation generated by SpCVAE. 96 dimensional embedded representation was found to be optimal with high samples reconstruction quality. The best number of clusters was chosen based on "elbow rule" and topic-specific reasoning. The approach applied let us automatically distinguish weak PVs of "displacement" and "split" types, as well as to isolate several strong vortex states of different shift directions. These results are only obtainable when one considers the spatial structure of the PV. We have constructed the calendar of the PV states based on the clustering result. Clustered events of weak PVs were examined and demonstrated good correspondence with the calendar of sudden stratospheric warmings that have been built manually. This result is now the basis for the research of the stratosphere-troposphere interaction for existing and future climate scenarios.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Nowadays, skillful numerical weather prediction is limited by about 10 days due to the chaotic nature of
atmospheric dynamics [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Skillful seasonal forecasting typically relies on the predictability of slow-varying
components of the climate system, such as sea surface temperature, sea ice, snow cover, and soil moisture. For
instance, the predictability of El Niño Southern Oscillation (ENSO) phenomenon is a remarkable example o f high
skills of the seasonal forecast system [
        <xref ref-type="bibr" rid="ref2 ref3">2, 3</xref>
        ]. However, recent studies demonstrated that the maximum seasonal
forecast skills have not yet been achieved, and pointed to the stratosphere as a potential source for enhancing
seasonal predictability [
        <xref ref-type="bibr" rid="ref1 ref4">1, 4</xref>
        ].
      </p>
      <p>
        Before [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] the stratosphere was considered playing a passive role in the stratosphere-troposphere coupling, that
is, it does not influence troposphere dynamics. Baldwin and Dunkerton showed [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] that the strength of the polar
vortex affects the main features of the paths of the cyclones propagation. Fro m that time, interest in the
stratospheric dynamics and its climatic changes have constantly been growing.
      </p>
      <p>Polar stratosphere became extremely cold during the polar night, and meridional temperature gradient becomes
strong, which leads to the formation of the polar vortex. Planetary waves propagating from the troposphere to the
stratosphere disturb and sometimes destroy polar vortex. These events are known as Sudden Stratospheric
Warming events (SSW), as the temperature near the pole dramatically increase ( up to 40° per 4 - 7 days) when the
vortex is destroyed. Two types of SSW are classified nowadays: “displacement” with the center of the vortex
shifted significantly towards the equator, and “split” with the vortex split into two vortices. There are periods of
the extremely strong vortex as well. Variability of the PV intensity is the most influential factor of intraseasonal
variability in the winter stratosphere. During springtime as the polar day is coming to high latitudes, temperature
gradient decrease, so the polar vortex disappears. Summertime variability of the stratosphere dynamics is low, and
there is no source of the stratosphere-troposphere coupling.</p>
      <p>
        It has been shown that weakenings of the polar vortex precede the shift of the storm tracks (main p ath of
midlatitude cyclones propagation) to the south, which may cause cold outbreaks in the North Atlantic - Europe region
[
        <xref ref-type="bibr" rid="ref10 ref6 ref7 ref8 ref9">6–10</xref>
        ]. These anomalies may act in the troposphere up to 2 months [
        <xref ref-type="bibr" rid="ref11 ref6">6, 11</xref>
        ]. Amplitudes of these anomalies are
comparable to the effect of the ENSO [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Therefore, one can extend and improve long-term weather forecast [
        <xref ref-type="bibr" rid="ref12 ref13 ref14 ref15 ref16">12–
16</xref>
        ]. However, relatively little attention is paid to strong polar vortex events compared to SSW. One of the main
reasons for that is the difficulty of their identification classification.
      </p>
      <p>
        There are a few metrics for the describing states of the PV [
        <xref ref-type="bibr" rid="ref17 ref3">3, 17</xref>
        ]. Widely used key features of PV for applying
different types of analysis are the aggregated and diagnostic parameters like maximum pressure anomaly, zonal
mean zonal winds, etc. These parameters do not preserve spatial characteristics like PV center shift from the North
Pole, vortex shape parameters or various anomalies duration.
      </p>
      <p>
        Over last decades machine learning methods demonstrated spectacular results in research of climatic changes of
atmospheric circulation [
        <xref ref-type="bibr" rid="ref18 ref19 ref20 ref21 ref22">18–22</xref>
        ]. Researchers mostly rely on state-of-the-art clustering methods. Using the
resulting atmosphere states grouping one can assess atmospheric circulation characteristics and trends within each
cluster. Most of this kind of works are focused on troposphere states research [
        <xref ref-type="bibr" rid="ref18 ref19 ref20 ref22">18–20, 22</xref>
        ]. However, machine
learning techniques are shown recently to be fruitful for stratosphere states clustering [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]. Mostly Kohonen
selforganizing maps [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ] is applied in these works for clustering. However, its ability to preserve the topology of a
dataset is rarely used. Hierarchical agglomerative clustering is the less frequently used method [
        <xref ref-type="bibr" rid="ref18 ref21">18, 21</xref>
        ]. For this
type of clustering, the source data of geophysical fields are usually a ggregated following a researcher sense of a
particular operation ability to preserve the informational content. As a result, each PV state is represented with
vectors. This procedure leads to the loss of crucial information about the spatial configuration of objects being
researched.
      </p>
      <p>
        In our work, we focused on polar vortex states research with the use of geopotential height at 10hPa level (see
section 2.1 “Data and preprocessing”). We applied hierarchical agglomerative clustering method on the embedded
representations of PV states that are generated by variational convolutional autoencoder with the sparsity
constraint (hereafter SpCVAE) [
        <xref ref-type="bibr" rid="ref24 ref25">24, 25</xref>
        ]. SpCVAE as a feature extractor and a tool for dimensionality reduction
preserves features of PV spatial configuration, yet it is capable of decreasing the computational costs of clustering.
Purposes of a SpCVAE here are dimensionality reduction and extraction of significant information based on the
whole PV states dataset. Being a neural network trained end-to-end the SpCVAE avoids manual feature
engineering. Thus there is no need to rely on a researcher`s sense of the importance of PV aggregated parameters.
2
2.1
      </p>
    </sec>
    <sec id="sec-2">
      <title>Data and Methods</title>
      <sec id="sec-2-1">
        <title>Data and Preprocessing</title>
        <p>
          We analyzed geopotential heights (HGT) and potential vorticity (PVt) fields at 10 hPa level from JRA-55
(Japanese 55-year Reanalysis) in this study [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ]. We considered wintertime period December-February (DJF) for
1958 - 2014 years. The spatial resolution of the data is 1,25° х 1,25°, the upper level of the model is 0.1 hPa, which
is crucial for the analysis of stratospheric processes. JRA-55 shown to be in good coherence with all modern
reanalysis data (S-RIP [
          <xref ref-type="bibr" rid="ref27">27</xref>
          ]). The main advantage of JRA-55 is the extended period from the 1958 year in
comparison with ERA-Interim (European Centre for Medium-Range Weather Forecasts (ECMWF) Reanalysis
data) and MERRA (NASA Modern Era Reanalysis for Research and Applications) that start from 1979.
        </p>
        <p>Source data for describing PV states were considered to be field values to the north from 40 °N.</p>
        <p>At the preprocessing stage, timestep-defined snapshots of HGT and PVt fields were projected using North-polar
Lambert azimuthal projection and interpolated to form a two-dimensional flat matrix of size 256x256. For each
date of a year, we calculated the climatological median and subtracted it from each snapshot of this date (e.g., from
all snapshots within January 25th of each year). Since we use the North-polar Lambert azimuthal projection, only
central rounded part of each sample is informative, so during all calculations, we used the mask   (Fig. 1b).</p>
        <p>We further normalized these snapshots, so all their values are limited and take values between 0.0 and 1.0:
  −min 
  = max  − m,in  , (1)</p>
        <p>,  ,
where  denotes the whole dataset snapshots,  is a particular snapshot,  and  are  matrix indices, min( ) and
max( ) are calculated taking the mask   into account. This kind of normalization applied to both HGT and PVt
datasets separately. With this preprocessing procedure, the PV states dataset is represented with two fields
containing 21476 matrices of size 256 x 256 which values fit the range between 0.0 and 1.0. Some examples of this
dataset are shown in Fig. 1a.
________________________________________________________________________________________________
(a)
(b)
∈ ℋ where ℋ is a hidden representations space, and  ̃ is the
where  is the number of pixels of input examples. In
→  ̃, where  is an input example matrix,  ∈
our study,  = 2 ∗ 256 ∗ 256 since HGT and PVt projected examples are matrices 256x256. The transformations
 → 
and</p>
        <p>→  ̃ are referred hereafter as encoder and decoder respectively. An autoencoder is trained with the loss
function defined according to the similarity definition suitable to the problem. Mean squared error is a widely used
loss function that is suitable for most tasks that imply the processing of geophysical fields. In a case of limited source
data values, one may normalize them accordingly to use binary cross-entropy loss (BCE) (2). We normalized source
data, so its values are limited (see Section 2.1) therefore we use BCE loss ℒ
:
ℒ
( ,  ) = − ∑
 =0 ∑

 =0   (  ln ℱ(  )) ,
(2)</p>
        <p>
          It was shown [
          <xref ref-type="bibr" rid="ref24 ref25 ref28">24, 25, 28</xref>
          ] that successful training of a neural network implies tuning its weights the way that the
trainable part of the network extracts latent parameters distribution of the training dataset. With the trained CAE, the
embedded representation [
          <xref ref-type="bibr" rid="ref29 ref30">29, 30</xref>
          ] of input samples preserve enough information for the network to be able to
reconstruct it with the appropriate quality. We use this feature of CAEs to perform dimensionality reduction with
minimum loss of information about latent parameters variability and the spatial structure of PV states.
        </p>
        <p>
          We applied commonly used techniques of convolutional neural networks quality improvement called Transfer
Learning (hereafter TL) [
          <xref ref-type="bibr" rid="ref28 ref31 ref32 ref33 ref34 ref35">28, 31–35</xref>
          ] and Fine Tuning (hereafter FT) [
          <xref ref-type="bibr" rid="ref36 ref37">36, 37</xref>
          ]. In practice, TL implies the construction
of a new neural network based on a subset of layers of the network that was previously trained on a huge dataset, e.g.,
ImageNet [
          <xref ref-type="bibr" rid="ref38">38</xref>
          ]. Moreover, these layers are set to be “frozen,” that is, their weights are not optimized during training.
This approach is fruitful when one uses a dataset which statistical characteristics are similar to ImageNet. In practice,
it means that the new dataset images should contain visual patterns similar to ones that are frequently met in
ImageNet. TL application significantly decreases the computational costs of new
models training. Fine Tuning
approach implies turning off the “frozen” state for some top layers of the transferred neural network. With this
approach, one can tune the whole CAE taking into account the peculiar properties of the dataset.
        </p>
        <p>
          We applied the TL technique while building the encoder part of the CAE. We used pre-trained VGG-16 [
          <xref ref-type="bibr" rid="ref39">39</xref>
          ] as a
transferred network. We used the convolutional part of VGG-16 and a set of new fully-connected layers attached to it.
VGG-16 convolutional sub-network is denoted as “convolutional core” in fig. 2.
        </p>
        <p>
          We also applied several
regularizations to prevent overfitting. Particularly we used the dropout [
          <xref ref-type="bibr" rid="ref40">40</xref>
          ] approach and L2 regularization which
penalizes high-value weights.
        </p>
        <p>In our model, the VGG-16 output is reshaped to a vector which is then the input for the encoder fully-connected
part. This fully-connected part consists of three layers FC1, FC2, and FC3, which are alternating with dropout layers.
FC3 is an intermediate one between encoder and decoder. The output of FC3 is the hidden representation of the input
sample and the input vector for the decoder.</p>
        <p>Autoencoders usually tend to be symmetric. With this approach, we constructed the decoding part to be mirrored
to the encoder. All the weights of the decoder are trainable. Following the best practices of composing convolutional
autoencoders, we used two-dimensional upsampling layers which mirror max-pooling layers of the encoder.
Upsampling here is an operation of repeating the layer`s input rows and columns by the specified number of times.
Decoder outputs are the reconstructed HGT and PVt fields of input example which has to be similar to the input in a
sense defined by the loss function of the network which is BCE loss (2).</p>
        <p>
          The dimensionality constraint mentioned above is the dimensionality of ℋ . The common approach of the
clustering involving autoencoders relies on their capability of projecting the input examples to the hidden
representation space ℋ. This transformation was shown to be trained so that the examples which are close to each
other in ℋ are similar [
          <xref ref-type="bibr" rid="ref29">29</xref>
          ]. However, the key feature of a dataset should be the opposite for the reliable, stable, and
reproducible clustering, that is, similar examples should be located close to each other in ℋ . The ordinary
undercomplete CAE does not guarantee this property of the projection  → ℋ. This issue might be addressed with
the variational autoencoder (VAE) [
          <xref ref-type="bibr" rid="ref41">41</xref>
          ] which was shown to produce continuous latent variables space ℋ. That is,
with VAE involved, similar examples are located close to each other in ℋ. However, since the distribution of the
features of  is normal in case of VAE, the clustering cannot produce valuable results in the generated feature space
ℋ. In our study, this issue is addressed with the constraint of sparsity, that is, features of the vector  are forced to be
Bernoulli-distributed. With this constraint, the hidden representation vectors  tend to be sparse, that is, only a few
features are non-zero for a particular example  . The undercomplete CAE with the mentioned constraints imposed is
referred hereafter as sparse variational convolutional autoencoder (SpCVAE). Its structure is presented in Fig. 2. As
shown in Fig. 2, the fields of the input example are processed separately, similar to the approach applied in [
          <xref ref-type="bibr" rid="ref42">42</xref>
          ].
        </p>
        <p>
          In our work, the only hyperparameter of the proposed SpCVAE is the number of nodes of the FC3 layer. This
number at the same time is the number of features of the hidden representation  (see Fig. 2) and the dimensionality
of ℋ (hereafter  ). There is a trade-off between reconstruction quality and the  . We use the multiscale
structural similarity ( [
          <xref ref-type="bibr" rid="ref43">43</xref>
          ]) as a measure for reconstruction quality. For optimization reasons, we use the
metric (1 −  ) with the rule “less is better” (Fig. 3a). We have conducted the research of the reconstruction
quality versus the  (see Fig. 3a). There is a reasonable value of  = 96 since after this value the
reconstruction quality stops improving significantly. There are more candidates for the best choice of  ,
however, only starting with the  = 96 the clustering results become stable and reproducible. Taking this result
into account, we further used the SpCVAE with 96 neurons of layer FC3. We have trained this SpCVAE using the
data described in section 2.1. We use then the encoder output in the inference mode of the trained SpCVAE as PV
states representation of reduced dimensionality.
2.3
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Hierarchical Agglomerative Clustering</title>
        <p>
          We applied Lance-Williams hierarchical agglomerative clustering [
          <xref ref-type="bibr" rid="ref44 ref45 ref46">44–46</xref>
          ] to define groups of stable PV states.
This method is frequently used for atmosphere and stratosphere states clustering [
          <xref ref-type="bibr" rid="ref18 ref19 ref20 ref21 ref22">18–22</xref>
          ]. We applied this clustering
method to PV states objects described with low-dimensional hidden representations generated by SpCVAE (see Fig.
2). We considered the Euclidean metric as a distance between objects in this feature space. We used Ward minimal
inter-cluster distance [
          <xref ref-type="bibr" rid="ref47">47</xref>
          ] as a criterion for clusters union. Ward inter-cluster distance between clusters U and V is the
following:
  ( ,  ) =
‖ ‖+‖ ‖
‖ ‖‖ ‖  2 (∑ ‖ ‖
  , ∑ ‖ ‖
  ) ,
where   and   are embedded representation vectors for objects assigned to clusters  and  respectively,  denotes
vectors, ‖ ‖ and ‖ ‖ are elements number of clusters 
and  . Hierarchical
agglomerative clustering algorithm for a set of objects   ,  = 1 …  is represented with pseudocode:
1.
        </p>
        <p>= 1, initialize the starting set  1 – the universal set of one-element clusters {{ 1},{ 2},…,{  }},
Ward inter-cluster distances calculated using (3) are equal to halved element-to-element squared
Euclidean distances,
2. for each</p>
        <p>= 2 …  repeat:</p>
        <p>search for a pair of most close clusters in terms of Ward inter-cluster distances (3):
b. unite  and  , exclude  and  from   , add the united cluster to   :
( ,  ) = argmin  ( ,  ) .</p>
        <p>, ∈ 

=</p>
        <p>∪  ,
  +1 = (  \{ ,  }) ∪  .
c. for each 
∈   +1, calculate inter-cluster distances to   ( , 
) using (3).</p>
        <p>Agglomerative hierarchical clustering procedure
with</p>
        <p>
          Ward inter-cluster distance definition (3) is the one
demonstrated most valuable results in a set of synthetic clustering problems [
          <xref ref-type="bibr" rid="ref48">48</xref>
          ]. With this procedure, the only
hyperparameter is the target clusters number  . We use the empirical “elbow rule” to choose the best clusters
number. Additionally, we considered topic-specific reasoning: we wanted the clustering method to be capable of
discriminating weak PV states of types “displacement” and “split” yet to be capable of discriminating various shifted
states that were demonstrated recently to be present [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ].
        </p>
        <p>(  ) =
 (  ) =</p>
        <p>1
‖  ‖</p>
        <p>1
‖ ‖−‖  ‖
∑
  ∈  , ≠  (  ,   )
∑</p>
        <p>∈ \   (  ,   ) ,
 (  ) =</p>
        <p>(  )− (  )
max( (  ), (  ))
 ( ) =
‖1‖ ∑  (  ) .
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(a)
(b)
mean distance between   and all the objects of all other clusters:</p>
        <p>We considered mean Silhouette score (hereafter 
) as a measure for clustering quality and used it for “elbow
rule”. We calculated</p>
        <p>using Euclidean metric of the hidden representation feature space ℋ . For each  -th
object of the  -th cluster   ∈   :  (  ) is the mean distance between   and all other objects of cluster   ,  (  ) is the
distance in our study. With these notations</p>
        <p>for one object   is given by:
where  is the whole dataset, ‖ ‖ is the number of its elements,   is the cluster that   is assigned to, ‖  ‖ is the
number of its elements,  (  ,   ) is the function defining the distance between   and   .  (  ,   ) is the Euclidean
and mean</p>
        <p>is given by:</p>
        <p>for each   is the measure of its similarity to the cluster   and yet its dissimilarity to all the other elements
outside   . So the higher</p>
        <p>, the more   is similar to   and the less similar to other clusters. Therefore, mean
Silhouette score (10) is considered as a measure of clustering quality with the rule “higher is better”. Even though the
maximum mean</p>
        <p>
          is achieved with two clusters, more reasoning should be involved when one is choosing the
number of clusters. First, there should be observed “split” and “displacement” SSW events. There are also should be
observed at least one strong pole-centered state, and some other shifted states which were presented recently [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ]. For
each number of clusters more than 3 we inspected maps of mean geopotential heights at 10hPa level. We have chosen
the minimal number of clusters that let us discriminate weak states of PV of types “displacement” and “split”. This
discrimination is observed starting from  = 7. With this reasoning, the group “L” of clustering results (see Fig. 3b)
should not be considered as an option. There is also group “N” of clustering results which are characterized by low
mean Sscore, that is, low clustering quality. The results within the “M” group are characterized by too high clusters
number or low clustering quality. In our study, the group “region of interest” is considered as a group of promising
clustering results (Fig. 3b). The numbers of clusters which produces high average  within this group are
12 and 13. In our study, we use  = 12.
        </p>
        <p>Summarizing the proposed method for clustering states of PV here is its general structure:
1. Prepare and preprocess PV states data (HGT and PVt fields),
2. Construct sparse variational convolutional autoencoder (fig. 2), train this SpCVAE on prepared PV states
dataset,
3. Apply dimensionality reduction using trained SpCVAE. Representation of the reduced dimensionality is
the encoder output for each PV state presented to SpCVAE as input example,
4. Apply hierarchical agglomerative clustering,
5. Choose the best hidden representation dimensionality  and best clusters number  based on the
metrics of examples reconstruction  , clustering quality  , stability and reproducibility of
clustering, and additional problem-specific reasoning (Fig. 3).
3</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Results and Discussion</title>
      <p>
        We applied the approach presented in Section 2 to the dataset described in Section 2.1, “Data and preprocessing.”
PV states were clustered using  = 12 number of clusters. We calculated the map of mean geopotential heights at
10hPa level for each cluster. These maps are shown in Fig. 4. We also selectively inspected individual PV states
represented by HGT fields. Visual inspection of these examples shows that PV states grouped by the proposed
method are similar to each other within each cluster. SSW events are clearly seen in the composites in Fig. 4: cluster
2 for “split” type and clusters 1 and 3 for “displacement” type.
dates of the “split” and “displacement” SSWs obtained by Charlton and Polvani for period 1958 – 2002 [
        <xref ref-type="bibr" rid="ref49">49</xref>
        ], and
non-classified SSWs from [
        <xref ref-type="bibr" rid="ref50">50</xref>
        ] for period 2003 - 2013 are shown. Almost all expert-defined SSW events are clearly
colocated in time with the segments of clusters 1-3. Most of “split” SSW events are co-located with the cluster 2 or
with the segment consisting of states associated with two clusters including cluster 2. The only one SSW event in
2002 missing corresponding states of clusters 1-3 is a subject for further research.
      </p>
      <p>
        Central dates of the SSWs are defined as the dates when zonal-mean zonal winds at 10hPa and 60°N fall below
zero m/s (became easterly). In Fig. 5b, we present zonal wind averaged along 60°N for each cluster. In this figure,
clusters 1-3 are distinctively characterized by winds close to zero or even easterly winds. This behavior is consistent
with the nature and definition of SSW events [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. Since there was no expert-level knowledge involved during
clustering, we may consider the proposed method to be capable of objective weak vortex clustering.
      </p>
      <p>
        It is also clear that clusters 10 and 12 represent strong vortex centered on the pole, and only a slight shift is
observed for cluster 12. In Fig. 5a the frequency histogram is shown for the clusters obtained in this study. Cluster 10
is the most frequent state of PV, which is consistent with the current understanding of the nature of PV. Clusters 4-9
and 11 represent shifted PV states of different intensity, which may be estimated by the zonal mean zonal wind at
60°N shown in Fig. 5b. In accordance with the recent study [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ], there are different shift directions: towards the
Atlantic (clusters 7 and 11), towards Eurasia (clusters 4-6 and 8) and North America (cluster 9).
(a)
(b)
4
      </p>
    </sec>
    <sec id="sec-4">
      <title>Conclusion</title>
      <p>We propose a sparse convolutional variational autoencoder as a dimensionality reduction tool. With this model, we
extracted valuable features of PV states initially represented by geopotential height and potential vorticity at 10hPa level.
Using the representation of reduced dimensionality, we applied the Lance-Williams hierarchical agglomerative
clustering with Ward inter-cluster distance definition. This method for the first time is capable of discriminating weak
PV states of types “displacement” and “split,” which is crucial for the analysis of the stratosphere-troposphere
interactions. The proposed method is also capable of classifying stable states of strong PV characterizing by different
directions of its center shift. This classification is found to be physically valid and consistent with recent studies. The
presented clustering method the first time provides an opportunity to analyze the influence of the strong PV
characterized by various shift directions on characteristics of tropospheric circulation.</p>
      <p>The proposed clustering method provides an opportunity of researching climatic changes of wintertime stratosphere,
which is crucial for improving seasonal forecast skill and assessing the long-term variability of the climate system.</p>
      <p>Results of this work can be used as a basis for new stratosphere-troposphere interactions research for existing and
future climate scenarios.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <p>We thank Joshua Studholme for helpful discussions and suggestions on improving the manuscript. We thank Japan
Meteorological Agency (JMA) for making JRA-55 data available.</p>
      <p>
        This research was supported in through computational resources provided by the Shared Facility Center “Data
Center of FEB RAS” (Khabarovsk) [
        <xref ref-type="bibr" rid="ref51">51</xref>
        ].
      </p>
      <p>This research was supported by the Russian Ministry of Science and Higher Education (agreement №
075-022018-189 (14.616.21.0102), project ID RFMEFI61618X0102).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1. National Research Council Assessment of Intraseasonal to Interannual
          <source>Climate Prediction and Predictability, ISBN 978-0-309-15183-2</source>
          . (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Butler</surname>
            ,
            <given-names>A.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Polvani</surname>
            ,
            <given-names>L.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Deser</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Separating the stratospheric and tropospheric pathways of El NiñoSouthern Oscillation teleconnections</article-title>
          .
          <source>Environmental Research Letters</source>
          ,
          <volume>9</volume>
          ,
          <fpage>024014</fpage>
          . (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Polvani</surname>
            ,
            <given-names>L.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sun</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Butler</surname>
            ,
            <given-names>A.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Richter</surname>
            ,
            <given-names>J.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Deser</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Distinguishing stratospheric sudden warmings from ENSO as key drivers of wintertime climate variability over the North Atlantic and Eurasia</article-title>
          .
          <source>Journal of Climate</source>
          ,
          <volume>30</volume>
          ,
          <fpage>1959</fpage>
          -
          <lpage>1969</lpage>
          . (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Kirtman</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pirani</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>The state of the art of seasonal prediction: Outcomes and recommendations from the</article-title>
          <source>First World Climate Research Program Workshop on Seasonal Prediction. Bulletin of the American Meteorological Society</source>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Baldwin</surname>
            ,
            <given-names>M.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dunkerton</surname>
          </string-name>
          , T.J.:
          <article-title>Propagation of the Arctic Oscillation from the stratosphere to the troposphere</article-title>
          .
          <source>Journal of Geophysical Research: Atmospheres</source>
          ,
          <volume>104</volume>
          ,
          <fpage>30937</fpage>
          -
          <lpage>30946</lpage>
          . (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Baldwin</surname>
            ,
            <given-names>M.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dunkerton</surname>
          </string-name>
          , T.J.:
          <article-title>Stratospheric harbingers of anomalous weather regimes</article-title>
          .
          <source>Science</source>
          ,
          <volume>294</volume>
          ,
          <fpage>581</fpage>
          -
          <lpage>584</lpage>
          . (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Baldwin</surname>
            ,
            <given-names>M.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thompson</surname>
            ,
            <given-names>D.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shuckburgh</surname>
            ,
            <given-names>E.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Norton</surname>
            ,
            <given-names>W.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gillett</surname>
            ,
            <given-names>N.P.</given-names>
          </string-name>
          :
          <source>Weather from the stratosphere? Science</source>
          ,
          <volume>301</volume>
          ,
          <fpage>317</fpage>
          -
          <lpage>319</lpage>
          . (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Cohen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jones</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Furtado</surname>
            ,
            <given-names>J.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tziperman</surname>
          </string-name>
          , E.:
          <article-title>Warm Arctic, cold continents: A common pattern related to Arctic sea ice melt, snow advance, and extreme winter weather</article-title>
          .
          <source>Oceanography</source>
          ,
          <volume>26</volume>
          ,
          <fpage>150</fpage>
          -
          <lpage>160</lpage>
          . (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Kidston</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scaife</surname>
            ,
            <given-names>A.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hardiman</surname>
            , S.C., Mitchell,
            <given-names>D.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Butchart</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baldwin</surname>
            ,
            <given-names>M.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gray</surname>
            ,
            <given-names>L.J.:</given-names>
          </string-name>
          <article-title>Stratospheric influence on tropospheric jet streams, storm tracks and surface weather</article-title>
          .
          <source>Nature Geoscience</source>
          ,
          <volume>8</volume>
          ,
          <fpage>433</fpage>
          . (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Kolstad</surname>
            ,
            <given-names>E.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Breiteig</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scaife</surname>
            ,
            <given-names>A.A.</given-names>
          </string-name>
          :
          <article-title>The association between stratospheric weak polar vortex events and cold air outbreaks in the Northern Hemisphere</article-title>
          .
          <source>Quarterly Journal of the Royal Meteorological Society</source>
          ,
          <volume>136</volume>
          ,
          <fpage>886</fpage>
          -
          <lpage>893</lpage>
          . (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Hitchcock</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Simpson</surname>
            ,
            <given-names>I.R.:</given-names>
          </string-name>
          <article-title>The downward influence of stratospheric sudden warmings</article-title>
          .
          <source>Journal of the Atmospheric Sciences</source>
          ,
          <volume>71</volume>
          ,
          <fpage>3856</fpage>
          -
          <lpage>3876</lpage>
          . (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Baldwin</surname>
            ,
            <given-names>M.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stephenson</surname>
            ,
            <given-names>D.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thompson</surname>
            ,
            <given-names>D.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dunkerton</surname>
            ,
            <given-names>T.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Charlton</surname>
            ,
            <given-names>A.J.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>O</given-names>
            <surname>'neill</surname>
          </string-name>
          , A.:
          <article-title>Stratospheric memory and skill of extended-range weather forecasts</article-title>
          .
          <source>Science</source>
          ,
          <volume>301</volume>
          ,
          <fpage>636</fpage>
          -
          <lpage>640</lpage>
          . (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Scaife</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Karpechko</surname>
            ,
            <given-names>A.Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baldwin</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brookshaw</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Butler</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eade</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gordon</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>MacLachlan</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Martin</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dunstone</surname>
          </string-name>
          , N.:
          <article-title>Seasonal winter forecasts and the stratosphere</article-title>
          .
          <source>Atmospheric Science Letters</source>
          ,
          <volume>17</volume>
          ,
          <fpage>51</fpage>
          -
          <lpage>56</lpage>
          . (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Sigmond</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scinocca</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kharin</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shepherd</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Enhanced seasonal forecast skill following stratospheric sudden warmings</article-title>
          .
          <source>Nature Geoscience</source>
          ,
          <volume>6</volume>
          ,
          <fpage>98</fpage>
          . (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>D.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scaife</surname>
            ,
            <given-names>A.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eade</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Knight</surname>
            ,
            <given-names>J.R.</given-names>
          </string-name>
          :
          <article-title>Seasonal to decadal prediction of the winter North Atlantic Oscillation: emerging capability and future prospects</article-title>
          .
          <source>Quarterly Journal of the Royal Meteorological Society</source>
          ,
          <volume>142</volume>
          ,
          <fpage>611</fpage>
          -
          <lpage>617</lpage>
          . (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Thompson</surname>
            ,
            <given-names>D.W.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baldwin</surname>
            ,
            <given-names>M.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wallace</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          :
          <article-title>Stratospheric Connection to Northern Hemisphere Wintertime Weather: Implications for Prediction</article-title>
          .
          <source>J. Climate</source>
          ,
          <volume>15</volume>
          ,
          <fpage>1421</fpage>
          -
          <lpage>1428</lpage>
          . (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Butler</surname>
            ,
            <given-names>A.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Seidel</surname>
            ,
            <given-names>D.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hardiman</surname>
            ,
            <given-names>S.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Butchart</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Birner</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Match</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Defining sudden stratospheric warmings</article-title>
          .
          <source>Bulletin of the American Meteorological Society</source>
          ,
          <volume>96</volume>
          ,
          <fpage>1913</fpage>
          -
          <lpage>1928</lpage>
          . (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18. Cheng,
          <string-name>
            <given-names>X.</given-names>
            ,
            <surname>Wallace</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.M.</surname>
          </string-name>
          :
          <article-title>Cluster analysis of the Northern Hemisphere wintertime 500-hPa height field: Spatial patterns</article-title>
          .
          <source>Journal of the Atmospheric Sciences</source>
          ,
          <volume>50</volume>
          ,
          <fpage>2674</fpage>
          -
          <lpage>2696</lpage>
          . (
          <year>1993</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Feldstein</surname>
            ,
            <given-names>S.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Intraseasonal and interdecadal jet shifts in the Northern Hemisphere: The role of warm pool tropical convection and sea ice</article-title>
          .
          <source>Journal of Climate</source>
          ,
          <volume>27</volume>
          ,
          <fpage>6497</fpage>
          -
          <lpage>6518</lpage>
          . (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Horton</surname>
            ,
            <given-names>D.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Johnson</surname>
            ,
            <given-names>N.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Singh</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Swain</surname>
            ,
            <given-names>D.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rajaratnam</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Diffenbaugh</surname>
            ,
            <given-names>N.S.:</given-names>
          </string-name>
          <article-title>Contribution of changes in atmospheric circulation patterns to extreme temperature trends</article-title>
          .
          <source>Nature</source>
          ,
          <volume>522</volume>
          ,
          <fpage>465</fpage>
          . (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Kretschmer</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Coumou</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Agel</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Barlow</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tziperman</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cohen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <article-title>More-persistent weak stratospheric polar vortex states linked to cold extremes</article-title>
          .
          <source>Bulletin of the American Meteorological Society</source>
          ,
          <volume>99</volume>
          ,
          <fpage>49</fpage>
          -
          <lpage>60</lpage>
          . (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Feldstein</surname>
            ,
            <given-names>S.B.</given-names>
          </string-name>
          :
          <article-title>Detecting ozone-and greenhouse gas-driven wind trends with observational data</article-title>
          .
          <source>Science</source>
          ,
          <volume>339</volume>
          ,
          <fpage>563</fpage>
          -
          <lpage>567</lpage>
          . (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Kohonen</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>The self-organizing map</article-title>
          .
          <source>Proceedings of the IEEE</source>
          ,
          <volume>78</volume>
          ,
          <fpage>1464</fpage>
          -
          <lpage>1480</lpage>
          . (
          <year>1990</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Marc'Aurelio Ranzato</surname>
            ,
            <given-names>F.-J.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Boureau</surname>
            ,
            <given-names>Y.-L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>LeCun</surname>
          </string-name>
          , Y.:
          <article-title>Unsupervised learning of invariant feature hierarchies with applications to object recognition</article-title>
          ., , Vol.
          <volume>127</volume>
          . (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Rumelhart</surname>
            ,
            <given-names>D.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hinton</surname>
            ,
            <given-names>G.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Williams</surname>
            ,
            <given-names>R.J.</given-names>
          </string-name>
          :
          <article-title>Learning representations by back-propagating errors</article-title>
          .
          <source>Nature</source>
          ,
          <volume>323</volume>
          ,
          <fpage>533</fpage>
          -
          <lpage>536</lpage>
          . (
          <year>1986</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Kobayashi</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ota</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Harada</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ebita</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Moriya</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Onoda</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Onogi</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kamahori</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kobayashi</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Endo</surname>
          </string-name>
          , H.: The JRA-
          <volume>55</volume>
          reanalysis:
          <article-title>General specifications and basic characteristics</article-title>
          .
          <source>Journal of the Meteorological Society of Japan. Ser</source>
          . II,
          <volume>93</volume>
          ,
          <fpage>5</fpage>
          -
          <lpage>48</lpage>
          . (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Fujiwara</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wright</surname>
            ,
            <given-names>J.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Manney</surname>
            ,
            <given-names>G.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gray</surname>
            ,
            <given-names>L.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Anstey</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Birner</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Davis</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gerber</surname>
            ,
            <given-names>E.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Harvey</surname>
            ,
            <given-names>V.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hegglin</surname>
            ,
            <given-names>M.I.</given-names>
          </string-name>
          :
          <article-title>Introduction to the SPARC Reanalysis Intercomparison Project (S-RIP) and overview of the reanalysis systems</article-title>
          .
          <source>Atmospheric Chemistry and Physics</source>
          ,
          <volume>17</volume>
          ,
          <fpage>1417</fpage>
          -
          <lpage>1452</lpage>
          . (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>Oquab</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bottou</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Laptev</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sivic</surname>
          </string-name>
          , J.:
          <article-title>Learning and Transferring Mid-Level Image Representations using Convolutional Neural Networks</article-title>
          .
          <source>In Proceedings of the Proceedings of the IEEE Conference on Computer Vision</source>
          and Pattern Recognition, , pp.
          <fpage>1717</fpage>
          -
          <lpage>1724</lpage>
          . (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29.
          <string-name>
            <surname>Hinton</surname>
            ,
            <given-names>G.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Salakhutdinov</surname>
            ,
            <given-names>R.R.</given-names>
          </string-name>
          :
          <article-title>Reducing the dimensionality of data with neural networks</article-title>
          .
          <source>Science</source>
          ,
          <volume>313</volume>
          ,
          <fpage>504</fpage>
          -
          <lpage>507</lpage>
          . (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30.
          <string-name>
            <surname>Nousi</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tefas</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Self-supervised autoencoders for clustering and classification</article-title>
          .
          <source>Evolving Systems</source>
          (
          <year>2018</year>
          .)
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          31.
          <string-name>
            <surname>Caruana</surname>
          </string-name>
          , R.:
          <article-title>Learning Many Related Tasks at the Same Time with Backpropagation</article-title>
          .
          <source>Advances in neural information processing systems</source>
          ,
          <volume>8</volume>
          . (
          <year>1995</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          32.
          <string-name>
            <surname>Collobert</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weston</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>A unified architecture for natural language processing: Deep neural networks with multitask learning</article-title>
          .
          <source>In Proceedings of the Proceedings of the 25th international conference on Machine learning, ACM</source>
          , pp.
          <fpage>160</fpage>
          -
          <lpage>167</lpage>
          . (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          33.
          <string-name>
            <surname>Dauphin</surname>
            ,
            <given-names>G.M.Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Glorot</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rifai</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bengio</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goodfellow</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lavoie</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Muller</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Desjardins</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>WardeFarley</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vincent</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Unsupervised and Transfer Learning Challenge: a Deep Learning Approach</article-title>
          .
          <source>In Proceedings of the Proceedings of ICML Workshop on Unsupervised and Transfer Learning</source>
          , pp.
          <fpage>97</fpage>
          -
          <lpage>110</lpage>
          . (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          34.
          <string-name>
            <surname>Pan</surname>
            ,
            <given-names>S.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          :
          <article-title>A Survey on Transfer Learning</article-title>
          .
          <source>IEEE Transactions on Knowledge and Data Engineering</source>
          ,
          <volume>22</volume>
          ,
          <fpage>1345</fpage>
          -
          <lpage>1359</lpage>
          . (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          35.
          <string-name>
            <surname>Pratt</surname>
            ,
            <given-names>L.Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mostow</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kamm</surname>
            ,
            <given-names>C.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kamm</surname>
            ,
            <given-names>A.A.</given-names>
          </string-name>
          :
          <article-title>Direct Transfer of Learned Information Among Neural Networks</article-title>
          .
          <source>In Proceedings of the AAAI</source>
          , Vol.
          <volume>91</volume>
          , pp.
          <fpage>584</fpage>
          -
          <lpage>589</lpage>
          . (
          <year>1991</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          36.
          <string-name>
            <surname>Maclin</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shavlik</surname>
            ,
            <given-names>J.W.:</given-names>
          </string-name>
          <article-title>Combining the predictions of multiple classifiers: Using competitive learning to initialize neural networks</article-title>
          .
          <source>In Proceedings of the Proceedings of the 1995 International Joint Conference on AI</source>
          , Citeseer: Montreal, Quebec, Canada, pp.
          <fpage>524</fpage>
          -
          <lpage>531</lpage>
          . (
          <year>1995</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          37.
          <string-name>
            <surname>Schmidhuber</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <article-title>Deep learning in neural networks: An overview</article-title>
          .
          <source>Neural networks</source>
          ,
          <volume>61</volume>
          ,
          <fpage>85</fpage>
          -
          <lpage>117</lpage>
          . (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          38.
          <string-name>
            <surname>Deng</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dong</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Socher</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>L.-J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fei-Fei</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Imagenet: A large-scale hierarchical image database</article-title>
          .
          <source>In Proceedings of the Computer Vision and Pattern Recognition</source>
          ,
          <year>2009</year>
          .
          <article-title>CVPR 2009</article-title>
          . IEEE Conference on, Ieee, pp.
          <fpage>248</fpage>
          -
          <lpage>255</lpage>
          . (
          <year>2009</year>
          ,
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          39.
          <string-name>
            <surname>Simonyan</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zisserman</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Very Deep Convolutional Networks for Large-Scale Image Recognition</article-title>
          .
          <source>arXiv:1409</source>
          .1556 [cs] (
          <year>2014</year>
          .)
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          40.
          <string-name>
            <surname>Srivastava</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hinton</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krizhevsky</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sutskever</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Salakhutdinov</surname>
          </string-name>
          , R.:
          <article-title>Dropout: a simple way to prevent neural networks from overfitting</article-title>
          .
          <source>The Journal of Machine Learning Research</source>
          ,
          <volume>15</volume>
          ,
          <fpage>1929</fpage>
          -
          <lpage>1958</lpage>
          . (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          41.
          <string-name>
            <surname>Kingma</surname>
            ,
            <given-names>D.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Welling</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Auto-encoding variational bayes</article-title>
          .
          <source>arXiv preprint arXiv:1312.6114</source>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref42">
        <mixed-citation>
          42.
          <string-name>
            <surname>Krinitskiy</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Verezemskaya</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grashchenkov</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tilinina</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gulev</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lazzara</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krinitskiy</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Verezemskaya</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grashchenkov</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tilinina</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          , et al.
          <article-title>Deep Convolutional Neural Networks Capabilities for Binary Classification of Polar Mesocyclones in Satellite Mosaics</article-title>
          .
          <source>Atmosphere</source>
          <year>2018</year>
          ,
          <volume>9</volume>
          ,
          <fpage>426</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref43">
        <mixed-citation>
          43.
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Simoncelli</surname>
            ,
            <given-names>E.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bovik</surname>
            ,
            <given-names>A.C.</given-names>
          </string-name>
          :
          <article-title>Multiscale structural similarity for image quality assessment</article-title>
          .
          <source>In Proceedings of the The Thrity-Seventh Asilomar Conference on Signals, Systems Computers</source>
          ,
          <year>2003</year>
          , Vol.
          <volume>2</volume>
          , pp.
          <fpage>1398</fpage>
          -
          <lpage>1402</lpage>
          Vol.
          <volume>2</volume>
          . (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref44">
        <mixed-citation>
          44.
          <string-name>
            <surname>Milligan</surname>
            ,
            <given-names>G.W.</given-names>
          </string-name>
          :
          <article-title>Ultrametric hierarchical clustering algorithms</article-title>
          .
          <source>Psychometrika</source>
          ,
          <volume>44</volume>
          ,
          <fpage>343</fpage>
          -
          <lpage>346</lpage>
          . (
          <year>1979</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref45">
        <mixed-citation>
          45.
          <string-name>
            <surname>Murtagh</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Contreras</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Algorithms for hierarchical clustering: an overview</article-title>
          .
          <source>Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery</source>
          ,
          <volume>2</volume>
          ,
          <fpage>86</fpage>
          -
          <lpage>97</lpage>
          . (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref46">
        <mixed-citation>
          46.
          <string-name>
            <surname>Murtagh</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Contreras</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Algorithms for hierarchical clustering: an overview</article-title>
          , II.
          <source>Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery</source>
          ,
          <volume>7</volume>
          ,
          <fpage>e1219</fpage>
          . (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref47">
        <mixed-citation>
          47.
          <string-name>
            <surname>Jr</surname>
            ,
            <given-names>J.H.W.</given-names>
          </string-name>
          :
          <article-title>Hierarchical Grouping to Optimize an Objective Function</article-title>
          .
          <source>Journal of the American Statistical Association</source>
          ,
          <volume>58</volume>
          ,
          <fpage>236</fpage>
          -
          <lpage>244</lpage>
          . (
          <year>1963</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref48">
        <mixed-citation>
          48.
          <string-name>
            <surname>Mandel</surname>
            ,
            <given-names>I.D.</given-names>
          </string-name>
          :
          <article-title>Klasternyj analiz</article-title>
          .
          <source>Finansy i statistika</source>
          ,
          <volume>176</volume>
          . (
          <year>1988</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref49">
        <mixed-citation>
          49.
          <string-name>
            <surname>Charlton</surname>
            ,
            <given-names>A.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Polvani</surname>
            ,
            <given-names>L.M.:</given-names>
          </string-name>
          <article-title>A new look at stratospheric sudden warmings. Part I: Climatology and modeling benchmarks</article-title>
          .
          <source>Journal of Climate</source>
          ,
          <volume>20</volume>
          ,
          <fpage>449</fpage>
          -
          <lpage>469</lpage>
          . (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref50">
        <mixed-citation>
          50.
          <string-name>
            <surname>Butler</surname>
            ,
            <given-names>A.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sjoberg</surname>
            ,
            <given-names>J.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Seidel</surname>
            ,
            <given-names>D.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosenlof</surname>
            ,
            <given-names>K.H.:</given-names>
          </string-name>
          <article-title>A sudden stratospheric warming compendium</article-title>
          .
          <source>Earth System Science Data</source>
          ,
          <volume>9</volume>
          . (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref51">
        <mixed-citation>
          51.
          <string-name>
            <surname>Sorokin</surname>
            ,
            <given-names>A.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Makogonov</surname>
            ,
            <given-names>S.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Korolev</surname>
            ,
            <given-names>S.P.</given-names>
          </string-name>
          :
          <article-title>The Information Infrastructure for Collective Scientific Work in the Far East of Russia</article-title>
          .
          <source>Sci. Tech. Inf. Proc.</source>
          ,
          <volume>44</volume>
          ,
          <fpage>302</fpage>
          -
          <lpage>304</lpage>
          . (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>