=Paper=
{{Paper
|id=Vol-1649/102
|storemode=property
|title=Statistical Modelling in Climate Science 
|pdfUrl=https://ceur-ws.org/Vol-1649/102.pdf
|volume=Vol-1649
|authors=Nikola Jajcay, Milan Paluš
|dblpUrl=https://dblp.org/rec/conf/itat/JajcayP16
}}
==Statistical Modelling in Climate Science ==
<pdf width="1500px">https://ceur-ws.org/Vol-1649/102.pdf</pdf>
<pre>
ITAT 2016 Proceedings, CEUR Workshop Proceedings Vol. 1649, pp. 102–109
http://ceur-ws.org/Vol-1649, Series ISSN 1613-0073, c 2016 N. Jajcay, M. Paluš


                                        Statistical modelling in climate science

                                                      Nikola Jajcay1,2 and Milan Paluš1
        1   Dept. of Nonlinear Dynamics and Complex Systems, Institute of Computer Science, Academy of Sciences of the Czech Republic
                         2 Dept. of Atmospheric Physics, Faculty of Mathematics and Physics, Charles University in Prague


       Abstract: When it comes to modelling in atmospheric and             the model errors [3] and these are intrinsic. The problem
       climate science, the two main types of models are taken             with initial errors is usually tackled by considering an en-
       into account – dynamical and statistical models. The for-           semble of model forecasts (instead of just one realization -
       mer ones have a physical basis: they utilize discretized dif-       integration from single initial state), starting with slightly
       ferential equations with a set of conditions (boundary con-         different initial conditions. The model errors are intrinsi-
       ditions + present state as an initial condition) and model          cally connected with the exponential error growth emerg-
       the system’s state by integrating the equations forward in          ing from the chaotic behaviour related to nonlinearities in
       time. Models of this type are currently used e.g. as a nu-          discretized equations [4]. This limits the predictability of
       merical weather prediction models. The statistical models           such GCMs to 6-10 days maximum (e.g. [5]).
       are considerably different: they are not based on physical
       mechanisms underlying the dynamics of the modelled sys-
       tem, but rather derived from the analysis of past weather
       patterns. An example of such a statistical model based              1.1   Statistical models
       on the idea of linear inverse modelling, is examined for
       modelling the El Niño – Southern Oscillation phenomenon
       with a focus on modelling cross-scale interactions in the           The second kind of models used in climate science are sta-
       temporal sense. Various noise parameterizations and the             tistical models. In their design, they are considerably dif-
       possibility of using a multi-variable model is discussed            ferent than the dynamical models in the sense that they
       among other characteristics of the statistical model. The           are not based on physical mechanism underlying the dy-
       prospect of using statistical models with low complexity            namics of the modelled system, but rather derived from
       as a surrogate model for statistical testing of null hypothe-       the analysis of past weather patterns. Probably the most
       ses is also discussed.                                              used concept is that of inverse stochastic model [6], where
                                                                           the model is designed, then estimated using past data and,
                                                                           finally, stochastically integrated forward in time to obtain
       1     Modelling in climate science                                  the prediction. The disadvantages connected to this type of
                                                                           models consist of the selection of variables that capture the
       Climate models, which rely on the use of quantitative               system we are trying to model. Other possible issue could
       methods to simulate interactions in the climate system, are         be the non-stationarity of the modelled system - since the
       one of the most important tools to predict and asses future         statistical model does not involve the underlying physi-
       climate projections or to study the climate of the past. In         cal mechanisms, just the interaction between subsystems
       general, two types of models are mainly used: dynamical             (ignoring hidden variables), the model estimated on some
       models and statistical models. The base for a dynamical             subset of the past data may not correctly capture all possi-
       model is a set of discretized differential equations which          ble states of the system. In other words, the training period
       are integrated forward in time from the present state, pos-         of the past data used to estimate the statistical model may
       ing as an initial condition. The most prominent example of          not capture the full phase space of the modelled system.
       the usage of dynamical models is without doubt a general
       circulation model (GCM hereafter). It employs a mathe-                 The motivation for building a statistical model for par-
       matical model of circulation of the planetary atmosphere            ticular phenomenon, apart from its forecasting, would be
       and oceans, therefore it uses the Navier-Stokes equations           to scale down the complexity of the problem. When we
       on a rotating sphere (describing a motion of viscous fluid)         find some e.g. nonlinear interactions in the observed data,
       with thermodynamic terms for energy sources and sinks.              and we are interested in uncovering the mechanisms, con-
       The above described model is used in numerical weather              structing a models of different complexity and seeking
       prediction, to infer the reanalysis datasets of the past cli-       such interactions in them would help to expose the mech-
       mate and for future climate projections in climate model            anisms and shed some light on the problem.
       intercomparison projects CMIP3 [1] and CMIP5 [2].                      In the following sections, the inverse stochastic model
          The uncertainties of the forecast arisen from the GCM            for forecasting the El Niño - Southern Oscillation (ENSO
       models are usually classified into two types: the first one is      hereafter) phenomenon is built following [7], with the fo-
       related to the initial errors (errors in determining the “true”     cus on various noise parametrizations and possible use of
       present state of the climate), while the second one is due to       multiple variables.
Statistical Modelling in Climate Science                                                                                                 103


      Figure 1: ENSO phenomenon, its phases and mechanisms: (left) neutral, (center) positive and (right) negative. Figures
      taken from [10].


      2    Data-based ENSO model                                       (Fig. 1 right) is acting to strengthen the Walker circula-
                                                                       tion, to move the area of persistent precipitation even more
      The ENSO phenomenon exhibits strong interannual cli-             westward, the differences in surface pressure is now larger
      mate signal and has a great economic and societal impact.        and the thermocline is even more tilted. The ENSO tends
      It originates from the coupled ocean-atmosphere dynam-           to naturally oscillate between these three phases without a
      ics of the tropical Pacific [8], but has a strong influence on   distinct period (there is no distinct peak in ENSO signal’s
      circulation and air-sea interaction also outside the tropical    spectrum) and the reasons why are still largely unknown.
      belt through teleconnections associated with it [9].                The important aspect of ENSO is that its positive phase
         The ENSO phenomenon expresses itself as a sea surface         - El Niño is generally characterized by a larger magnitude
      temperature (SST hereafter) anomaly and exists in three          than its negative phase - La Niña. This statistical skewness
      distinct phases - the neutral, positive (El Niño) and nega-      is one of the indicators that, at least to some extent, the dy-
      tive (La Niña). The basic physical mechanisms for each of        namics of ENSO involves nonlinear processes [11]. At the
      the phases are depicted in Fig. 1. The normal state of the       same time, the most detailed numerical dynamical mod-
      equatorial Pacific (Fig. 1 left) is warm SST in the western      els seem to severely underestimate this nonlinearity [12],
      basin, near Australia and cold SST in the eastern basin,         hence the quality of the forecast is not satisfactory.
      near the coast of Peru. Above the warm water in the west,           From the reviews of statistical models for ENSO fore-
      the deep convection takes place, where warm and moist air        casting before 2000 [13] it was clear, that majority of mod-
      is ascending to the border of troposphere, creating an area      els were still linear, but lately the nonlinear models are
      of low atmospheric pressure and area of persistent precip-       getting more attention (e.g. [14]). In the following, we de-
      itation. From the upper part of the troposphere, the air is      scribe easy-to-interpret nonlinear model for ENSO fore-
      moving eastward and then it descends already as cold and         casting.
      dry, creating an area of high atmospheric pressure above
      the eastern equatorial Pacific. From this basin, the air is      2.1   Inverse models
      blowing westward on the surface, in agreement with the           The concept of inverse stochastic models are used as the
      trade winds, finishing the circulation loop known as the         starting point in developing the ENSO model. Let x(t) be
      Walker circulation. The easterly surface air flow triggers       the state vector of anomalies, so x(t) = X(t) − X, where
      the oceanic surface current to flow poleward, effectively        X(t) is the climate state vector (could be multi- or univari-
      removing water from the surface, thus the water needs to         ate climate observations e.g. temperature, pressure etc.
      be replaced and this is due to the upwelling, where in the       or a PCA time series from eigen-decomposition of some
      equatorial area, the water is upwelled from roughly 50 me-       climate field) and X is its time-mean. The evolution of
      ters depth to the surface. Since the thermocline (a border       anomalies could be expressed as
      between cold deep ocean and warm surface ocean) is lo-
      cated below 50 meters in the west, the upwelled water is                                ẋ = Lx + N(x)                      (1)
      warm, but in the eastern Pacific the thermocline level is
                                                                       where L is a linear operator, N represents the nonlinear
      above the 50 meters, thus the upwelled water is cold, cre-
                                                                       terms and dot denotes time derivative.
      ating the cold SST in the east and warm SST in the west.
                                                                          The simplest type of inverse models is linear inverse
         The warm phase of ENSO (Fig. 1 center) creates a warm
                                                                       models (LIM, [6]). By assuming, in eq. (1), that N(x)dx ≈
      SST anomaly in the eastern Pacific, acting to weaken the
                                                                       Tx dt + dr(0) , where T is the matrix describing linear feed-
      Walker circulation, to move the area of persistent precip-
                                                                       backs of unresolved (hidden) processes on x and dr(0) is a
      itation eastward, to diminish the difference between east-
                                                                       white-noise process, eq. (1) could be written as
      ern and western Pacific surface pressures and to level off
      the thermocline. Reversely, the negative phase of ENSO                      dx = B(0) xdt + dr(0) , B(0) = L + T.           (2)
104                                                                                                                                 N. Jajcay, M. Paluš

      The matrix B(0) and the covariance matrix of the noise                          The eqs. (4) and (5) describe a wide variety of processes
      Q ≡ hr(0) r(0)T i can be directly estimated from the ob-                     in a fashion that explicitly accounts for the modeled pro-
      served statistics of x by multiple linear regression [15].                   cess x feeding back on the noise statistics. The linear mul-
      The state vector x, or predictor-variable vector, consists of                tilevel model is obtained by assuming Ai ≡ 0 and c(0) ≡ 0
      amplitudes of corresponding principal components (PCA                        in eq. (4). Details of the methodology and further discus-
      analysis [16] yields spatial patterns - empirical orthogonal                 sion could be found in [7].
      functions and its respective time series - principal com-                       It is well known, that the extreme ENSO events tend to
      ponents), while the vector of response variables contains                    occur in boreal winter. From several ways to include this
      their tendencies ẋ.                                                         phase locking to the annual cycle, the alternative approach
                                                                                   used here is to include seasonal dependence in the dynam-
                                                                                   ical part of the first level. Namely, we assume the matrix
      2.2   Nonlinear multilevel model
                                                                                   B(0) and vector c(0) to be periodic, with period T = 12
      The assumptions of linear, stable dynamics and of additive                   months:
      white-noise used to construct LIMs are only valid to cer-                        B(0)    = B0 + Bs sin(2πt/T ) + Bc cos(2πt/T ),
      tain degree of approximation. In particular, the stochastic                        (0)
      forcing dr(0) typically involves serial correlations, and, in                     c      = c0 + cs sin(2πt/T ) + cc cos(2πt/T )       (6)
      addition, the matrices B(0) and Q obtained from the data                     In this case, the whole record is used to estimate four
      exhibit substantial dependence on the lag, that was used to                  seasonal-dependent coefficients. The model is trained
      fit them [17]. The two modifications of the basic inverse                    in the leading EOF (empirical orthogonal function)
      model, that address both nonlinearity and serial correla-                    space [16] of tropical Pacific SST anomalies. The opti-
      tions are taken into account, as in [18].                                    mal number of state-vector components and the degree of
          The first modification is obtained by assuming polyno-                   nonlinearity has to be assessed by cross-validation. The
      mial, rather than linear form of N(x) in eq. (1), in par-                    parameters in this paper were used as in [7].
      ticular, a quadratic dependence. The ith component Ni (x)
      could be written as
                                                                                 3   Results
                                            (0)        (0)
                 Ni (x) ≈ xT Ai x + ti x + ci dt + dri          (3)
                                                                                   In this section, the brief results are presented of how the
      The matrices Ai represent the blocks of a third-order ten-                   statistical model is able to simulate the ENSO signal. The
                                  (0)
      sors, while the vectors bi = li + ti are the rows of the                     skill of the model is determined in the sense of basic linear
      matrix B = L + T (as in eq. (2)). These objects, as well
                 (0)                                                               ENSO metrics such as the amplitude of the ENSO signal,
      as components of the vector c(0) , are estimated by multiple                 the seasonality (since the seasonality is important aspect
      polynomial regression [19].                                                  of ENSO dynamics) and finally, the power spectrum of
                                                                                   ENSO signal. The model is employed as described in the
         The second modification, considering the serial correla-
                                                                                   previous section, the matrices and vectors are estimated
      tions in residual forcing, is due to the multilevel structure
                                                                                   from the previous data and then the model is integrated to
      of our model. In particular, consider the ith component of
                                                                                   obtain the time series of same length as the training data.
      the first, main level of the inverse stochastic model
                                                                                   Since the model is stochastic (forced by a white noise),
                                              
                                     (0)   (0)        (0)                          we employed an ensemble of 20 members. Each member
                  dxi = xT Ai x + bi + ci dt + dri ,            (4)
                                                                                   is integrated with slightly different initial conditions and
                                                                                   these members are referred to as realizations.
      where x = {xi } is the state vector and matrices Ai , vec-
            (0)                         (0)                                           The basic ENSO metric is its amplitude, which could
      tors bi and the components ci of the vector c(0) as well                     be characterized by the standard deviation of SST anoma-
                             (0)
      as the components ri of the residual forcing vector r(0)                     lies averaged over Nino3.4 box (bounded by 5◦ S - 5◦ N
      are determined by the least squares. The additional model                    and 120◦ W - 170◦ W). In Fig. 2 we can see the ENSO
      level is added to express the known increments dr(0) as a                    amplitude as derived from the Nino3.4 index [20] (thick
      linear function of an extended state vector [x, r(0) ]. We                   black line), along with 20 realizations from the data-based
      estimate this level’s residual forcing again by the least                    ENSO model, both linear and quadratic (gold for linear,
      squares. More levels are added the same way, until the                       red for quadratic).
      Lth level’s residual, r(L+1) , becomes white in time, and its                   As can be seen, the linear model slightly overestimates
      lag-0 correlation matrix converges to constant, hence                        the ENSO amplitude, while the quadratic model slightly
                                                                                   underestimate the ENSO amplitude. From the spread of
              (0)        (1)                    (1)
            dri     =   bi [x, r(0) ]dt + ri dt,                                   the ensemble members we could infer that the model is
              (1)        (2)                          (2)                          sensitive to initial conditions and the forcing. Still, the
            dri   =     bi [x, r(0) , r(1) ]dt + ri dt,
                                                                                   ensemble averages for both models are within reasonable
              ...                                                                  distance from the data borderline, therefore in this aspect
              (L)        (L+1)                                  (L+1)
            dri     =   bi       [x, r(0) , . . . , r(L) ]dt + ri       dt   (5)   the model performs adequately.
Statistical Modelling in Climate Science                                                                                                                                105

                               0.95
                                                          ENSO amplitude                                tive models are more flat in this area of frequencies. In
                               0.90
                                                                                                        the higher frequencies (around annual frequency and less)
                                                                                                        the power spectra are in agreement. In general, the spectra
                               0.85
                                                                                                        of modelled time series could be said to copy the actual
      STD of SSTA -- Nino3.4


                               0.80                                                                     Nino3.4 time series. The power spectra were computed
                               0.75                                                                     using the Welch method [21].
                               0.70
                                                                                                                10 8
                               0.65


                               0.60
                                                                                                                10 7
                                    linearan linear
                                    linear model 1
                                    linear model 2
                                    linear model 3
                                    linear model 4
                                    linear model 5
                                    linear model 6
                                    linear model 7
                                  linearar model 98
                                  linear model 10
                                  linear model 11
                                  linear model 12
                                  linear model 13
                                  linear model 14
                                  linear model 15
                                  linear model 16
                                  linear model 17
                                  linear model 18
                                  linear model 19
                                                l 20


                                                                     quadratic mod .
                                                                     quadratic model 1
                                                                     quadratic model 2
                                                                     quadratic model 3
                                                                     quadratic model 4
                                                                     quadratic model 5
                                                                     quadratic model 6
                                                                     quadratic model 7
                                                                            tic mo el 8
                                                                    quadratic moddel 9
                                                                    quadratic model 10
                                                                    quadratic model 11
                                                                    quadratic model 12
                                                                    quadratic model 13
                                                                    quadratic model 14
                                                                    quadratic model 15
                                                                    quadratic model 16
                                                                    quadratic model 17
                                                                    quadratic model 18
                                                                          tic mo el 19
                                                                                 del 20
                                                                     quadra ean quad
                                    line model


                                         mode
                                       me


                                                                           m


                                                                    quadra
                                                                                                                10 6


                                                                                                        power
      Figure 2: ENSO amplitude as standard deviation of SST
                                                                                                                10 5
      anomalies in data (black line) and in 20 realizations of lin-
      ear (gold) and quadratic (red) model.
                                                                                                                10 4

         Other metric connected with ENSO amplitude is its sea-
      sonality. As written above, the ENSO phenomenon ex-                                                       10 3
                                                                                                                             10 -1                        10 0
      hibits seasonal changes in variance, with elevated variance                                                                         period [1/yr]
      in winter months and lower variance in spring and summer
      months. This can be also seen in Fig. 3, where the monthly
                                                                                                        Figure 4: ENSO power spectra estimated using the Welch
      variance is plotted for the data and for both models. Both
                                                                                                        method in data (black curve) and in 20 realizations of lin-
      models are capable of modelling higher variance in winter
                                                                                                        ear (gold) and quadratic (red) model. Thicker lines repre-
      months and drop in variance through spring and summer,
                                                                                                        sent the mean over 20 realizations in the respective model.
      although the difference in variance is higher in data than in
      both models. Still, the ensemble averages are reasonably
      close to the data.
                                                                                                        4         Noise parametrization in the model
                                1.2
                                                                                                        The statistical model, once estimated, is integrated for-
                                1.1
                                                                                                        ward in time and forced by a noise - usually a realization
                                1.0                                                                     of spatially correlated random process. In the most in-
                                                                                                        tuitive and basic case, the last level residuals’ covariance
                                0.9                                                                     matrix is estimated and decomposed using Cholesky fac-
        STD of SSTA


                                0.8
                                                                                                        torization yielding a lower triangular matrix R. When the
                                                                                                        model is integrated, the random realization of white noise
                                0.7                                                                     is multiplied by the matrix R, yielding spatially corre-
                                                                                                        lated white noise which is used as a random forcing in the
                                0.6
                                                                                                        model. The results for quadratic and linear ENSO models
                                0.5                                                                     from the previous section were obtained using this simple
                                                                                                        noise parametrization, and the question is whether looking
                                0.4                                                                     deeper into the residuals’ structure could aid the model’s
                                 Jan


                                       b

                                              r

                                                    r

                                                          y
                                                              Jun

                                                                    Jul

                                                                            g

                                                                                   p

                                                                                         t

                                                                                               v

                                                                                                   c


                                                                                                        performance.
                                                                                       Oc
                                            Ma

                                                  Ap


                                                                                                   De
                                                        Ma


                                                                                             No
                                       Fe


                                                                          Au

                                                                                Se


                                                                month


      Figure 3: ENSO seasonality as standard deviation per                                              4.1            Dependence on the system’s state
      month in data (black curve) and in 20 realizations of linear                                      First refinement for the noise parametrization arises from
      (gold) and quadratic (red) model. Thicker lines represent                                         the concept of modelling climate processes which exhibit
      the mean over 20 realizations in the respective model.                                            low-frequency variability (LFV). In this method, we find
                                                                                                        and select noise samples, snippets, from the past noise
         The last metric taken into account was the power spec-                                         (residuals) which have forced the system during short time
      trum of Nino3.4 time series. The spectrum for the Nino3.4                                         intervals that resemble the LFV phase just preceding the
      data and both linear and quadratic model realizations can                                         currently observed state, and then use these snippets (or in-
      be seen in Fig. 4. The main peak in data occurs at roughly                                        formation contained in them) to drive the current state into
      5 year period, but still the ensemble averages for respec-                                        the future. For full methodology and discussion, see [22].
106                                                                                                                                                        N. Jajcay, M. Paluš

         The found past noise snippets can be used in two differ-                               0.90
                                                                                                                            ENSO amplitude
      ent ways. The first one (as used in [22]) seeks various snip-
      pets from the past observations and then directly uses them
                                                                                                0.85
      to force the model as an ensemble. When e.g. we find 4


                                                                       STD of SSTA -- Nino3.4
      intervals which resemble the LFV phase, we integrate the
      model 4 times using all 4 noise snippets directly and than                                0.80

      average over them. The second version (as used in our
      study) is to find, say, 100 samples of the past noise clos-
                                                                                                0.75
      est to the current state of the system, cluster them together
      and create covariance matrix from them. Afterwards, the


                                                                                                       linlienaerar
                                                                                                       linear 1
                                                                                                       linear 2
                                                                                                       linear 3
                                                                                                       linear 4
                                                                                                       linear 5
                                                                                                       linear 6
                                                                                                       linear 7
                                                                                                       line 8
                                                                                                     linearar 9
                                                                                                     linear 10
                                                                                                     linear 11
                                                                                                     linear 12
                                                                                                     linear 13
                                                                                                     linear 14
                                                                                                     linear 15
                                                                                                     linear 16
                                                                                                     linear 17
                                                                                                     linear 18
                                                                                                     linear 19
                                                                                                                20


                                                                                                                                       conditan cond.
                                                                                                                                       conditioned 1
                                                                                                                                       conditioned 2
                                                                                                                                       conditioned 3
                                                                                                                                       conditioned 4
                                                                                                                                       conditioned 5
                                                                                                                                       conditioned 6
                                                                                                                                       conditioned 7
                                                                                                                                       con ioned 8
                                                                                                                                      conditditioned 9
                                                                                                                                      conditioned 10
                                                                                                                                      conditioned 11
                                                                                                                                      conditioned 12
                                                                                                                                      conditioned 13
                                                                                                                                      conditioned 14
                                                                                                                                      conditioned 15
                                                                                                                                      conditioned 16
                                                                                                                                      conditioned 17
                                                                                                                                      conditioned 18
                                                                                                                                      conditioned 19
                                                                                                                                                   20
      Cholesky decomposition is used to obtain the matrix R and


                                                                                                                                             ioned
                                                                                                   mean
      finally, the random white noise realization is multiplied by


                                                                                                                                         me
      the matrix R. Using this matrix, the spatial covariance of                                 1.2
      the forcing is dependent on the current state of the sys-
      tem. In both noise parametrizations, the current system                                    1.1
      state could be estimated in multiple ways: either using
      correlation of the SSA time series, or using the Euclidean                                 1.0
      distance in the subspace spanned by first few EOFs.
         As can be seen in Fig. 5, although the amplitude statis-


                                                                        STD of SSTA
                                                                                                 0.9
      tics are not substantially shifted, the transient from high-
      variance winter period to low-variance spring and summer                                   0.8

      are better captured by the later model, with noise forcing
                                                                                                 0.7
      conditioned on system’s state. The power spectra for both
      models are practically the same (not shown).
                                                                                                 0.6


      4.2   Seasonal dependence of the forcing                                                   0.5
                                                                                                  Jan


                                                                                                          b

                                                                                                                r

                                                                                                                      r

                                                                                                                            y
                                                                                                                                Jun

                                                                                                                                      Jul

                                                                                                                                              g

                                                                                                                                                     p

                                                                                                                                                           t

                                                                                                                                                                 v

                                                                                                                                                                     c
                                                                                                                                                         Oc
                                                                                                              Ma

                                                                                                                    Ap


                                                                                                                                                                     De
                                                                                                                          Ma


                                                                                                                                                               No
                                                                                                         Fe


                                                                                                                                            Au

                                                                                                                                                  Se
      Although the seasonal dependence of the model is cap-                                                                       month
      tured in model’s dynamics by fitting the seasonally depen-
      dent matrices B(0) and c(0) (recall eq. (6)), our analysis       Figure 5: ENSO amplitude (upper) and seasonality (bot-
      showed, that the last level’s residuals still exhibit season-    tom) in data (black curve) and in 20 realizations of linear
      ally dependent amplitude. To address this issue, we com-         (gold) and linear with conditioned noise on the system’s
      puted the standard deviations for each month from the last       state (red) model. Thicker lines represent the mean over
      level’s residuals, then fitted the 5 harmonics of the annual     20 realizations in the respective model.
      cycle to capture the seasonal dependence, removed this
      dependence from the residuals, then estimated covariance
      matrix and subsequently the matrix R and finally gener-           The two latter modifications bring just a slight improve-
      ated spatially correlated white noise realization which was      ments into ENSO metrics (not shown), but could have
      multiplied back by the requisite seasonal amplitude to ac-       more substantial advancements in modelling different at-
      count for the seasonally dependent amplitude of the forc-        mospheric phenomena.
      ing. The fitted harmonics of the annual cycle were selected
      as
                                                                       5                               Synchronization and causality in the
            Pi = cos(2πit/T ) + sin(2πit/T ), i = 1, . . . , 5   (7)                                   observed and modelled data
      and then regressed on the seasonally varying standard de-
      viation of the last level’s residuals.                           Better understanding of the complex dynamics of the at-
                                                                       mosphere and climate is one of the challenges for con-
                                                                       temporary science. Considering the climate system as
      4.3   Using extended covariance matrix
                                                                       a complex network of interacting subsystems [23] is a
      The last modification to the noise is to use the extended co-    new paradigm bringing new data analysis methods help-
      variance matrix instead of lag-0 covariance matrix. When         ing to detect, describe and predict atmospheric phenom-
      evaluating system’s state we do not take just the state clos-    ena [24]. A crucial step in constructing climate networks
      est to the current state of the model, but, say 5 consecu-       is inference of network links between climate subsys-
      tive months and construct the extended matrix out of this        tem [25]. Directed links determine which subsystems in-
      snippet. Then the matrix is decomposed using Cholesky            fluence other subsystems, i.e. uncover the drivers of at-
      factorization and used as a spatial correlation matrix R is      mospheric phenomena. Inference of causal relationships
      random forcing generation.                                       from climate data is an intensively developing research
Statistical Modelling in Climate Science                                                                                                   107

      field, e.g. [26, 27]. Typically, a causal relation is sought
      between different variables or modes of atmospheric vari-
      ability.
         Paluš [28] has open another view at the complexity
      of atmospheric dynamics by uncovering causal relations
      or information flow between dynamics on different time
      scales in the same variable. Recently, phase-phase and
      also phase-amplitude interactions between dynamics on
      different temporal scales were observed in the ENSO dy-
      namics (captured by the Nino3.4 index) using the ap-
      proach as in [28]. Shortly, we use the continuous wavelet
      transform to the time series for particular time scales to
                                                                           Figure 6: Phase synchronization (left) and phase-
      obtain the instantaneous phase and amplitude of the oscil-
                                                                           amplitude causality (right) in Nino3.4 time series. Shown
      latory mode as
                                                                           is the significance (over 95th percentile against 500 Fourier
                  ψ(t) =       s(t) + iŝ(t) = A(t)e(iφ (t)) ,      (8)    transform surrogates) of k-nearest neighbours estimate of
                                                                           mutual information and conditional mutual information.
                                       ŝ(t)
                  φ (t) =      arctan        ,                      (9)
                                       s(t)
                               q                                           to uncover the mechanisms of these interactions and shed
                  A(t) =          s2 (t) + ŝ2 (t).                (10)    more light onto the dynamics of ENSO in general. We
                                                                           constructed the ENSO model and repeated the above anal-
      Then the time series of phase and / or amplitude are used
                                                                           ysis to modeled ensemble of the Nino3.4 time series.
      to study the interactions. We adopt measures from infor-
      mation theory, namely mutual information and conditional
      mutual information, where the mutual information could
      be expressed as

                                                   p(x, y)
                 I(X;Y ) = ∑ ∑ p(x, y) log                 ,       (11)
                             x∈X y∈Y              p(x)p(y)

      where p(·) is the probability distribution or joint probabil-
      ity distribution and X and Y are our time series of either
      phase or amplitude derived from the ENSO SST data. Fi-
      nally, the measures we are interested in could be written
      as:                                                                  Figure 7: Phase synchronization (left) and phase-
                                                                           amplitude causality (right) in modeled Nino3.4 time series
         • phase synchronization – I(φ1 (t); φ2 (t)),                      by the data-based model. Shown is the aggregate of 5 real-
         • phase-phase       causality     –    I(φ1 (t); φ2 (t + τ) −     izations of k-nearest neighbours estimate of mutual infor-
           φ2 (t)|φ2 (t)),                                                 mation and conditional mutual information. Significance
                                                                           against 500 Fourier transform surrogate data.
         • phase-amplitude causality –                 I(φ1 (t); A2 (t +
           τ)|A2 (t), A2 (t − η), A2 (t − 2η)),                               As seen from the analysis of modelled data (Fig. 7),
                                                                           the main phase synchronization bands (annual cycle with
                                                                           quasi-biennal cycle and combination frequencies) are also
      5.1   Interactions in the data
                                                                           captured by the modelled data, while the phase - ampli-
      As can be seen from Fig. 6, in ENSO dynamics captured                tude interactions are not very well captured. This might
      by the Nino3.4 index, the synchronization of annual cycle            arise from the low complexity of the model, or the absence
      with quasi-biennal and combination frequencies (frequen-             of some nonlinear interactions in the model design (apart
      cies that arise from the interactions between annual and             from quadratic).
      the most prominent ENSO period) is observed. Also, the
      4-6 year cycle of phase in ENSO dynamics influence the
      quasi-biennal range of the amplitude time series.                    6   Modelling surrogate data with statistical
                                                                               model
      5.2   Interactions in the model
                                                                           Surrogate data (or analogous data) is a method to generate
      Our goal was to simulate the nonlinear cross-scale inter-            synthetic data set (time series) that preserve some of the
      actions in the model. This is important since it might help          statistical properties, while omitting the others. One way
108                                                                                                                      N. Jajcay, M. Paluš

      of using them, is to test statistical significance by contra-
      diction. This involves posing a null hypothesis describing
      some kind of a process and then generating an ensemble
      of surrogate data according to null hypothesis using Monte
      Carlo methods. One of the most used technique for gener-
      ating surrogate data is the Fourier transform surrogate [29]
      (FT surrogates), which preserve the linear correlations in
      the data (periodogram or spectrogram, including autocor-
      relation) of the time series, but omits any other interactions
      in them.
         As an example, consider two intertwined Lorenz sys-
      tems, where one of them drives the other. Now, using the
                                                                       Figure 8: Phase synchronization (left) and phase-
      time series in one dimension, say the x dimension from
                                                                       amplitude causality (right) in modelled Nino3.4 time se-
      both Lorenz systems, we can use some method for detect-
                                                                       ries by the data-based model. Shown is the aggregate of
      ing causality, e.g. conditional mutual information between
                                                                       5 realizations of k-nearest neighbours estimate of mutual
      the two time series of two Lorenz systems. We get the
                                                                       information and conditional mutual information. Signifi-
      value of conditional mutual information, but this is still
                                                                       cance against 500 surrogate time series created with data-
      not enough to interpret it in the means of whether there
                                                                       based model.
      is a causal relationship between them or the result arose
      by chance. For this purpose, we construct an ensemble of
      Fourier transform surrogate data (which qualitatively pre-       minimum. This way, we can get better idea of the statisti-
      serves properties of the time series, but allows no causal       cal significance of the interactions between subsystems, in
      relationship between them) and repeat the analysis using         particular the nonlinear ones, since we are testing against
      the very same method on this ensemble and finally com-           the model with just linear interactions.
      pare the value for actual data with the histogram of values
      obtained from the ensemble of surrogate data. When the           7   Conclusions
      value from the data exceeds some percentile (e.g. 95th ) of
      the surrogate data distribution, we say that the causal re-      Statistical modelling in climate science is continuously
      lationship is significant in comparison with e.g. 500 FT         getting more attention, since their usage is not limited to
      surrogates.                                                      forecast some of the phenomena of interest (like ENSO),
         When studying nonlinear cross-scale interactions in           but could also be used to infer some of the statistical prop-
      time series using the above method, the statistical test in-     erties and relationships among different subsystems. Since
      volves creating an ensemble of surrogate, synthetic time         the statistical models live in phase space of particularly re-
      series and repeat the analysis for the whole ensemble.           duced dimensionality, when we could observe the inter-
      Then we computed the percentile, where the observed in-          actions of interest, the identification of their sources will
      teractions could not arose by random chance. Of course,          become more feasible.
      one could use Fourier transform method to generate the              We showed that the statistical model with the right set-
      surrogate time series, effectively posing a null hypothe-        tings, which were selected based on careful inspection of
      sis of a linear process which has the same spectrum to           the modelled system, could generate synthetic time series
      that of an observed data. On the other hand, one can cre-        of interest, copying the desired properties of the system -
      ate a more sophisticated null hypothesis by exploiting the       both linear and nonlinear statistics. Since the stochastic-
      options of a data-based model: when one consider just a          ity is the important aspect of the data-based model, var-
      linear model, omit the dynamical seasonal dependence in          ious parametrization techniques exist to correctly model
      B(0) and c(0) terms (as in eq. (6)) and use the simplest noise   the system’s external forcing. Finally, the possibility of
      parametrisation (just consider the spatial covariance struc-     usage of the low complexity model as surrogate data was
      ture), the model will omit the nonlinear interactions and        discussed, showing advantages of usage of such technique
      could pose as a surrogate data model copying the basic           to infer statistical significance.
      statistical properties of a modelled time series. This way,         The outlook for future work combines various differ-
      the analysis would show whether the cross-scale interac-         ent paths which appeared. One direction would be fo-
      tions are arising from the seasonal dependent dynamics, or       cusing on statistical modelling itself, experimenting with
      from nonlinear (e.g. quadratic) interactions between sub-        various variable model, with input time series and their
      systems and so on.                                               preprocessing and so on and so forth. Other direction
         When comparing Fig. 6 (testing against 500 Fourier            would be connecting the statistical models with dynami-
      transform surrogates) and Fig. 8 (testing against 500 data-      cal ones, in the sense, that statistical models could be used
      based model surrogates), the significant interactions are        for parametrization of e.g. sub-grid phenomena (micro-
      virtually the same, expect in the latter, the “fluctuations”     physics of clouds, local convection etc.) in large coupled
      (or they might be false positives as well) are attenuated to     atmospheric-oceanic models.
Statistical Modelling in Climate Science                                                                                                     109

      References                                                          [19] McCullagh, P., and J. A. Nelder: Generalized Linear Mod-
                                                                              els. Chapman and Hall (1989)
                                                                          [20] Rayner N. A., D. E. Parker, E. B. Horton, C. K. Folland,
      [1] Meehl, G. A., C. Covey, T. Delworth, M. Latif, B. McA-              L. V. Alexander, D. P. Rowell, E. C. Kent and A. Kaplan:
          vaney, J. F. B. Mitchell, R. J. Stouffer, and K. E. Taylor:         Global analyses of sea surface temperature, sea ice, and
          The WCRP CMIP3 multi-model dataset: A new era in cli-               night marine air temperature since the late nineteenth cen-
          mate change research. A Bull. Amer. Meteor. Soc. 88 (2007)          tury. J. Geophys. Res. 108 (2003) 4407
          1383–1394                                                       [21] Welch, P. D.: The use of Fast Fourier Transform for the
      [2] Taylor, K.E., R.J. Stouffer and G.A. Meehl: An Overview             estimation of power spectra: A method based on time aver-
          of CMIP5 and the experiment design. A Bull. Amer. Meteor.           aging over short, modified periodograms. IEEE Trans. Audio
          Soc. 93 (2012) 485–498                                              AU-15 (1967) 70–73
      [3] Bjerknes, V.: Dynamic meteorology and hydrology, Part           [22] Chekroun, M. D., D. Kondrashov and M. Ghil: Predicting
          II. Kinematics. Gibson Bros., Carnegie Institute, New York.         stochastic systems by noise sampling, and application to the
          (1911)                                                              El Niño-Southern Oscillation. P. Natl. Acad. Sci. USA 108
      [4] Lorenz, E. N.: Deterministic nonperiodic flow. J. Atmos. Sci.       (2011) 11766–11771
          20 (1963) 130–141                                               [23] A. A. Tsonis and P. J. Roebber: The architecture of the
      [5] Van den Dool, H. M.: Long-range weather forecasts through           climate network. Physica A: Statistical Mechanics and its
          numerical and empirical methods. Dyn. Atmos. Oceans 20              Applications, 333 (2004) 497–504
          (1994) 247–270                                                  [24] S. Havlin, D. Y. Kenett, E. Ben-Jacob, A. Bunde, R. Cohen,
      [6] Penland, C.: Random forcing and forecasting using princi-           H. Hermann, J. Kantelhardt, J. Kertész, S. Kirkpatrick, J.
          pal oscillation pattern analysis. Mon. Weat. Rev. 117 (1989)        Kurths, et al.: Challenges in network science: Applications
          2165–2185                                                           to infrastructures, climate, social systems and economics.
      [7] Kravtsov, S., D. Kondrashov, and M. Ghil: Multilevel regres-        The European Physical Journal Special Topics, 214 (2012)
          sion modeling of nonlinear processes: Derivation and appli-         273–293
          cations to climate variability. J. Climate 18 (2005) 4404—      [25] M. Paluš, D. Hartman, J. Hlinka, and M. Vejmelka: Dis-
          4424                                                                cerning connectivity from dynamics in climate networks.
      [8] Philander, S. G. H.: El Niño, La Niña, and the Southern             Nonlinear Processes in Geophysics 18 (2011) 751–763
          Oscillation. Academic Press (1990)                              [26] Ebert-Uphoff and Y. Deng: Causal discovery for climate re-
      [9] Alexander, M. A., I. Bladé, M. Newman, J. R. Lanzante, N.-          search using graphical models. Journal of Climate 25 (2012)
          C. Lau, and J. D. Scott: The atmospheric bridge: The in-            5648–5665
          fluence of ENSO teleconnections on air–sea interaction over     [27] Y. Deng and I. Ebert-Uphoff: Weakening of atmospheric
          the global oceans. J. Climate 15 (2002) 2205–2231                   information flow in a warming climate in the community cli-
      [10] WIKIPEDIA . ORG:          El Niño–Southern Oscillation             mate system model. Geophysical Research Letters 41 (2014)
          https://en.wikipedia.org/wiki/El_Ni%C3%B1o%                         193–200
          E2%80%93Southern_Oscillation, downloaded June 27,               [28] Paluš, M.: Multiscale atmospheric dynamics: Cross-
          2016.                                                               frequency phase-amplitude coupling in the air temperature.
      [11] Ghil, M. and A. W. Robertson: Solving problems with                Phys. Rev. Lett. 112 (2014) 1–5
          GCMs: General circulation models and their role in the cli-     [29] Theiler, J., S. Eubank, A. Longtin, B. Galdrikian, and J.
          mate modeling hierarchy. Academic Press, (2000) 285–325             Doyne Farmer: Testing for nonlinearity in time series: The
      [12] Hannachi, A., D. B. Stephenson, and K. R. Sperber:                 method of surrogate data. Physica D 58 (1992) 77–94
          Probability-based methods for quanfifying nonlinearity in
          the ENSO. Clim. Dyn. 20 (2003) 241–256
      [13] Ghil, M. and N. Jiang: Recent forecast skill for the El Niño
          / Southern Oscillation. Geophys. Res. Lett. 25 (1998) 171-
          –174
      [14] Timmermann, A., H. U. Voss, and R. Pasmanter: Empir-
          ical dynamical system modeling of ENSO using nonlinear
          inverse techniques. J. Phys. Oceanogr. 31 (2001) 1579–1598
      [15] Wetherill, G. B.: Regression Analysis with Applications.
          Chapman and Hall (1986)
      [16] Hannachi, A., I. T. Jolliffe and D. B. Stephenson: Empirical
          orthogonal functions and related techniques in atmospheric
          science: A review. Int. J. Climatol. 27 (2007) 1119–1152
      [17] Penland, C. and M. Ghil: Forecasting Northern Hemi-
          sphere 700-mb geopotential height anomalies using empir-
          ical normal modes. Mon. Wea. Rev. 121 (1993) 2355—2372
      [18] Kondrashov, D., S. Kravtsov, A. W. Robertson and M. Ghil:
          A Hierarchy of Data-Based ENSO Models. J. Climate 18
          (2005) 4425–4444

</pre>