<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Predicting Ramp Events with a Stream-based HMM framework</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Carlos A. Ferreira</string-name>
          <xref ref-type="aff" rid="aff4">4</xref>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Joa˜ o Gama</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>V´ıtor S. Costa</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Audun Botterud</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Argonne National Laboratory</institution>
          ,
          <addr-line>Argonne, IL</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>CRACS-INESC TEC and FC - University of Porto</institution>
          ,
          <country country="PT">Portugal</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>INESC TEC and FE - University of Porto</institution>
          ,
          <country country="PT">Portugal</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>LIAAD-INESC TEC and FEP - University of Porto</institution>
          ,
          <country country="PT">Portugal</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>LIAAD-INESC TEC and ISEP - Polytechnic Institute of Porto</institution>
          ,
          <country country="PT">Portugal</country>
        </aff>
        <aff id="aff5">
          <label>5</label>
          <institution>Vladimiro Miranda</institution>
        </aff>
      </contrib-group>
      <fpage>28</fpage>
      <lpage>32</lpage>
      <abstract>
        <p>The motivation for this work is the study and prediction of wind ramp events occurring in a large-scale wind farm located in the US Midwest. In this paper we introduce the SHREA framework, a stream-based model that continuously learns a discrete HMM model from wind power and wind speed measurements. We use a supervised learning algorithm to learn HMM parameters from discretized data, where ramp events are HMM states and discretized wind speed data are HMM observations. The discretization of the historical data is obtained by running the SAX algorithm over the first order variations in the original signal. SHREA updates the HMM using the most recent historical data and includes a forgetting mechanism to model natural time dependence in wind patterns. To forecast ramp events we use recent wind speed forecasts and the Viterbi algorithm, that incrementally finds the most probable ramp event to occur. We compare SHREA framework against Persistence baseline in predicting ramp events occurring in very short-time horizons.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Ramping is one notable characteristic in a time series associated with
a drastic change in value in a set of consecutive time steps. Two
properties of a ramping event i.e. slope and phase error, are important
from the point of view of the System Operator (SO), with
important implications in the decisions associated with unit commitment or
generation scheduling. Unit commitment decisions must prepare the
generation schedule in order to smoothly accommodate forecasted
drastic changes in wind power availability [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. In this paper we
present SHREA a novel stream-based framework that predicts
ramping events in short term wind power forecasting.
      </p>
      <p>The development of the SHREA framework is the answer to the
three main issues available in ramp event forecasting. How can we
describe and get insights on the wind power, and wind speed,
timedependent dynamic and use this description to predict short-time
ahead ramp events? How can we combine real valued historical wind
power and speed measurements and Numerical Weather Predictions
(NWP), specially wind speed predictions, to output reliable real-time
predictions? How can we continuously adapt SHREA to
accommodate different natural weather regimes yet producing reliable
predictions?</p>
      <p>
        To answer these questions we designed a stream-based framework
that continuously learns a discrete Hidden Markov Model (HMM)
and uses it to generate predictions. To learn and update the HMM the
SHREA framework uses a supervised strategy whereas the HMM
parameters are estimated from historical data, the state transitions
probabilities are estimated from wind power measurements and the
emission probabilities, at each state, are estimated from wind speed
observations. To estimate the state probability transitions, first, we
combine a ramp filter, a derivative alike filter, and a user-defined
threshold to translate the real-valued wind power time series into
a labeled time-series, coding three different types of ramp events:
ramp-up, no-ramp and ramp-down. Then, the transitions occurring
in this labeled time series are used to estimate the transitions of the
Markov process hidden in the HMM, i.e., to model the transitions
between the three states associated with the three types of ramp events.
To learn the HMM emission probabilities, first we combine a ramp
filter and the SAX algorithm [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] to translate the wind speed
measurements signal into a string. Next we use both the wind power labeled
time series and the wind speed string to estimate the emission
probabilities at each state. The estimative is obtained by counting the string
symbols, coding wind speed variations, associated with a given state/
ramp event.
      </p>
      <p>When we analyze wind power historical data we observe both
seasonal weather regimes and short-time ahead dependence of the recent
past wind power/speed measurements. Thus, to accommodate these
issues, in SHREA we included a strategy that forgets old weather
regimes and continuously updates the HMM with the most recent
measurements, both wind power measurements and wind speed
measurements.</p>
      <p>
        To generate ramp event predictions occurring in short-time ahead
window we use the wind speed forecast, obtained from a major NWP
provider, and the current HMM. First, we run a filter over the wind
speed forecast signal to obtain a signal of wind speed variations.
Next, we run the SAX algorithm to translate the resulting real-valued
time series into a string. Then, we run the Viterbi algorithm [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] to
obtain the most likely sequence of ramp events. We could use the
Forward-Backward algorithm [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] usually used to estimate the
posterior probability but we would be using long time ahead, thus
unreliable, wind speed forecasts to predict current ramp events.
      </p>
      <p>It is important to observe that wind speed measurements and
forecasts, mainly short time horizon predictions, are approximately
equally distributed over time. Moreover, the wind power output of
each turbine is related to wind speed measurements.</p>
      <p>
        In this work we run the SHREA framework to describe and predict
very short-time ahead ramp events occurring in a large-scale wind
farm located in the US Midwest. We present a comparison against
the Persistence model that is known to be hard to beat in short-time
forecasts [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
      <p>Despite the difficulty of the ramp forecasting problem, in this work
we make the following contributions: Develop a stream-based
framework that predicts ramp events and generates both descriptive and
cost-effective models; Introduce a forgetting mechanism so that we
can learn a HMM using only the most recent weather regimes; Use
wind speed forecasts as observations of a discrete HMM to predict
short-time ahead ramp events.</p>
      <p>In the next Section we introduce the ramp event forecast problem.
In Section 3 we present a detailed description of our framework.
In Section 4 we present and discuss the obtained results. Last, we
present some conclusions and present future research directions.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Ramp Event Definition and Related Work</title>
      <p>
        One of the main problems in ramp forecasting is how to define a
ramp. In fact, there is no standard definition [
        <xref ref-type="bibr" rid="ref3 ref7 ref8">7, 3, 8</xref>
        ] and almost
all existing literature report different definitions, depending, for
instance, on the location or on the farm’s size.
      </p>
      <p>
        The authors in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] and [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] define several relevant
characteristics for ramp definition, characterization and identification: to define
a ramp event, we have to determine values for its three key
characteristics: direction, duration and magnitude (see Figure 1). With
respect to direction there are two basic types of ramps: the upward
ones (or ramp-ups), and the downward ones (or ramp-downs). The
former, characterized by an increase of wind power, result from a
rapid raise of wind speeds, which might (not necessarily) be due to
low-pressure systems, low-level jets, thunderstorms, wind gusts, or
other similar weather phenomena. Downward ramps are due to a
decrease in wind power, which may occur because of a sudden
depletion of the pressure gradient, or due to very high wind speeds, that
lead wind turbines to reach cut-out limits (typically 22-25m/s) and
shut down, in order to prevent the wind turbine from damage [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. In
order to consider a ramp event, the minimum duration is assumed to
be 1 hour in [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], although in [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] these events lie in intervals of 5 to
60 minutes. The magnitude of a ramp is typically represented by the
percentage of the wind farm’s nominal power - nameplate.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] the authors studied the sensitivity of two ramp definitions
to each one of the two parameters introduced above: ramp amplitude
ranging from 150 to 600MW and ramp duration values varying
between 5 and 60 minutes. The definition that we present and use in
this work is similar to the one described in [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. It is more
appropriate to use in real operations since it does not considers a time-ahead
point to identify a ramp event.
      </p>
      <p>Definition 1 A ramp event is considered to occur at time point t, the
end of an interval, if the magnitude of the increase or decrease in the
power signal is greater than the threshold value, the Pref :
| P (t) − P (t − Δt)| &gt; Pref</p>
      <p>
        The parameter Δt is related to the ramp duration and defines the
size of the time interval considered to identify a ramp. In [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] some
results are presented that relate this parameter to the type and
magnitude of identified ramps. The Pref parameter is usually defined
according to the specific features of the wind farm site and, usually,
is defined as a percentage of the nominal wind power capacity or a
specified amount of megawatts.
      </p>
      <p>
        A comprehensive analysis of ramp modeling and prediction may
be found in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
1
2
3
4
      </p>
      <sec id="sec-2-1">
        <title>Algorithm 1: SHREA: a stream-based ramp predictor</title>
        <p>input : Three time series: PT , wind power measurements; OT , wind speed measurements; and JT , wind
Speed forecasts; a, the forecast horizon; Pref , threshold to identify ramp events; Δt, the ramp
definition parameter; W, the PAA parameter that specifies the amount of signal aggregation; σ, a
forgetting factor
output: A sequence of predictions Qrd . . . Qrd+a for each period/window d = 1, . . .
fcoor ueanchtpTeriimod/ewPindeorwidoddos ← 0; flag ← 0; Acount ← 0; Bcount ← 0;
countT imeP eriods + +
Preprocessing
Psd ← fitSpline(Pd), Osd ← fitSpline(Od), Jsd ← fitSpline(Jd)
Pfd ← rampDef(Psd, Δt), Ofd ← rampDef(Osd, Δt), Jfd ← rampDef(Jsd, Δt)
Ld ← label(Pfd , Pref ); // Label Data
Odn ← znorm(Ofd ), Jdn ← znorm(Jfd )
Osdtr ← SAX(PAA(Odn)), OFsdtr ← SAX(PAA(Jdn))
Learn Supervised HMM
π ←
(δ(Ld(r) = rampDown), δ(Ld(r) = noramp), δ(Ld(r) = rampUp))
λd(A, B, π) ← LearnHMM(Osdtr (1, . . . , r), Ld(1, . . . , r), Acount, Bcount)
Predict Ramp Events using the learned HMM
Qrd . . . Qr+a ← V iterbi(λ, OFsdtr(r + 1, . . . , r + a))</p>
        <p>d
λd(A, B, π) ← updateHMM(Osdtr(r + 1, . . .), Ld(r + 1, . . .))
Forgetting mechanism
if (countTimePeriods==σ) then</p>
        <p>Acaouuxnt ← Acount; Bcaouuxnt ← Bcount; flag ← 1
if (countTimePeriods mod σ == 0 &amp; flag==1) then</p>
        <p>Acount ← Acount − Acaouuxnt; Bcount ← Bcount − Bcaouuxnt
3</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Methodology developed to Forecast Ramps</title>
      <p>In this section we present SHREA framework, a stream-based
framework that uses a supervised learning strategy to obtain a HMM.
SHREA continuously learns a discrete HMM on a fixed size
nonoverlapping moving window and, at each time period, uses the
updated HMM to predict ramp events. We introduce a forgetting
mechanism to forget old wind regimes and to accommodate weather global
changes. The SHREA architecture has three main steps (see
algorithm pseudo-code in Algorithm 1): preprocessing phase, where a
ramp filter and the SAX algorithm are used to translate real valued
signals into events/strings; learning phase, where a supervised
strategy is used to learn a HMM; and prediction phase, where the Viterbi
algorithm is used to forecast ramp events. In the following lines we
describe each one of these phases.
3.1 Preprocessing In the preprocessing phase we translate the
real-valued points occurring in a given time period d, i.e. occurring
inside a non-overlapping fixed size window, into a discrete
timeseries suitable to be used at HMM learning and prediction time. First,
we fit a spline to both the wind power and wind speed measurements
time series obtaining, respectively, two new signals, Psd and Os . We
d
run the same procedure over J time series, a wind speed forecast, and
d
obtain Js . We fit splines to the original data to remove high
frequencies that can be considered noisy data. Second, we run ramp
definition one, presented above in Section 2, to filter the three smoothed
signals and obtain three new signals: Pf , Ofd and Jf . These signals
d d
are wind power and speed variations, derivative alike signals, suitable
to identify ramp events. Third, we use a user-defined power
variation threshold, the input parameter Pref value, to translate the wind
power signal Pfd into a labeled time series Ld(1, . . . , r + a), where 1
is the first point of the time window, r is the forecast launch time and
a is the time horizon. We map each wind power variation into one of
three labels/ramp events: ramp-up, ramp-down and no-ramp. These
three labels will be the three states of our HMM and the transitions
will be estimated using the points of the Ld time series.</p>
      <p>
        At this point we already have the data needed to estimate the
transitions of the Markov process hidden in the HMM process. Now we
need to transform wind speed data into a format suitable to estimate
emission probabilities of the discrete HMM that we are learning. We
combine Piecewise Aggregate Approximation (PAA) and SAX
algorithms [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] to translate the wind speed variations into symbolic time
series, more precisely. Thus, we normalize the two wind speed
signals and obtain Odn and Jdn signals. Odn will be used to estimate the
HMM emission probabilities and the Jdn will be used as the ahead
observations that will be used to predict ramp events. Next, we run the
PAA algorithm in each one of these signals to reduce complexity and,
again, obtain smoothed signals. The degree of signal compression is
the W PAA parameter that is a user-defined parameter of SHREA.
This parameter is related with time point aggregation. Next, we run
the SAX algorithm to map each PAA signal into string symbols. This
way we obtain two discrete signals Ostr and Jsdtr. After the
preprod
cessing phase we have two discrete time series, Ld and Osdtr that will
be used to learn the HMM state transitions and emissions
probabilities, respectively.
3.2 Learn a Discrete HMM Here we explain how do we learn the
HMM in the time period d, and then how we update it in time.
      </p>
      <p>
        In the HMM that we learn, compactly written λ(A, B, π), the
state transitions, the A parameter, are associated with wind power
measurements and the emissions probabilities, the B parameter, are
associated with wind speed measurements. In Figure 2 we show a
HMM learned by SHREA at the end of the 2010 winter. To estimate
these two parameters we use the ramp labels, Ld(1, . . . , r), and the
d
wind speed mesurements signals, Ostr(1, . . . , r), and run the
wellknown and straightforward supervised learning algorithm described
in [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. To estimate the transition probabilities between states, the
three-way matrix A, we count the transitions between symbols
observed in Ld(1, . . . , r) and compute the marginals to estimate the
probabilities. To estimate the emission probabilities for each state,
the matrix B, we count, for each state, the observed frequency of
each symbol and then use state marginals to compute the
probabilities. This way, we obtain the maximum likelihood estimate of both
the transitions and the emission probability matrices.
      </p>
      <p>We now explain how to update the model in the time. We
design our framework to improve over the time with the arriving of
new data. At each time period d SHREA is fed with new data and
the HMM parameters are updated to include the most recent
historical data. At each time period d we update the HMM parameters by
counting the state transitions and state emissions coded in the
current vectors Osdtr(1, . . . , r) and Ld(1, . . . , r), obtaining the number
of state transitions and emissions at each HMM state, the Acount and
Bcount. Then, we compute the marginal probabilities of each matrix
and obtain the updated HMM, the model λd(Ad, Bd, πd) that will be
used to predict ramp events. The learned HMM, λd, will be used to
predict ramp events occurring between r and r + a. In the next time
period (i.e. the next fixed sized time window) we will update the λd
HMM, using this same strategy but including also the transitions and
emissions of the time period d that were not used to estimate λd, i.e.,
we update Acount and Bcount with the wind measurements of the
time period d occurring after d’s launch time and before d + 1 period
launch time, the r point. By using this strategy we continuously
update the HMM to include both the most recent data and all old data.
By using this strategy, and with the course of time, the HMM can
become less sensitive to new weather regimes. Thus we introduce a
forgetting strategy to update the HMM using only the most recent
measurements and forgetting the old data. This strategy relies on a
threshold that specifies the number of time periods to include in the HMM
estimation. This forgetting parameter, σ, is a user-defined value that
can be set by experienced wind power technicians. Considering that
at time period d we have read σ time periods and that we backup the
current counts into Acaouuxnt and Bcaouuxnt temporary matrices. After
reading 2σ time periods we will use the following forgetting
mechanism: Ac2oσunt = Acount − Acaouuxnt and Bc2oσunt = Bc2oσunt − Bcaouuxnt.</p>
      <p>2σ
Then, we reset Acaouuxnt and Bcaouuxnt equal to the updated Acount and
2σ
Bc2oσunt matrices, respectively. Next, to predict ramp events occurring
in the time periods following 2σ, we will update and use the HMM
parameters obtained from the Ac2oσunt and Bc2oσunt to forecast ramp
events. Every time we read a number of time periods that equals a
multiple of σ we apply this forgetting mechanism using the updated
auxiliary matrices.</p>
      <sec id="sec-3-1">
        <title>3.3 Predict Ramp Events using the learned HMM In this step</title>
        <p>we use the HMM learned in time period d, the λd, and the string
d
Jstr, obtained from wind speed forecasts, to predict ramp events for
the time points ranging from r to r + a. Remember that r is the
prediction launch time and a is the forecast horizon.</p>
        <p>
          To obtain the ramp event predictions we run the Viterbi
algorithm [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]. We feed this algorithm with Jstr and λd and get the state
d
d d
predictions (the ramp events) Qr+1, . . . , Qr+a for the time points
r + 1, . . . , r + a of time period d. Saying it in other way we obtain
predictions for the points occurring in a non overlapping time
window starting at r and with length equal to a. We will obtain the most
likely sequence of states that best explains the observations, i.e., we
d d
will obtain a sequence of states Qr+1, . . . , Qr+a that maximizes the
probability P (Qr+1, . . . , Qr+a| Jr+1, . . . , Jrd+a, λd).
        </p>
        <p>d d d</p>
        <p>Regarding the π parameter, we introduce a non classical approach
to estimate this parameter. We defined this strategy after observing
that it is almost impossible to beat a ramp event forecaster that
predicts the ramp event occurring one step ahead to be the current
observed ramp event. Thus, we set π to be a distribution having zero
probability for all events except the event observed at launch time,
the r time point. In the pseudo code we write π ← (δ(Ld(r) ==
rampDown), δ(Ld(r) == noramp), δ(Ld(r) == rampU p),)
where δ is a Dirac delta function defined by δ(x) = 1, if x is T RU E
and δ(x) = 0, if x is F ALSE.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Experimental Evaluation</title>
      <p>In this section we describe the configurations, the metrics
and the results that we obtain in our experimental evaluation.
b c</p>
      <sec id="sec-4-1">
        <title>Table</title>
        <p>Costs
1:</p>
        <sec id="sec-4-1-1">
          <title>Misclassification</title>
          <p>ed down
itcd no
re up
P</p>
        </sec>
      </sec>
      <sec id="sec-4-2">
        <title>4.1 Experimental Configuration Our goal is to predict ramp</title>
        <p>
          events in a large-scale wind farm located in the US Midwest. To
evaluate our system we collected historical data and, to make
predictions, use wind speed power predictions (NWP) for the time
period ranging between 3rd of June 2009 and 16th of February 2010.
Each turbine in the wind farm has a Supervisory Control and Data
Acquisition System (SCADA) that registers several parameters,
including the wind power generated by each turbine and the measured
wind speed at the turbine, the latter are 10 minute spaced point
measurements. In this work we consider a subset of turbines and
compute, for each time point, the subset mean wind power output and
the subset mean wind speed, obtaining two time series of
measurements. The wind speed power prediction for the wind farm location
was obtained from a major provider. Every day we get a wind speed
forecast with launch time at 6 am and having 24 hours horizon. The
predictions are 10 minute spaced point forecasts. In this work we
run SHREA to forecast ramp events occurring 30, 60 and 90
minutes ahead, the a parameter. We start by learning a HMM using five
days of data and then use the learned, and updated, HMM to
generate predictions for each fixed size non overlapping time window.
Moreover, we split the day in four periods and run SHREA to learn
four independent HMM models: dawn, period ranging between zero
and six hours; morning, period ranging between six to twelve hours;
afternoon, period ranging between twelve and eighteen hours; nigh,
period ranging between eighteen and midnight. The last four models
were only used to give some insight on the ramp dynamics and were
not used to make predictions. We define a ramp event to be a change
in wind power production higher than 20% of the nominal capacity,
i.e., we set the Pref threshold equal to 20% of the nominal capacity.
Moreover, we run a set of experiments by setting Δt parameter equal
to 1, 2 and 3 time points, i.e., equal to 30, 60 and 90 minutes. We run
SHREA using thirty minute signal aggregation, thus each time point
represents thirty minutes of data. In these experiments we also
consider phase error corrections. Phase errors are errors in forecasting
ramp timing [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. We identify events that occur in a timestamp, t, not
predicted at that time, but predicted instead to occur in one, or two,
time periods immediately before or after t.
        </p>
        <p>Furthermore, as SHREA is continuously updating the HMM, we
set the forgetting parameter σ = 30, i.e., each time the system reads a
new period of 30 days of data, the system forgets 30 days of old data.
The amount of forgetting used in this work results from a careful
study of the wind patterns.</p>
        <p>
          For this configuration we compute and present the Hanssen &amp;
Kuippers Skill Score (KSS) and the Skill Score (SS) [
          <xref ref-type="bibr" rid="ref1 ref6">1, 6</xref>
          ].
Moreover, we compute the expected misclassification costs (EC) using the
formula presented in [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. The cost matrix presented in Table 1
defines the misclassification costs. We compare SHREA against a
Persistence baseline algorithm. Despite its simplicity, the predictions of
this model are the same as the last observation, this model is known
to be hard to beat in short-time ahead predictions [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ].
4.2 Results This work is twofold and here we present and
analyze both the descriptive and predictive performance of the SHREA
framework.
        </p>
        <p>In Figure 2 we present an example of HMM generated by SHREA
in February. This model was learned when running SHREA to
predict 90 minutes ahead events and setting Δt = 2. This HMM has
three states, each state is associated with one ramp type, and each
state emits six symbols, each representing a discrete bin of the
observed wind speed. The lower level of wind speed is associated with
the a character and the higher level of wind speed is associated with
the f character. The labels in the edges show the state emissions and
the state transition probabilities.</p>
        <p>
          The HMM models that we obtained in our experiments uncover
interesting ramp behaviors. If we consider all the data used in these
experiments, when we set Δt = 1 we found that there were
detected 7% more ramp-up events than ramp-down events. When we
set Δt = 3 we get the inverse behavior, we get 4% more
rampdowns than ramp-ups. This behavior is easily explained by the wind
natural dynamics that causes steepest ramp-up events and smooth
ramp-down events. If we analyze independently the four periods of
the day we can say that we have a small number of ramp events,
both ramp-ups and ramp-downs, in the afternoon. If we compute the
mean number of ramps, for all Δt parameters we get approximately
30%(15%) more ramp-up(ramp-down) events at night than in the
afternoon. Overall, we can say that we get more ramp events at night
and, in second place, at the dawn period. Moreover, we can say that
in the summer we get, both for ramp-up and ramp-down events, wind
speed distributions with higher entropy, we get approximately 85%
of the probability concentrated in two observed symbols. Different
from this behavior, in the winter we have less entropy in the wind
speed distribution associated with both types of ramp events. In the
winter we have approximately 91% of the probability distribution
concentrated in the one symbol. The emission probability
distribution of the ramp-down state is concentrated in symbol a and the
emission probability distribution in the ramp-down state is concentrated
in symbol f. These two findings are consistent with our empirical
visual analysis and other findings [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]: Large wind ramps tend to
occur in the winter and usually there is a rapid wind speed increase
followed by a more gradual wind speed decrease. These findings are
also related with the average high temperature in the summer and
with the stable temperatures registered during the afternoons.
Considering the Δt parameter, we can say that the number of ramps,
both ramp-ups and ramp-downs, increase with the Δt parameter. In
general, we observe large ramps only when we compare time points
that are 20 to 30 minutes apart.
        </p>
        <p>As is illustrated in Figure 2 we identified a large portion of
selfloops, especially ramp-up to ramp-up transitions in the winter nights.
The percentage of self-loops range between 12%, when we run
SHREA with Δt = 1, and 55% when we set Δt = 3. This self-loop
transition shows that we have a high percentage of ramp events
having a magnitude of at least 40% of the nameplate, two times the Pref
threshold. Furthermore, in the winter we get a higher proportion of
ramp-up to ramp-down and ramp-down to ramp-up transitions than
in the summer. This is especially clear at the dawn and night periods.
This phenomena can be related with the difference in the average
temperatures registered in these time periods.</p>
        <p>Before presenting the forecast performance, it must be said that
the quality of ramp forecasting depends a great deal on the quality of
meteorological forecasts. Moreover, as the HMMs represent
probability distributions it is expected that SHREA will be biased to predict
no-ramp events. Typically SHREA over predicts no-ramp events but
makes less severe errors. This biased behavior of SHREA is an
acceptable feature since it is better to forecast a no-ramp event when
we observe a ramp-down(ramp-up) event than predicting a
rampup(ramp-down) event. In real wind power operations (see Table 1)
the cost of the later error is several times larger than the former
errors.</p>
        <p>In Table 2 we present the mean (inside brackets we present the
associated standard deviation) KSS, SS and Expected Cost metrics that
we obtained when running SHREA, and the reference model, to
predict ramp events occurring in the last hundred days of the evaluation
period.</p>
        <p>Before presenting a detailed discussion of the obtained results, we
must say that, if we consider the same Δt parameter, in all
experiments we obtained better, or equal, results than the baseline
algorithm, the Persistence algorithm. Moreover, we must say that when
we generate predictions for the 30 minute horizon (one time point
ahead, since we use 30 minutes aggregation) we get the same results
as the Persistence model. This phenomena is related with the strategy
that we used to define the HMM initial state distribution. Remember
that we set the HMM π parameter equal to the last state observed.</p>
        <p>As expected, the KSS results worsen with the increase of the time
horizon. It is well known that the forecast reliability/fit worsens as
the distance from the forecast launch time increases. Moreover we
can say that we obtained better KSS values for the morning period
than in the other three periods of the day. For lack of space we do not
present a detailed description of the results that we obtain when we
run SHREA to predict ramp events occurring in each one of the four
periods of the day. This can be related with the wind speed forecasts
launch time. The wind speed forecast that we use in this work is
updated every day at 6 am.</p>
        <p>The analysis of the Δt parameter shows that the mean KSS
values increase with the increase in the Δt value. Again, this can be
explained by the wind patterns, typically the wind speed increases
smoothly during more than 30 minutes. In Table 2 we can see clearly
that SHREA performance improves with the increase in Δt
parameter. We observe the same behavior when inspecting the results that
we obtained by running the Persistence algorithm. Concerning the
SS, we can see that we obtain improvements over the Persistence
forecast that ranges between 0% and 16%.</p>
        <p>Concerning the phase error technique, we get important
improvements for the two phase error parameter values considered in this
study. The amount of improvement that we obtained by considering
the phase error can be valuable in real time operations. The
technicians can prepare the wind farm to deal with a nearby ramp event.
In Table 2 we present the results without considering the phase error
technique, phE = 0, and considering one time point (30 minutes),
phE = 1, and two time points (60 minutes), phE = 2, phase errors
corrections.</p>
        <p>We also introduce a misclassification cost analysis framework that
can be used to quantify the management decisions. We define a
misclassification cost scenario (see Table 2) and show that SHREA
produces valuable predictions. In this real scenario, SHREA
generates significant lower operational costs and better operational
performance than the baseline model.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusions and Future Work</title>
      <p>In this work we obtained some insights on the intricate mechanisms
hidden in the ramp event dynamics and obtain valuable forecasts for
very short-time horizons. For instance, we can now say that steepest
and large wind ramps tend to occur more often in the winter.
Moreover, typically there is a rapid wind speed increase followed by a
more gradual wind speed decrease. Overall, with the obtained HMM
models we both obtained insights on the wind ramp dynamics and
generate accurate predictions that prove to be cost beneficial when
compared against a Persistence forecast method.</p>
      <p>The performance of SHREA is heavily dependent on the wind
speed forecasts quality. Thus, in a near future we hope to get
special purpose NWP suitable to detect ramp events and having more
frequent daily updates. Moreover, we will study multi-variate HMM
emissions to include other NWP parameters like wind direction and
temperature.</p>
      <p>Acknowledgments: This manuscript has been created by
UChicago Argonne, LLC, Operator of Argonne National Laboratory
(“Argonne“). Argonne, a U.S. Dep. of Energy Office of Science
laboratory, is operated under Contract No. DE AC02-06CH11357. The
authors also acknowledge EDP Renewables, North America, LLC.
This work was is also funded by the ERDF - through the COMPETE
programme and by National Funds through the FCT Project KDUS.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>K.T.</given-names>
            <surname>Bradford</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.L.</given-names>
            <surname>Carpenter</surname>
          </string-name>
          , and
          <string-name>
            <given-names>B.</given-names>
            <surname>Shaw</surname>
          </string-name>
          , '
          <article-title>Forecasting southern plains wind ramp events using the wrf model at 3-km'</article-title>
          ,
          <source>in AMS Student Conference</source>
          , (
          <year>2010</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>C.</given-names>
            <surname>Ferreira</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Gama</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Miranda</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Botterud</surname>
          </string-name>
          , '
          <article-title>A survey on wind power ramp forecasting'</article-title>
          ,
          <source>in Report ANL/DIS 10-13</source>
          , Argonne National Laboratory, (
          <year>2010</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>U.</given-names>
            <surname>Focken</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Lange</surname>
          </string-name>
          , '
          <article-title>Wind power forecasting pilot project in alberta'</article-title>
          , Oldenburg, Germany: energy &amp; meteo
          <string-name>
            <surname>systems</surname>
            <given-names>GmbH</given-names>
          </string-name>
          , (
          <year>2008</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>J.</given-names>
            <surname>Freedman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Markus</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Penc</surname>
          </string-name>
          , '
          <article-title>Analysis of west texas wind plant ramp-up and ramp-down events'</article-title>
          ,
          <source>in AWS Truewind</source>
          , LLC, Albany,
          <string-name>
            <surname>NY</surname>
          </string-name>
          , (
          <year>2008</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>B.</given-names>
            <surname>Greaves</surname>
          </string-name>
          , J. Collins,
          <string-name>
            <given-names>J.</given-names>
            <surname>Parkes</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Tindal</surname>
          </string-name>
          , '
          <article-title>Temporal forecast uncertainty for ramp events'</article-title>
          ,
          <source>Wind Engineering</source>
          ,
          <volume>33</volume>
          (
          <issue>11</issue>
          ),
          <fpage>309</fpage>
          -
          <lpage>319</lpage>
          , (
          <year>2009</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>A.W.</given-names>
            <surname>Hanssen</surname>
          </string-name>
          and
          <string-name>
            <given-names>W.J.A.</given-names>
            <surname>Kuipers</surname>
          </string-name>
          , '
          <article-title>On the relationship between the frequency of rain and various meteorological parameters'</article-title>
          , Mededelingen van de Verhandlungen,
          <volume>81</volume>
          , (
          <year>1965</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>C.</given-names>
            <surname>Kamath</surname>
          </string-name>
          , '
          <article-title>Understanding wind ramp events through analysis of historical data'</article-title>
          ,
          <source>in IEEE PES Transmission and Distribution Conference and Expo</source>
          , New Orleans, LA,
          <string-name>
            <surname>United</surname>
            <given-names>States</given-names>
          </string-name>
          , (
          <year>2010</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>A.</given-names>
            <surname>Kusiak</surname>
          </string-name>
          and
          <string-name>
            <given-names>H.</given-names>
            <surname>Zheng</surname>
          </string-name>
          , '
          <article-title>Prediction of wind farm power ramp rates: A data-mining approach'</article-title>
          ,
          <source>J. of Solar Energy Engineering</source>
          ,
          <volume>131</volume>
          , (
          <year>2009</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>J.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Keogh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Lonardi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>B.</given-names>
            <surname>Chiu</surname>
          </string-name>
          , '
          <article-title>A symbolic representation of time series, with implications for streaming algorithms'</article-title>
          ,
          <source>in 8th ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery</source>
          , San Diego, CA, (
          <year>2003</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>C.</given-names>
            <surname>Monteiro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Bessa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Miranda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Botterud</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wang</surname>
          </string-name>
          , and G. Conzelmann, '
          <article-title>Wind power forecasting: State-of-the-art 2009'</article-title>
          ,
          <source>in Report ANL/DIS 10-1</source>
          ,
          <string-name>
            <given-names>Argonne</given-names>
            <surname>National</surname>
          </string-name>
          <string-name>
            <surname>Laboratory</surname>
          </string-name>
          , (
          <volume>209</volume>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>C. W.</given-names>
            <surname>Potter</surname>
          </string-name>
          , E. Grimit, and
          <string-name>
            <given-names>B.</given-names>
            <surname>Nijssen</surname>
          </string-name>
          , '
          <article-title>Potential benefits of a dedicated probabilistic rapid ramp event forecast tool'</article-title>
          , IEEE, (
          <year>2009</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>L.R.</given-names>
            <surname>Rabiner</surname>
          </string-name>
          , '
          <article-title>A tutorial on hidden markov models and selected applications in speech recognition'</article-title>
          ,
          <source>Proceedings of the IEEE</source>
          ,
          <volume>77</volume>
          (
          <issue>2</issue>
          ), (
          <year>1989</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>A.</given-names>
            <surname>Srinivasan</surname>
          </string-name>
          , '
          <article-title>Note on the location of optimal classifiers in ndimensional roc space'</article-title>
          ,
          <source>in Oxford University Technical Report PRGTR-2-99</source>
          , Oxford, England, (
          <year>1999</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>