<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Efficient load profiling and forecasting in large electric power systems</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Imre Lendák</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tomáš Horváth</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Data Science and Engineering Department, Faculty of Informatics, Eötvös Loránd University</institution>
          ,
          <addr-line>Budapest</addr-line>
          ,
          <country country="HU">Hungary</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The goal of this paper is to present an efficient load forecasting algorithm for large electric power systems. It uses a combination of nearest neighbor-based load profile clustering and rule-based load forecasting. The load data was sliced into daily load curves, which were K-Means-clustered, thereby compressing data and simplifying the solution. K-Means was chosen in the proof of concept phase and will be substituted with more precise solutions later. In the forecasting phase the daily load profile is predicted based on the forecast date, day type (e.g. weekday or weekend) and historical consumption data for similar days in the past. The solution was tested on a large dataset consisting of one year-long, 5-minute measurement data in a 1900-power-line system. The solution showed excellent performance in both the training and forecast phases. It produced meaningful forecasts even when the input data contained significant amounts of anomalies. An additional advantage of the presented solution is that it can be used for medium and long-term forecasting with limited and/or missing input data.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>The challenge to accurately predict the power flows in
today’s large electric power systems receives ample
attention. Numerous papers are published in specialized smart
grid journals with the promise of being able to predict
the electricity consumption of single households or their
groups. Others develop solutions which predict the power
flows in electric power transmission systems, which span
over large geographic regions, e.g. sizable parts of
continental Europe or the USA. Yet another group of scientists
works on data compression algorithms, with the intention
to lower the communication and storage costs incurred in
modern smart grids.</p>
      <p>Within this setting, we start from the idea that the flows
on the power lines in electric power transmission systems
have some form of periodicity. More specifically, we will
theorize that the configurations of these large systems does
not change frequently, and under the same load
conditions the flows will be similar on the power lines for
similar days, e.g. for Wednesdays in July the load will most
probably be very similar under the same loading
conditions. Therefore we propose a 2-phase load forecasting
algorithm, consisting of a daily load profile clustering and
a load forecasting phase.</p>
      <p>The following sections of this document contain more
detailed description of each of the above steps.
2</p>
    </sec>
    <sec id="sec-2">
      <title>State-of-the-art</title>
      <p>The body of electric load forecasting knowledge is very
large, with numerous papers published in all major
domain-specific journals and conferences. As an
extensive review of all relevant solutions would not be feasible
due to the page limits, we will only refer to those research
results, which specifically focus on time series clustering,
smart meter big data management and the combination of
solutions from these domains used in load forecasting.
2.1</p>
      <sec id="sec-2-1">
        <title>Time series analysis</title>
        <p>
          Reference [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ] presents two novel time series
clustering methods, namely k-shape and k-MultiShapes (k-MS),
which rely on scalable iterative refinement procedures
based on shape-based distances (SBD). The authors claim
that their solution(s) achieve similar results to dynamic
time warping, which at a lower computational cost.
kShape is quoted as a suitable and novel solution for
creating homogeneous and well-separated clusters of time
series data. The positive characteristics of k-Shape are
domain independence, accuracy and efficiency [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ].
        </p>
        <p>
          Reference [33] describes a convolutional neural
network-based time series classification solution, in which
the time series features are automatically learned instead
of handpicking. The authors describe the process of
data preparation, filtering, and the structure of the used
network. The authors of reference claim that
semisupervision can boost time series clustering performance
[
          <xref ref-type="bibr" rid="ref7">7</xref>
          ].
2.2
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Data compression</title>
        <p>
          Reference [
          <xref ref-type="bibr" rid="ref29">29</xref>
          ] contains an application-oriented review of
smart meter data analysis solutions. Three main
application areas are identified, namely load analysis, load
forecasting, and load management. This is a rare reference
which addresses the data privacy and security aspect of
the analyzed solutions as well. The most important
motivation behind data compression in smart metering are
reduced congestion of communication channels used for
data transmission, storage overhead, as well as improved
data mining efficiency. Reference [
          <xref ref-type="bibr" rid="ref30">30</xref>
          ] presents a
comprehensive study on smart meter big data compression
solutions. The authors of reference [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ] present a
featurebased, load data compression method for smart metering
infrastructures. The solution is not lossless. The authors
claim it is efficient, with little reconstruction error. The
solution was validated on the Irish Smart Metering Trial
Data. The authors of reference [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ] present lossless
compression algorithms for power system operational data.
        </p>
        <p>
          The authors of reference [
          <xref ref-type="bibr" rid="ref28">28</xref>
          ] use K-SVD sparse
representation technique. In the dictionary learning phase, they
decompose load profiles into linear combinations of
several partial usage patterns (PUPs). In the sparse coding
phase, a linear support vector machine (SVM) is used to
classify load profiles as residential or small and
mediumsized enterprises (SMEs). The authors claim that their
solution outperforms k-means, the discrete wavelet
transform (DWT), principal component analysis (PCA), as well
as piecewise aggregate approximation (PAA).
        </p>
        <p>
          The solution presented in reference [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ] utilizes
deepstacked auto-encoders in electric load data compression
and classification.
2.3
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>Load classification and forecasting</title>
        <p>
          A more general review of smart meter data intelligence
is provided in references [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ] and [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. The authors of
references [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ][
          <xref ref-type="bibr" rid="ref21">21</xref>
          ] and [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ] explore state-of-the-art
machine learning approaches in load forecasting. They
review more than 50 research papers and group their
contributions into single and hybrid computational
intelligencebased approaches. They perform a qualitative analysis
based on accuracy and prove the superiority of hybrid
solutions. Various short-term load forecasting techniques
were compared as early as 1989 in reference [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ].
Various machine learning-based short-term load forecasting
techniques ranging from moving averages to deep
neural networks are addressed in references [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ][
          <xref ref-type="bibr" rid="ref5">5</xref>
          ][
          <xref ref-type="bibr" rid="ref6">6</xref>
          ][
          <xref ref-type="bibr" rid="ref8">8</xref>
          ][
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]
[
          <xref ref-type="bibr" rid="ref23">23</xref>
          ][
          <xref ref-type="bibr" rid="ref24">24</xref>
          ]. Smart meter forecasting from one minute to
oneyear horizons is presented in reference [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. Electricity
price and demand forecasting is tackled by the authors in
[
          <xref ref-type="bibr" rid="ref18">18</xref>
          ]. Bus load forecasting is addressed in reference [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ].
Reference [
          <xref ref-type="bibr" rid="ref27">27</xref>
          ] presents a smart meter data
characterization method based on the Gaussian mixture (GM) model.
The authors claim that compared to other state-of-the-art
solutions, theirs offers significantly better fitting for
meter data. Reference [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ] describes a hybrid clustering and
classification technique in short-term energy consumption
forecasting.
        </p>
        <p>
          Reference [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] proposes to use clustering in
bottomup, short-term load forecasting. The authors cluster load
curves by using wavelets to measure similarity and thereby
create super-consumer profiles. The solution was
implemented in R and is freely available. The authors of
reference [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] analyze four years of measurements represented
as time series collected at 245 HV/MV substations. They
use the stationarity property of the estimated models to
identify daily customer profiles.
        </p>
        <p>
          The authors of reference [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] analyze annual load
curves of households and create annual and weekly load
profiles. They also show how additional features of
household affect annual consumption and random variation in
household energy consumption. Reference [
          <xref ref-type="bibr" rid="ref32">32</xref>
          ] presents
an analysis of the daily consumption data of 300
residential customers in China. The authors identify four types
of monthly usage patterns and 9 abnormal users, with
significantly different electricity use patterns. They prove that
more than 80% of households have a similar monthly
electricity usage pattern.
        </p>
        <p>
          The authors of references [
          <xref ref-type="bibr" rid="ref31">31</xref>
          ] used k-Shape for
building energy usage pattern analysis and tested their solution
on real-life data measured in ten institutional buildings.
Reference [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ] goes even further, by using ML techniques
to guess the lifestyles of energy consumers based on their
consumption patterns.
3
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Problem definition</title>
      <p>It is necessary to develop a load forecasting solution which
is capable to predict loads in extremely large, Europe-wide
electric power transmission systems consisting of
thousands of power lines. The input data will consist of
historical measured loads with a sampling rate of 5 minutes
available for at least the last 1-year period. This data will be
referred to as dynamic data, due to its frequency of change.
Due to data privacy limitation, the input data will not
contain the complete static data model of the system under
consideration. This means that there will be no data
provided to build a mathematical graph consisting of the
busbars (vertices) and power lines (edges) connecting them.</p>
      <p>It is expected that the prediction horizon will be 5
minutes ahead. The solution should be extensible and be
capable to provide acceptable mid- and long-term forecasts
(1 day or 1 week ahead) as well. Additionally, it is
necessary for the solution to handle temporary unavailability of
significant amounts of measurements, when those will not
be provided in a timely manner by one or more countries
and/or companies in the geographical area under
consideration. Optionally, the solution should be able to
incorporate weather forecast and other freely available 3rd party
data and thereby increase the accuracy of its outputs. The
relevance of such data might vary, as the extent of data
anonymization required will not allow the forecasting tool
access to the geographical location of system resources
(i.e. power lines).
4</p>
    </sec>
    <sec id="sec-4">
      <title>Solution</title>
      <p>We suppose that the power flows measured on the power
lines in large electric power systems show some level of
regularity and can be therefore classified into load
profiles. Based on this assumption, we propose to create a
hybrid forecasting solution which consists of two phases.
In the first phase the electric power flow data is clustered
into daily load profiles. In the second phase we
experiment with various forecasting algorithms to predict the
daily load profile for each power line based on historical
data. This means that instead of predicting the expected
values of power flows 5 minutes in the future, we predict
the load profile for an entire day in advance.</p>
      <p>This solution addresses most of the more complex
requirements listed above, namely it is expected that it can
handle missing data and create forecasts for multiple days
ahead. More specifically, it tolerates the absence of
significant amounts of short-term historical data and still
produces meaningful forecasts based on medium- or
longterm historical data (e.g. data older than a week or month).
Similarly, medium- or long-term (7 days or more ahead)
forecasts are feasible with this type of solution.</p>
      <p>In the following sections we propose the load profile
generation and forecasting steps.
4.1</p>
      <sec id="sec-4-1">
        <title>Load profile generation</title>
        <p>The power flow data is sliced into daily (24h) load profiles
consisting of measured flow values sampled every 5
minutes. The sliced daily load profiles are normalized. The
amplitude of each daily load profile is memorized. The
normalized daily flows are clustered, separately for each
power line, i.e. a set of representative load profiles is
calculated for each power line. For each year, month, day of
the week and power line we memorize the load profile and
amplitude.</p>
        <p>The daily load profile clustering introduces some error,
but significantly improves algorithm performance if it is
not necessary to re-calculate the centroids too often. As
the main idea is that the daily load profiles will be similar,
this should not be an issue, i.e. we should re-calculate
the representative daily load profiles relatively rarely, e.g.
once in 15 or 30 days.
4.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>Load forecasting</title>
        <p>Our improved baseline load forecasting solutions is
rulebased. In the predict phase it looks up the most likely
historical load profile and amplitude for each of the power
lines. They are used to calculate the predicted load profile
for the entire prediction day and extract as many samples
as required. This means that we will predict for a whole
day, i.e. 288 future values with a 5-minute sampling
interval.</p>
        <p>Predictions spanning two calendar days are somewhat
challenging as in their case it is necessary to either select
two load profiles and stitch them together; or to select the
‘end’ and the ‘start’ of the same load profile. In the actual
solution we perform the latter, simpler solution, i.e. use a
single daily profile to cover both calendar days before and
after midnight.</p>
        <p>Load profile prediction The load profile selection is
rulebased, and it is performed in the following steps (listed by
priority):
1. if there are (past) values for the same (year, weekday,
power line) tuple within the last two months, then
select one;
2. if there are (past) values for the same (year-1, month,
weekday, power line) tuple, then select one;
3. if there are historical values for (year – (2:N), month,
weekday, power line) tuples, then select the most
likely one – N is configurable and defaults to 15; or
4. if none of the above is found, do a random load profile
selection.</p>
        <p>Amplitude calculus Amplitudes are chosen as averages
of historical values for similar days in the past. This part
of the solution is also rule-based, and its steps/choices
are very similar to the curve selection algorithm presented
above (also listed by priority of choice):
1. choose non-zero amps for the same weekday within
the last two calendar months and average them;
2. average last year’s amplitudes for the same month
and weekday combination;
3. average the values in the longer-term history up to M
years in the past, where M is configurable and
defaults to 15; or
4. choose a default amplitude, which was for simplicity
set to a (configurable) scalar.
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Experiments</title>
      <p>The input dataset was loaded from Heterogeneous Data
Format (HDF), version 5. We used HDFView version 3
for data exploration. The training data consisted of
historical power flows for 1935 power lines over a
July-toJuly one-year period expanding over two calendar years.
The training data consisted of 3-month-long power flows
over a July-September period immediately after the
training data. The training data was split into 68 data slices
ranging in length between one hour and a couple of
dayslong. Both training and testing power flows were sampled
every 5 minutes.</p>
      <p>We implemented the solution in Python version 2. We
used the scikit-learn library for K-Means and other
necessary ML algorithms. Data visualization during
experimentation was performed with matplotlib.
5.1</p>
      <sec id="sec-5-1">
        <title>Load profiles</title>
        <p>The number of representative load profiles (i.e. centroid
count) was set to 10, which was shown to be a sufficient
during data exploration. The clustering was performed by
K-Means, which was chosen due to its efficiency. Each
load profile was a curve consisting of 288 numerical
values.</p>
        <p>We re-trained the model if a sufficiently long time
period expired since the last training performed. For
simplicity we performed full re-train is runs from scratch. This
design decision was acceptable as the clustering phase for
the 1-year period and 1935 power lines took up to 20
minutes on a personal computer with an Intel i7 CPU, 8 GB of
RAM and SSD.</p>
        <p>Example 1-week input load profiles are shown in
Figure ??. Example load profiles detected for the above
single power in its corresponding 1-year-long flow data with
K-Means and cluster number 5 are shown in Figure ??.
Note that the cluster number of five was used here only to
illustrate the clustering results in a visually pleasing
diagram. Otherwise, during most experiments cluster
number 10 was used. The introduction of these load profiles
allowed us to significantly reduce the solution space, i.e.
to replace 365 (or more if the dataset spans a longer time
period) daily curves with a small set of load profiles. Daily
loads in the historical data were essentially represented
with tuples consisting of a load profile identifier, daily
amplitude multiplier (as the curves were normalized in the
(1, 1) range), year, month and weekday. We used weekdays
as based on past experience, and the related works, we
theorized that load profiles will be quite similar in a certain
day of the week in each month of the calendar year, e.g.
customer electricity use is usually similar on each
Saturday in July if the weather is good.</p>
        <p>As explained earlier, we reduced the size of the
historical dataset by introducing the load profiles and thereby
simplified the forecasting task. The original, vast dataset
consisted of 288 daily flow measurements with a 5-minute
sampling rate, collected for 365 days and 1916 power lines
(i.e. 288x365x1916 = 201,409,920 values). With the load
profiles we reduced the dataset to a tuple consisting of
the power line identifier, date, load profile identifier and
amplitude, i.e. the multiplier with which the load
profile is multiplied to obtain the ‘original’ load
measurements. This meant that instead of 288 floating point
values for each of the day and power line combination, we
received a tuple consisting of the above four elements, i.e.
4x365x1916 = 2,797,360, which was a data reduction by
72 times, i.e. almost two orders of magnitude.</p>
        <p>We randomly selected one power line and created a plot
of the assigned load profile identifiers with K-Means over
a 60-day long period, starting with a Thursday (i.e.
weekday identifier 3). The resulting diagram can be seen in
Figure ??. The diagram covers a 60-day period during the
July-August period. We can see in the diagram that for
almost all weekends (for Saturdays 2, 9, 16, and Sundays,
3, 10, 17, etc.) the representative load profile was with ID
= 0. Additionally, we can see that load profile 4 was very
frequent for weekdays.</p>
        <p>The power flow amplitudes for the same power line and
period is shown in Figure ??. We can clearly identify an
anomalous period around day 30, similarly as in the load
profile diagram above. Such periods are usually related
to periods with different weather conditions (e.g. colder,
rainy days with less use or air-conditioning) and/or
configuration changes in the power system. In this diagram
we might identify dips in amplitudes during the weekends
(day 0 is a Thursday), but there is no other clear regularity
identifiable via visual inspection.
5.2</p>
      </sec>
      <sec id="sec-5-2">
        <title>Load forecasting</title>
        <p>We transformed the date information into a tuple of three
values, namely year, month and weekday. With this
modification, the inputs fed into the load forecasting code were
tuples representing historical daily loads in the following
format:
(line id, year, month, day of the week, profile id, amp)
We implemented the rule-based, baseline algorithm as
described above. The forecasting code expected the
following inputs:</p>
        <sec id="sec-5-2-1">
          <title>Forecast date, e.g. May 30th, 2019;</title>
        </sec>
        <sec id="sec-5-2-2">
          <title>Power line identifier.</title>
          <p>The code transformed the dates into the (year, month, day
of the week) sub-tuples and subsequently looked up the
most similar historical data as explained in section IV/B.
The load forecasting code returned two values, namely the
‘expected’ load profile identifier and amplitude for the
prediction day.</p>
          <p>Experiment I: Short-term forecasting We compared the
results of our load forecasting solution to the persistence
model, which simply predicts that the N future values will
be equal to the N historical values preceding them. The
persistence model implementation used for testing was
limited to short-term forecasts for the next N values
immediately following the last time period received in the
training data, i.e. it did not support time gaps between the
latest training data and the forecasting period.</p>
        </sec>
        <sec id="sec-5-2-3">
          <title>Length</title>
        </sec>
        <sec id="sec-5-2-4">
          <title>1 hour(s) 2 hour(s) 4 hour(s)</title>
        </sec>
        <sec id="sec-5-2-5">
          <title>Persistence model Our model 47.72 53.44</title>
          <p>We experimented with N=(12, 24, 48), i.e. a 1, 2 and
4 hour short-term prediction horizons. One execution of
our prediction code took around 30 seconds to execute,
regardless of the selected interval length. The time to
execute the same prediction task on the persistence model was
quite similar, which meant that most of the time was spent
on creating the resulting datasets.</p>
          <p>The RMSE errors calculated with the persistence and
our model are shown in Table ??. The error was
calculated between the prediction values and measured values
extracted from the adapt/test data.</p>
          <p>Experiment II: Mid-term forecasting As explained
above, the proposed solution is capable to produce
meaningful mid and long-term forecasts with limited historical
data availability. We tested the algorithm on the following
three prediction tasks:</p>
        </sec>
        <sec id="sec-5-2-6">
          <title>1 day ahead, i.e. predict loads for the next day.</title>
          <p>1 week ahead, i.e. predict the power flows for the day
one week in future compared to the last training data
item.</p>
        </sec>
        <sec id="sec-5-2-7">
          <title>1 month ahead.</title>
          <p>As the persistence model used during this research did
not have built-in support for these types of prediction
tasks, we measured the RMSE for the proposed model
only. Table ?? contains the results of our measurements.</p>
          <p>We decided to further explore the resulting predictions
(i.e. load forecasts) by visually comparing the predicted
load curves to the actual measurements received as part of
the test/adapt data. A randomly selected load flow
prediction and real daily load values are shown in Figure ??.</p>
          <p>The values predicted one week in the future (in red
color) are in a similar value range, i.e. they do not have an
inverted sign or exceedingly different amplitudes. Not
surprisingly the anomalous zero value in the (real) measured
value is not predicted by the presented load forecasting
algorithm.</p>
          <p>In Figure ?? we present the resulting daily load curve
for the same power line as in the previous example.</p>
          <p>The relatively low accuracy levels can be improved by
implementing a more accurate load profile clustering
technique, instead of the relatively coarse K-Means, with a
different distance measure. Such changes would allow the
authors to find the most relevant daily load profiles.
6</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Conclusion</title>
      <p>This paper describes a power flow prediction algorithm,
which relies on analyzing historical (power) flow
information and creating a configurable number of
representative daily load profiles for each power line. Predictions
are based on high performance look-ups – a single load
profile index is selected for the target prediction day, for
which a (daily) load profile is calculated. The algorithm is
rule-based as opposed to creating a neural network-based
or other machine learning solution. It can be tweaked
further, and one might expect from it to produce deterministic
results for the target electric power system under
consideration. It might be used as a baseline solution, against
which less deterministic, machine-learning solutions can
be compared and measured.</p>
      <p>The main advantages of the presented algorithm are its
training and prediction performance. It analyzes the
historical flow information and creates configurable numbers
of representative daily load profiles for each power line.
Predictions are based on high performance look-ups – a
single load profile index is selected for the prediction day,
a predicted (daily) flow curve is calculated for the whole
calendar day for which the prediction is initiated. The
predicted values are ’stitched’ (i.e. amplitudes are
augmented) to the actual input flows and the requested number
of predicted samples is returned. The solution can make
predictions based on (very) limited information and
handle gaps in input data. It is also capable to quickly predict
the most likely and meaningful flows for extended future
periods, i.e. instead of covering only a couple of hours
immediately after the last input (training) data received, it
is capable to predict a day, week or even longer periods
ahead. Prediction accuracy will obviously vary as a
function of the amount of historical data, i.e. if there are
historical flow values measured multiple years in the past, then
we expect to obtain higher accuracy. This was not shown
in our experiments though, as the training data available
covered only one calendar year.</p>
      <p>The solution was tested on dataset consisting of power
flows collected over a one year, July-to-July period. The
test data covered the July to September period
immediately following the training data. The system under
consideration consisted of 1916 power lines. The accuracy
of the proposed model was compared to the persistence
model. We showed that the RMSE was quite similar in
short-term forecasting tasks (1 to 4 hours) to the
persistence model. We measured the accuracy of our algorithm
in mid-term load forecasting scenarios, whose length was
set to be 1 day, 1 week, as well as 1 and 3 months in the
future.</p>
      <p>The main disadvantage of the algorithm is its reliance
on historical flow information only, i.e. it does not take
auxiliary information into consideration. Additionally, the
algorithm does not specifically cover national holidays,
which often fall on weekdays and result in weekend-like
load profiles - this missing element is relevant if the
algorithm is used for state-level forecasting, but has lower
relevance in continent-wide (e.g. Europe) load
forecasting scenarios, in which the impact of national holidays
is lower. Further tuning and optimization of the load
profile classification and curve/amplitude selection
algorithms might further improve performance and accuracy.
(2018) 861–868.
[33] Ze¸bik, M., Korytkowski, M., Angryk, R., Scherer, R.:
Convolutional Neural networks for time series classification. In
International Conference on Artificial Intelligence and Soft
Computing. Springer, Cham. (2017, June) 635–642.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Alahakoon</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yu</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          :
          <article-title>Smart electricity meter data intelligence for future energy systems: A survey</article-title>
          .
          <source>IEEE Transactions on Industrial Informatics</source>
          .
          <volume>12</volume>
          (
          <issue>1</issue>
          ) (
          <year>2016</year>
          )
          <fpage>425</fpage>
          -
          <lpage>436</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Amjady</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          (
          <year>2001</year>
          ).
          <article-title>Short-term hourly load forecasting using time-series modeling with peak load estimation capability</article-title>
          .
          <source>IEEE Transactions on Power Systems</source>
          ,
          <volume>16</volume>
          (
          <issue>3</issue>
          ),
          <fpage>498</fpage>
          -
          <lpage>505</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Amjady</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          :
          <article-title>Short-term bus load forecasting of power systems by a new hybrid method</article-title>
          .
          <source>IEEE Transactions on Power Systems</source>
          .
          <volume>22</volume>
          (
          <issue>1</issue>
          ) (
          <year>2007</year>
          )
          <fpage>333</fpage>
          -
          <lpage>341</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Auder</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cugliari</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goude</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Poggi</surname>
            ,
            <given-names>J. M.:</given-names>
          </string-name>
          <article-title>Scalable clustering of individual electrical curves for profiling and bottom-up forecasting</article-title>
          .
          <source>Energies</source>
          .
          <volume>11</volume>
          (
          <issue>7</issue>
          ) (
          <year>2018</year>
          )
          <year>1893</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hu</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <article-title>Shortterm load forecasting with deep residual networks</article-title>
          .
          <source>IEEE Transactions on Smart Grid</source>
          . (
          <year>2018</year>
          )
          <fpage>1</fpage>
          -
          <lpage>1</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>J. F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>W. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>C. M.:</given-names>
          </string-name>
          <article-title>Analysis of an adaptive time-series autoregressive moving-average (ARMA) model for short-term load forecasting</article-title>
          .
          <source>Electric Power Systems Research</source>
          .
          <volume>34</volume>
          (
          <issue>3</issue>
          ) (
          <year>1995</year>
          )
          <fpage>187</fpage>
          -
          <lpage>196</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Dau</surname>
            ,
            <given-names>H. A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Begum</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Keogh</surname>
          </string-name>
          , E.:
          <article-title>Semi-supervision dramatically improves time series clustering under dynamic time warping</article-title>
          .
          <source>In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management</source>
          . (
          <year>2016</year>
          , October)
          <fpage>999</fpage>
          -
          <lpage>1008</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Ding</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Benoit</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Foggia</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bésanger</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wurtz</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Neural network-based model design for short-term load forecast in distribution systems</article-title>
          .
          <source>IEEE Transactions on Power Systems</source>
          .
          <volume>31</volume>
          (
          <issue>1</issue>
          ) (
          <year>2016</year>
          )
          <fpage>72</fpage>
          -
          <lpage>81</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Espinoza</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Joye</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Belmans</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>De Moor</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Shortterm load forecasting, profile identification, and customer segmentation: a methodology based on periodic time series</article-title>
          .
          <source>IEEE Transactions on Power Systems</source>
          .
          <volume>20</volume>
          (
          <issue>3</issue>
          ) (
          <year>2005</year>
          )
          <fpage>1622</fpage>
          -
          <lpage>1630</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Fallah</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Deo</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shojafar</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Conti</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shamshirband</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Computational intelligence approaches for energy load forecasting in smart energy management grids: state of the art, future challenges, and research directions</article-title>
          .
          <source>Energies</source>
          .
          <volume>11</volume>
          (
          <issue>3</issue>
          ) (
          <year>2018</year>
          )
          <fpage>596</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Hagan</surname>
            ,
            <given-names>M. T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Behr</surname>
            ,
            <given-names>S. M.:</given-names>
          </string-name>
          <article-title>The time series approach to short term load forecasting</article-title>
          .
          <source>IEEE Transactions on Power Systems</source>
          .
          <volume>2</volume>
          (
          <issue>3</issue>
          ) (
          <year>1987</year>
          )
          <fpage>785</fpage>
          -
          <lpage>791</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hu</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ye</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xu</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Electric Load Data Compression and Classification Based on Deep Stacked Auto-Encoders</article-title>
          .
          <year>Energies</year>
          .
          <volume>12</volume>
          (
          <issue>4</issue>
          ) (
          <year>2019</year>
          )
          <fpage>653</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Kuusela</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Norros</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reittu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Piira</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Hierarchical Multiplicative Model for Characterizing Residential Electricity Consumption</article-title>
          .
          <source>Journal of Energy Engineering</source>
          .
          <volume>144</volume>
          (
          <issue>3</issue>
          ) (
          <year>2018</year>
          )
          <fpage>04018023</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Kwac</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Flora</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rajagopal</surname>
          </string-name>
          , R.:
          <article-title>Lifestyle segmentation based on energy consumption data</article-title>
          .
          <source>IEEE Transactions on Smart Grid</source>
          .
          <volume>9</volume>
          (
          <issue>4</issue>
          ) (
          <year>2018</year>
          )
          <fpage>2409</fpage>
          -
          <lpage>2418</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Golab</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Golab</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ilyas</surname>
            ,
            <given-names>I. F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jin</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Smart meter data analytics: Systems, algorithms, and benchmarking</article-title>
          .
          <source>ACM Transactions on Database Systems (TODS)</source>
          .
          <volume>42</volume>
          (
          <issue>1</issue>
          ) (
          <year>2017</year>
          )
          <article-title>2</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Massidda</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Marrocu</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Smart meter forecasting from one minute to one year horizons</article-title>
          .
          <source>Energies</source>
          .
          <volume>11</volume>
          (
          <issue>12</issue>
          ) (
          <year>2018</year>
          )
          <fpage>3520</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Moghram</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rahman</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>Analysis and evaluation of five short-term load forecasting techniques</article-title>
          .
          <source>IEEE Transactions on power systems. 4</source>
          (
          <issue>4</issue>
          ) (
          <year>1989</year>
          )
          <fpage>1484</fpage>
          -
          <lpage>1491</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <surname>Motamedi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zareipour</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosehart</surname>
          </string-name>
          , W. D.:
          <article-title>Electricity price and demand forecasting in smart grids</article-title>
          .
          <source>IEEE Transactions on Smart Grid</source>
          .
          <volume>3</volume>
          (
          <issue>2</issue>
          ) (
          <year>2012</year>
          )
          <fpage>664</fpage>
          -
          <lpage>674</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <surname>Paparrizos</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gravano</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Fast and accurate timeseries clustering</article-title>
          .
          <source>ACM Transactions on Database Systems (TODS)</source>
          .
          <volume>42</volume>
          (
          <issue>2</issue>
          ) (
          <year>2017</year>
          )
          <article-title>8</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <surname>Paparrizos</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gravano</surname>
          </string-name>
          , L.:
          <article-title>k-shape: Efficient and accurate clustering of time series</article-title>
          .
          <source>In Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data</source>
          . (
          <year>2015</year>
          , May)
          <fpage>1855</fpage>
          -
          <lpage>1870</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <surname>Raza</surname>
            ,
            <given-names>M. Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Khosravi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>A review on artificial intelligence based load demand forecasting techniques for smart grid and buildings</article-title>
          .
          <source>Renewable and Sustainable Energy Reviews</source>
          .
          <volume>50</volume>
          (
          <year>2015</year>
          )
          <fpage>1352</fpage>
          -
          <lpage>1372</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <surname>Sarkar</surname>
            ,
            <given-names>S. J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kundu</surname>
            ,
            <given-names>P. K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sarkar</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          :
          <article-title>Development of lossless compression algorithms for power system operational data</article-title>
          .
          <source>IET Generation</source>
          , Transmission Distribution.
          <volume>12</volume>
          (
          <issue>17</issue>
          ) (
          <year>2018</year>
          )
          <fpage>4045</fpage>
          -
          <lpage>4052</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <surname>Sun</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Luh</surname>
            ,
            <given-names>P. B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cheung</surname>
            ,
            <given-names>K. W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guan</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Michel</surname>
            ,
            <given-names>L. D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Venkata</surname>
            ,
            <given-names>S. S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Miller</surname>
            ,
            <given-names>M. T.</given-names>
          </string-name>
          :
          <article-title>An efficient approach to short-term load forecasting at the distribution level</article-title>
          .
          <source>IEEE Transactions on Power Systems</source>
          .
          <volume>31</volume>
          (
          <issue>4</issue>
          ) (
          <year>2016</year>
          )
          <fpage>2526</fpage>
          -
          <lpage>2537</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <surname>Taylor</surname>
            ,
            <given-names>J. W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McSharry</surname>
            ,
            <given-names>P. E.</given-names>
          </string-name>
          :
          <article-title>Short-term load forecasting methods: An evaluation based on european data</article-title>
          .
          <source>IEEE Transactions on Power Systems</source>
          .
          <volume>22</volume>
          (
          <issue>4</issue>
          ) (
          <year>2007</year>
          )
          <fpage>2213</fpage>
          -
          <lpage>2219</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <surname>Tong</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kang</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xia</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          :
          <article-title>Smart metering load data compression based on load feature identification</article-title>
          .
          <source>IEEE Transactions on Smart Grid</source>
          .
          <volume>7</volume>
          (
          <issue>5</issue>
          ) (
          <year>2016</year>
          )
          <fpage>2414</fpage>
          -
          <lpage>2422</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <surname>Torabi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hashemi</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Saybani</surname>
            ,
            <given-names>M. R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shamshirband</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mosavi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>A Hybrid clustering and classification technique for forecasting short-term energy consumption</article-title>
          .
          <source>Environmental Progress Sustainable Energy</source>
          .
          <volume>38</volume>
          (
          <issue>1</issue>
          ) (
          <year>2019</year>
          )
          <fpage>66</fpage>
          -
          <lpage>76</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <surname>Tripathi</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>De</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>An efficient data characterization and reduction scheme for smart metering infrastructure</article-title>
          .
          <source>IEEE Transactions on Industrial Informatics</source>
          .
          <volume>14</volume>
          (
          <issue>10</issue>
          ) (
          <year>2018</year>
          )
          <fpage>4300</fpage>
          -
          <lpage>4308</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kang</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xia</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Luo</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Sparse and redundant representation-based smart meter data compression and pattern extraction</article-title>
          .
          <source>IEEE Transactions on Power Systems</source>
          .
          <volume>32</volume>
          (
          <issue>3</issue>
          ) (
          <year>2017</year>
          )
          <fpage>2142</fpage>
          -
          <lpage>2151</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hong</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kang</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Review of smart meter data analytics: Applications, methodologies, and challenges</article-title>
          .
          <source>IEEE Transactions on Smart Grid</source>
          .
          <volume>10</volume>
          (
          <issue>3</issue>
          ) (
          <year>2019</year>
          )
          <fpage>3125</fpage>
          -
          <lpage>3148</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <surname>Wen</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhou</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Compression of smart meter big data: A survey</article-title>
          .
          <source>Renewable and Sustainable Energy Reviews</source>
          .
          <volume>91</volume>
          (
          <year>2018</year>
          )
          <fpage>59</fpage>
          -
          <lpage>69</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ning</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Deb</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cheong</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>S. E.</given-names>
          </string-name>
          , ...
          <string-name>
            <surname>Tham</surname>
          </string-name>
          , K. W.:
          <article-title>k-Shape clustering algorithm for building energy usage patterns analysis and forecasting model accuracy improvement</article-title>
          .
          <source>Energy and Buildings</source>
          .
          <volume>146</volume>
          (
          <year>2017</year>
          )
          <fpage>27</fpage>
          -
          <lpage>37</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ren</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhou</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Identifying household electricity consumption patterns: A case study of Kunshan, China</article-title>
          .
          <source>Renewable and Sustainable Energy Reviews. 91</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>