<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Truly Scalable Data Series Similarity Search</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Karima Echihabi Supervised by: Prof. Themis Palpanas and Prof. Houda Benbrahim IRDA, Rabat IT Center</institution>
          ,
          <addr-line>ENSIAS, Mohammed V Univ</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>Data series are widely used in numerous domains. Analyzing this data is important for a variety of real-world applications and has been extensively studied over the past 25 years. At the core of the analysis task lies a classic algorithm called similarity search. A number of approaches have been proposed to support similarity search over massive data series collections. The results of two comprehensive data series experimental evaluations form the foundations of our future work, which will lead to the development of a novel index that can eciently support both exact and approximate data series similarity search, as well as progressive query answering with bound guarantees. Based on the insights gained from the exhaustive study of the related work, the new index will surpass the state-the-art approximate and exact techniques both in performance and accuracy.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>
        A data series is a sequence of ordered real values1. Data
series are omnipresent in various domains from science and
engineering to business and medicine [
        <xref ref-type="bibr" rid="ref35">35</xref>
        ]. The
proliferation of IoT technologies is also heavily contributing to the
explosive growth of data series collections to the order of
terabytes [
        <xref ref-type="bibr" rid="ref38">38</xref>
        ]. A common subroutine to data series
analytical tasks is a classic algorithm called similarity search.
Therefore the research community has extensively studied
the development of ecient similarity search algorithms for
data series. The similarity search algorithm for data series
returns the set of candidate data series in a collection that
is similar to a given query series. This algorithm is often
reduced to the nearest neighbor problem where data series
are represented as data points in multidimensional space and
their (dis)similarity is evaluated using a distance function.
1The order attribute can be angle, mass, frequency, position,
or time [
        <xref ref-type="bibr" rid="ref39">39</xref>
        ]. When the order is based on time, the sequence
is called a time series. The terms data series, time series
and sequence are used interchangeably.
      </p>
      <p>Proceedings of the VLDB 2019 PhD Workshop, August 26th, 2019. Los
Angeles, California. Copyright (C) 2019 for this paper by its authors.
Copying permitted for private and academic purposes.</p>
      <p>Although a data series can be represented as a vector
in high dimensional space, conventional vector-based
approaches are not adapted for two reasons: (a) they cannot
scale to thousands of dimensions; and (b) they do not exploit
the correlation among dimensions typical for data series.</p>
      <p>Similarity search methods can either return exact or
approximate answers. Exact methods are costly while
approximate methods o↵er better eciency at the expense of losing
some accuracy. We call approximate the methods that do
not provide any guarantees on the results ng-approximate,
and those that provide guarantees on the approximation
error, -✏-approximate methods, where ✏ is the approximation
error and , the probability that ✏ will not be exceeded.</p>
      <p>
        A plethora of similarity search methods have been
published by the community including techniques designed for
generic vectors [
        <xref ref-type="bibr" rid="ref14 ref20 ref23 ref24 ref25 ref36 ref44 ref46 ref5 ref7">24, 23, 7, 14, 46, 20, 25, 44, 5, 36</xref>
        ] and those
specific to data series [
        <xref ref-type="bibr" rid="ref10 ref15 ref27 ref3 ref30 ref33 ref34 ref40 ref41 ref42 ref43 ref45 ref47 ref49 ref9">3, 41, 27, 42, 47, 34, 33, 40, 15, 9, 43,
45, 10, 49, 30</xref>
        ].
      </p>
      <p>
        This work aims to propose a novel index that will support
progressive query answering with probabilistic guarantees.
We describe the related work, succinctly report the results of
our extensive experimental evaluation of exact methods [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ],
and give a glimpse of some very interesting results from an
ongoing experimental study focused on approximate
methods [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. Finally, we present our future work directions.
      </p>
      <p>
        In our work, we focus on the problem of whole matching
similarity search in collections with a very large number of
data series, i.e., similarity search that produces approximate
or exact results, by calculating distances on the whole (not
a sub-) sequence. This is a very popular problem that lies
at the core of several other algorithms, and is important for
many applications in various domains in the real world [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ].
2.
      </p>
    </sec>
    <sec id="sec-2">
      <title>RELATED WORK</title>
      <p>Similarity search involves finding a set of data series in
a collection that are similar to a query according to some
definition of sameness. A common abstraction is to consider
the query and candidate data series as points in a metric
space and evaluating the sameness (or di↵erence) using the
euclidean distance.</p>
      <p>
        To develop ecient similarity search algorithms on
massive datasets, two major costs need to be optimized: the cost
of accessing data on disk (I/O) and the cost of comparing
the query to candidates (CPU cost). Typically, the first cost
is reduced by using summarization techniques that map the
high-dimensional data to a lower-dimensional space, while
the second cost is optimized with sophisticated data
structures and search algorithms. Several similarity search
methods have been proposed in the literature supporting either
exact search [
        <xref ref-type="bibr" rid="ref20 ref23 ref27 ref42 ref46 ref7">23, 7, 46, 20, 27, 42</xref>
        ], approximate search [
        <xref ref-type="bibr" rid="ref24 ref25 ref36 ref44 ref5">24,
25, 44, 5, 36</xref>
        ], or both [
        <xref ref-type="bibr" rid="ref10 ref14 ref30 ref34 ref43 ref45 ref47 ref49">14, 43, 45, 10, 49, 47, 30, 34</xref>
        ].
      </p>
      <p>
        In the following section, we provide a succinct
description of the state-of-the-art similarity search methods and
the summarizations techniques on which they are based.
1. Summarization Techniques. The Discrete Fourier
Transform (DFT) [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] decomposes a data series into
frequency coecients, a subset of which represents a
summarization of the data series.
      </p>
      <p>
        The Discrete Haar Wavelet Transform (DHWT) [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]
transforms a data series using Haar wavelets into a
hierarchical representation.
      </p>
      <p>
        Random projections map the raw high dimensional data
into a lower dimensional space using a random matrix while
preserving pairwise distances within a distortion
threshold [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ].
      </p>
      <p>
        The Piecewise Aggregate Approximation (PAA) [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ] and
Adaptive Piecewise Constant Approximation (APCA) [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]
techniques divide a data series into segments (of equal and
arbitrary length, respectively) and approximate each
segment with the mean of the points that belong to it. The
Extended Adaptive Piecewise Approximation (EAPCA) [
        <xref ref-type="bibr" rid="ref45">45</xref>
        ]
method enhance APCA by approximating each segment
with the standard deviation in addition to the mean.
      </p>
      <p>
        Symbolic Aggregate Approximation (SAX) [
        <xref ref-type="bibr" rid="ref32">32</xref>
        ] first
approximates data series using PAA, then discretizes the PAA
values into a compact binary representation.
      </p>
      <p>
        Symbolic Fourier Approximation (SFA) [
        <xref ref-type="bibr" rid="ref43">43</xref>
        ] first
transforms a data series into DFT coecients, which are then
approximated using a succinct symbolic approximation.
      </p>
      <p>
        Optimized Product Quantization (OPQ) [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] applies a
linear transformation on the data series to decorrelate it, then
applies on it a product quantizer [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ].
2. Exact Similarity Search Methods. Below, we briefly
describe algorithms that produce exact results.
      </p>
      <p>
        The R*-tree [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] is a height-balanced spatial index that
organizes data into a hierarchy of nested overlapping
rectangles. Search returns all entries in leaves whose rectangle
contains the query.
      </p>
      <p>
        The M-tree [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] is a multidimensional index for metric
space which partitions the data using hyper-spheres based
on their relative distances. The search algorithm uses the
triangular inequality to prune data.
      </p>
      <p>
        The VA+file [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] is an improvement of the VA-file [
        <xref ref-type="bibr" rid="ref46">46</xref>
        ].
It creates a filter file containing summarizations of the raw
data. Search uses the filter file to prune candidates based
on a lower bounding distance. For eciency reasons, we
modified the VA+file to use the DFT transform instead of
the Karhunen–Lo`eve transform (KLT).
      </p>
      <p>
        Stepwise [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ] represents the data space in a multi-level
hierarchical representation using DHWT summarizations.
Search transforms a query into DHWT and filters out
candidate based on upper and lower bounding distances.
      </p>
      <p>
        The SFA method [
        <xref ref-type="bibr" rid="ref43">43</xref>
        ] builds a trie with SFA
summarizations of the data. During query answering, a lower bounding
distance is used to prune out candidates.
      </p>
      <p>
        The UCR Suite [
        <xref ref-type="bibr" rid="ref42">42</xref>
        ] is an optimized sequential scan
algorithm for exact matching that we consider as a baseline for
performance comparisons.
      </p>
      <p>
        The DSTree [
        <xref ref-type="bibr" rid="ref45">45</xref>
        ] index dynamically segments data using
EAPCA. During search, it uses a lower bounding distance
to prune the search space.
      </p>
      <p>
        iSAX2+ [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] is a bulk-loading index based on SAX.
During search, a query is represented with SAX symbols and
the candidates contained in non-pruned leaves are further
refined using the euclidean distance. Pruning uses a lower
bounding distance.
      </p>
      <p>
        ADS+ [
        <xref ref-type="bibr" rid="ref49">49</xref>
        ] is the first adaptive data series index based on
SAX. It starts with a minimal tree structure containing only
summarizations, and then adds in the raw data to leaves
during query answering. It supports a number of search
options, in particular SIMS, a skip-sequential algorithm.
      </p>
      <p>In addition to exact search, the MTree, SFA trie, DSTree,
iSAX2+ and ADS+ also support ng-approximate search.
3. Approximate Similarity Search Methods. We now
present the three most popular approximate methods.</p>
      <p>
        SRS [
        <xref ref-type="bibr" rid="ref44">44</xref>
        ] belongs to the family of LSH methods, inspired
by the randomized algorithm in [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ]. It uses random
projections to build a scalable index and supports approximate
search with theoretical guarantees.
      </p>
      <p>
        IMI [
        <xref ref-type="bibr" rid="ref21 ref5">21, 5</xref>
        ] is an inverted index based on OPQ. A query is
answered by returning all points corresponding to the
corresponding entries in the inverted index. Returned results
are ng-approximate.
      </p>
      <p>
        HNSW [
        <xref ref-type="bibr" rid="ref36">36</xref>
        ] is an in-memory neighborhood graph
exploiting the Voronoi Diagram and the Delaunay Triangulation.
A greedy search returns the best candidates with high
empirical accuracy but no formal theoretical guarantees.
3.
      </p>
    </sec>
    <sec id="sec-3">
      <title>PROPOSED WORK</title>
      <p>
        We propose a novel data series index that can answer
progressive similarity search queries with strong probability
guarantees. In addition to this unique functionality, it also
leverages the strengths and addresses the weaknesses of the
state-of-the-art approximate and exact techniques.
Completed Work. We outline the main contributions
in [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]: (i) we provide a formal problem definition for
data series similarity search, unifying conflicting
terminology from di↵erent research communities; (ii) we present a
survey of the state-of-the-art data series similarity search
techniques (Table 1 summarizes each technique according
to our definitions); and (iii) we conduct an extensive
experimental evaluation for the eciency of data series exact
similarity search.
      </p>
      <p>The study assessed ten state-of-the-art methods under the
same experimental framework. To guard against
implementation bias, we used a large number of comparison criteria
including implementation-independent ones, and we
reimplemented from scratch four methods that were not
available in C/C++. Our implementations largely outperform
the original ones both in time and space, thus enriching the
landscape of data series similarity search methods.</p>
      <p>
        In an e↵ort to make our results reproducible and support
future research in the area, we share with the community a
public archive containing all source codes, datasets, queries,
results, and plots [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>Based on the results of our study, we draw up
recommendations to help users decide the best approach for their
problem. Figure 1 shows a decision matrix given a typical
hardware and query workload. The VA+file is particularly
well-suited for long series in-memory while for shorter series,
the DSTree is the best contender on disk, and iSAX2+ is the
winner in-memory.</p>
      <p>ADS+
s DSTree
ex iSAX2+
ed M-tree
In R*-tree</p>
      <p>SFA trie</p>
      <p>VA+file
r UCR Suite
tehO SMtepAwSiSse</p>
      <p>We present an elaborate discussion based on the deep
insights gained about the data series similarity search
problem. The main points are the following:</p>
      <p>
        1. Unexpected results confirmed the importance of careful
parameter tuning, hardware setup, implementation
framework, and workload selection. In particular, Stepwise and
ADS+ performed below our expectations while our
optimized implementations of the DSTree and the VA+file
helped bring them back to the spotlight. Moreover, our
carefully crafted experiments identified optimal parameters
that were di↵erent than the ones published in the original
papers. Another important finding was that, unlike what
was originally believed [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ], the tightness of the lower bound
of a given method does not alone predict its performance.
In fact, it is of paramount importance to consider other
factors such as the hardware platform and the the clustering
quality of a an index.
      </p>
      <p>2. A better understanding of current approaches helped
pinpoint interesting avenues for improvement, in particular
identifying the methods that would most benefit from
modern hardware. For instance: (i) the DSTree is a very good
candidate for parallelization as its index building is over 85%
CPU cost; (ii) the performance of Stepwise can be
significantly improved with a redesign of the physical storage and
the use of modern hardware as the total cost of query
answering is 50%-98% CPU; (iii) ADS+ can be enhanced with
the use of asynchronous I/O to overcome the expensive
random I/O incurred with each skip.</p>
      <p>3. Although index building with iSAX-based indexes is
much faster than with the DSTree, the latter achieves a
better clustering as it adapts to the data distribution.</p>
      <p>4. Choosing between an index scan and a serial scan is
not a trivial decision. In fact, access path selection is an
optimization problem that depends on a variety of factors
including hardware, query pruning ratio, data
characteristics, the accuracy of a summarization and the ecacy of the
clustering provided by an index.</p>
      <p>
        Work In Progress. Our current work involves an
experimental study that evaluates data series approximate
similarity search, both in-memory and on-disk [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. It di↵ers
from other experimental studies which focused on the
eciency of exact search [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], the accuracy of dimensionality
reduction techniques and similarity measures for
classification tasks [
        <xref ref-type="bibr" rid="ref16 ref29 ref6">29, 16, 6</xref>
        ], or in-memory data [
        <xref ref-type="bibr" rid="ref31 ref4">31, 4</xref>
        ].
      </p>
      <p>
        Our results show that some strikingly simple
modifications to existing exact methods enable them to answer
✏-approximate queries and have excellent empirical
performance. In fact, extensive experiments on large synthetic and
real datasets, including the two largest real datasets publicly
available, demonstrate that the extended techniques
compete in memory and outperform on disk the state-of-the-art
approximate techniques from the vector indexing
community both in accuracy and eciency. Figure 2 summarizes
results for 100-NN queries on an in-memory 25GB data
collection extracted from the Sift dataset [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. We observe that
our modifications to iSAX2+ enable it to outperform the
popular ng-approximate search approaches HNSW and IMI
(Faiss implementation) and the state-of-the-art LSH method
SRS, in terms of combined indexing and query answering
costs. We measure accuracy using MAP, a popular metric
in the information retrieval literature [
        <xref ref-type="bibr" rid="ref37 ref8">37, 8</xref>
        ].
      </p>
      <p>
        Future Work. Inspired by the insights gained from our
two experimental studies on the the inner workings of the
di↵erent indexing approaches and the e↵ectiveness of their
design choices, the key future direction for our work is the
design and development of a novel data series index that
will outperform the state-of-the-art approximate and exact
techniques. The new index will also support progressive
query answering [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] with probability guarantees, so as to
further enable interactive exploration tasks on very large
data series collections.
      </p>
    </sec>
    <sec id="sec-4">
      <title>4. CONCLUSIONS</title>
      <p>
        The goal of this thesis is to develop a new index that
will support approximate and exact search, and progressive
query answering with probability guarantees. The first step
in this direction was to thoroughly assess the
state-of-theart [
        <xref ref-type="bibr" rid="ref17 ref18">17, 18</xref>
        ]. We describe the lessons learned from two
extensive experimental evaluations and outline our future work.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Lernaean</given-names>
            <surname>Hydra</surname>
          </string-name>
          <article-title>Archive</article-title>
          . http://www.mi.parisdescartes.fr/ ~themisp/dsseval/,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <article-title>[2] TEXMEX Datasets for Approximate Nearest Neighbor Search</article-title>
          . http://corpus-texmex.
          <source>irisa.fr/</source>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>R.</given-names>
            <surname>Agrawal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Faloutsos</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Swami</surname>
          </string-name>
          .
          <article-title>Ecient similarity search in sequence databases</article-title>
          . pages
          <fpage>69</fpage>
          -
          <lpage>84</lpage>
          ,
          <year>1993</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M.</given-names>
            <surname>Aumu</surname>
          </string-name>
          <article-title>¨ller, E. Bernhardsson, and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Faithfull</surname>
          </string-name>
          .
          <article-title>Annbenchmarks: A benchmarking tool for approximate nearest neighbor algorithms</article-title>
          .
          <source>In SISAP</source>
          , pages
          <fpage>34</fpage>
          -
          <lpage>49</lpage>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>A.</given-names>
            <surname>Babenko</surname>
          </string-name>
          and
          <string-name>
            <given-names>V.</given-names>
            <surname>Lempitsky</surname>
          </string-name>
          .
          <article-title>The inverted multi-index</article-title>
          .
          <source>IEEE Transactions on Pattern Analysis and Machine Intelligence</source>
          ,
          <volume>37</volume>
          (
          <issue>6</issue>
          ):
          <fpage>1247</fpage>
          -
          <lpage>1260</lpage>
          ,
          <year>June 2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>A. J.</given-names>
            <surname>Bagnall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lines</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bostrom</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Large</surname>
          </string-name>
          , and
          <string-name>
            <given-names>E. J.</given-names>
            <surname>Keogh</surname>
          </string-name>
          .
          <article-title>The great time series classification bake o↵: a review and experimental evaluation of recent algorithmic advances</article-title>
          .
          <source>Data Min. Knowl. Discov.</source>
          ,
          <volume>31</volume>
          (
          <issue>3</issue>
          ):
          <fpage>606</fpage>
          -
          <lpage>660</lpage>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>N.</given-names>
            <surname>Beckmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.-P.</given-names>
            <surname>Kriegel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Schneider</surname>
          </string-name>
          , and
          <string-name>
            <given-names>B.</given-names>
            <surname>Seeger. The R</surname>
          </string-name>
          *
          <article-title>-tree: an ecient and robust access method for points and rectangles</article-title>
          . In ICMD, pages
          <fpage>322</fpage>
          -
          <lpage>331</lpage>
          . ACM,
          <year>1990</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>C.</given-names>
            <surname>Buckley</surname>
          </string-name>
          and
          <string-name>
            <given-names>E. M.</given-names>
            <surname>Voorhees</surname>
          </string-name>
          .
          <article-title>Evaluating evaluation measure stability</article-title>
          .
          <source>In SIGIR</source>
          , pages
          <fpage>33</fpage>
          -
          <lpage>40</lpage>
          . ACM,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>A.</given-names>
            <surname>Camerra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Palpanas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Shieh</surname>
          </string-name>
          , and
          <string-name>
            <given-names>E. J.</given-names>
            <surname>Keogh</surname>
          </string-name>
          .
          <source>iSAX 2</source>
          .
          <article-title>0: Indexing and Mining One Billion Time Series</article-title>
          . In ICDM, pages
          <fpage>58</fpage>
          -
          <lpage>67</lpage>
          . IEEE Computer Society,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>A.</given-names>
            <surname>Camerra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Shieh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Palpanas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Rakthanmanon</surname>
          </string-name>
          , and
          <string-name>
            <given-names>E. J.</given-names>
            <surname>Keogh</surname>
          </string-name>
          .
          <article-title>Beyond one billion time series: indexing and mining very large time series collections with iSAX2+</article-title>
          . Knowl. Inf. Syst.,
          <volume>39</volume>
          (
          <issue>1</issue>
          ):
          <fpage>123</fpage>
          -
          <lpage>151</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>K.</given-names>
            <surname>Chakrabarti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Keogh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Mehrotra</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Pazzani</surname>
          </string-name>
          .
          <article-title>Locally adaptive dimensionality reduction for indexing large time series databases</article-title>
          .
          <source>ACM TODS</source>
          ,
          <volume>27</volume>
          (
          <issue>2</issue>
          ):
          <fpage>188</fpage>
          -
          <lpage>228</lpage>
          ,
          <year>June 2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>K.-P.</given-names>
            <surname>Chan</surname>
          </string-name>
          and
          <string-name>
            <given-names>A. W.-C.</given-names>
            <surname>Fu</surname>
          </string-name>
          .
          <article-title>Ecient time series matching by wavelets</article-title>
          .
          <source>In ICDE</source>
          , pages
          <fpage>126</fpage>
          -
          <lpage>133</lpage>
          ,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>P.</given-names>
            <surname>Ciaccia</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Patella. PAC Nearest Neighbor</surname>
          </string-name>
          <article-title>Queries: Approximate and Controlled Search in High-Dimensional and Metric Spaces</article-title>
          .
          <source>In ICDE</source>
          , pages
          <fpage>244</fpage>
          -
          <lpage>255</lpage>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>P.</given-names>
            <surname>Ciaccia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Patella</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Zezula</surname>
          </string-name>
          .
          <article-title>M-tree: An Ecient Access Method for Similarity Search in Metric Spaces</article-title>
          .
          <source>In VLDB</source>
          , pages
          <fpage>426</fpage>
          -
          <lpage>435</lpage>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>R.</given-names>
            <surname>Cole</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. E.</given-names>
            <surname>Shasha</surname>
          </string-name>
          , and
          <string-name>
            <given-names>X.</given-names>
            <surname>Zhao</surname>
          </string-name>
          .
          <article-title>Fast window correlations over uncooperative time series</article-title>
          .
          <source>In SIGKDD</source>
          , pages
          <fpage>743</fpage>
          -
          <lpage>749</lpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>H.</given-names>
            <surname>Ding</surname>
          </string-name>
          , G. Trajcevski,
          <string-name>
            <given-names>P.</given-names>
            <surname>Scheuermann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Wang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>E.</given-names>
            <surname>Keogh</surname>
          </string-name>
          .
          <article-title>Querying and mining of time series data: experimental comparison of representations and distance measures</article-title>
          .
          <source>PVLDB</source>
          ,
          <volume>1</volume>
          (
          <issue>2</issue>
          ):
          <fpage>1542</fpage>
          -
          <lpage>1552</lpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>K.</given-names>
            <surname>Echihabi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Zoumpatianos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Palpanas</surname>
          </string-name>
          , and
          <string-name>
            <given-names>H.</given-names>
            <surname>Benbrahim</surname>
          </string-name>
          .
          <article-title>The Lernaean Hydra of Data Series Similarity Search: An Experimental Evaluation of the State of the Art</article-title>
          .
          <source>PVLDB</source>
          ,
          <volume>12</volume>
          (
          <issue>2</issue>
          ):
          <fpage>112</fpage>
          -
          <lpage>127</lpage>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>K.</given-names>
            <surname>Echihabi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Zoumpatianos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Palpanas</surname>
          </string-name>
          , and
          <string-name>
            <given-names>H.</given-names>
            <surname>Benbrahim</surname>
          </string-name>
          .
          <article-title>The Return of the Lernaean Hydra: An Experimental Evaluation of Data Series Approximate Similarity Search</article-title>
          .
          <source>Under Submission</source>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>C.</given-names>
            <surname>Faloutsos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ranganathan</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Y.</given-names>
            <surname>Manolopoulos</surname>
          </string-name>
          .
          <article-title>Fast subsequence matching in time-series databases</article-title>
          .
          <source>In SIGMOD</source>
          , pages
          <fpage>419</fpage>
          -
          <lpage>429</lpage>
          , New York, NY, USA,
          <year>1994</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>H.</given-names>
            <surname>Ferhatosmanoglu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Tuncel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Agrawal</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A. El</given-names>
            <surname>Abbadi</surname>
          </string-name>
          .
          <article-title>Vector Approximation Based Indexing for Non-uniform High Dimensional Data Sets</article-title>
          .
          <source>In CIKM</source>
          , pages
          <fpage>202</fpage>
          -
          <lpage>209</lpage>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>T.</given-names>
            <surname>Ge</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Ke</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Sun</surname>
          </string-name>
          .
          <article-title>Optimized product quantization</article-title>
          .
          <source>IEEE Trans. Pattern Anal. Mach</source>
          . Intell.,
          <volume>36</volume>
          (
          <issue>4</issue>
          ):
          <fpage>744</fpage>
          -
          <lpage>755</lpage>
          , Apr.
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>A.</given-names>
            <surname>Gogolou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Tsandilas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Palpanas</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Bezerianos</surname>
          </string-name>
          .
          <article-title>Progressive similarity search on time series data</article-title>
          . In BigVis, in conjunction with EDBT/ICDT,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>A.</given-names>
            <surname>Guttman. R-Trees</surname>
          </string-name>
          :
          <article-title>A Dynamic Index Structure for Spatial Searching</article-title>
          .
          <source>In SIGMOD</source>
          , pages
          <fpage>47</fpage>
          -
          <lpage>57</lpage>
          ,
          <year>1984</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>P.</given-names>
            <surname>Indyk</surname>
          </string-name>
          and
          <string-name>
            <given-names>R.</given-names>
            <surname>Motwani</surname>
          </string-name>
          .
          <article-title>Approximate nearest neighbors: Towards removing the curse of dimensionality</article-title>
          .
          <source>In STOC</source>
          , pages
          <fpage>604</fpage>
          -
          <lpage>613</lpage>
          ,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>H.</given-names>
            <surname>Jegou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Douze</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Schmid</surname>
          </string-name>
          .
          <article-title>Product quantization for nearest neighbor search</article-title>
          .
          <source>IEEE Transactions on Pattern Analysis and Machine Intelligence</source>
          ,
          <volume>33</volume>
          (
          <issue>1</issue>
          ):
          <fpage>117</fpage>
          -
          <lpage>128</lpage>
          ,
          <year>Jan 2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>W.</given-names>
            <surname>Johnson</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Lindenstrauss</surname>
          </string-name>
          .
          <article-title>Extensions of Lipschitz mappings into a Hilbert space</article-title>
          .
          <source>In CONM</source>
          , volume
          <volume>26</volume>
          of Contemporary Mathematics, pages
          <fpage>189</fpage>
          -
          <lpage>206</lpage>
          .
          <year>1984</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>S.</given-names>
            <surname>Kashyap</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Karras</surname>
          </string-name>
          .
          <article-title>Scalable knn search on vertically stored time series</article-title>
          . In C. Apt,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ghosh</surname>
          </string-name>
          , and P. Smyth, editors,
          <source>KDD</source>
          , pages
          <fpage>1334</fpage>
          -
          <lpage>1342</lpage>
          . ACM,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>E.</given-names>
            <surname>Keogh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Chakrabarti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Pazzani</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Mehrotra</surname>
          </string-name>
          .
          <article-title>Dimensionality reduction for fast similarity search in large time series databases</article-title>
          .
          <source>KAIS</source>
          ,
          <volume>3</volume>
          (
          <issue>3</issue>
          ):
          <fpage>263</fpage>
          -
          <lpage>286</lpage>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>E.</given-names>
            <surname>Keogh</surname>
          </string-name>
          and
          <string-name>
            <given-names>S.</given-names>
            <surname>Kasetty</surname>
          </string-name>
          .
          <article-title>On the need for time series data mining benchmarks: A survey and empirical demonstration</article-title>
          .
          <source>Data Min. Knowl. Discov.</source>
          ,
          <volume>7</volume>
          (
          <issue>4</issue>
          ):
          <fpage>349</fpage>
          -
          <lpage>371</lpage>
          , Oct.
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>H.</given-names>
            <surname>Kondylakis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Dayan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Zoumpatianos</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Palpanas</surname>
          </string-name>
          .
          <article-title>Coconut: A scalable bottom-up approach for building data series indexes</article-title>
          .
          <source>PVLDB (11)6</source>
          , pages
          <fpage>677</fpage>
          -
          <lpage>690</lpage>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>W.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Sun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>X.</given-names>
            <surname>Lin</surname>
          </string-name>
          .
          <source>Approximate Nearest Neighbor Search on High Dimensional Data - Experiments</source>
          , Analyses, and
          <source>Improvement (v1.0)</source>
          . CoRR, abs/1610.02455,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>J.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. J.</given-names>
            <surname>Keogh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Lonardi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>B. Y.</given-names>
            <surname>Chiu</surname>
          </string-name>
          .
          <article-title>A symbolic representation of time series, with implications for streaming algorithms</article-title>
          .
          <source>In SIGMOD</source>
          , pages
          <fpage>2</fpage>
          -
          <lpage>11</lpage>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <string-name>
            <given-names>M.</given-names>
            <surname>Linardi</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Palpanas</surname>
          </string-name>
          . Scalable, variable
          <article-title>-length similarity search in data series: The ulisse approach</article-title>
          .
          <source>PVLDB</source>
          ,
          <volume>11</volume>
          (
          <issue>13</issue>
          ):
          <fpage>2236</fpage>
          -
          <lpage>2248</lpage>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [34]
          <string-name>
            <given-names>M.</given-names>
            <surname>Linardi</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Palpanas</surname>
          </string-name>
          . ULISSE:
          <article-title>ULtra compact Index for Variable-Length Similarity SEarch in Data Series</article-title>
          . In ICDE, pages
          <fpage>1356</fpage>
          -
          <lpage>1359</lpage>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [35]
          <string-name>
            <given-names>M.</given-names>
            <surname>Linardi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Palpanas</surname>
          </string-name>
          , and
          <string-name>
            <given-names>E.</given-names>
            <surname>Keogh</surname>
          </string-name>
          .
          <article-title>Matrix profile x: Valmod - scalable discovery of variable-length motifs in data series</article-title>
          .
          <source>In SIGMOD</source>
          , pages
          <fpage>1053</fpage>
          -
          <lpage>1066</lpage>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          [36]
          <string-name>
            <given-names>Y. A.</given-names>
            <surname>Malkov</surname>
          </string-name>
          and
          <string-name>
            <given-names>D. A.</given-names>
            <surname>Yashunin</surname>
          </string-name>
          .
          <article-title>Ecient and robust approximate nearest neighbor search using hierarchical navigable small world graphs</article-title>
          .
          <source>CoRR, abs/1603.09320</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          [37]
          <string-name>
            <surname>C. D. Manning</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Raghavan</surname>
          </string-name>
          , and H. Schu¨tze. Introduction to Information Retrieval. Cambridge University Press, New York, NY, USA,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          [38]
          <string-name>
            <given-names>T.</given-names>
            <surname>Palpanas</surname>
          </string-name>
          .
          <article-title>Data series management: The road to big sequence analytics</article-title>
          .
          <source>SIGMOD Record</source>
          ,
          <volume>44</volume>
          (
          <issue>2</issue>
          ):
          <fpage>47</fpage>
          -
          <lpage>52</lpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          [39]
          <string-name>
            <given-names>T.</given-names>
            <surname>Palpanas</surname>
          </string-name>
          .
          <article-title>Big sequence management: A glimpse of the past, the present, and the future</article-title>
          .
          <source>In SOFSEM</source>
          , volume
          <volume>9587</volume>
          <source>of LNCS</source>
          , pages
          <fpage>63</fpage>
          -
          <lpage>80</lpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          [40]
          <string-name>
            <given-names>B.</given-names>
            <surname>Peng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Palpanas</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Fatourou</surname>
          </string-name>
          . ParIS:
          <article-title>The Next Destination for Fast Data Series Indexing and Query Answering</article-title>
          .
          <source>IEEE BigData</source>
          , pages
          <fpage>791</fpage>
          -
          <lpage>800</lpage>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          [41]
          <string-name>
            <given-names>D.</given-names>
            <surname>Rafiei</surname>
          </string-name>
          .
          <article-title>On similarity-based queries for time series data</article-title>
          .
          <source>In ICDE</source>
          , pages
          <fpage>410</fpage>
          -
          <lpage>417</lpage>
          ,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref42">
        <mixed-citation>
          [42]
          <string-name>
            <given-names>T.</given-names>
            <surname>Rakthanmanon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. J. L.</given-names>
            <surname>Campana</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Mueen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. E. A. P. A.</given-names>
            <surname>Batista</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. B.</given-names>
            <surname>Westover</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Zhu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Zakaria</surname>
          </string-name>
          , and
          <string-name>
            <given-names>E. J.</given-names>
            <surname>Keogh</surname>
          </string-name>
          .
          <article-title>Searching and mining trillions of time series subsequences under dynamic time warping</article-title>
          .
          <source>In KDD</source>
          , pages
          <fpage>262</fpage>
          -
          <lpage>270</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref43">
        <mixed-citation>
          [43]
          <string-name>
            <given-names>P.</given-names>
            <surname>Sch</surname>
          </string-name>
          <article-title>¨afer and M. Ho¨gqvist. Sfa: A symbolic fourier approximation and index for similarity search in high dimensional datasets</article-title>
          .
          <source>In EDBT</source>
          , pages
          <fpage>516</fpage>
          -
          <lpage>527</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref44">
        <mixed-citation>
          [44]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Sun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Qin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>X.</given-names>
            <surname>Lin</surname>
          </string-name>
          . SRS:
          <article-title>Solving c-approximate Nearest Neighbor Queries in High Dimensional Euclidean Space with a Tiny Index</article-title>
          .
          <source>PVLDB</source>
          ,
          <volume>8</volume>
          (
          <issue>1</issue>
          ):
          <fpage>1</fpage>
          -
          <lpage>12</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref45">
        <mixed-citation>
          [45]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Pei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Wang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Huang</surname>
          </string-name>
          .
          <article-title>A dataadaptive and dynamic segmentation index for whole matching on time series</article-title>
          .
          <source>PVLDB</source>
          ,
          <volume>6</volume>
          (
          <issue>10</issue>
          ):
          <fpage>793</fpage>
          -
          <lpage>804</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref46">
        <mixed-citation>
          [46]
          <string-name>
            <given-names>R.</given-names>
            <surname>Weber</surname>
          </string-name>
          , H.
          <article-title>-</article-title>
          <string-name>
            <surname>J. Schek</surname>
            , and
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Blott</surname>
          </string-name>
          .
          <article-title>A Quantitative Analysis and Performance Study for Similarity-Search Methods in HighDimensional Spaces</article-title>
          . In VLDB, pages
          <fpage>194</fpage>
          -
          <lpage>205</lpage>
          ,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref47">
        <mixed-citation>
          [47]
          <string-name>
            <surname>D.-E. Yagoubi</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Akbarinia</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Masseglia</surname>
            , and
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Palpanas</surname>
          </string-name>
          .
          <article-title>DPiSAX: Massively Distributed Partitioned iSAX</article-title>
          . pages
          <fpage>1135</fpage>
          -
          <lpage>1140</lpage>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref48">
        <mixed-citation>
          [48]
          <string-name>
            <surname>C.-C. M. Yeh</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Zhu</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Ulanova</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Begum</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Ding</surname>
            ,
            <given-names>H. A.</given-names>
          </string-name>
          <string-name>
            <surname>Dau</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          <string-name>
            <surname>Zimmerman</surname>
            ,
            <given-names>D. F.</given-names>
          </string-name>
          <string-name>
            <surname>Silva</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Mueen</surname>
            , and
            <given-names>E.</given-names>
          </string-name>
          <string-name>
            <surname>Keogh</surname>
          </string-name>
          .
          <article-title>Time series joins, motifs, discords and shapelets: a unifying view that exploits the matrix profile</article-title>
          .
          <source>DAMI</source>
          , pages
          <fpage>1</fpage>
          -
          <lpage>41</lpage>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref49">
        <mixed-citation>
          [49]
          <string-name>
            <given-names>K.</given-names>
            <surname>Zoumpatianos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Idreos</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Palpanas</surname>
          </string-name>
          .
          <article-title>ADS: the adaptive data series index</article-title>
          . VLDB J.,
          <volume>25</volume>
          (
          <issue>6</issue>
          ):
          <fpage>843</fpage>
          -
          <lpage>866</lpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>