<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Computational aspects around preference queries</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Karim Alami supervised by Sofian Maabout</string-name>
          <email>karim.alami@u-bordeaux.fr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Univ. Bordeaux</institution>
          ,
          <addr-line>CNRS, LaBRI, UMR 5800, F-33400 Talence</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <abstract>
        <p>Preference queries present two main challenges: diculty for users to express their preferences and the computational complexity. For skyline queries, the preferences can be on attributes, e.g., some user may look for the best flights regarding price and number of stops, and others may look for the best flights regarding number of stops and duration. In addition, preference can be expressed as a (partial) order on attributes domains, e.g., some user may prefer flight company A over B while another one may have the opposite preference. For top-k queries, users define a score function to rank objects, e.g., users who give more importance to price could define the following score function: price ⇤ 2 + duration of the flight. In general, several rounds are required before converging towards a satisfying answer where at each round, more precise preferences are given by the user. This is due to the diculty to figure out the precise formulation of the user's preferences. Therefore, a more or less high number of queries need to be evaluated. Our research work aims to make these queries answering faster through dedicated index structures and precomputed views. The main challenges when adopting this strategy are (i) lightweight memory consumption and (ii) fast maintenance process. Our first step was NSC, an index structure that optimizes skyline queries. However, the structure was designed for a static context making it unsuitable when data can be inserted/deleted. We redesigned NSC to cope with dynamic data and in some cases, we proposed further approaches when the structure is not suitable. In this paper, we summarize our previous contributions and present some perspective research regarding the link between regret minimization queries and what we did so far.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>Preference queries aim to retrieve points among a set of
points that can be considered interesting regarding the
preferences of the user over a set of parameters. They are
ecient tools which reduce the amount of data returned to the
user ,and consequently, avoids him an endless comparison of
data.</p>
      <p>
        The information retrieval regarding user preferences have
historically been a ranking problem, i.e., given a set of key
words, retrieve the elements that ”best” match these
keywords, we call it often ”top-k” query . Its adaptation to
relational databases has been addressed in [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Users are
requested to put weights on the attributes in order to select
points that have the higher values on attributes with the
higher weights. Top-k queries gives the advantage of
controlling the size of the output, however the selection of the
weights remain very hard. There has been several works to
simplify this process, e.g., selection of a range of weights,
elicitation and regret minimizing, among others. Authors in
[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] proposed the skyline operator as an alternative to
ranking queries. A skyline query returns a set of points which are
not dominated by any other point of the dataset. A point x
dominates a point y i↵ x is better or equal on all attributes
and strictly better on at least one attribute. The skyline
query provides the advantage to not rely on a score function
however, the size of the output is not controllable and it
requires a quadratic computational time regarding the size
of the dataset. Many studies have been done to optimize
the skyline query execution time either by optimizing the
number of comparisons or by indexation.
      </p>
      <p>
        In previous work [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], the structure NSC has been presented
as an index to optimize skyline queries. Its main idea
consists in comparing every record r of a dataset to all
remaining records r0 and store the subspaces where r0 dominates r
in a form of pair of subspaces compare(r, r0) = hX|Y i where
X represents the attributes where r0 is strictly better than
r and Y represents the attributes where r and r0 are equal.
Now given a subspace Z, a record r belongs to Sky(Z), i.e.
the skyline over the subspace Z, if and only if there does
not exist a pair hX|Y i associated to r that covers Z, i.e.
Z ✓ XY and Z 6= Y . We denote by cover(hX|Y i) the
set of subspaces covered by hX|Y i. For example, consider
a dataset with attributes A, B and C. The pair hAB|Ci
covers the subspaces {A, B, AB, AC, BC, ABC}. The time
complexity of NSC is quadratic wrt the size of the dataset as
well as the space complexity. However, not every pair should
be kept. Let P airs(r) be the set of pairs associated to r, this
set can be minimized by computing a subset Q ✓ P airs(r)
such that cover(Q) = cover(P airs(r)), i.e. the set of
subspaces covered by P airs(r) are covered by Q as well. We say
that Q is an equivalent subset of P airs(r), Q ⌘ P airs(r).
The minimization problem is NP-Hard and a polynomial
greedy approximate algorithm has been proposed.
      </p>
      <p>
        In the literature, works that optimize skyline queries can
be divided in three groups, works that (i) design fast
algorithms without precomputation, (ii) design index
structures (iii) materialize the results. We note BSkyTree [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] the
state of the art algorithm to process skyline queries without
precomputing. We note HashCube [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], a bitmap like index
structure which associates a 2d Boolean vector v to every
record r where d is the number of attributes. v[i] is set i↵ r
belongs to Sky(Z) such that Z is encoded by i. HashCube
is highly ecient for skyline query answering, however,
authors only proposed an algorithm to construct the structure
from scratch in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. No obvious maintenance procedure is
noticed. Materialized skyline views are very time and
memory consuming as the number of skylines wrt to a dataset
is exponential to the number of dimension. However, they
provide the best query answering performance.
      </p>
      <p>
        The experiments performed in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] assess the construction
time and memory performance of NSC against its
competitors as well as query execution time. However, the structure
was designed for a static context. An insertion/deletion of a
record or a change in an attribute domain order may require
rebuilding the structure from scratch.
      </p>
      <p>For this thesis, we studied the ability of NSC to deal with
updates. Precisely, we addressed its maintenance in case
of (i) dynamic data, i.e., points are removed and inserted at
any time and (ii) streaming data, i.e., append-only database
and sliding window model. We redesigned the structure to
cope with each context. Presently, we are working on skyline
queries optimization over a dataset with nominal attributes
and dynamic order. We found that NSC is not suitable for
this context, hence we established a novel approach based on
views. Finally, we aim to investigate relationship between
multidimensional skylines and regret minimization queries.</p>
    </sec>
    <sec id="sec-2">
      <title>SKYLINE QUERIES OPTIMIZATION IN</title>
    </sec>
    <sec id="sec-3">
      <title>PRESENCE OF UPDATES</title>
      <p>In the following, we present the adaptation of NSC for,
first, dynamic data, i.e., insertion/deletion of one or multiple
records at any time, then, streaming data, i.e., insertion of
records at regular intervals. We studied separately both
contexts because they present di↵erent challenges.
2.1</p>
    </sec>
    <sec id="sec-4">
      <title>Dynamic Data</title>
      <p>In the case of dynamic data, i.e., insertion/deletion of one
or multiple records at any time, NSC should be updated in
accordance to the new dataset. The baseline approach is to
build NSC structure from scratch, however this approach is
costly.</p>
      <p>
        Related work pointed out that dealing with deletion is
harder than insertion. We note the work [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] which
addressed the maintenance of a materialized skyline view in
case of deletion. For NSC, deletion is a hard task as well.
Let r be the deleted record then for every record r, we
remove from P airs(r) the pair p computed wrt r .
However, p may have exclusively covered some pairs that have
been deleted from P airs(r) during the minimization
process. Recovering these pairs requires recomputing P airs(r)
from scratch. We propose a solution that detects when
P airs(r) should be recomputed. For every distinct pair p
in P airs(r), we set a counter of how many records are the
source of p. Hence, we recompute P airs(r) only if p is
in P airs(r) and its counter set to 1. In theory, this
approach could be seen as building the structure from scratch
at each deletion, however, in practice, only a small fraction
of the dataset requires their set of pairs to be recomputed.
In terms of time complexity, let the size of the dataset be n.
Let I be the set of records for which their respective set of
pairs requires to be recomputed. Identifying I takes O(n)
time and recomputing P airs(r) 8 r 2 I takes O(|I| · n).
Regarding memory consumption, NSC size is augmented by a
constant due to the additional counter.
      </p>
      <p>Now for insertions, let r+ be a record to be inserted then
we compute P airs(r+). Regarding the existing records in
the dataset, let r be one, we compute p+ = compare(r, r+).
However, P airs(r) is already a minimal set of pairs, hence
we address the problem: should P airs(r) [ compare(r, r+)
be minimized now? or should the minimization process be
triggered after a number of pairs appended? In absence of
an incremental greedy algorithm, we propose a linear
algorithm wrt the size of P airs(r), called min inclusion, which
takes every pair p 2 P airs(r) and compare it to p+. The
pair covered by the other one is discarded. This algorithm
does not provide any approximation guarantee. However,
in practice, min inclusion allows a good compression ratio
wrt the greedy algorithm.
2.2</p>
    </sec>
    <sec id="sec-5">
      <title>Streaming Data</title>
      <p>
        We consider the following stream model. Records are
streamed at regular interval time ✓ . They have a validity
period of size ! which can also be seen as a sliding window
over the dataset. Hence, records are considered outdated !
timestamps after their arrival time. The semantic of
continuous queries [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] states that the query result should be
available and accurate at each time. However, this condition
is hardly attainable, especially for skyline queries. We point
out two main diculties for continuous skyline queries. Let
Qsky be a skyline query, then (i) a record r belonging to
Ans(Qsky) at a timestamp t may leave Ans(Qsky) at
timestamp t0 &gt; t if a streamed record r0 dominates r. And (ii) a
record r can join Ans(Qsky) at timestamp t0 later than its
arrival time if skyline records that dominate r get outdated.
We note the contribution of [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. Authors propose to
maintain a skyline set DBSky and a set of potentially skyline
records DBRest, i.e., records which may join DBSky in the
future. They propose an algorithm to maintain these sets
together with an event list recording timestamps where a
record in DBRest may join DBSky. Still, this approach
makes a big number of records comparison which makes it
non scalable wrt dimensionality and data cardinality.
Moreover it maintains a single skyline query, hence maintaining
several skylines may require an exponential space wrt to the
number of attribute. We note as well the approach presented
in [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] which is based on R-trees. Regarding NSC structure,
the approaches presented for dynamic data are not suitable
for two main reasons: (i) the timestamp where a record is
no more valid is known and (ii) the frequency of streaming
is often high. Generally, real time data processing is hard
in streaming context. Batch processing mode provides a
larger time to process data, however, the query result is not
accurate with the current dataset. We proposed a
framework called M SSD which handles three data structure, (i)
a bu↵er B, (ii) a main dataset R and (iii) NSCt, a redesigned
version for NSC. Incoming records are first bu↵ered during
an interval of time of size k, then inserted into the main
dataset R. NSCt only indexes records in R, hence, queries
evaluated through NSCt, consider only records in R. We
sacrifice the accuracy of the queries, nonetheless, we ensure
a fast maintenance that allows faster query answering than
state of the art works. Next we explain the main novelties
for NSCt. We organize the set of pairs of a record r as a
sequence of buckets of pairs P airs(r) = [Buck1, . . . , Buckm],
where each bucket corresponds to the pairs computed wrt a
batch of data. We adopt this approach for the following
purpose. Let b1 and b2 be two batches such that their respective
timestamp are T S(b1) and T S(b2), and T S(b1) &lt; T S(b2).
As we consider a sliding window data model with an
interval of size ! , records in b1 get outdated before records in
b2. Therefore, let r be a record, the bucket computed wrt b1
is located before the bucket computed wrt b2 in P airs(r).
During maintenance, Buck1, i.e. the oldest bucket, is
simply discarded as it contains pairs computed wrt an outdated
batch. Moreover, during the minimization process, a bucket
Bucki is minimized only wrt successor buckets. We
formalized the problem of buckets minimization for NSCt and
proved its NP-Hardness by a reduction to MSC (Minimum
Set Cover) problem. We provided a greedy approximate
algorithm as well. We present in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] early experiments we
performed. It shows that despite the maintenance time, NSCt
answers much more queries during the batch interval than
BSkyTree [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ].
      </p>
    </sec>
    <sec id="sec-6">
      <title>SKYLINE QUERIES OPTIMIZATION IN</title>
    </sec>
    <sec id="sec-7">
      <title>PRESENCE OF DYNAMIC ORDER</title>
      <p>
        In many real world applications, users are allowed to
express their preferences over the values of nominal attributes.
In such case, we say that the attribute domain has a dynamic
order. For example, on a movie platform website, users want
the best rated movies but have preferences over the genre as
well. Also, on a flight booking website, users are interested
in cheap and fast flights but may have preferences over the
airline companies. To ease the comprehension, we consider
datasets with only one nominal attribute A and some
number of numerical attributes. A user preference over A is a
partial order that can be written as a set of (a1, a2) such
that a1, a2 2 A and a1 is preferred over a2. In such
scenario, NSC is not suitable to answer skyline queries as it
is constructed depending on a given partial order over A.
Hence it should be updated every time a query with a
different partial order is issued, or it may require to construct
NSC for each possible partial order over A. In the
literature, there is two major approaches to handle the problem
of answering skyline queries over datasets with attributes
having dynamic order, (i) algorithms which, given a query
Q, maps a nominal attribute into a set of virtual numerical
attributes in accordance to the user preference Q.R. Then,
a skyline query is processed over the transformed dataset
[
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. And (ii) answers the given query through a set of
cached views, each computed wrt a partial order [
        <xref ref-type="bibr" rid="ref10 ref16">16, 10</xref>
        ].
These works adopt refinement and chain product
decomposition techniques in order to evaluate an issued query through
materialized views. Let Q be an issued query and let Q0 be
a view then Ans(Q) ✓ Ans(Q0) if Q0.R ✓ Q.R. We say
that Q is a refinement of Q0. Now let {Q1, . . . , Qn} a set of
views such that Qi.R 8 i 2 [1 . . . n] is a chain, i.e., the
values of A are totally ordered, then Ans(Q) = Si2 [1...n] Qi if
Q.R = Ni2 [1...m] Qi.R. We say that {Q1, . . . , Qn} is a chain
product decomposition of Q. We adopted a novel approach
which theoretically and experimentally provided better
results than the above approaches. We define the single partial
order spo over an attribute domain Dom(A) as a partial
order where only two values in Dom(A) are comparable and all
other values are incomparable. Given a query Q with user
preference Q.R, Ans(Q) is the intersection of the skylines
wrt spos composing Q.R. Every spo skyline query is
considered a view whether it is materialized or not. Our
experiments showed that answering a query through non
materialized spo views is even faster than online algorithms, mainly
because skyline algorithms are highly weakened by
dimensionality added by mapping a nominal attribute to several
numerical attributes. Moreover, this approach, namely the
single partial order decomposition, presents the advantage of
easily selecting the right views in order to evaluate a given
query compared to the chain product decomposition which
is an NP Hard Problem. Regarding the memory
consumption, let |dom(A)| = m, there exists m! total orders wrt
dom(A), hence m! views to store for chain product
decomposition approach. For refinement approach, the higher the
number of views stored, the faster will be query answering.
Note that there exists an exponential number of partial
orders wrt m. Our approach requires 2 · Cm2 views as for every
two values, two views are stored. This can be further
optimized by selecting only a subset of spo views to materialize.
We addressed the following problem: given a workload Q
and an integer k, select a set of views of size at most k to
materialize such that the cost of answering the queries in
Q through the views is minimum. We are working on the
proof of the Hardness of this problem. Finally we extended
the work to the case where datasets have several nominal
attributes.
4.
      </p>
    </sec>
    <sec id="sec-8">
      <title>REGRET MINIMIZING QUERIES AND</title>
    </sec>
    <sec id="sec-9">
      <title>MULTIDIMENSIONAL SKYLINES</title>
      <p>In this section, we mainly present state of the art of regret
optimization queries and we place some questions that we
plan to investigate during this thesis.</p>
      <p>Skyline and Top-k queries share the same objective which
is selecting the best elements. However, on one hand, skyline
queries return a non constrained size of result by relying on
just the dominance relation between elements, and on the
other hand, Top-k queries require a score function from the
user to restrain the result size to k.</p>
      <p>
        Recently [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] presented “regret minimizing set” to avoid
the limitations of skyline and Top-k queries by not requiring
a score function while bounding the result size. The main
idea is to select a representative subset S of a dataset T . Let
f be a score function, k be an integer, then let fk(T ) be the
score of the kth ranked point using f . The regret of a subset
S wrt f is f1(T ) f1(S) and the regret ratio is f1(Tf1)(Tf)1(S) .
Given a family of functions F , the problem here is to find
S of size r which minimizes the maximal regret ratio among
all f 2 F . A greedy approximate algorithm to solve this
problem has been proposed in [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. Consequently, [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]
proposed a relaxation, namely the k-regret minimizing set: the
regret of S considered here is max(fk(T ) f1(S), 0).
Moreover, they proved the NP-hardness of the regret
minimization problem, with or without the relaxation. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] addressed
the query evaluation optimization. They proposed a linear
time dynamic programming algorithm based on the skyline
set for a two-dimensional dataset. In addition, for datasets
with more than 3 dimensions, they proposed an approach
where they discretise the family of linear function F , into
a set of function F wrt a parameter given by the user.
This discretisation allows a controlled approximation of the
optimal regret ratio. A notable contribution to the regret
minimization line of research is given by [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. They provide
a reformulation of a regret representative set. Let S ⇢ T .
Then S is a (k, ✏)-set i↵ 8 f 2 F , fk(Tfk) (Tf)1(S)  ✏. They
formulate two regret minimizing set problems, (i) by size
minimization, i.e., find the smallest (k, ✏)-set, and (ii) by
regret minimization, compute the (k, ✏)-set of size r and ✏ is
minimum.
      </p>
      <p>Our previous work addressed two aspects of skyline queries:
index structure to optimize multidimensional queries and its
ecient maintenance upon updates. We aim to extend this
material to optimize regret minimization queries. We plan
to investigate more deeply the relationship between skyline
sets and regret minimizing sets (RMS). Let T be a dataset
over a set of dimensions D = {D1, . . . , Dd}. One question
that could be interesting to investigate is to find a
relationship between the regret minimization sets when considering
subspaces of D. More precisely, let X ✓ Y ✓ D and S(X),
resp. S(Y ), be the RMS wrt X, resp. Y . How these two
sets do compare? Which situations make them comparable?
Which functions families make them comparable? How can
the RMS’s wrt all subspaces can be computed eciently?
Consider the RMS frequency of a tuple to be the number
of RMS’s it belongs to, then retrieving the k most frequent
tuples could be seen as a way to define the ‘best” k tuples.
Let t 2 T . Is the knowledge we have about the skylines to
which t belongs, can give us hints about its belonging to an
RMS? Let the skyline frequency of a tuple be the number
of skylines it belongs to. Let L be the set of tuples that
are at least l skyline frequent, what approximation, if any,
gives the restriction of regret minimizing set computation
wrt L compared to the whole dataset? In case of a positive
answer, it would be interesting to see how to adapt NSC to
optimize the calculation of the set L and consequently, an
approximate regret minimization set.</p>
    </sec>
    <sec id="sec-10">
      <title>Acknowledgements</title>
      <p>Experiments presented in this paper were carried out
using the PlaFRIM experimental testbed, supported by Inria,
CNRS (LABRI and IMB), Universit´e de Bordeaux,
Bordeaux INP and Conseil R´egional d’Aquitaine
(see https://www.plafrim.fr/)</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>P. K.</given-names>
            <surname>Agarwal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Sintos</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Suri</surname>
          </string-name>
          .
          <article-title>Ecient algorithms for k-regret minimizing sets</article-title>
          .
          <source>In SEA</source>
          <year>2017</year>
          , June 21-23,
          <year>2017</year>
          , London, UK, pages
          <volume>7</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>7</lpage>
          :
          <fpage>23</fpage>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>K.</given-names>
            <surname>Alami</surname>
          </string-name>
          and
          <string-name>
            <given-names>S.</given-names>
            <surname>Maabout</surname>
          </string-name>
          .
          <article-title>Multidimensional skylines over streaming data</article-title>
          .
          <source>In International Conference on Database Systems for Advanced Applications</source>
          , pages
          <fpage>338</fpage>
          -
          <lpage>342</lpage>
          . Springer,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A.</given-names>
            <surname>Asudeh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Nazi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Zhang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Das</surname>
          </string-name>
          .
          <article-title>Ecient computation of regret-ratio minimizing set: A compact maxima representative</article-title>
          .
          <source>In Proceedings of the 2017 ACM SIGMOD Conference</source>
          , pages
          <fpage>821</fpage>
          -
          <lpage>834</lpage>
          . ACM,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>K. S.</given-names>
            <surname>Bøgh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Chester</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Sidlauskas</surname>
          </string-name>
          ,
          <string-name>
            <surname>and I. Assent.</surname>
          </string-name>
          <article-title>Hashcube: A data structure for space and query ecient skycube compression</article-title>
          .
          <source>In Proceedings of CIKM conference</source>
          , pages
          <fpage>1767</fpage>
          -
          <lpage>1770</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>K. S.</given-names>
            <surname>Bøgh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Chester</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Sidlauskas</surname>
          </string-name>
          ,
          <string-name>
            <surname>and I. Assent.</surname>
          </string-name>
          <article-title>Template skycube algorithms for heterogeneous parallelism on multicore and GPU architectures</article-title>
          .
          <source>In Proc. of SIGMOD Conference</source>
          , pages
          <fpage>447</fpage>
          -
          <lpage>462</lpage>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>S.</given-names>
            <surname>Bo</surname>
          </string-name>
          ¨rzso¨nyi,
          <string-name>
            <given-names>D.</given-names>
            <surname>Kossmann</surname>
          </string-name>
          , and
          <string-name>
            <given-names>K.</given-names>
            <surname>Stocker</surname>
          </string-name>
          .
          <article-title>The skyline operator</article-title>
          .
          <source>In Proc. of ICDE conf.</source>
          , pages
          <fpage>421</fpage>
          -
          <lpage>430</lpage>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>S.</given-names>
            <surname>Chaudhuri</surname>
          </string-name>
          and
          <string-name>
            <given-names>L.</given-names>
            <surname>Gravano</surname>
          </string-name>
          .
          <article-title>Evaluating top-k selection queries</article-title>
          .
          <source>In VLDB</source>
          , volume
          <volume>99</volume>
          , pages
          <fpage>397</fpage>
          -
          <lpage>410</lpage>
          ,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>S.</given-names>
            <surname>Chester</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Thomo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Venkatesh</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Whitesides</surname>
          </string-name>
          .
          <article-title>Computing k-regret minimizing sets</article-title>
          .
          <source>Proceedings of the VLDB Endowment</source>
          ,
          <volume>7</volume>
          (
          <issue>5</issue>
          ):
          <fpage>389</fpage>
          -
          <lpage>400</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>N.</given-names>
            <surname>Hanusse</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Kamnang-Wanko</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Maabout</surname>
          </string-name>
          .
          <article-title>Computing and summarizing the negative skycube</article-title>
          .
          <source>In Proc. of CIKM Conference</source>
          , pages
          <fpage>1733</fpage>
          -
          <lpage>1742</lpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Y.-L. Hsueh</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.-C. Lin</surname>
          </string-name>
          , and
          <string-name>
            <surname>C.-C. Chang</surname>
          </string-name>
          .
          <article-title>An ecient indexing method for skyline computations with partially ordered domains</article-title>
          .
          <source>IEEE Transactions on Knowledge &amp; Data Engineering</source>
          , pages
          <fpage>963</fpage>
          -
          <lpage>976</lpage>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>J.</given-names>
            <surname>Lee</surname>
          </string-name>
          and
          <string-name>
            <given-names>S.-W.</given-names>
            <surname>Hwang</surname>
          </string-name>
          .
          <article-title>Scalable skyline computation using a balanced pivot selection technique</article-title>
          .
          <source>Information Systems</source>
          ,
          <volume>39</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>21</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>M.</given-names>
            <surname>Morse</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Patel</surname>
          </string-name>
          , and
          <string-name>
            <surname>W. I. Grosky.</surname>
          </string-name>
          <article-title>Ecient continuous skyline computation</article-title>
          .
          <source>Information Sciences</source>
          ,
          <volume>177</volume>
          (
          <issue>17</issue>
          ):
          <fpage>3411</fpage>
          -
          <lpage>3437</lpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>D.</given-names>
            <surname>Nanongkai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. D.</given-names>
            <surname>Sarma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Lall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. J.</given-names>
            <surname>Lipton</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Xu</surname>
          </string-name>
          .
          <article-title>Regret-minimizing representative databases</article-title>
          .
          <source>Proceedings of the VLDB Endowment</source>
          ,
          <volume>3</volume>
          (
          <issue>1</issue>
          -2):
          <fpage>1114</fpage>
          -
          <lpage>1124</lpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Tao</surname>
          </string-name>
          and
          <string-name>
            <given-names>D.</given-names>
            <surname>Papadias</surname>
          </string-name>
          .
          <article-title>Maintaining sliding window skylines on data streams</article-title>
          .
          <source>IEEE Trans. Knowl</source>
          . Data Eng.,
          <volume>18</volume>
          (
          <issue>2</issue>
          ):
          <fpage>377</fpage>
          -
          <lpage>391</lpage>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>D. B. Terry</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Goldberg</surname>
            ,
            <given-names>D. A.</given-names>
          </string-name>
          <string-name>
            <surname>Nichols</surname>
            , and
            <given-names>B. M.</given-names>
          </string-name>
          <string-name>
            <surname>Oki</surname>
          </string-name>
          .
          <article-title>Continuous queries over append-only databases</article-title>
          .
          <source>In Proceedings of the 1992 ACM SIGMOD</source>
          , pages
          <fpage>321</fpage>
          -
          <lpage>330</lpage>
          ,
          <year>1992</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>R. C.-W. Wong</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Pei</surname>
            ,
            <given-names>A. W.-C.</given-names>
          </string-name>
          <string-name>
            <surname>Fu</surname>
            , and
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Wang</surname>
          </string-name>
          .
          <article-title>Online skyline analysis with dynamic preferences on nominal attributes</article-title>
          .
          <source>IEEE Transactions on Knowledge and Data Engineering</source>
          ,
          <volume>21</volume>
          (
          <issue>1</issue>
          ):
          <fpage>35</fpage>
          -
          <lpage>49</lpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>P.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Agrawal</surname>
          </string-name>
          ,
          <string-name>
            <surname>O</surname>
          </string-name>
          ¨.
          <article-title>Egecioglu, and</article-title>
          <string-name>
            <given-names>A. El</given-names>
            <surname>Abbadi</surname>
          </string-name>
          . Deltasky:
          <article-title>Optimal maintenance of skyline deletions without exclusive dominance region generation</article-title>
          .
          <source>In Proceedings of ICDE Conference</source>
          , pages
          <fpage>486</fpage>
          -
          <lpage>495</lpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>S.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Mamoulis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. W.</given-names>
            <surname>Cheung</surname>
          </string-name>
          , and
          <string-name>
            <given-names>B.</given-names>
            <surname>Kao</surname>
          </string-name>
          .
          <article-title>Ecient skyline evaluation over partially ordered domains</article-title>
          .
          <source>Proceedings of the VLDB Endowment</source>
          ,
          <volume>3</volume>
          (
          <issue>1</issue>
          -2):
          <fpage>1255</fpage>
          -
          <lpage>1266</lpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>