<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>An Eficient Diversity-Aware Method for the Empty-Answer Problem</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Yuto Ikeda</string-name>
          <email>ikeda.yuto@ist.osaka-u.ac.jp</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Chuan Xiao</string-name>
          <email>chuanx@ist.osaka-u.ac.jp</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Makoto Onizuka</string-name>
          <email>onizuka@ist.osaka-u.ac.jp</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Osaka University</institution>
          ,
          <addr-line>1-5 Yamadaoka, Suita, Osaka 565-0871</addr-line>
          <country country="JP">Japan</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Workshop Proce dings</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>This study tackles the empty-answer problem in database queries, where no results meet all user-specified conditions, though some may satisfy individual ones. We explore query relaxation, which removes certain conditions for results, and a record search method that uses user preferences to evaluate records. In particular, we emphasize the importance of diversity in the results to better match user preferences, which has been ignored in existing approaches. To address this, we introduce the use of Maximal Marginal Relevance (MMR) - a ranking function balancing query relevance and record diversity - for query relaxation, proposing a method that searches for diverse record sets while maintaining many conditions. Experiments with real-world datasets demonstrated that the proposed method significantly increases search speed (up to 300 times faster) while maintaining high MMR scores, indicating an efective balance between eficiency and result diversity.</p>
      </abstract>
      <kwd-group>
        <kwd>empty-answer problem</kwd>
        <kwd>query relaxation</kwd>
        <kwd>maximal marginal relevance</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        In various applications, setting desired conditions and
searching for data is a fundamental operation. Ideally,
searches should yield a small number of records that match
the specified conditions. However, the number of records
retrieved can vary significantly depending on the user’s
answer problem) or no records at all (the empty-answer
problem) [1]. Whereas the many-answer problem can
be addressed by preseting the top- results (e.g., with a
LIMIT clause), the empty-answer problem is more
challenging. The causes of the empty-answer problem can be
categorized into two scenarios: (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) records satisfy each
condition individually but not collectively due to
multiple conditions, and (
        <xref ref-type="bibr" rid="ref2">2</xref>
        ) the conditions are invalid, such as
searching for records that do not meet pre-set constraints
in the data. In this paper, we target the empty-answer
problem and address the first scenario, where potentially
all records in the database fall within the search scope.
      </p>
      <p>To solve the empty-answer problem, existing methods
are broadly classified into two approaches. The first is the
tions set by the user are reduced to solve the empty-answer
problem. The second is the ranking method [5], which
translates user preferences into a function to evaluate
records. It efectively reflects user preferences, especially
when users can design the ranking function.</p>
      <p>In the user’s record set, the relevance to the query and
the diversity within the recommended set are both
imporis key to capturing user preferences [6]. While various
approaches have been proposed, none have focused on
the diversity of the record set post-solving the
emptyanswer problem in the database field. Meanwhile,
Maximal Marginal Relevance (MMR) [7], a ranking function
balancing query relevance and record diversity, has been
proposed and widely used for information retrieval.</p>
      <p>In this study, we aim to solve the empty-answer problem
by considering diversity and utilizing MMR as the ranking
function. We formalize MMR for relational database
applications and propose a method for quickly finding a record
set that maximizes MMR, Also, We devise a series of
relaxed query search and record search techniques tailored
to this objective. They broaden results by removing some
functions, respectively. In addition, we utilize
cardinalprocess. Experiments with real-world datasets show that
our method significantly increases search speed (up to
more than 300 times faster) while maintaining high MMR
scores and outperforming baseline approaches.
query relaxation method [2, 3, 4], where conjunctive condi- conditions and evaluate records based on user-defined
DOLAP 2024: 26th International Workshop on Design, Optimization, ity estimation techniques to further optimize the search
nEvelop-O
LGOBE
(M. Onizuka)
(M. Onizuka)
Languages and Analytical Processing of Big Data, co-located with
∗Corresponding author.</p>
      <p>https://sites.google.com/site/chuanxiao1983 (C. Xiao); http://</p>
    </sec>
    <sec id="sec-2">
      <title>2. Preliminaries</title>
      <p>0000-0001-5559-8300 (M. Onizuka)
CEUR</p>
      <p>CEUR
© 2024 Copyright for this paper by its authors. Use permitted under Creative Commons License that this query is composed of  conjunctive conditions,
Attribution 4.0 International (CC BY 4.0).</p>
      <p>We denote the query provided by the user as  , and assume
CEUR</p>
      <p>ceur-ws.org
with each condition represented as   for 0 ≤  ≤  − 1 .</p>
      <p>Then, the query can be expressed as:  =</p>
      <p>−1
⋀
=0  .</p>
      <p />
      <p>Next, we define the set of conditions of the query  as
  , and the power set of   as   (</p>
      <p>). A relaxed query
 ′, derived from  , is formulated using a proper subset
 ⊆   (
 ) ⧵ {  } as follows:
 ′ = ⋀   .</p>
      <p>∈</p>
      <p>Since the number of elements in   (
are 2 − 2 candidates for  ′, excluding  itself. We denote
the total number of records in the database as  , the total
number of columns in a record as  , and the number of
records the user wishes to obtain as a query result as  .</p>
      <p>
        )is 2 − 1, there
(
        <xref ref-type="bibr" rid="ref1">1</xref>
        )
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Proposed Method</title>
      <sec id="sec-3-1">
        <title>3.1. Base Algorithm</title>
        <p>
          MMR. MMR is a combined score comprising (
          <xref ref-type="bibr" rid="ref1">1</xref>
          ) the rele- tween every pair of records (as per Equation 4), a task
vance of the recommended record set to the user-specified
query, and (
          <xref ref-type="bibr" rid="ref2">2</xref>
          ) the diversity within the recommended
record set. When the recommended record set is defined
as  ′, MMR is defined as follows [ 7]:
 (, 
′) =  (, 
′) +   (
′
)
relevance and diversity.
where  is a parameter that adjusts the balance between
        </p>
        <p>We define relevance and diversity as follows:
 (, 
 (
′) =
′) = {
1</p>
        <p>∑
 ∈ ′
1</p>
        <p>−1
∑
=0  (  ,  )</p>
        <p>min, ′∈ ′,≠ ′ ( (,</p>
        <p>if (
′
)
) if (
′) = 1
′) ≠ 1</p>
        <p>condition</p>
        <p>, and 0 otherwise. (
where  (  ,  ) returns 1 if the recor′d  satisfies the query</p>
        <p>)denotes # records in
 ′ Intuitively, Equation 3 represents the average ratio of
the number of matching conditions to the total number
of conditions for each record in  ′. Additionally, the</p>
        <p>function in Equation 4 defines the distance between
records, and any distance function can be applied. In this
study, the Manhattan distance is used for numerical data,
and the Hamming distance for categorical or binary data.</p>
        <p>Problem Definition.</p>
        <p>The problem is defined as follows:
given a query  that yields an empty-answer for single
table dataset  , the objective is to identify a subset of records
 ′ ⊆  , consisting of  records, where k is the number
that user designated, that maximizes  (,</p>
        <p>′).</p>
        <p>MMR is a metric designed for evaluating a set of records.</p>
        <p>To identify the recommended record set that maximizes
MMR, it is necessary to calculate and compare the
distances between all pairs of the  records. This process
incurs (</p>
        <p>)-time, which becomes impractical,
particularly for large values of  . As a result, methods have been
proposed to search for the recommended record set in a
greedy manner [8] for the sake of eficiency.
 (, 
 (,</p>
        <p>
          (
(
          <xref ref-type="bibr" rid="ref2">2</xref>
          )
(
          <xref ref-type="bibr" rid="ref3">3</xref>
          )
        </p>
        <p>To identify a recommended record set that maximizes the
MMR score, it is necessary to calculate the distance
bethat is computationally intensive. Consequently, existing
methods [8, 9] employ heuristic approaches to find
approximate answers, including the state-of-the-art method [9]
which incurs a time complexity of ()</p>
        <p>to find these
approximate answers.</p>
        <p>
          Our approach also utilizes a heuristic method to
eficiently find approximate solutions. Considering that a
query can be viewed as an abstraction or specification of
its resulting records, we introduce the concept of
querylevel MMR as a preprocessing step for record-level MMR.
generate multiple relaxed queries from the original query,
aiming to maximize query-level MMR (as shown in the
top right corner of the figure). Subsequently, we acquire
the results of these relaxed queries (depicted at the bottom
(
          <xref ref-type="bibr" rid="ref4">4</xref>
          ) right corner) and select a recommended record set from
these results that maximizes the record-level MMR.
        </p>
        <p>We define query-level MMR by substituting a
recommended record set ( ′) with a relaxed query set ( ′) in</p>
        <sec id="sec-3-1-1">
          <title>Equation 2 as follows:</title>
          <p>′) =  (, 
′) +   (</p>
          <p>′).</p>
          <p>Additionally, we introduce metrics for query relevance
and diversity. Query relevance calculates the similarity
between the original query and a relaxed query set, while
query diversity measures the diversity within the relaxed
query set. These metrics are defined as follows:
′) =
1</p>
          <p>∑
  ′∈ ′ ()</p>
          <p>(
′) =  ″, ′∈ ′
min
, ′≠ ″ (
′
)
(
′ ∩  ″)
′ ∪  ″).</p>
          <p>
            (
            <xref ref-type="bibr" rid="ref5">5</xref>
            )
(
            <xref ref-type="bibr" rid="ref6">6</xref>
            )
(
            <xref ref-type="bibr" rid="ref7">7</xref>
            )
          </p>
          <p>denotes the number of conditions in a query  .</p>
          <p>
            Our method comprises two stages: (
            <xref ref-type="bibr" rid="ref1">1</xref>
            ) searching for
relaxed queries that maximize query-level MMR and
obtaining their results; (
            <xref ref-type="bibr" rid="ref2">2</xref>
            ) selecting a recommended record
set that maximizes record-level MMR from these results.
          </p>
          <p>By relaxing the original query provided by the user,
records that satisfy the relaxed query are considered as
the user’s query while contributing to the diversity of the
plexity of the proposed method is expressed as:
candidates for the recommended record set. These candi- between the values of each column.
date records should ideally satisfy more conditions from
If the cardinality of   is   (0 ≤   ≤ 1), the time
comrecommended record set. To achieve this, relaxed queries
are selected to maximize query-level MMR.</p>
          <p>As discussed in Section 2, there are 2 − 2 potential
candidates for relaxed queries. Directly searching through all
these candidates is impractical. In our proposed method,
we record pairs of conditions used in previous relaxed
queries and determine conditions greedily one by one for
new relaxed queries. This approach ensures diversity in
the records derived from relaxed queries by varying the
conditions within each group, and it can be achieved with
a time complexity of (</p>
          <p>2).</p>
          <p>For specific processing, initially, from the conditions in
the original query, we select one condition that has been
used the least in previous relaxed query groups for the
new relaxed query. For subsequent conditions, when the
set of already determined conditions is   ′, the condition
that minimizes ∑∈  ′
(,</p>
          <p>) is chosen. Here,
is a function that records the frequency of condition
pairs appearing in relaxed queries from past iterations. For
instance, if a user’s query contains conditions  1,  2,  3, and
the first recommended record search included  1,  2 in its
relaxed query, then (</p>
          <p>1,  2) = 1 and (</p>
          <p>In case of multiple conditions minimizing the 
tion, priority is given to the condition least used in prior
relaxed query groups. Finally, records satisfying all query
conditions just before the relaxed query yields no results
are considered as candidates. After determining the
relaxed query, all condition pairs in this query are recorded.</p>
          <p>1,  3) = 0.</p>
          <p>function. Starting from the second condition, evaluation is
conducted only on the record set that meets all previously
established conditions. This strategy significantly reduces
the number of records evaluated for each condition.</p>
          <p>After identifying candidates for the recommended
record set, we search for the record that maximizes the
MMR from that set. In doing so, we maintain the minimum
distance between the records selected in the recommended
record set to reduce computational costs. However, unlike
existing methods, our proposed method does not conduct a
full search of records, rendering the avoidance of duplicate
calculations for additional record candidates infeasible.</p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Complexity Analysis</title>
        <p>The exact time complexity of the proposed method is
influenced by the proportion of records satisfying each
condition in the query and the dependency relationship
between the sets of records satisfying multiple conditions,
which makes it challenging to calculate precisely. We
therefore consider the time complexity under the
general assumption that there is no dependency relationship
When searching for a relaxed query, records are
proWe revisit the proposed method. When determining the
gressively narrowed down with each determined
condiconditions of a query, the method does not explicitly
ad((
2 +  +</p>
        <p>−1
∑
ing conditions for the relaxed query and recording the
conditions used in  ′, and the second term pertains to the
complexity of calculating MMR for the record set obtained
from  ′ and determining the recommended record set. The
number of additional records is not factored in here. The
third term relates to the complexity of searching  ′. The
sum inside represents the number of records evaluated
for each condition.</p>
        <p>Considering these parameters, typically  and  are up
to 100 or smaller, while  is often larger. Therefore, when
, ,  ≪ 
, the time complexity approximates to:
(
−1</p>
        <p>
          ∑ (∏   )).
=0 =0
(
          <xref ref-type="bibr" rid="ref8">8</xref>
          )
        </p>
        <sec id="sec-3-2-1">
          <title>Equation 8 is  , we have</title>
          <p>For estimating complexity, consider a scenario where
the selection rate is identical for all conditions. If   in
(
−1</p>
          <p>∑ (∏   )) ≤ (
=0 =0
∞
=0
∑(  ) = (</p>
          <p>
            1
1 − 
)
(
            <xref ref-type="bibr" rid="ref9">9</xref>
            )
          </p>
        </sec>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Further Optimizations</title>
        <p>dress scenarios involving multiple relevant conditions.
The primary objectives for obtaining a relaxed query, as
mentioned earlier, are to maintain high diversity among
queries and to retain as many conditions from the user’s
query as possible. In scenarios where conditions have
the same priority in the co-occurrence matrix, selecting
any of these conditions would similarly uphold the
diversity among queries. Consequently, when prioritizing
these conditions, the focus should be on obtaining a
relaxed query that preserves more conditions from the user’s
query. For optimization, we propose to select from
conditions with equal priority in the co-occurrence matrix
those likely to yield more records after evaluation.</p>
        <p>
          Methods for exploring the cardinality of conditions
include: (
          <xref ref-type="bibr" rid="ref1">1</xref>
          ) direct evaluation of the condition to calculate the
exact cardinality, and (
          <xref ref-type="bibr" rid="ref2">2</xref>
          ) utilizing cardinality estimation
for an approximate value. Each of these methods presents
its own advantages and disadvantages. Direct evaluation
provides precise cardinality but may lead to longer
execution times, especially when evaluating a few conditions
over a large number of records. In contrast, cardinality
estimation ofers more consistent execution times regardless
        </p>
        <p>Dataset. We follow the existing research on query
relaxation [4] to use the Cars dataset [13], which was released
by Mottin et al. After removing duplicate records, this
dataset comprises 128,443 records with 31 columns, all
containing boolean values. We employed 167 of queries.
These are the queries used in existing research [4] and
these are consists of 4 − 10 conditions.</p>
        <p>Competitors. In addition to the method proposed in
Section 3 (proposed base method) and the enhanced approach
described in Section 3.3 (proposed optimized method), we
included two comparison methods in our experiments: a
greedy method targeting the entire dataset (greedy) and
a random selection method (random). The threshold for
switching from cardinality estimation to direct execution
in the proposed optimized method is set at 10.
Environment. All experiments were performed on a
MacOS Ventura 13.2.1 machine equipped with an Apple M2
CPU and 24 GB of main memory. For the implementation
of all algorithms in the experiments, Python 3.8 was used.
We employed PostgreSQL for data storage, ensuring a
unified experimental environment between the proposed
base method and the exhaustive search method. The
reported execution times exclude the dataset loading times.
To implement the cardinality estimation in the proposed
optimized method, DeepDB [10] was utilized. The model
used in this experiment was prepared beforehand.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>5. Conclusion</title>
      <p>In this study, we formulated an evaluation metric that
considers both diversity and relevance in the field of databases.
We proposed a method that searches for a record set that
can be presented quickly compared to existing methods,
solving the empty-answer problem. We further improved
the method by employing cardinality estimation. In
addition, we conducted an empirical evaluation on a dataset
and queries used in previous research, confirming that our
method achieves significant speed improvements while
maintaining accuracy.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgements</title>
      <p>This work is supported by JSPS Kakenhi 22H03903,
23H03406, 23K17456, and CREST JPMJCR22M2.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>S.</given-names>
            <surname>Basu Roy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <surname>G. Das</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          <string-name>
            <surname>Nambiar</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Mohania</surname>
          </string-name>
          ,
          <article-title>Minimum-efort driven dynamic faceted search in structured databases</article-title>
          ,
          <source>in: CIKM</source>
          ,
          <year>2008</year>
          , pp.
          <fpage>13</fpage>
          -
          <lpage>22</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>N.</given-names>
            <surname>Koudas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. K.</given-names>
            <surname>Tung</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Vernica</surname>
          </string-name>
          ,
          <article-title>Relaxing join and selection queries</article-title>
          ,
          <source>in: VLDB</source>
          ,
          <year>2006</year>
          , pp.
          <fpage>199</fpage>
          -
          <lpage>210</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>C.</given-names>
            <surname>Mishra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Koudas</surname>
          </string-name>
          ,
          <article-title>Interactive query refinement</article-title>
          ,
          <source>in: EDBT</source>
          ,
          <year>2009</year>
          , pp.
          <fpage>862</fpage>
          -
          <lpage>873</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>D.</given-names>
            <surname>Mottin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Marascu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. B.</given-names>
            <surname>Roy</surname>
          </string-name>
          , G. Das,
          <string-name>
            <given-names>T.</given-names>
            <surname>Palpanas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Velegrakis</surname>
          </string-name>
          ,
          <article-title>A holistic and principled approach for the empty-answer problem</article-title>
          ,
          <source>The VLDB Journal</source>
          <volume>25</volume>
          (
          <year>2016</year>
          )
          <fpage>597</fpage>
          -
          <lpage>622</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>S.</given-names>
            <surname>Agrawal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Chaudhuri</surname>
          </string-name>
          ,
          <string-name>
            <surname>G. Das</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Gionis</surname>
          </string-name>
          ,
          <source>Automated ranking of database query results</source>
          ,
          <source>CIDR</source>
          (
          <year>2003</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M.</given-names>
            <surname>Kaminskas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Bridge</surname>
          </string-name>
          , Diversity, serendipity, novelty, and
          <article-title>coverage: a survey and empirical analysis of beyond-accuracy objectives in recommender systems</article-title>
          ,
          <source>ACM Transactions on Interactive Intelligent Systems</source>
          <volume>7</volume>
          (
          <year>2016</year>
          )
          <fpage>1</fpage>
          -
          <lpage>42</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>J.</given-names>
            <surname>Carbonell</surname>
          </string-name>
          , J. Goldstein,
          <article-title>The use of mmr, diversitybased reranking for reordering documents and producing summaries</article-title>
          ,
          <source>in: SIGIR</source>
          ,
          <year>1998</year>
          , pp.
          <fpage>335</fpage>
          -
          <lpage>336</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>I.</given-names>
            <surname>Catallo</surname>
          </string-name>
          , E. Ciceri,
          <string-name>
            <given-names>P.</given-names>
            <surname>Fraternali</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Martinenghi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Tagliasacchi</surname>
          </string-name>
          ,
          <article-title>Top-k diversity queries over bounded regions</article-title>
          ,
          <source>ACM Transactions on Database Systems</source>
          <volume>38</volume>
          (
          <year>2013</year>
          )
          <fpage>1</fpage>
          -
          <lpage>44</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>K.</given-names>
            <surname>Hirata</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Amagata</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Fujita</surname>
          </string-name>
          , T. Hara,
          <article-title>Solving diversity-aware maximum inner product search eficiently and efectively</article-title>
          , in: CIKM,
          <year>2022</year>
          , pp.
          <fpage>198</fpage>
          -
          <lpage>207</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>B.</given-names>
            <surname>Hilprecht</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Schmidt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kulessa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Molina</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Kersting</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Binnig</surname>
          </string-name>
          ,
          <article-title>Deepdb: Learn from data, not from queries!</article-title>
          ,
          <source>Proceedings of the VLDB Endowment</source>
          <volume>13</volume>
          (
          <year>2020</year>
          )
          <fpage>992</fpage>
          -
          <lpage>1005</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Liang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kamsetty</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Duan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Abbeel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Hellerstein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Krishnan</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Stoica</surname>
          </string-name>
          ,
          <article-title>Deep unsupervised cardinality estimation</article-title>
          , arXiv preprint arXiv:
          <year>1905</year>
          .
          <volume>04278</volume>
          (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>R.</given-names>
            <surname>Ito</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Sasaki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Xiao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Onizuka</surname>
          </string-name>
          , Scardina:
          <article-title>Scalable join cardinality estimation by multiple density estimators</article-title>
          ,
          <source>arXiv preprint arXiv:2303.18042</source>
          (
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>D.</given-names>
            <surname>Mottin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. B.</given-names>
            <surname>Roy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Marascu</surname>
          </string-name>
          , , G. Das,
          <string-name>
            <given-names>T.</given-names>
            <surname>Palpanas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Velegrakis</surname>
          </string-name>
          ,
          <article-title>A holistic and principled approach for the empty-answer problem</article-title>
          , https://helios2.mi. parisdescartes.fr/~themisp/queryrelaxation,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>