<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>X (R. Eschauzier);</journal-title>
      </journal-title-group>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Prioritization during Traversal-based Query Processing</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ruben Eschauzier</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ruben Taelman</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ruben Verborgh</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Workshop</string-name>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Link Traversal, Link Prioritization, Query Optimization</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>IDLab, Department of Electronics and Information Systems, Ghent University - imec</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2024</year>
      </pub-date>
      <volume>000</volume>
      <fpage>0</fpage>
      <lpage>0001</lpage>
      <abstract>
        <p>The decentralization envisioned for the current centralized web requires querying approaches capable of accessing multiple small data sources while complying with legal constraints related to personal data, such as licenses and the GDPR. Link Traversal-based Query Processing (LTQP) is a querying approach designed for highly decentralized environments that satisfies these legal requirements. An important optimization avenue in LTQP is the order in which links are dereferenced, which involves prioritizing links to query-relevant documents. However, assessing and comparing the algorithmic performance of these systems is challenging due to various compounding factors during query execution. Therefore, researchers need an implementation-agnostic and deterministic metric that accurately measures the marginal efectiveness of link prioritization algorithms in LTQP engines. In this paper, we motivate the need for accurately measuring link prioritization performance, define and test such a metric, and outline the challenges and potential extensions of the proposed metric. Our findings show that the proposed metric highlights diferences in link prioritization performance depending on the queried data fragmentation strategy. The proposed metric allows for evaluating link prioritization performance and enables easily assessing the efectiveness of future link prioritization algorithms.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Currently, web user data is stored in centralized data silos, which restricts innovation and poses privacy
concerns [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] Decentralized eforts like Solid distribute user data across multiple personal data stores,
which are highly personal and permissioned, requiring a querying approach that efectively handles
decentralized and permissioned data.
      </p>
      <p>
        Link Traversal-based Query Processing (LTQP) is an integrated querying approach designed to meet
these requirements [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Starting from seed data sources, the LTQP query engine dynamically discovers
new data sources by following hyperlinks found in previously discovered sources.
      </p>
      <p>
        However, LTQP is currently slower compared to centralized alternatives. An optimization avenue
is link prioritization, where the engine dynamically determines the order in which links should be
dereferenced based on their expected relevance to the query [
        <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
        ].
      </p>
      <p>To optimize LTQP prioritization strategies, researchers need insights into the marginal performance
of proposed algorithms and the theoretically optimal performance these strategies can achieve.</p>
      <p>
        Previous studies have used metrics like result arrival times, total execution time, and other arrival
time-based metrics [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] to assess prioritization strategies [
        <xref ref-type="bibr" rid="ref3 ref4 ref6">3, 4, 6</xref>
        ]. These measures fail to capture how
close implemented prioritization approaches are to the theoretically optimal strategy. Additionally,
these metrics fall short when comparing diferent engine implementations and environments due to
unrelated diferences that may skew link prioritization performance results:
• Programming Languages: Diferent languages can substantially afect engine performance [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ],
hindering purely time-based comparisons of algorithms.
LGOBE
      </p>
      <p>CEUR</p>
      <p>
        ceur-ws.org
• Dereferencing Overhead: Network communication times vary widely [
        <xref ref-type="bibr" rid="ref8 ref9">8, 9</xref>
        ], introducing
significant noise into execution times, even for comparisons of the same engine.
• Orthogonal Optimization Strategies: Diferent query optimization [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] strategies might create
diferences in performance independent of prioritization strategies.
      </p>
      <p>This leads to our research questions: How can we compare the marginal algorithmic performance of
prioritization algorithms in an implementation-agnostic manner and How can we define an Oracle baseline
that represents optimal prioritization performance?</p>
      <p>In this paper, we introduce the metric: Relevant Retrieval Ratio ( 3), pronounced r-cubed. This
metric allows for straightforward observation of the algorithmic performance of prioritization
approaches, independent of implementations, and provides a benchmark against the theoretically optimal
prioritization strategy.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Existing Metrics</title>
      <p>
        Existing LTQP and federated querying metrics [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] are insuficient for measuring the marginal
algorithmic performance of link prioritization algorithms:
• Query execution time: Prioritization algorithms are unlikely to improve total query execution
time [
        <xref ref-type="bibr" rid="ref3 ref4">4, 3</xref>
        ]. However, Link prioritization can improve [
        <xref ref-type="bibr" rid="ref3 ref4">4, 3</xref>
        ] arrival times of the first  results.
However, the arrival times are influenced by both prioritization strategies and other unrelated
engine optimizations, obscuring the prioritization algorithm’s efect.
• Dieficiency Metrics : Dieficiency metrics measure the continuous eficiency of query engines
by reflecting the density of produced results over a given time interval [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. This is in contrast to
measuring the first  arrival times, which only measures engine performance at discrete points.
      </p>
      <p>
        However, the same limitations remain, as these metrics are based on result arrival times.
• Number of HTTP Requests: For federated query engines, the number of HTTP requests
represents the eficiency of source selection and join planning during query execution [
        <xref ref-type="bibr" rid="ref12 ref13">12, 13</xref>
        ].
Federated engines often query SPARQL endpoints, allowing them to delegate parts of the query
execution to the server [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. In contrast, in LTQP, the engine uses HTTP requests to download
documents, so the number of requests corresponds to the number of dereferenced documents.
Thus, the number of HTTP requests measures the engine’s ability to prune irrelevant links.
However, it does not indicate the efectiveness of prioritization algorithms, which reorder link
dereference order, rather than pruning links.
      </p>
      <p>The metric proposed in this paper complements existing metrics by measuring the marginal
performance of link prioritization algorithms rather than the overall performance of the LTQP system.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Relevant Retrieval Ratio</title>
      <p>
        The subweb [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] traversed during LTQP can be represented as a directed graph, where an edge from
document A to document B indicates that a URI pointing to document B was first discovered in the
data obtained from document A. Figure 1 shows an example of such a directed graph.
      </p>
      <p>During LTQP query execution, an optimal link dereference order that finds all documents needed
to answer a query exists, as shown in the right-most directed graph in Figure 1. This theoretical
optimum can be considered an oracle link prioritization algorithm with perfect information on the
location of query-relevant documents. The optimal path can always be achieved, as the oracle only
uses links between documents that an engine could also use. An engine should strive to achieve a
traversal path close to this optimal path. As such, we use the ratio between the optimal traversal path
length and the actual traversal path length during query execution to evaluate the performance of
link prioritization algorithms. The optimal traversal path is defined as the shortest path needed to
dereference all documents required for the complete query result and is determined retrospectively.</p>
      <sec id="sec-3-1">
        <title>Traversed node</title>
      </sec>
      <sec id="sec-3-2">
        <title>Relevant document</title>
      </sec>
      <sec id="sec-3-3">
        <title>Seed document</title>
        <p>Given the length of the optimal traversal path and the actual traversal path, we define the  3 metric
 3 = |  | ,
|  |
|  |, |  | &gt; 0
(1)
where |  | is the length of the engine traversal path and |  | is the length of the optimal path. In
this definition, a higher value is better, with  3 = 1 indicating optimal algorithmic link prioritization
performance. Note that when |  | = 0 or |  | = 0, our query has no results and the notion of optimal
link prioritization does not exist. When an engine performs poorly, resulting in smaller diferences in
the metric’s values, taking the log2 can help visualize relative diferences.</p>
        <p>Using the  3 metric, we can compare the algorithmic eficiency of the two traversal algorithms in our
example. Using breadth-first search, the path to dereference the query-relevant document contains six
nodes, while the optimal traversal order contains two. Thus, the  3 metric for breadth-first traversal
equals 0.33. Depth-first visits only three nodes, yielding an  3 value of 0.67. From this analysis, we can
conclude that depth-first traversal outperforms breadth-first traversal on this subweb.</p>
        <p>
          To find the optimal traversal path to dereference all query-relevant documents, we notice that it
is equivalent to solving the directed Steiner tree problem [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ] on our directed graph, with the
queryrelevant documents serving as terminals. As such, we can use existing Steiner tree solvers to eficiently
ifnd our optimal traversal path.
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Results</title>
      <p>
        Our experiment uses the Discover queries from the SolidBench benchmark [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. To track the required
information for computing the metric during query execution, we use the query engine Comunica [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ].
To compute the optimal path, we will use a heuristic Steiner tree solver [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] to speed up computations.
Although an exact solver would be ideal, it significantly increases computational complexity for large
subwebs [
        <xref ref-type="bibr" rid="ref19 ref20">19, 20</xref>
        ].
      </p>
      <p>
        Our experiments compare depth and breadth-first prioritization [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] using  3 and time until the final
query result. We aim to validate the substantiated assumption that Discover queries do not benefit from
improved link prioritization [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ].
      </p>
      <p>
        Table 1 shows the computed metrics per template instantiation and the diference in the log2 of
the metric value. We find that either method can outperform the other based on the used template.
Furthermore, there is a significant diference in performance depending on the template instantiation,
likely due to the diferent fragmentation strategies used in SolidBench [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Some pods store
queryrelevant data in a single file, while others fragment these into directories, complicating optimal traversal.
      </p>
      <p>Query
D1.0
D1.1
D1.2
D1.3
D1.4
D2.0
D2.1
D2.2
D2.3
D2.4
D3.0
D3.1
D3.2
D3.3
D3.4
D4.0
D4.1
D4.2
D4.3
D4.4</p>
      <p>DF</p>
      <p>The diferences in prioritization performance are not observed in time until the last result; the Pearson
correlation coeficients between  3 and time until the last result are -0.042 and 0.201 for depth and
breadth-first, respectively. These low correlations suggest a weak link between  3 and query execution
time, possibly caused by noise in execution time.</p>
      <p>
        Using the  3, we confirm our previous paper [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] by computing a metric value instead of an
enginespecific in-depth analysis of the query execution progress.
      </p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion</title>
      <p>The current definition and implementation of the Relevant Retrieval Ratio (  3) have several limitations.
First, due to the computational complexity of the Steiner tree problem for graphs and the lack of readily
available exact solvers that work on directed graphs, our implementations use heuristics, thus leading
to potentially suboptimal traversal path lengths. Second, the metric can not be computed when a query
produces no results, either due to a timeout or no results existing for the query. Finally, our metric uses
theoretically optimal paths. In practice, document dereference times can vary, making the theoretically
optimal path potentially suboptimal. In future work, more extensive benchmarking of the metric is
required to validate its efectiveness in measuring prioritization performance. Furthermore, a new
metric that includes a penalty term for HTTP request time would account for real-world uncertainties
in LTQP scenarios. Finally, an  3 metric for the first  results can be defined.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Acknowledgments</title>
      <p>This research was supported by SolidLab Vlaanderen (Flemish Government, EWI and RRF project
VV023/10). Ruben Taelman is a postdoctoral researcher at the Research Foundation – Flanders (FWO).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>R.</given-names>
            <surname>Verborgh</surname>
          </string-name>
          ,
          <article-title>A data ecosystem fosters sustainable innovation</article-title>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>R.</given-names>
            <surname>Verborgh</surname>
          </string-name>
          ,
          <article-title>The web's data triad</article-title>
          ,
          <year>2024</year>
          . URL: https://ruben.verborgh.org/blog/2024/05/30/ the-webs
          <article-title>-data-triad/.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>R.</given-names>
            <surname>Taelman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Verborgh</surname>
          </string-name>
          ,
          <article-title>Link traversal query processing over decentralized environments with structural assumptions</article-title>
          , in: International Semantic Web Conference, Springer,
          <year>2023</year>
          , pp.
          <fpage>3</fpage>
          -
          <lpage>22</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>O.</given-names>
            <surname>Hartig</surname>
          </string-name>
          , M. T. Özsu,
          <article-title>Walking without a map: Ranking-based traversal for querying linked data</article-title>
          ,
          <source>in: The Semantic Web-ISWC</source>
          <year>2016</year>
          : 15th International Semantic Web Conference, Kobe, Japan,
          <source>October 17-21</source>
          ,
          <year>2016</year>
          , Proceedings,
          <source>Part I 15</source>
          , Springer,
          <year>2016</year>
          , pp.
          <fpage>305</fpage>
          -
          <lpage>324</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>M.</given-names>
            <surname>Acosta</surname>
          </string-name>
          , M.-E. Vidal,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Sure-Vetter</surname>
          </string-name>
          ,
          <article-title>Dieficiency metrics: measuring the continuous eficiency of query processing approaches</article-title>
          ,
          <source>in: The Semantic Web-ISWC</source>
          <year>2017</year>
          : 16th International Semantic Web Conference, Vienna, Austria,
          <source>October 21-25</source>
          ,
          <year>2017</year>
          , Proceedings,
          <source>Part II 16</source>
          , Springer,
          <year>2017</year>
          , pp.
          <fpage>3</fpage>
          -
          <lpage>19</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>J.</given-names>
            <surname>Hanski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Taelman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Verborgh</surname>
          </string-name>
          ,
          <article-title>Observations on bloom filters for traversal-based query execution over solid pods (</article-title>
          <year>2024</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>M.</given-names>
            <surname>Fourment</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. R.</given-names>
            <surname>Gillings</surname>
          </string-name>
          ,
          <article-title>A comparison of common programming languages used in bioinformatics</article-title>
          ,
          <source>BMC bioinformatics 9</source>
          (
          <year>2008</year>
          )
          <fpage>1</fpage>
          -
          <lpage>9</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Christiansen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Jefay</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Ott</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F. D.</given-names>
            <surname>Smith</surname>
          </string-name>
          ,
          <article-title>Tuning red for web trafic</article-title>
          ,
          <source>ACM SIGCOMM Computer Communication Review</source>
          <volume>30</volume>
          (
          <year>2000</year>
          )
          <fpage>139</fpage>
          -
          <lpage>150</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>K.</given-names>
            <surname>KOBAYASHI</surname>
          </string-name>
          ,
          <string-name>
            <surname>T. KATAYAMA</surname>
          </string-name>
          ,
          <article-title>Analysis and evaluation of packet delay variance in the internet</article-title>
          ,
          <source>IEICE transactions on communications 85</source>
          (
          <year>2002</year>
          )
          <fpage>35</fpage>
          -
          <lpage>42</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>O.</given-names>
            <surname>Hartig</surname>
          </string-name>
          ,
          <article-title>Zero-knowledge query planning for an iterator implementation of link traversal based query execution</article-title>
          ,
          <source>in: Extended Semantic Web Conference</source>
          , Springer,
          <year>2011</year>
          , pp.
          <fpage>154</fpage>
          -
          <lpage>169</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>G.</given-names>
            <surname>Montoya</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.-E. Vidal</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          <string-name>
            <surname>Corcho</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <string-name>
            <surname>Ruckhaus</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Buil-Aranda</surname>
          </string-name>
          ,
          <article-title>Benchmarking federated sparql query engines: Are existing testbeds enough?</article-title>
          ,
          <source>in: The Semantic Web-ISWC</source>
          <year>2012</year>
          : 11th International Semantic Web Conference, Boston, MA, USA, November
          <volume>11</volume>
          -
          <issue>15</issue>
          ,
          <year>2012</year>
          , Proceedings,
          <source>Part II 11</source>
          , Springer,
          <year>2012</year>
          , pp.
          <fpage>313</fpage>
          -
          <lpage>324</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>P.</given-names>
            <surname>Peng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Ge</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Zou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. T.</given-names>
            <surname>Özsu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <article-title>Optimizing multi-query evaluation in federated rdf systems</article-title>
          ,
          <source>IEEE Transactions on Knowledge and Data Engineering</source>
          <volume>33</volume>
          (
          <year>2019</year>
          )
          <fpage>1692</fpage>
          -
          <lpage>1707</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>M.</given-names>
            <surname>Schmidt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Görlitz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Haase</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Ladwig</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Schwarte</surname>
          </string-name>
          , T. Tran,
          <article-title>Fedbench: A benchmark suite for federated semantic data query processing</article-title>
          ,
          <source>in: The Semantic Web-ISWC</source>
          <year>2011</year>
          : 10th International Semantic Web Conference, Bonn, Germany,
          <source>October 23-27</source>
          ,
          <year>2011</year>
          , Proceedings,
          <source>Part I 10</source>
          , Springer,
          <year>2011</year>
          , pp.
          <fpage>585</fpage>
          -
          <lpage>600</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>A.</given-names>
            <surname>Schwarte</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Haase</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Hose</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Schenkel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Schmidt</surname>
          </string-name>
          , Fedx:
          <article-title>Optimization techniques for federated query processing on linked data, in: The Semantic Web-ISWC</article-title>
          <year>2011</year>
          : 10th International Semantic Web Conference, Bonn, Germany,
          <source>October 23-27</source>
          ,
          <year>2011</year>
          , Proceedings,
          <source>Part I 10</source>
          , Springer,
          <year>2011</year>
          , pp.
          <fpage>601</fpage>
          -
          <lpage>616</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>O.</given-names>
            <surname>Hartig</surname>
          </string-name>
          , J.-C.
          <article-title>Freytag, Foundations of traversal based query execution over linked data</article-title>
          ,
          <source>in: Proceedings of the 23rd ACM conference on Hypertext and social media</source>
          ,
          <year>2012</year>
          , pp.
          <fpage>43</fpage>
          -
          <lpage>52</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>L.</given-names>
            <surname>Zosin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Khuller</surname>
          </string-name>
          ,
          <article-title>On directed steiner trees</article-title>
          ,
          <source>in: SODA</source>
          , volume
          <volume>2</volume>
          ,
          <year>2002</year>
          , pp.
          <fpage>59</fpage>
          -
          <lpage>63</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>R.</given-names>
            <surname>Taelman</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. Van Herwegen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. Vander</given-names>
            <surname>Sande</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Verborgh</surname>
          </string-name>
          ,
          <article-title>Comunica: a modular sparql query engine for the web</article-title>
          ,
          <source>in: The Semantic Web-ISWC</source>
          <year>2018</year>
          : 17th International Semantic Web Conference, Monterey, CA, USA, October 8-
          <issue>12</issue>
          ,
          <year>2018</year>
          , Proceedings,
          <source>Part II 17</source>
          , Springer,
          <year>2018</year>
          , pp.
          <fpage>239</fpage>
          -
          <lpage>255</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>D.</given-names>
            <surname>Watel</surname>
          </string-name>
          , M.
          <article-title>-A. Weisser, A practical greedy approximation for the directed steiner tree problem</article-title>
          ,
          <source>Journal of Combinatorial Optimization</source>
          <volume>32</volume>
          (
          <year>2016</year>
          )
          <fpage>1327</fpage>
          -
          <lpage>1370</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>F. K.</given-names>
            <surname>Hwang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. S.</given-names>
            <surname>Richards</surname>
          </string-name>
          ,
          <article-title>Steiner tree problems</article-title>
          ,
          <source>Networks</source>
          <volume>22</volume>
          (
          <year>1992</year>
          )
          <fpage>55</fpage>
          -
          <lpage>89</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>A.</given-names>
            <surname>Lucas</surname>
          </string-name>
          ,
          <article-title>Ising formulations of many np problems</article-title>
          ,
          <source>Frontiers in physics 2</source>
          (
          <year>2014</year>
          )
          <fpage>74887</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>R.</given-names>
            <surname>Eschauzier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Taelman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Verborgh</surname>
          </string-name>
          ,
          <article-title>How does the link queue evolve during traversal-based query processing? (</article-title>
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>