<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Towards More Intelligent SPARQL Querying Interfaces</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Hashim Khan</string-name>
          <email>hashim.khan@uni-paderborn.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Data Science Group, Paderborn University</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Over years, the Web of Data has grown significantly. Various interfaces such as SPARQL endpoints, data dumps, and Triple Pattern Fragments (TPF) have been proposed to provide access to this data. Studies show that many of the SPARQL endpoints have availability issues. The data dumps do not provide live querying capabilities. The TPF solution aims to provide a trade-off between the availability and performance by dividing the workload among TPF servers and clients. In this solution, the TPF server only performs the triple patterns execution of the given SPARQL query. While the TPF client performs the joins between the triple patterns to compute the final resultset of the SPARQL query. High availability is achieved in TPF but increase in network bandwidth and query execution time lower the performance. We want to propose a more intelligent SPARQL querying server to keep the high availability along with high query execution performance, while minimizing the network bandwidth. The proposed server will offer query execution services (can be single triple patterns or even join execution) according to the current status of the workload. If a server is free, it should be able to execute the complete SPARQL query. Thus, the server will offer execution services while avoiding going beyond the maximum query processing limit, i.e. the point after which the performance start decreasing or even service shutdown. Furthermore, we want to develop a more intelligent client, which keeps track of a server's processing capabilities and therefore avoid DOS attacks and crashes.</p>
      </abstract>
      <kwd-group>
        <kwd>Availability</kwd>
        <kwd>Performance</kwd>
        <kwd>Intelligent TPF Server</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>The problem we want to address focuses on the trade-off between the availability
and performance in Linked Data interfaces.</p>
      <p>
        A large amount of Linked Data is available on the web and it keeps on
increasing day by day. According to LODStats1, a total of ∼ 150 billion triples
available from 9960 datasets. Querying this massive amount of data in a
scalable way is particularly challenging. SPARQL is the primary query language to
retrieve data from RDF linked datasets [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ]. The true value of these datasets
becomes apparent when users can execute arbitrary queries over them, to
retrieve precisely those facts they are interested in [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ]. Various interfaces have
been proposed to provide access to this data:
SPARQL endpoints offer a public interface to execute SPARQL queries over
the underlying RDF data hosted by endpoints. In this interface, the client sends
a complete SPARQL query to the server (i.e., SPARQL endpoint). The server
executes the complete query and returns the final results. The responsibility of
query execution is completely on the server side, while the client is idle most of
the time. Here, better performance in query execution is ensured due to the use
of indexes for query plan generation. Also, the network overhead is reduced as
compared to the TFP server because in a case of SPARQL endpoint, only the
final result of the query need to be transferred over the network. However many
of the current SPARQL endpoints suffer from low availability rates due to its
expressiveness towards query execution of all types [
        <xref ref-type="bibr" rid="ref28 ref8">28,8</xref>
        ]. A recent study [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ]
shows that more than 60% of endpoints are offline.
      </p>
      <p>
        Triple Pattern Fragments interface tries to tackle the problem of SPARQL
endpoints and aims to provide a trade-off between the availability and
performance. The TPF interface divides the workload among servers and clients. In
this solution, unlike SPARQL endpoints, the TPF server does not execute the
complete SPARQL query. Rather, the server only performs the triple patterns
execution of the given SPARQL query. While the TPF client performs the joins
between the triple patterns to compute the final resultset of the SPARQL query.
Study [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ] shows the TPF interface achieves high availability at the expense of
increased bandwidth and slower query execution times. This is because the client
fetches the results of the complete triple patterns of the query and performs local
joins. As such, it is possible that most of the triple pattern results are excluded
after performing joins with the results of another triple pattern. Furthermore,
in the case of TPF, the server selects and fetches the triples according to the
triple patterns given in the query. After that, the server transfers these triples
to the client for further execution, that is, to apply joins on it and to generate
the final results. The approach is inefficient because the server’s only job is to
fetch the triples but not to execute the actual query. Also, transferring a large
number of triples towards the client, makes the network overloaded which is an
obvious reason for its low performance.
      </p>
      <p>Data dumps offer another option to provide access to Linked Data. In this
interface, the data is made publicly available as data dumps. The data dumps
first need to be downloaded and later be processed on local machines to execute
SPARQL queries. But this way of accessing data goes against the real aim of
Semantic Web, i.e. it is not live querying.</p>
      <p>Our Proposal: The main objective of this research is to provide a more
intelligent TPF interface in which the server and client negotiate first, and then
divide the task of query processing, according to the current status of the server.
Thus, it is not hard coded that TPF server will only execute triple patterns and
TPF client will perform the joins between triple patterns. In the proposed
interface, the server can even execute the complete SPARQL query, provided that
current workload is not reaching beyond its processing capabilities. In this
proposed work, we aim to achieve high availability as well as good query runtime
performance.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Preliminaries</title>
      <p>In this section, we formally define key definitions and basic concepts, which are
used in this research proposal and my future PhD work .</p>
      <sec id="sec-2-1">
        <title>Definition 1 (RDF Term, RDF Triple and Data Source). Assume there</title>
        <p>are pairwise disjoint infinite sets I, B, and L (IRIs, Blank nodes, and Literals,
respectively). Then the RDF term RT = I ∪ B ∪ L. The triple (s, p, o) ∈ (I ∪
B) × I × (I ∪ B ∪ L) is called an RDF triple, where s is called the subject, p
the predicate and o the object. An RDF data set or dataset d is a set of RDF
triples d = {(s1, p1, o1), . . . , (sn, pn, on)}.</p>
      </sec>
      <sec id="sec-2-2">
        <title>Definition 2 (Query Triple Pattern and Basic Graph Pattern). By using</title>
        <p>Definition 1 and assume an infinite set V of variables. A tuple tp ∈ (I ∪ L ∪ V ∪
B) × (I ∪ V ) × (I ∪ L ∪ V ∪ B) is a triple pattern. A Basic Graph Pattern is a
finite set of triple patterns.</p>
      </sec>
      <sec id="sec-2-3">
        <title>Definition 3 (Solution Mapping). A mapping μ from V to RT is a partial</title>
        <p>function μ : V → RT where RT = (I ∪ B ∪ L). For a triple pattern pattern
obtained by replacing the variables in tp according to μ. The domain of μ, denoted
by dom(μ), is the subset of V where μ is defined. We sometimes write down
concrete mappings in square brackets, for instance, μ = [?X → a, ?Y → b] is the
mapping with dom(μ) = {?X, ?Y } such that, μ(?X) = a and μ(?Y ) = b.</p>
      </sec>
      <sec id="sec-2-4">
        <title>Definition 4 (Triple Pattern Matching). Let d be a dataset (we call it data</title>
        <p>source as well) with set of RDF as the set of mappings [[tp]]d = {μ : V →
RT | dom(μ) = var(P ) and μ(P ) ⊆ d}. If μ ∈ [[tp]]d, we say that μ is a solution
for tp in d. If a data source d has at least one solution for a triple pattern tp,
then we say d matches tp.</p>
      </sec>
      <sec id="sec-2-5">
        <title>Definition 5 (Linked Data Fragment). According to [28], “a Linked Data</title>
        <p>Fragment (LDF) of a dataset is a resource consisting of those triples of this
dataset that match a specific selector, together with their metadata and
hypermedia controls to retrieve other Linked Data Fragments. We define a specific type
of ldfs that require minimal effort to generate by a server, while still enabling
efficient querying on the client side”.</p>
      </sec>
      <sec id="sec-2-6">
        <title>Definition 6 (Triple Pattern Fragment). According to [28], “a Triple Pat</title>
        <p>tern Fragment (TPF) is a Linked Data Fragment with a triple pattern as selector,
count metadata, and the controls to retrieve any other triple pattern fragment
of the dataset. Each page of a triple pattern fragment contains a subset of the
matching triples, together with all metadata and controls”.</p>
        <p>Now we explain the query execution process in TPF interfaces. For basic
graph patterns (BGPs), the algorithm works as follows:
1. For each triple pattern tpi in the BGP B = {tp1, . . . , tpn}, fetch the first
page φi1of the triple pattern fragment fi for tpi, which contains an estimate
cnti of the total number of matches for tpi. Choose ∈ such that cnt∈ =
min(cnt1, . . . , cntn). f∈ is then the optimal fragment to start with.
2. Fetch all remaining pages of the triple pattern fragment f∈. For each triple
t ∈ LDF , generate the solution mapping μt such that μt(tp∈) = t. Compose
the subpattern Bt = {tp|tp = μt(tpj)∧tpj ∈ B}\{t}. If Bt 6= ∅, find mappings
ΩBt by recursively calling the algorithm for Bt. Else, ΩBt = {μ∅} with μ∅
the empty mapping.
3. Return all solution mappings μ ∈ {μt ∪ μ0|μ0 ∈ ΩBt }.
2.1</p>
      </sec>
      <sec id="sec-2-7">
        <title>Use Case Scenario</title>
        <p>Consider the following SPARQL query to be executed on TPF interface of the
DBpedia TPF 2, to find countries and their capitals of Europe:</p>
        <sec id="sec-2-7-1">
          <title>PREFIX dbo:&lt;http://dbpedia.org/ontology/&gt; PREFIX dct:&lt;http://purl.org/dc/terms/&gt; PREFIX dbr:&lt;http://dbpedia.org/resource/&gt;</title>
        </sec>
        <sec id="sec-2-7-2">
          <title>SELECT ?countries ?city</title>
          <p>WHERE {
?countries dbo:capital ?city . #tp1 = 111
?countries dct:subject dbr:Category:Countries_in_Europe. #tp2 = 1741
}</p>
          <p>In case of TPF server it only performs the triple patterns execution, that
is, it has returned 111 triples which matched with the triple pattern 1 (tp1)
while that of 1741 with triple pattern 2 (tp2). After the server fetches these 111
+ 1741 = 1851 triples in total, the client performs the join between the triple
patterns to compute the final resultset of the SPARQL query, which has only 48
results 3. Fetching so many triples to generate such a small result, clearly shows
that the high availability benefits of the TPF come at the expense of increased
bandwidth and slower query execution times.</p>
          <p>The proposed TPF server will offer query execution services (can be single
triple patterns or even join execution) according to the current status of the
workload. If a server is free, it should be able to execute the complete SPARQL
query. Thus, the server will offer execution services while avoiding reaching the
maximum query processing limit, i.e. service shutdown. This type of service is
possible with coordination between server and client in an intelligent way, which
2 DBpedia TPF: http://fragments.dbpedia.org/
3 As a result of this query, we got 48 number of result rows.
is the aim of this research. In the motivating example, if the server is able to
execute the complete query, then only the final query result (i.e., 48 in total)
will be fetched across the network.
3</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Relevancy</title>
      <p>Performance can be defined as the rate at which a task can be completed. In our
case, performance is the number of querying executed by the proposed interface
in a time interval. Availability is the characteristic of a server where it can listen
to client request and respond well in time. Due to the increasing trend of the
Semantic Web, developers of the applications care about a reliable source which
can ensure the provision of linked data in a live queryable form.</p>
      <p>
        High performance with high availability is the requirement of almost every
data consumer. Whenever there is a large amount of data, people will want to
query it - and nothing is more intriguing than the vast amounts of Linked Data
published over the last few years [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Application developers require Linked Data
which should be available and can be queried in an efficient way.
      </p>
      <p>
        As discussed in section 1, existing interfaces offer either good performance or
availability but not both. In this research our aim is to make a trade-off between
performance and availability, which is shown in Fig. 1.
The easiest way to publish the Linked Data knowledge graph is to upload an
archive, usually called Data dumps, in an RDF format such as N-Triples or
N-Quads. Clients download these files through HTTP and use it according to
their needs. Advantage of using Data dump is its universality, i.e. client obtains
whole of the knowledge graph and then makes it available online through any
access point of its choice locally [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ]. The issue here is that this is not live
querying. Some initiatives have been taken to ease the usage of data dumps. For
example, the LOD Laundromat [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] obtains data dumps from the Web, cleans
up quality issues like serialization, and then republishes the dumps in the same
RDF format. However, this effort still does not handle the issue that data is not
live queryable: clients are still bound to download data dumps and upload them
in a SPARQL-based system before SPARQL queries can be executed.
      </p>
      <p>
        SPARQL endpoint uses the SPARQL protocol, which follows the
First-ComeFirst-Served principle, therefore the queries have to wait in queue for execution
by the server 4.Many triple stores, like Virtuoso [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] and Jena TDB [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], offer a
SPARQL interface.While serving queries concurrently by a SPARQL endpoint,
it has to restrict the size of the result-set returned to the end user or to generate
a time-out error message in a case where a query spends too many resources [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
Therefore, we may not get the complete result-set which we should expect from
the query. Also, a characteristic of SPARQL endpoints is that, in addition to
the uncertainty of the total number of requests to be made by different users,
the execution cost per request varies significantly due to the relatively high
expressiveness of SPARQL [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. In 2013 a survey disclosed that most of the
public SPARQL endpoints had an uptime of less than 95% [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ].
      </p>
      <p>In order to tackle these issues, most public LOD providers introduced a
policy for the fair utilisation of the endpoint resources. As an example,
DBpedia administrator shared the specifications relating to public SPARQL endpoint
utilization. According to this specification, a query can have a maximum of 120
seconds execution time, 10000 results and 50 parallel connections per IP
address5. Since every data provider configures the endpoint of their own, thus it
will be difficult for application developers to rely on it.</p>
      <p>
        Another strategy in this regard is to decompose a query into sub-queries and
then to execute it under quotas [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. The main shortcomings of this approach are
(i) knowing the quotas configured by the data provider is not always possible
and (ii) these quotas can not be applied to all the SPARQL queries to get the
correct evaluation result.
      </p>
      <p>
        Many other approaches exist to apply queries over Linked Data documents [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
One type of approach makes use of pre-populated index structures [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ] and
another focuses on live exploration by the use of traversal-based query
execution [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. Typically, these query execution approaches have longer query
execution times compared to SPARQL endpoints, but—unlike data dumps—permit
for live querying. The required bandwidth is generally less than that of the
data dumps, but the efficiency can still be low depending on the type of query.
Moreover, completeness with respect to a knowledge graph cannot be ensured,
and some of the queries are difficult or impossible to evaluate without the use
of an index [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. As an example, queries for patterns with unbound subjects
(e.g., ?s foaf:made &lt;o&gt;) show problems because Linked Data documents, by its
definition are subject-centric.
4 https://www.w3.org/TR/sparql11-overview/
5 https://wiki.dbpedia.org/public-sparql-endpoint
      </p>
      <p>
        The Triple Pattern Fragments (TPF) approach [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ] is an implementation
of the Linked Data Fragments (LDF) [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], where the server's only job is to
evaluate triple pattern queries. In this approach, the triple pattern queries can
be executed in bounded time [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], therefore the TPF server does not face a
convoy effect [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. However, as joins have to be performed on the client side, a
huge amount of triples transfer from the server to the client leading to poor
SPARQL query execution performance.
      </p>
      <p>
        Hartig et al. further extended the idea of basic TPF in brTPF [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], where
by the use of well-known bind join strategy [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], it allows clients to attach
intermediate results to TPF requests. The response to such a brTPF request is
expected to contain RDF triples from the underlying dataset, that do not only
match the given triple pattern (as in the case of TPF), but also guarantees the
contribution in join with the given intermediate result. Hence, given the brTPF
interface, it becomes possible to distribute the execution of joins between client
and server. Due to this approach the overall throughput of the client-server
system is significantly reduced.
      </p>
      <p>
        Minier et al. proposed a SPARQL query engine based on Web
preemption [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. It allows SPARQL queries to be suspended by the web server after a
fixed time quantum and resumed upon client request. This approach has solved
the problem of partial result generation but the server may not perform well,
when all the parallel queries are of high cost at a time.
      </p>
      <p>The related work discussed here shows that, for the application developers
to rely on linked open data, there exists a gap between the live availability and
the efficient query execution. Therefore, in this the author aims to fill this gap.
5</p>
    </sec>
    <sec id="sec-4">
      <title>Research Questions</title>
      <p>In order to balance the trade-off between availability and performance, through
intelligent TPF server and client, we have to answer the following research
questions:</p>
      <p>I- How can we define a Workload Threshold (WT) beyond which the server
query execution performance starts decreasing?
II- How to avoid the WT?
III- How the server should communicate with the client to convey its current
status? What services it can provide to the client?
IV- How the client will break the query into blocks according the the current
capabilities of the server?
6</p>
    </sec>
    <sec id="sec-5">
      <title>Hypothesis</title>
      <p>Our hypotheses are directly related to the performance and high availability:
H1. Performance: The null hypothesis pertaining to query runtime
performance states that the proposed approach would not lead to significant
performance improvements with respect to the state-of-the-art triple pattern fragment
interfaces. The alternative hypothesis states that the proposed approach would
lead to a significant performance improvement with respect to the
state-of-theart triple pattern fragment interfaces.</p>
      <p>H2. Availability: The null hypothesis pertaining to server availability states
that the proposed approach would not lead to availability comparable to the
state-of-the-art triple pattern fragment interfaces. The alternative hypothesis
states that the proposed approach would be able to achieve availability,
comparable to the state-of-the-art triple pattern fragment interfaces.
7</p>
    </sec>
    <sec id="sec-6">
      <title>Approach</title>
      <p>The approach proposed in this research work aims to provide a trade-off between
availability and performance by dividing the workload among the servers and
clients. Therefore, it is important to discuss the responsibilities of the client and
server.
7.1</p>
      <sec id="sec-6-1">
        <title>Server’s Responsibility</title>
        <p>The main functions of the proposed server are:
1. Workload Monitoring: As discussed, the proposed server offers services
according to the current workload status. Therefore, the server should be
able to monitor its current status of the workload and avoid reaching the
maximum WT limit.
2. Workload-aware Services: The server provides query execution services
according to the current status of the workload. As such, it is possible that
the server will execute the complete SPARQL query (like SPARQL
endpoints) or only execute individual triple patterns (like TPF), depending on
the current status of the workload. Thus, it is also possible that the query
execution plan is divided among the server and client, i.e., some of the query
joins are performed by the server while others are executed by the client.
3. Coordination: As the proposed server provides adaptive query execution
services, it is important to first negotiate a connection with client and
coordinate the current execution services that are available for the given
connection. The client then needs to decompose the given input query according to
the available services.
7.2</p>
      </sec>
      <sec id="sec-6-2">
        <title>Client’s responsibility</title>
        <p>The main functions of the client are:
1. Coordination: As discussed previously, the client has to take updates of
server's current available services and decompose the given SPARQL query
accordingly.
2. Query Planning The query execution plan will be generated on the client’s
side according to the current available services from the server. The partial
query execution can also be done by the client.</p>
        <p>
          At present, we are working on devising algorithms to perform the above
mentioned responsibilities. The first step is to know the WT for the different
available interfaces. To this end, we are conducting experiments by using Iguana [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ], a
benchmark execution framework to analyze the performance of triplestores using
multiple parallel agents with/without updates. The goal of these experiments is
to calculate the maximum WT limit. We are increasing workload by using
multiple querying agents and analyzing the overall throughput until it reaches the
maximum limit. Once, we know the maximum WT limit for the proposed server,
the next step would be to make the server more intelligent, able to avoid reaching
the maximum WT limit.
8
        </p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>Evaluation Plan</title>
      <p>In this section, we briefly explain our evaluation plan to compare the
proposed approach with state-of-the-art approaches. Please note that this plan may
changes according to the availability of new benchmarks, performance metrics,
and new solutions for SPARQL query processing.</p>
      <p>
        Benchmark Selection: The first important step in our evaluation is the
selection of the most relevant and representative SPARQL benchmark. To this
end, SPARQL benchmarks such as BSBM [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], LUBM [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], DBpedia SPARQL
benchmarks [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ], WatDiv [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], SP2Bench [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ], and FEASIBLE [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] are
generally used in state-of-the-art evaluations of SPARQL querying interfaces. Recent
study [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] shows that synthetic benchmarks can be used to test the scalability
of the systems. However, they often fail to contain important SPARQL query
constructs such UNION, FILTER, OPTIONAL etc. Furthermore, the study reveals
that the DBpedia benchmark generated by FEASIBLE [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] framework is the
most diverse SPARQL benchmark on the specified SPARQL features. We plan
to use the most state-of-the-art WatDiv (synthetic) and FEASIBLE (real data
and queries) benchmarks in our evaluation.
      </p>
      <p>
        Metrics: In general, a SPARQL querying benchmark comprises of three main
components: (1) a set of RDF datasets, (2) a set of SPARQL queries, and (3) a set
of performance metrics [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ]. The first two components are directly provided by
the selected benchmark. There can be many performance metrics as discussed
in [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ]. We will mostly focus on the metrics relevant to the availability and
performance of the SPARQL querying interfaces.
      </p>
      <p>
        The metrics relevant to the query runtime performances are: (1) query
runtime, (2) Query Mix per Hour (QMpH), (3) Queries per Second (QpS), (4)
Number of intermediate results and (5) Network traffic generated by the
interfaces [
        <xref ref-type="bibr" rid="ref19 ref21 ref6">21,19,6</xref>
        ]. The metrics relevant to the availability of the systems are:
(1) Query processing overhead in terms of the CPU and memory usage (this
also includes the number of disk/memory swaps), (2) Parallelism with/without
Updates which measure the parallel query processing capabilities of the
querying interfaces by simulating workloads from multiple querying agents with and
without dataset updates [
        <xref ref-type="bibr" rid="ref30 ref6 ref9">9,30,6</xref>
        ].
      </p>
      <p>
        Systems: SAGE [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] and Communica [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ] are state-of-the-art HDT-based
interfaces which we will compare with the proposed solution. In addition, SPARQL
endpoints interfaces backed by triplestores such as Virtuoso, BlazeGraph, GraphDB
etc. will also be considered in the evaluation.
      </p>
      <p>Best Practices: We will follow the standard best practices such as running
benchmark queries multiple times and presenting their average results, starting
with a warm up phase where certain test queries will be run before starting
the actual evaluation etc. Furthermore, we will use various significance tests
such as T-Test, Wilcoxon signed-rank test etc. to measure the significance of the
performance improvements. To ensure the reproducibility of our results, we will
make sure to provide public access to the complete data, queries, and the results
used in our evaluation. Furthermore, we will create a user manual to help in
reproducing the results.
9</p>
    </sec>
    <sec id="sec-8">
      <title>Reflections</title>
      <p>We believe that the real essence of the Semantic Web will be felt when
semantically Linked Data is available, and applications can easily and reliably access the
very specific part of that data, in an efficient manner. Our proposed approach
takes into account providing not only availability of Linked Data or efficient
execution of the queries, but providing both at the same time, by balancing
trade-offs between the two aspects.
10</p>
    </sec>
    <sec id="sec-9">
      <title>Acknowledgments</title>
      <p>This work has been supported by the German Federal Ministry of Transport
and Digital Infrastructure (BMVI) in the projects LIMBO (no. 19F2029I) and
OPAL (no. 19F2028A), the H2020 Marie Sklodowska-Curie project KnowGraphs
(GA no. 860801), and the German Federal Ministry of Education and Research
(BMBF) within the program ’KMU-innovativ: Forschung fu¨r die zivile
Sicherheit’ in the project SOLIDE (no. 13N14456).</p>
      <p>I am thankful to my PhD supervisors Prof. Dr. Axel-cyrille Ngonga Ngomo
and Dr. Muhammad Saleem.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <source>SIGUCCS '09: Proceedings of the 37th Annual ACM SIGUCCS Fall Conference: Communication and Collaboration</source>
          . ACM, New York, NY, USA (
          <year>2009</year>
          ),
          <fpage>459091</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2. Aluc¸,
          <string-name>
            <given-names>G.</given-names>
            ,
            <surname>Hartig</surname>
          </string-name>
          ,
          <string-name>
            <surname>O.</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O</given-names>
            ¨zsu, M.T.,
            <surname>Daudjee</surname>
          </string-name>
          ,
          <string-name>
            <surname>K.</surname>
          </string-name>
          :
          <article-title>Diversified stress testing of RDF data management systems</article-title>
          .
          <source>In: ISWC</source>
          , pp.
          <fpage>197</fpage>
          -
          <lpage>212</lpage>
          (
          <year>2014</year>
          ). https://doi.org/10.1007/978-3-
          <fpage>319</fpage>
          -11964-9 13, https://doi.org/10.1007/ 978-3-
          <fpage>319</fpage>
          -11964-9_
          <fpage>13</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Aranda</surname>
            ,
            <given-names>C.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Polleres</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Umbrich</surname>
          </string-name>
          , J.:
          <article-title>Strategies for executing federated queries in sparql1.1</article-title>
          . In: International Semantic Web Conference (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Beek</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rietveld</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bazoobandi</surname>
            ,
            <given-names>H.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wielemaker</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schlobach</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Lod laundromat: A uniform way of publishing other people's dirty data</article-title>
          . In: Mika,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Tudorache</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Bernstein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Welty</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Knoblock</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            , Vrandeˇci´c, D.,
            <surname>Groth</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Noy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            ,
            <surname>Janowicz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Goble</surname>
          </string-name>
          , C. (eds.) The Semantic Web - ISWC
          <year>2014</year>
          . pp.
          <fpage>213</fpage>
          -
          <lpage>228</lpage>
          . Springer International Publishing,
          <string-name>
            <surname>Cham</surname>
          </string-name>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Bizer</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heath</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Berners-Lee</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Linked data - the story so far</article-title>
          .
          <source>Int. J. Semantic Web Inf. Syst</source>
          .
          <volume>5</volume>
          (
          <issue>3</issue>
          ),
          <fpage>1</fpage>
          -
          <lpage>22</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Bizer</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schultz</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>The Berlin SPARQL benchmark</article-title>
          .
          <source>Int. J. Semantic Web Inf. Syst</source>
          .
          <volume>5</volume>
          (
          <issue>2</issue>
          ),
          <fpage>1</fpage>
          -
          <lpage>24</lpage>
          (
          <year>2009</year>
          ). https://doi.org/10.4018/jswis.2009040101, https://doi. org/10.4018/jswis.2009040101
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Blasgen</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gray</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mitoma</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Price</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>The convoy phenomenon</article-title>
          .
          <source>SIGOPS Oper. Syst. Rev</source>
          .
          <volume>13</volume>
          (
          <issue>2</issue>
          ),
          <fpage>20</fpage>
          -
          <lpage>25</lpage>
          (
          <year>Apr 1979</year>
          ). https://doi.org/10.1145/850657.850659, http://doi.acm.
          <source>org/10</source>
          .1145/850657.850659
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Buil-Aranda</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hogan</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Umbrich</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vandenbussche</surname>
          </string-name>
          , P.Y.:
          <article-title>Sparql webquerying infrastructure: Ready for action</article-title>
          ? In: International Semantic Web Conference. pp.
          <fpage>277</fpage>
          -
          <lpage>293</lpage>
          . Springer (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Conrads</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Saleem</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Morsey</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ngomo</surname>
            ,
            <given-names>A.N.:</given-names>
          </string-name>
          <article-title>IGUANA: A generic framework for benchmarking the read-write performance of triple stores</article-title>
          .
          <source>In: ISWC</source>
          . pp.
          <fpage>48</fpage>
          -
          <lpage>65</lpage>
          . Springer (
          <year>2017</year>
          ). https://doi.org/10.1007/978-3-
          <fpage>319</fpage>
          -68204- 4 5, https://doi.org/10.1007/978-3-
          <fpage>319</fpage>
          -68204-
          <issue>4</issue>
          _
          <fpage>5</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Erling</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mikhailov</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Virtuoso: RDF Support in a Native RDBMS</article-title>
          , pp.
          <fpage>501</fpage>
          -
          <lpage>519</lpage>
          . Springer Berlin Heidelberg, Berlin, Heidelberg (
          <year>2010</year>
          ), https://doi.org/10. 1007/978-3-
          <fpage>642</fpage>
          -04329-1_
          <fpage>21</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Guo</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pan</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heflin</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>LUBM: a benchmark for OWL knowledge base systems</article-title>
          .
          <source>J. Web Sem</source>
          .
          <volume>3</volume>
          (
          <issue>2-3</issue>
          ),
          <fpage>158</fpage>
          -
          <lpage>182</lpage>
          (
          <year>2005</year>
          ). https://doi.org/10.1016/j.websem.
          <year>2005</year>
          .
          <volume>06</volume>
          .005, https://doi.org/10.1016/j. websem.
          <year>2005</year>
          .
          <volume>06</volume>
          .005
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Haas</surname>
            ,
            <given-names>L.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kossmann</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wimmers</surname>
            ,
            <given-names>E.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <article-title>Optimizing queries across diverse data sources</article-title>
          .
          <source>In: Proceedings of the 23rd International Conference on Very Large Data Bases</source>
          . pp.
          <fpage>276</fpage>
          -
          <lpage>285</lpage>
          . VLDB '
          <volume>97</volume>
          , San Francisco, CA, USA (
          <year>1997</year>
          ), http: //dl.acm.org/citation.cfm?id=
          <volume>645923</volume>
          .
          <fpage>670995</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Hartig</surname>
            ,
            <given-names>O.:</given-names>
          </string-name>
          <article-title>An overview on execution strategies for linked data queries</article-title>
          .
          <source>DatenbankSpektrum</source>
          <volume>13</volume>
          (
          <issue>2</issue>
          ),
          <fpage>89</fpage>
          -
          <lpage>99</lpage>
          (
          <year>Jul 2013</year>
          ). https://doi.org/10.1007/s13222-013-0122-1, https://doi.org/10.1007/s13222-013-0122-1
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Hartig</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aranda</surname>
          </string-name>
          , C.B.:
          <article-title>brtpf: Bindings-restricted triple pattern fragments (extended preprint)</article-title>
          .
          <source>CoRR</source>
          (
          <year>2016</year>
          ), http://arxiv.org/abs/1608.08148
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Hartig</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bizer</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Freytag</surname>
            ,
            <given-names>J.C.</given-names>
          </string-name>
          :
          <article-title>Executing sparql queries over the web of linked data</article-title>
          . In: Bernstein,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Karger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.R.</given-names>
            ,
            <surname>Heath</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Feigenbaum</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Maynard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            ,
            <surname>Motta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            ,
            <surname>Thirunarayan</surname>
          </string-name>
          ,
          <string-name>
            <surname>K</surname>
          </string-name>
          . (eds.) The Semantic Web - ISWC
          <year>2009</year>
          . pp.
          <fpage>293</fpage>
          -
          <lpage>309</lpage>
          . Springer Berlin Heidelberg, Berlin, Heidelberg (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Hartig</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Letter</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          , P´erez, J.:
          <article-title>A formal framework for comparing linked data fragments</article-title>
          . In: d'Amato,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Fernandez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Tamma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            ,
            <surname>Lecue</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            ,
            <surname>Cudr´</surname>
          </string-name>
          e-Mauroux,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Sequeda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Lange</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Heflin</surname>
          </string-name>
          ,
          <string-name>
            <surname>J</surname>
          </string-name>
          . (eds.) The Semantic Web - ISWC
          <year>2017</year>
          . pp.
          <fpage>364</fpage>
          -
          <lpage>382</lpage>
          . Springer International Publishing,
          <string-name>
            <surname>Cham</surname>
          </string-name>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Heling</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Acosta</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Maleshkova</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sure-Vetter</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Querying large knowledge graphs over triple pattern fragments: An empirical study</article-title>
          . In: Vrandeˇci´c, D.,
          <string-name>
            <surname>Bontcheva</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <article-title>Sua´rez-</article-title>
          <string-name>
            <surname>Figueroa</surname>
            ,
            <given-names>M.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Presutti</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Celino</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sabou</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kaffee</surname>
            ,
            <given-names>L.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Simperl</surname>
          </string-name>
          , E. (eds.) The Semantic Web - ISWC
          <year>2018</year>
          . pp.
          <fpage>86</fpage>
          -
          <lpage>102</lpage>
          . Springer International Publishing,
          <string-name>
            <surname>Cham</surname>
          </string-name>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Minier</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Skaf-Molli</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Molli</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          : Sage:
          <article-title>Web preemption for public sparql query services</article-title>
          .
          <source>In: The World Wide Web Conference</source>
          . pp.
          <fpage>1268</fpage>
          -
          <lpage>1278</lpage>
          . WWW '19,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY, USA (
          <year>2019</year>
          ). https://doi.org/10.1145/3308558.3313652, http:// doi.acm.
          <source>org/10</source>
          .1145/3308558.3313652
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Morsey</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Auer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ngomo</surname>
            ,
            <given-names>A.N.</given-names>
          </string-name>
          :
          <article-title>DBpedia SPARQL benchmark - performance assessment with real queries on real data</article-title>
          .
          <source>In: ISWC</source>
          . pp.
          <fpage>454</fpage>
          -
          <lpage>469</lpage>
          . Springer (
          <year>2011</year>
          ). https://doi.org/10.1007/978-3-
          <fpage>642</fpage>
          -25073-6 29, https: //doi.org/10.1007/978-3-
          <fpage>642</fpage>
          -25073-6_
          <fpage>29</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20. P´erez, J.,
          <string-name>
            <surname>Arenas</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gutierrez</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Semantics and complexity of sparql</article-title>
          .
          <source>ACM Trans. Database Syst</source>
          .
          <volume>34</volume>
          (
          <issue>3</issue>
          ),
          <volume>16</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>16</lpage>
          :
          <fpage>45</fpage>
          (Sep
          <year>2009</year>
          ). https://doi.org/10.1145/1567274.1567278
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Saleem</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mehmood</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ngomo</surname>
            ,
            <given-names>A.N.:</given-names>
          </string-name>
          <article-title>FEASIBLE: a feature-based SPARQL benchmark generation framework</article-title>
          .
          <source>In: ISWC</source>
          . pp.
          <fpage>52</fpage>
          -
          <lpage>69</lpage>
          . Springer (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Saleem</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , Sz´arnyas, G.,
          <string-name>
            <surname>Conrads</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bukhari</surname>
            ,
            <given-names>S.A.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mehmood</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ngonga</surname>
            <given-names>Ngomo</given-names>
          </string-name>
          ,
          <string-name>
            <surname>A.C.</surname>
          </string-name>
          :
          <article-title>How representative is a sparql benchmark? an analysis of rdf triplestore benchmarks</article-title>
          .
          <source>In: The World Wide Web Conference</source>
          . pp.
          <fpage>1623</fpage>
          -
          <lpage>1633</lpage>
          . WWW '19,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY, USA (
          <year>2019</year>
          ). https://doi.org/10.1145/3308558.3313556, http://doi.acm.
          <source>org/10</source>
          .1145/ 3308558.3313556
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Schmidt</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , et al.:
          <article-title>SP2Bench: A SPARQL performance benchmark</article-title>
          .
          <source>In: Semantic Web Information Management - A Model-Based Perspective</source>
          , pp.
          <fpage>371</fpage>
          -
          <lpage>393</lpage>
          (
          <year>2009</year>
          ), https://doi.org/10.1007/978-3-
          <fpage>642</fpage>
          -04329-1_
          <fpage>16</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Taelman</surname>
            , R., Van Herwegen,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vander</surname>
            <given-names>Sande</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Verborgh</surname>
          </string-name>
          , R.:
          <article-title>Comunica: A modular sparql query engine for the web</article-title>
          . In: Vrandeˇci´c, D.,
          <string-name>
            <surname>Bontcheva</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          , Sua´rezFigueroa,
          <string-name>
            <given-names>M.C.</given-names>
            ,
            <surname>Presutti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            ,
            <surname>Celino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            ,
            <surname>Sabou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Kaffee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.A.</given-names>
            ,
            <surname>Simperl</surname>
          </string-name>
          , E. (eds.) The Semantic Web - ISWC
          <year>2018</year>
          . pp.
          <fpage>239</fpage>
          -
          <lpage>255</lpage>
          . Springer International Publishing,
          <string-name>
            <surname>Cham</surname>
          </string-name>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Umbrich</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hose</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Karnstedt</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Harth</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Polleres</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Comparing data summaries for processing live queries over linked data</article-title>
          .
          <source>World Wide Web</source>
          <volume>14</volume>
          (
          <issue>5</issue>
          ),
          <fpage>495</fpage>
          -
          <lpage>544</lpage>
          (
          <year>Oct 2011</year>
          ). https://doi.org/10.1007/s11280-010-0107-z, https://doi.org/10. 1007/s11280-010-0107-z
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Vandenbussche</surname>
          </string-name>
          , P.Y.,
          <string-name>
            <surname>Umbrich</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Matteis</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hogan</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Buil-Aranda</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          : Sparqles:
          <article-title>Monitoring public sparql endpoints</article-title>
          .
          <source>Semantic web 8(6)</source>
          ,
          <fpage>1049</fpage>
          -
          <lpage>1065</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Vandenbussche</surname>
          </string-name>
          , P.Y.,
          <string-name>
            <surname>Umbrich</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Matteis</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hogan</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Buil-Aranda</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          : Sparqles:
          <article-title>Monitoring public sparql endpoints</article-title>
          .
          <source>Semantic Web</source>
          <volume>8</volume>
          ,
          <fpage>1</fpage>
          -
          <lpage>17</lpage>
          (01
          <year>2017</year>
          ). https://doi.org/10.3233/SW-170254
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>Verborgh</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hartig</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>De Meester</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haesendonck</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>De Vocht</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sande</surname>
            ,
            <given-names>M.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cyganiak</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Colpaert</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mannens</surname>
          </string-name>
          , E., Van De Walle, R.:
          <article-title>Low-cost queryable linked data through triple pattern fragments</article-title>
          .
          <source>In: Proceedings of the 2014 International Conference on Posters &amp;#</source>
          <volume>38</volume>
          ;
          <string-name>
            <surname>Demonstrations</surname>
          </string-name>
          Track - Volume
          <volume>1272</volume>
          . pp.
          <fpage>13</fpage>
          -
          <lpage>16</lpage>
          . ISWC-PD'
          <fpage>14</fpage>
          ,
          <string-name>
            <surname>CEUR-WS</surname>
          </string-name>
          .org, Aachen, Germany, Germany (
          <year>2014</year>
          ), http://dl.acm.org/citation.cfm?id=
          <volume>2878453</volume>
          .
          <fpage>2878457</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29.
          <string-name>
            <surname>Verborgh</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Vander</given-names>
            <surname>Sande</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Hartig</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            ,
            <surname>Van Herwegen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>De Vocht</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>De Meester</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Haesendonck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            ,
            <surname>Colpaert</surname>
          </string-name>
          ,
          <string-name>
            <surname>P.</surname>
          </string-name>
          :
          <article-title>Triple pattern fragments: a lowcost knowledge graph interface for the web</article-title>
          .
          <source>JOURNAL OF WEB SEMANTICS 37-38</source>
          ,
          <fpage>184</fpage>
          -
          <lpage>206</lpage>
          (
          <year>2016</year>
          ), http://dx.doi.org/10.1016/j.websem.
          <year>2016</year>
          .
          <volume>03</volume>
          .003
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30.
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          , et al.:
          <source>BioBenchmark Toyama</source>
          <year>2012</year>
          :
          <article-title>An evaluation of the performance of triple stores on biological data</article-title>
          .
          <source>J. Biomedical Semantics</source>
          <volume>5</volume>
          ,
          <issue>32</issue>
          (
          <year>2014</year>
          ). https://doi.org/10.1186/2041-1480-5-32, https://doi.org/10.1186/ 2041-1480-5-32
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>