<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Knowledge Graph (R)Evolution and the Web of Data</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Katja Hose</string-name>
          <email>khose@cs.aau.dk</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Aalborg University</institution>
          ,
          <country country="DK">Denmark</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>When querying knowledge over the Web we typically consider the Web of Data to be a static point of reference that is always available and that never changes. However, when actually running queries "in the wild", we encounter a broad range of problems; spanning from the (un)availability of entire knowledge graphs (and their SPARQL endpoints) to outdated references between knowledge graphs and beyond that, we are almost entirely missing out on the availability of previous versions of knowledge graphs and provenance metadata about them. This position paper discusses these issues in context and sketches some of the solutions to mitigate them. In particular, this paper first discusses approaches to keep knowledge graphs available for continuous and scalable querying and afterwards presents an approach that enables community-driven updates so that mistakes can be corrected or missing information can be added. Then, the paper highlights what we can learn from RDF archiving solutions to better support evolving knowledge graphs. And finally, the paper puts these aspects into perspective and provides an outlook to open challenges and future work.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>in science, improved information extraction, new domains, etc.), some knowledge
disappears (outdated information, knowledge provider no longer offers the service, etc.),
erroneous information is corrected (or not), links and identifiers are changing, systems
crash or become temporarily overloaded and unavailable, etc.</p>
      <p>Obviously, there are plenty of different aspects, research questions, and proposed
solutions in this context that are worth pursuing. This paper focuses on a subset of them
and briefly highlights a few particular fundamental research questions and solutions that
we have been working on in my lab. These questions are:
– How can we reduce the query load at an endpoint to keep it responsive and available?
(Section 2)
– How can we keep knowledge graphs available despite the original sources becoming
unavailable? (Section 3)
– How can we correct erroneous information in a community-driven and decentralized
way? (Section 4)
– How can we support an evolving Web of Data? (Section 5)
These are the same questions that I discussed in my keynote titled “How can we fix
the Web of Data?” at the MEPDaW Workshop 2021. To conclude, Section 6 puts these
questions into perspective and provides an outlook to open challenges and future work.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Load balancing and endpoint availability</title>
      <p>One of the frequently encountered issues that we experience when processing queries
on the Web of Data is that response times heavily depend on the resources currently
available at the source that hosts the queried knowledge graph. Typically, the source
offers access via a SPARQL endpoint that is capable of executing complex and arbitrary
SPARQL queries. Obviously, this can easily become a bottleneck; when expensive
queries are running concurrently, there are only little resources left to answer other
queries. As a consequence, response time and throughput suffer – and in the worst case
– the endpoint might become so overloaded that it crashes and becomes unavailable
until the provider notices and fixes the issue.</p>
      <p>
        One way to counteract this issue is to reduce the load at the server and instead push
some of it to the client that issued the query. Over the years, several approaches have
been proposed that are trying to find a trade-off between the extremes of (a) having only
downloadable dumps and doing everything on the client and (b) SPARQL endpoints
doing everything on the server. Key methods cover (i) partitioning and compression of
the data on the server so that smaller amounts of data have to be accessed and transferred,
(ii) preemption [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] so that not all the results for a query are computed but only a subset
unless the client asks for more results, and (iii) restricting the range of supported queries
on the server to those that can be executed efficiently, e.g., triple patterns [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] or star
patterns [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>
        WiseKG [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] is the latest approach in this field, builds upon these key methods, and is
able to dynamically allocate query processing tasks to server or client depending on the
current query load on the server, i.e., if the server is overloaded, more tasks are pushed
to the clients that issued the queries, which then have to download portions of the data
and compute joins locally. For this purpose, the server partitions the data into smaller
partitions based on star patterns and similar predicates so that all subgraphs matching
similar star patterns are grouped together and compressed using HDT [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. The server
optimizes a query by decomposing it into star-pattern-based subqueries, optimizes the
join order, and decides which subqueries should be executed at the server and which
ones at the client. While there is still room for optimization, e.g., cost model, query
decomposition, data partitioning, statistics, etc., experimental results have shown that
especially the dynamic workload-aware shifting of tasks from server to client increases
WiseKG’s throughput in comparison to other approaches.
      </p>
      <p>
        Until just a few years ago, the standard approach for querying remote knowledge
graphs used to be a SPARQL endpoint. Today, the landscape has broadened up; we
no longer have only endpoints but also other approaches including TPF [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] and
WiseKG [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. All of these have different strengths and weaknesses that also depend
on the type of query that we want to execute [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. To optimize and efficiently process
queries in this heterogeneous landscape [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], future work needs to develop efficient
query processing strategies over these heterogeneous interfaces that can exploit their
strengths and avoid their weakness.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Keeping knowledge graphs available despite failing original sources</title>
      <p>
        Another reason why query processing on the Web of Data is often not a reliable service
is that it totally relies on the services offered by the data providers: Web interfaces with
downloadable dumps, SPARQL endpoints, dereferenceable URIs/IRIs, etc. Studies [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]
and monitoring services2 have shown that these Web interfaces, especially SPARQL
endpoints, are often not available, i.e., there is no guarantee that the data or query
interface necessary to answer a query is actually available when needed. The reasons for
this are manifold; for instance, as discussed in the previous section, an endpoint might
be overloaded and become unresponsive. Sometimes endpoint crashes are not detected
for longer periods of time and sometimes endpoints are permanently taken offline, e.g.,
when the grant that funded an academic project ended. The underlying issue is that
maintaining a SPARQL endpoint requires considerable resources in terms of hardware
and computing power but also in terms of human resources and server administration
and maintenance. And since knowledge graphs on the Web typically come as (Linked)
Open Data providing the service for the public does not generate any financial income.
      </p>
      <p>
        To keep the information available, we can rely on file sharing principles and P2P
systems. In particular, unstructured P2P systems are an interesting foundation where –
instead of servers representing endpoints – we have servers representing independent
clients (aka peers or nodes), each sharing some own data as well as copies of data from
other peers. One such system designed for sharing knowledge graphs, PIQNIC [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], first
splits a large dataset into smaller fragments, for example by predicates of the triples, that
are easier to share and process. To ensure that the data remains available, the fragments
are replicated at multiple peers so that a certain number of copies of each dataset are
available in the network. Since there is no global knowledge in P2P systems, each peer
only knows a couple of other peers (neighbors); to keep such a network of peers stable
2 http://www.lodhub.aau.dk/
and connected in the presence of peers joining and leaving the network, peers regularly
exchange information about their neighbors and update their connections. It might, for
instance, be advantageous to have a direct connection to a peer that has related data, e.g.,
in the sense of “joinability” so that the datasets of the peers can be joined in a query to
produce results.
      </p>
      <p>
        For query processing and optimization, it is of course beneficial to have access to
statistics about datasets and information about which datasets are available at which peer.
Each peer then maintains an index containing such information about its neighbors, i.e.,
it captures which fragments (predicates) of a dataset are stored at which neighbors and
what URIs/IRIs these fragments contain [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Such indexes can, for instance, be based
on Bloom Filters and used to estimate the size of a join result based on the degree
of overlapping bits representing URIs/IRIs in the fragments. They can also be used to
decide whether a pair of fragments can produce join results at all so that the join can be
entirely pruned from the query execution plan.
      </p>
      <p>Evaluation results show that such kind of indexes capturing not only fragments but
also contained URIs/IRIs substantially increase query performance. And if we replicate
all fragments at ca. 5% of the peers in the network, then we can lose more than 50%
of the peers until we start even noticing the effect in result completeness. Of course,
query execution time and throughput also benefit from a higher degree of replication
and locality.</p>
      <p>There are still many challenges to explore in future work, incl. complex queries,
alternative types of index structures, and forms of partitioning and allocation. In
particular, workload-aware approaches that can help tune the system for a particular query
workload are interesting and promising areas of future work.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Facilitating community-driven updates</title>
      <p>Another fundamental issue we encounter often is that some knowledge graphs are not
up-to-date; they (i) contain erroneous information, e.g., due to an error in the information
extraction pipeline, (ii) contain outdated information, e.g., the president of a country
changes once in a while or links to other knowledge graphs might no longer be accurate
if the other side changed the URIs/IRIs, or (iii) lack important information, e.g., links
to other sources or the nickname, nationality, etc. of a person.</p>
      <p>
        With the current architecture of the Web of Data, there is no way for consumers to
update a knowledge graph other than contacting the provider hoping that the feedback
will be taken into account. This, however, is a difficult and tedious endeavor. Hence, we
proposed a community-driven architecture and methods giving communities of users
the opportunity to maintain and update knowledge graphs: ColChain [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] is a system that
builds upon P2P architectures (as mentioned above) and exploits blockchain technology
to facilitate community-driven updates while keeping track of and enabling query access
to older versions of a knowledge graph.
      </p>
      <p>So, the core idea is to define communities of users and peers that can propose updates
to a knowledge graph and together decide, e.g., by voting using a majority consensus
protocol, whether to accept an update or not. If an update is accepted, it needs to be
propagated throughout the network to update all other copies. During query processing,
an older copy of a knowledge graph can be recovered by going through the materialized
version of the knowledge graph and the change sets of the updates. In this sense, it is
possible to support “time-travel queries” by computing the result for a given query based
on the state of a set of knowledge graphs at a particular point in time.</p>
      <p>Our experimental results show that there is not much overhead for query processing
that is caused by extending the basic P2P approach with the blockchain and community
functionality. Since only the latest version of a knowledge graph is materialized,
answering queries accessing older versions takes a bit longer since the required versions of the
knowledge graph are created on the fly.</p>
      <p>Future work will consider alternative ways of defining communities and voting
strategies, e.g., not giving all peers/users an equal weight in the voting process. In
particular, individual update regimes per knowledge graph might be required since
some publishers might wish to retain some special rights, such as a veto for updates.
Of course, another important area of future work is to consider materializing multiple
versions of a knowledge graph to improve query performance (see also Section 5).
5</p>
    </sec>
    <sec id="sec-5">
      <title>Supporting an evolving Web of Data</title>
      <p>Current approaches for managing and processing RDF knowledge graphs typically
optimize for one of two extremes: either the data is considered to be entirely static or it
comes in a stream setting. There is not much work on use cases in between that could
support evolving knowledge graphs with arbitrary (slow) rates of updates. Well-known
examples of such evolving knowledge graphs are DBpedia, YAGO, and Wikidata; over
the years, they grew by capturing more knowledge, there were changes to the schemas
and ontologies, URI/IRI naming schemes have changed, etc.</p>
      <p>
        One of the basic challenges is to define objective measures that capture the
characteristics of the evolution over time [
        <xref ref-type="bibr" rid="ref16 ref8">8, 16</xref>
        ]. Proposed measures compare consecutive pairs
of revisions and analyze the differences between them. Such measures range from
lowlevel measures (incl. growth, additions, deletions of triples and vocabulary elements) to
high-level measures (incl. affected entities, types, literals, ontologies) trying to capture
the semantics and provide deeper insights of what the changes mean.
      </p>
      <p>
        Another challenge is to support multiple versions of a knowledge graph, which is the
goal of so-called RDF archiving systems [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. These systems are not necessarily part
of the Web of Data (although they could be used as backends for SPARQL endpoints)
but approach the problem from a different angle and support functionalities that
standard approaches for knowledge graph management and querying do not consider. The
straightforward case are queries against a single (current, past) version of a knowledge
graph. But there are other types of interesting queries, e.g., in which revisions of a
knowledge graph does a query produce results, in which revisions were certain entities
added or deleted, or in which version was there a relationship between two entities or
was there ever. These types of queries obviously require comparisons between multiple
revisions, which the research community has not yet much paid much attention to.
      </p>
      <p>Existing RDF archiving systems make some basic design choices, e.g., storing
independent copies per revision, storing snapshots and delta chains, or tagging triples
with timestamps. Some systems then make use of metadata encoding to capture which
triples belong to which revision – this ranges from reification to using provenance
ontologies (PROV-O) to capture arbitrary types of metadata. Another common way is
to repurpose named graphs to model the revisions so that triples become quads where
the fourth column can be used to tag the revision without “breaking” the triple format.
Other approaches simply extend the layout and add more columns. There are also other
important design choices, which involve whether multiple graphs are supported within
the same system, or whether concurrent updates are important.</p>
      <p>
        To truly support (slowly) evolving knowledge graphs on the Web, future work has
to design systems that can serve as backends to access the Web of Data by combining
the obtained insights from triple stores and RDF archives. As a first step, the evolution
measures can be used to identify evolution patterns that can then be used as guidance to
choose and design an appropriate data layout and data structures [
        <xref ref-type="bibr" rid="ref15 ref18 ref19">15, 18, 19</xref>
        ]. Afterwards,
we need to develop query optimizers and efficient query processing strategies for the
special types of queries in this setup.
6
      </p>
    </sec>
    <sec id="sec-6">
      <title>Conclusion</title>
      <p>This position paper is centered around the question of how to mitigate the current
challenges of the Web of Data and in particular those caused by its evolving knowledge
graphs. Several challenges in this context have been discussed in more detail: (i)
improving the availability of endpoints by better sharing load between server and client,
(ii) lowering the burden of and the dependence on the data provider by introducing
a P2P-style system using replication, (iii) supporting community-driven updates in a
decentralized setup and enabling knowledge graph evolution by introducing blockchain
principles, and (iv) enabling a broader range of functionality over evolving knowledge
graphs by fusing advances in RDF archiving systems and efficient data management and
query processing in triple stores.</p>
      <p>
        To truly support knowledge graph evolution, there are plenty of open research
challenges within these areas that researchers are only just beginning to explore, e.g.,
supporting different types of time-travel queries, scalable query processing and
optimization over heterogeneous RDF interfaces, incorporating different standards of encoding
knowledge graphs and queries (property graphs vs. RDF, Cypher vs. SPARQL) but also
recent developments on SHACL/SheX constraints and shapes in general [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. Moreover,
to help users understand and exploit the information contained in complex knowledge
graphs, we need to develop efficient ways for graph exploration [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] and extend them to
support evolving graphs. And then, there is also the issue of trust and how to reassure a
user that systems and answers to queries can be trusted. A foundation of trust in this sense
is provenance and metadata management, which covers not only approaches to capture
metadata about triples and knowledge graphs, such as reification and RDF-star [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], but
also approaches to capture workflow provenance and lineage to explain how knowledge
graphs were generated, processed, and integrated, and finally also approaches to provide
explanations on how the system arrived at a certain answer to a query [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
Acknowledgements. This research was partially funded by the Danish Council for
Independent Research (DFF) under grant agreement no. DFF-8048-00051B and the
Poul Due Jensen Foundation.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Aebeloe</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Keles</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Montoya</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hose</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Star Pattern Fragments: Accessing Knowledge Graphs through Star Patterns</article-title>
          . CoRR abs/
          <year>2002</year>
          .09172 (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Aebeloe</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Montoya</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hose</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>A Decentralized Architecture for Sharing and Querying Semantic Data</article-title>
          .
          <source>In: ESWC. Lecture Notes in Computer Science</source>
          , vol.
          <volume>11503</volume>
          , pp.
          <fpage>3</fpage>
          -
          <lpage>18</lpage>
          . Springer (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Aebeloe</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Montoya</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hose</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Decentralized Indexing over a Network of RDF Peers</article-title>
          .
          <source>In: ISWC. Lecture Notes in Computer Science</source>
          , vol.
          <volume>11778</volume>
          , pp.
          <fpage>3</fpage>
          -
          <lpage>20</lpage>
          . Springer (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Aebeloe</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Montoya</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hose</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>ColChain: Collaborative Linked Data Networks</article-title>
          .
          <source>In: WWW</source>
          . pp.
          <fpage>1385</fpage>
          -
          <lpage>1396</lpage>
          . ACM / IW3C2 (
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Azzam</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aebeloe</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Montoya</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Keles</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Polleres</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hose</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>WiseKG: Balanced Access to Web Knowledge Graphs</article-title>
          . In: WWW. pp.
          <fpage>1422</fpage>
          -
          <lpage>1434</lpage>
          (
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Berners-Lee</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Linked Data Design Issues</article-title>
          , http://www.w3.org/DesignIssues/LinkedData.html
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Fernández</surname>
            ,
            <given-names>J.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Martínez-Prieto</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gutiérrez</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Polleres</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Arias</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Binary RDF representation for publication and exchange (HDT)</article-title>
          .
          <source>J. Web Semant</source>
          .
          <volume>19</volume>
          ,
          <fpage>22</fpage>
          -
          <lpage>41</lpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Fernández</surname>
            ,
            <given-names>J.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Umbrich</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Polleres</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Knuth</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Evaluating query and storage strategies for RDF archives</article-title>
          .
          <source>Semantic Web</source>
          <volume>10</volume>
          (
          <issue>2</issue>
          ),
          <fpage>247</fpage>
          -
          <lpage>291</lpage>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Hartig</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          :
          <article-title>Foundations of RDF★ and SPARQL★: An Alternative Approach to StatementLevel Metadata in RDF</article-title>
          . In: AMW (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Hernández</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Galárraga</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hose</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Computing How-Provenance for SPARQL Queries via Query Rewriting</article-title>
          .
          <source>Proc. VLDB Endow</source>
          .
          <volume>14</volume>
          (
          <issue>13</issue>
          ),
          <fpage>3389</fpage>
          -
          <lpage>3401</lpage>
          (
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Lissandrini</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mottin</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hose</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pedersen</surname>
          </string-name>
          , T.B.:
          <article-title>Knowledge Graph Exploration Systems: are we lost? In: CIDR (</article-title>
          <year>2022</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Minier</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Skaf-Molli</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Molli</surname>
            ,
            <given-names>P.:</given-names>
          </string-name>
          <article-title>SaGe: Web Preemption for Public SPARQL Query Services</article-title>
          . In: WWW. pp.
          <fpage>1268</fpage>
          -
          <lpage>1278</lpage>
          . ACM (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Montoya</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aebeloe</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hose</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Towards Efficient Query Processing over Heterogeneous RDF Interfaces</article-title>
          . In: ISWC (Best Workshop Papers).
          <source>Studies on the Semantic Web</source>
          , vol.
          <volume>36</volume>
          , pp.
          <fpage>39</fpage>
          -
          <lpage>53</lpage>
          . IOS Press (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Montoya</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Keles</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hose</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Analysis of the Effect of Query Shapes on Performance over LDF Interfaces</article-title>
          .
          <source>In: QuWeDa@ISWC. CEUR Workshop Proceedings</source>
          , vol.
          <volume>2496</volume>
          , pp.
          <fpage>51</fpage>
          -
          <lpage>66</lpage>
          . CEUR-WS.org (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Pelgrin</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Galárraga</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hose</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Efficient In-memory Indexing for Metadata-augmented RDF</article-title>
          . In: MEPDaW@ISWC. CEUR Workshop Proceedings, CEUR-WS.org (
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Pelgrin</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Galárraga</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hose</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Towards fully-fledged archiving for RDF datasets</article-title>
          .
          <source>Semantic Web</source>
          <volume>12</volume>
          (
          <issue>6</issue>
          ),
          <fpage>903</fpage>
          -
          <lpage>925</lpage>
          (
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Rabbani</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lissandrini</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hose</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Optimizing SPARQL Queries using Shape Statistics</article-title>
          . In: EDBT. pp.
          <fpage>505</fpage>
          -
          <lpage>510</lpage>
          . OpenProceedings.org (
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Sagi</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lissandrini</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pedersen</surname>
            ,
            <given-names>T.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hose</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>A Design Space for RDF Data Representations</article-title>
          .
          <source>The VLDB Journal</source>
          (
          <year>2022</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Taelman</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mahieu</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vanbrabant</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Verborgh</surname>
          </string-name>
          , R.:
          <article-title>Optimizing storage of RDF archives using bidirectional delta chains</article-title>
          .
          <source>Semantic Web</source>
          (
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Vandenbussche</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Umbrich</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Matteis</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hogan</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aranda</surname>
            ,
            <given-names>C.B.</given-names>
          </string-name>
          :
          <article-title>SPARQLES: Monitoring public SPARQL endpoints</article-title>
          .
          <source>Semantic Web</source>
          <volume>8</volume>
          (
          <issue>6</issue>
          ),
          <fpage>1049</fpage>
          -
          <lpage>1065</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Verborgh</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sande</surname>
            ,
            <given-names>M.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hartig</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Herwegen</surname>
            ,
            <given-names>J.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vocht</surname>
            ,
            <given-names>L.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Meester</surname>
            ,
            <given-names>B.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haesendonck</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Colpaert</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Triple Pattern Fragments: A low-cost knowledge graph interface for the Web</article-title>
          .
          <source>J. Web Semant</source>
          .
          <fpage>37</fpage>
          -
          <issue>38</issue>
          ,
          <fpage>184</fpage>
          -
          <lpage>206</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>