<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Exploring a Text Corpus via a Knowledge Graph?</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Sapienza Universita di Roma</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>bernasconi</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>mecellag@diag.uniroma</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Universita degli Studi di Bari Aldo Moro</institution>
          ,
          <addr-line>ITA</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>Semantic enrichment methods may be used to identify relevant entities in textual documents. These extracted entities are part of knowledge graphs and thus linked by semantic relationships. This work explores the idea of navigating the semantic relationships among extracted entities as a way to search a text corpus. A modular software system (including document management, semantic enrichment, data consolidation, and data integration) has been designed, to o er a visual user interface for such navigation on top of an arbitrary corpus of textual documents. The software, called arca, has been used in a real use case: to search in the book catalogue of a publishing house. The evaluation carried out with a set of potential users has shown so far the feasibility and e ectiveness of the approach. Critical issues and potential limitations of the paradigm have also been found and are discussed.</p>
      </abstract>
      <kwd-group>
        <kwd>Semantic enrichment Knowledge graph</kwd>
        <kwd>Visual search in- terface</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Searching and exploring a vast text corpus has often arisen as a human need.
Traditionally, a search is based on manually curated metadata classifying
documents, for example, by arguments, authors, etc. In the digital era, the search
moved from physical cabinets to databases. Albeit being a useful paradigm, the
maintenance of metadata is an expensive process, which becomes progressively
more expensive and less reliable with the increase of required detail. E orts of
? This work has been partly supported by projects ARCA (POR FESR Lazio
2014{2020 - Avviso pubblico \Creativita 2020", domanda prot. n.
A0128-201717189) and STORYBOOK (POR FESR Lazio 2014-2020 - Avviso Pubblico \Progetti
di Innovazione Digitale", domanda prot. n. A0349-2020-34437).</p>
      <p>
        Copyright c 2021 for this paper by its authors. Use permitted under Creative
Commons License Attribution 4.0 International (CC BY 4.0). This volume is published
and copyrighted by its editors. IRCDL 2021, February 18-19, 2021, Padua, Italy.
transitioning to electronic documents (either created natively as such or
digitized) have helped, enabling the direct text-based search of the content.
Semantic enrichment methods, as named-entity recognition and linking (NERL) [
        <xref ref-type="bibr" rid="ref14 ref18">14,18</xref>
        ],
aim at bridging the semantic gap between raw text and concepts, by associating
words in the documents with entities in a knowledge base, often a knowledge
graph (KG).
      </p>
      <p>
        NERL successfully enabled users to search and analyze text corpora more
e ectively [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. While knowledge extraction methods as NERL are now broadly
used by big players in the industry as well as in academic projects, their usage
by small to medium size organizations is still minimal.
1.1
      </p>
    </sec>
    <sec id="sec-2">
      <title>Research questions</title>
      <p>For the sake of the analytic approach, we frame our e ort through a set of
research questions. The questions elicited below are relevant to the application of
KG-based approaches for the exploration of text corpora.</p>
      <p>Q1. Would users, exploring a corpus of text, pro t from the semantic navigation
of the associated KG of topics?
Q2. What kind of user interface would e ectively support such navigation?
Q3. What kind of users, scenarios, and tasks would bene t from this interaction
paradigm?
Q4. Do building and maintaining a semantic enrichment and KG creation pipeline
necessarily involve high upfront costs and highly skilled developers?
1.2</p>
    </sec>
    <sec id="sec-3">
      <title>Hypotheses</title>
      <p>To try to reply to the questions above, we designed the presented study.
H1 (relevant to Q1 and Q2). Users will be able to explore a text corpus
through a KG-based user interface which o ers the following main functions: (a)
nd concepts through text search, (b) visually navigate the concepts and their
relationships, and (c) show documents relevant to the selected concept.
H2 (relevant to Q3). Given a corpus of text in a speci c domain, it will
bene t both users with little knowledge of the domain (by supporting
semanticallyrelevant discovery) and domain experts (by enabling topic-oriented visual
organizations of the documents).</p>
      <p>H3 (relevant to Q4). It is feasible to build a ready-to-use complete system,
including both semantic enrichment pipeline and web-based front end, able, with
only some con guration, to be applied to any speci c corpus to enable the
KGbased exploration.
1.3</p>
    </sec>
    <sec id="sec-4">
      <title>Approach</title>
      <p>
        A comprehensive software system has been previously envisioned and proposed
[
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] to address the research questions. The system, which now has been fully
implemented and evaluated on a speci c use case, is meant to enable the
KGbased exploration of a given text corpus and hence test our hypotheses. The
system is organized according to the following main functions:
{ extraction of entities from a given text corpus;
{ integration between available metadata, extracted entities present in the
text, and data from external knowledge bases;
{ consolidation of the local data in a KG stored in a triple store;
{ search and exploration of the corpus through the navigation of the KG in a
composite user interface.
      </p>
      <p>In order to ensure the whole solution is useful for potential users it has been
implemented and evaluated within a speci c case study: exploring the book
catalog of a medium-sized publishing house specialized mainly in ancient history.
The concrete case study o ered the context for fruitful exchange among the
stakeholders that are often involved in scenarios of information retrieval and
library search: who maintain the corpus (the publisher), who need to search
the corpus (researchers of the eld and interested individuals), who develop the
software solution (in this case the authors of the present study).</p>
      <p>The remaining sections are organized as follows. Section 2 presents related
work about visual information seeking. Section 3 reports the design process
starting from identifying user requirements to the development and implementation
of the nal interface. Section 4 describes the pipeline of the proposed system and
the technologies used during the implementation. Section 5 reports the
evaluation process and analyzes the ndings. Finally, Section 6 discusses future research
directions.
2</p>
      <sec id="sec-4-1">
        <title>Related work</title>
        <p>In this section, relevant literature is brie y surveyed for relevant works, starting
from traditional systems for visual information seeking to tools for semantic
enrichment of unstructured text and visualization/exploration of semantic data
as KGs, both in the general case and the speci c case of a corpus of books.
2.1</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Traditional systems</title>
      <p>
        There has been a large amount of work in the literature about visual
information seeking [
        <xref ref-type="bibr" rid="ref19 ref3">19,3</xref>
        ]. The rst attempts to create a visual search interface, have
been done in the early 1990s [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], where some researchers had applied direct
manipulation principles to search interfaces, creating what they called dynamic
queries [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. These are visual query systems, often based on the query-by-example
paradigm[
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]: search interfaces where users can manipulate sliders and other
graphical controls to change search parameters. The results of those changes are
immediately displayed to them in some visualization. A limit of these systems,
for unstructured information like books, is that exploring and ltering by basic
metadata (i.e., author, title, etc.) can be useful, but it is often insu cient.
2.2
      </p>
    </sec>
    <sec id="sec-6">
      <title>Semantic enrichment</title>
      <p>
        There has hence been recently a lot of research on how to attach semantics to
unstructured data [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], through processes like NERL.
      </p>
      <p>
        The GLOBDEF system [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] works with pluggable enhancement modules,
which are dynamically activated to create on-the- y pipelines for data
enhancement. This tool carries out a semantic enrichment challenge but stops there. It
does not include the management of the generated metadata, the integration
with existing KGs and the visual exploration of these data through a unique
visualization.
      </p>
      <p>Apache Stanbol 3 is a set of components able to o er various services for
semantic enrichment, visualization of KG and the management of metadata. It
is advantageous and can be integrated with our system, but on itself, it does not
o er a ready-to-use system.
2.3</p>
    </sec>
    <sec id="sec-7">
      <title>Visualization of semantic data</title>
      <p>
        The extracted semantics can then be extremely useful for exploring the data, but
they are not xed and homogeneous like a set of prede ned metadata. Therefore,
data models and visual user interfaces need to deal with these complex and
heterogeneous data. The Semantic Web [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and Linked Data [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] e orts deal with
data modelling, integration, and interaction of this kind of data on the Web.
These e orts lately contributed to the emergence of KGs as a way to organize
complex data-sets integrating multiple sources [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
      <p>
        Many user interfaces for visualization and exploration of KGs exist, and new
ones are being developed every year, especially in the context of Semantic Web
and Linked Data technologies [
        <xref ref-type="bibr" rid="ref12 ref16 ref5">16,12,5</xref>
        ].
      </p>
      <p>
        Metaphactory [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] is a platform for building KG applications, can be easily
integrated into other software infrastructures; for the loading and visualization
of RDF KGs, it uses Ontodia [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], which represents one of the most powerful free
tools [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] and includes a range of elements that support a variety of interaction
techniques. The visualization paradigm is based on the idea of loading in the
main panel of the tool the fragment of interest of the entire KG (which can
consist of local data, a remote SPARQL endpoint, or the merge of multiple such
sources). Entities can be found through textual search and then dragged to the
main panel. Connections among entities are shown, and new entities can also be
added by expanding the connections of shown entities.
      </p>
      <p>A customized version of Ontodia is included in the software system presented
in this paper. The customization enables the use of such KG navigation to search
in a text corpus of documents.
2.4</p>
    </sec>
    <sec id="sec-8">
      <title>Exploration of a digital library</title>
      <p>Many tools face the challenge of the exploration of the contents of a digital
library. In particular, two are in the same direction of this work.</p>
      <p>
        Yewno Discover [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] is an integrated system that o ers classi cation and
visual exploration of academic materials to help scholars in their research, but
      </p>
      <sec id="sec-8-1">
        <title>3 see https://stanbol.apache.org/</title>
        <p>is not adaptable and exible to di erent contexts of use, except with ad hoc
adjustments.</p>
        <p>Talk to Books4 is a tool by Google to explore ideas and discover books by
getting quotes that respond to user's queries helping users nd interesting books
that may not be available through keyword search, but does not admit the visual
exploration of the RDF KG which allows users to discover, visually, connections
between concepts and books.
3</p>
        <sec id="sec-8-1-1">
          <title>System requirements</title>
          <p>The speci c use case of the publishing house o ered the opportunity to adopt
a user-centred design approach to identify and re ne the system requirements.
From informal interviews with publishing house representatives and a team of
researchers in the same domain, an initial set of requirements has been identi ed:
{ the user should be able to search entities textually;
{ for an entity, the user should be able to see the relevant books;
{ the user should be able to navigate among entities, following semantic
relationships between them;
{ for a document, the user should be able to access the basic information and
be informed on how to obtain it (buy it from a bookstore, borrow it from a
library, etc.);
{ any user should be able to perform the operations without the need of being
taught how to, following established interaction patterns and metaphors.
4</p>
        </sec>
        <sec id="sec-8-1-2">
          <title>The system</title>
          <p>The software system has been implemented and tested in the context of the
speci c use case, but it is designed for general use. The aim is to o er a
readyto-use package to explore visually any corpus of texts through a specialized KG.
4.1</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-9">
      <title>Software modules</title>
      <p>The system is composed of a pipeline to build the KG and a web-based front
end to search the corpus using the KG. The pipeline is composed of three steps:
newly added documents of the corpus enter the pipeline; in the second step,
semantic enrichment services extract information from the documents; in the
third step, the generated data is consolidated locally in a way that it can also
be integrated with additional data provided by external services. RDF is used
to represent all the data items in the pipeline, employing existing vocabularies
and ontologies whenever possible and creating new terms if needed.</p>
      <sec id="sec-9-1">
        <title>4 see https://books.google.com/talktobooks/</title>
        <p>4.2</p>
      </sec>
    </sec>
    <sec id="sec-10">
      <title>The user interface</title>
      <p>The visual user interface is composed of two main components (see Fig. 1). The
rst component contains the visualization and search of the entities contained
in the KG. It is a customized version of the Ontodia workspace (described in
Section 2.3). The second component shows the list of documents associated with
the selected entity, o ering further interaction with them.</p>
    </sec>
    <sec id="sec-11">
      <title>Exploration of the knowledge graph. The knowledge exploration compo</title>
      <p>nent (see part 1 of Fig. 1, left side) has the following features.</p>
      <p>Searching graph entities. The panel on the left can be used to search for
entities in the knowledge graph, corresponding in the use case mainly with entities
from DBpedia. For example, typing \Rome" the user gets all the entities
containing that string in their label. One or more of the returned entities (e.g., the
one corresponding to the city of Rome) may be loaded to the graph navigation
panel through drag-and-drop.</p>
      <p>Knowledge graph navigation. The central panel allows the user to navigate
the KG. Starting from any shown entity, its connections can be expanded (hence
adding the connected entities to the graph), either generically showing them all
or selecting only some connections of a speci c semantic type5 (e.g., birth place).
Furthermore, the connections among shown entities are shown by default, as
they may be of interest. This panel is coordinated with the document list panel
5 By the informal term of semantic connection type we refer here to RDF properties
(described below and shown in part 2 of Fig. 1, right side) so that the latter shows
the list of documents which include, as a topic, the entity currently selected in
the former.</p>
      <p>Documents as entities. Apart from being shown in the document list panel,
documents can also beo explored as entities themselves in the KG exploration.
They are linked to their topics by two types of semantic connections: concept for
any entity found in the text, top concept for the ones recognized as main topics
for that text. This choice enables di erent ways to interact with the system:
{ starting from a document, exploring its topics and then possibly other
documents from them (e.g., in Fig. 1, from the book The Tale of Cupid and
Psyche to the topic Rome and then to the book Scutulata Pavimenta);
{ from shown entities, visualizing which documents are about two or more of
them (e.g., in Fig. 1, the book The Tale of Cupid and Psyche is both about
Rome and, speci cally, about Castel Sant'Angelo).</p>
      <p>Kinds of entities. Di erent colours are used as an aid to distinguish three
broad sets of entities:
{ DBpedia entities not found in the corpus are in blue;
{ DBpedia entities found at least once in the corpus are in green;
{ documents are in red.</p>
      <p>Document list. The document list panel (part 2 of Fig. 1, rigth side), which
can be shown or hidden as needed, shows the list of documents associated with
the entity currently selected in the graph exploration panel, i.e., the documents
whose extracted entities include that one. The documents may be shown ordered
by year of publication or by relevance (if for that document it is a main topic
or just a topic). By clicking on the info button of a book, a modal window with
further information on the document is opened. The information includes the list
of snippets, i.e., all the textual contexts of the document in which the selected
concept has been found.
5</p>
      <sec id="sec-11-1">
        <title>Evaluation</title>
        <p>As anticipated, the system has been applied to the use case of a publishing house
specialized in ancient history. Speci cally, it has been tested on a corpus of 112
books, a subset of the full catalog of the editor. The evaluation has been carried
out with the help of a group of researchers who are experts of ancient history
and speci cally of the topics covered by the set of books. The evaluation of the
system has been divided into three phases. In the rst phase, the researchers, as
domain experts, assessed the quality of the semantic enrichment process. In the
following two phases, the system as a whole has been evaluated through user
tests. In both phases, users interact with the visual user interface, in the rst
phase without given constraints, in the second phase with a set of tasks and a
more structured setup.
5.1</p>
      </sec>
    </sec>
    <sec id="sec-12">
      <title>Quality of entity extraction</title>
      <p>The researchers have read deeply ten books contained in the system, and for
each book have evaluated the quality of:
{ the correspondence of the top concepts with the main topics;
{ the correspondence of the concepts with entities mentioned in the book;
{ the disambiguation of the words.</p>
      <p>The ndings were analyzed qualitatively, as the purpose was to test the
feasibility of the whole system rather than to evaluate the NERL method per se.
The quality has been deemed su cient to be used e ectively. Nevertheless, some
issues emerged. They are described in section 5.4 along with ideas to approach
them.
5.2</p>
    </sec>
    <sec id="sec-13">
      <title>Direct observation of unconstrained navigation</title>
      <p>In the second phase of the evaluation, two users were invited to experiment using
the user interface without any speci c constraints.</p>
      <p>Three types of reactions have been observed:
{ positive surprise in nding and verifying relationships between concepts that
they were already aware of;
{ amazement in nding new unknown relationships;
{ displeasure in not nding expected relationships, due to the lack of content.
Overall, from this rst observation of the user interface use, the results have
been that the navigation was smooth and stimulating in the exploration of the
contents.
5.3</p>
    </sec>
    <sec id="sec-14">
      <title>Task-driven usage and questionnaire</title>
      <p>In the second part of the evaluation, six users participated in a task-oriented
evaluation.</p>
      <p>Before starting with the compilation of the questionnaire, users were invited
to watch four video tutorials in the appropriate section of the interface to
familiarize themselves with the arca commands and functions. They were given
a questionnaire containing task-instructions interspersed with the related
questions, some of them open ended, some of them requiring a number on a Likert
scale from 1 to 5. The questionnaire was divided into four sections: (I) three
basic tasks followed each one by a numeric question on the di culty; (II) ve
identical macro-tasks (decomposed in seven sub-tasks) each one of them
consisting in a set of search-exploration steps starting from a di erent topic, chosen
by the user, interspersed by two questions after each sub-task, one numeric and
one open ended, to evaluate the data quality of the explored results; (III) four
general open ended questions; (IV) other three pairs of questions, one numeric
and one open ended, to evaluate the technical aspects of the interface.</p>
      <p>Using and aggregating Likert scale scores was helpful in quantitatively
summarizing the sentiment of the users, albeit the number of users involved in the
test is too small to look for statistical relevance.</p>
      <p>I To answer the rst section of the questionnaire, users were assigned three
precise activities to perform, and on the basis of the results obtained, it been
assessed the ease of use of the system and the satisfaction obtained by the
exploration of resources. From Figure 2a it can be seen that 100% of the test
considered the navigability of arca from simple to very simple. In Figure 2b
the 87% consider consistent the books related to the selected entities and in
Figure 2c 83% is satis ed with the list of search results connected to a concept.
II In the second section of the questionnaire, instead of assigning speci c tasks,
each of the six users was given the freedom to choose ve di erent concepts
from which to start their own research and ve di erent books to explore at own
choice. For each chosen concept and book, the questions assessed the quality of
the explored search results. From this group of questions, it emerged that for
the users, the obtained results are mostly coherent with what they expect.
III In the third section, four general questions were asked, which are shown
below along with an outline of user responses.</p>
    </sec>
    <sec id="sec-15">
      <title>What are the most useful features of arca?</title>
      <p>Among the most useful features, users that have tested the interface have
identi ed:
{ the possibility of identifying and exploring links between concepts which
makes the research method immersive and stimulating;
{ the search for books starting from concepts and, above all, starting from
other books;
{ the fact that it is an open system and modulated on improvable and
replaceable components;
{ it is a potentially very large container of editorial products;
{ the ability to save search paths and export them as SVG and PNG images;
{ the display of integrated data (texts, images, links);
{ the direct connection with the catalogs of the publishing house;
{ the possibility of verifying the concepts extracted from the corpus of books
through the \info" button of the component containing the books;
{ the division of the concepts extracted from books into concepts and top
concepts.</p>
    </sec>
    <sec id="sec-16">
      <title>What are arca weaknesses?</title>
      <p>Among the weaknesses of arca at the actual stage, users have identi ed:
{ sometimes errors of disambiguation and restitution of entities not always
relevant;
{ sometimes errors in extracting concepts from the text;
{ little information contained in the system to satisfy curiosity in exploring a
larger catalog of books to discover more connections;
{ the system o ers multiple features that must be properly explained to allow
the user a complete browsing experience.</p>
    </sec>
    <sec id="sec-17">
      <title>What features do you think is useful to add to arca to make it a</title>
      <p>better system?
The features that, for the users, could improve the system are:
{ integration with other KGs;
{ improvement of the entity extraction module;
{ improve the order and organization of the connected elements to the entities;</p>
    </sec>
    <sec id="sec-18">
      <title>Add any other notes and thoughts useful to improve arca.</title>
      <p>Users have noted that it might be interesting to implement the following features:
{ o er the possibility to save the search history;
{ create personal bibliographic lists with the results of the searches in a
dedicated space on the board (exportable and downloadable);
{ create a support system that can help the users to know in a more easily and
understandable way, how to take full advantage of all the features of arca
IV In the fourth section, there are questions to assess the technical aspects of
the interface. The Likert scale questions and answers reveal that the loading
times and the organization of the visualization of the results can be improved.
In the motivation following the Likert scale questions, users told that they have
experienced slowdowns in particular in viewing the book catalog, and in the
loading of video tutorials.
The system obtained a more than satisfactory performance in recommending
relevant editorial products and received a high score in terms of usability; the
simplicity of use; user satisfaction with the results shown; consistency of the
contents with the s domain of the publishing house; attractiveness of the system.</p>
      <p>Nonetheless, the evaluation of the quality of the entity extraction (Section
5.1) highlighted some issues that need to be addressed in future versions and
considered in related works, like the wrong resolution of acronyms and
abbreviations and the incorrect disambiguation of entities due to the absence of the
correct entity in the DBpedia KG.</p>
      <p>Another feedback is that some users complained about a relatively small
amount of information in the internal KG (built with the concepts of 112 books
and the related metadata). It is expected that when the catalog of books is
numerically more signi cant, the chance of discovering new information and
connections while browsing the KG will increase.</p>
      <p>Finally, based on initial observations, it has been seen that using the
system at rst glance can be di cult without viewing the video tutorials. As a
lighter alternative to video tutorials, an help component could be implemented
to accompany the users in the rst searches and thus make them independent
in exploiting all the possibilities of exploration that the system o ers.
6</p>
      <p>Conclusions and future work
arca is an innovative system based on the visual semantic search that allows
the exploration of a text corpus. Through a knowledge graph-based navigation,
users can start from any relevant entity and reach other entities related to it,
discovering in which books or articles each entity is present and evaluating which
of these results are useful for their research. The user studies conducted so far
con rm the amenability of the proposed system to domain experts who were
able to perform non-trivial tasks of search and exploration, tasks that would be
more cumbersome to execute with the search tools they are used to. Feedback
gathered from users suggest that the proposed exploration mechanism tends to
amplify the user experience by also o ering opportunities for further study and
discovery of sources, themes, and materials, which have the potential of enriching
the research process with new ideas.</p>
      <p>Through the presented user study, many desiderata have been collected, and
they can be used to guide further development and experimentation in this
context. Furthermore, in order to extend the evaluation to more users, a more
extensive indirect observation study has been planned. In addition to the
questionnaire, the analysis will be further completed with objective data gathered
by tracking users' activity through interaction logs.</p>
      <p>As a potential future direction of research and development, the scope of the
semantic enrichment process could be broadened to other document elements,
such as images and captions, enriching KG exploration.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Ahlberg</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shneiderman</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Visual information seeking: Tight coupling of dynamic query lters with star eld displays</article-title>
          .
          <source>In: The Craft of Information Visualization</source>
          , pp.
          <volume>7</volume>
          {
          <fpage>13</fpage>
          .
          <string-name>
            <surname>Elsevier</surname>
          </string-name>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Ahlberg</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Williamson</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shneiderman</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Dynamic queries for information exploration: An implementation and evaluation</article-title>
          . p.
          <volume>619</volume>
          {
          <fpage>626</fpage>
          . CHI '
          <volume>92</volume>
          ,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computing Machinery, New York, NY, USA (
          <year>1992</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Baeza-Yates</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ribeiro-Neto</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          , et al.:
          <article-title>Modern information retrieval</article-title>
          , vol.
          <volume>463</volume>
          . ACM press New York (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Berners-Lee</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hendler</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lassila</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          :
          <article-title>The Semantic Web</article-title>
          .
          <source>Scienti c American</source>
          <volume>284</volume>
          (
          <issue>5</issue>
          ),
          <volume>34</volume>
          {
          <fpage>43</fpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Bikakis</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sellis</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Exploration and visualization in the web of big linked data: A survey of the state of the art</article-title>
          .
          <source>arXiv preprint arXiv:1601.08059</source>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Bizer</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heath</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Berners-Lee</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Linked data-the story so far</article-title>
          .
          <source>Int. J. on Semantic Web and Information Systems</source>
          <volume>5</volume>
          (
          <issue>3</issue>
          ),
          <volume>1</volume>
          {
          <fpage>22</fpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Bolina</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <string-name>
            <given-names>Yewno</given-names>
            <surname>Discover</surname>
          </string-name>
          .
          <source>Nordic Journal of Information Literacy in Higher Education</source>
          <volume>11</volume>
          (
          <issue>1</issue>
          ) (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Ceriani</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bernasconi</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mecella</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>A streamlined pipeline to enable the semantic exploration of a bookstore</article-title>
          .
          <source>In: Digital Libraries: The Era of Big Data and Data Science - 16th Italian Research Conference on Digital Libraries, IRCDL</source>
          <year>2020</year>
          , Bari, Italy, January
          <volume>30</volume>
          -
          <issue>31</issue>
          ,
          <year>2020</year>
          , Proceedings.
          <source>Communications in Computer and Information Science</source>
          , vol.
          <volume>1177</volume>
          , pp.
          <volume>75</volume>
          {
          <fpage>81</fpage>
          . Springer (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Dudas</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lohmann</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Svatek</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pavlov</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Ontology visualization methods and tools: a survey of the state of the art</article-title>
          .
          <source>Knowledge Eng. Review</source>
          <volume>33</volume>
          ,
          <issue>e10</issue>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Ehrlinger</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          , Wo , W.:
          <article-title>Towards a de nition of knowledge graphs</article-title>
          .
          <source>In: SEMANTiCS</source>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Haase</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Herzig</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kozlov</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nikolov</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Trame</surname>
          </string-name>
          , J.:
          <article-title>metaphactory: A platform for knowledge graph management</article-title>
          .
          <source>Semantic Web</source>
          <volume>10</volume>
          ,
          <issue>1</issue>
          {
          <fpage>17</fpage>
          (06
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Marie</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gandon</surname>
            ,
            <given-names>F.L.</given-names>
          </string-name>
          :
          <article-title>Survey of linked data based exploration systems</article-title>
          .
          <source>In: IESD@ISWC</source>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Mouromtsev</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pavlov</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Emelyanov</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Morozov</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Razdyakonov</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Galkin</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>The simple web-based tool for visualization and sharing of semantic data and ontologies</article-title>
          . In: International Semantic Web Conference (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Nadeau</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sekine</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>A survey of named entity recognition and classi cation</article-title>
          .
          <source>Lingvisticae Investigationes</source>
          <volume>30</volume>
          (
          <issue>1</issue>
          ),
          <volume>3</volume>
          {
          <fpage>26</fpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Nisheva-Pavlova</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alexandrov</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>GLOBDEF: A Framework for Dynamic Pipelines of Semantic Data Enrichment Tools</article-title>
          .
          <source>In: Proc. of MTSR 2018</source>
          . pp.
          <volume>159</volume>
          {
          <fpage>168</fpage>
          . Springer (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Po</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bikakis</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Desimoni</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Papastefanatos</surname>
          </string-name>
          , G.:
          <article-title>Linked data visualization: Techniques, tools, and big data (</article-title>
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Ristoski</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paulheim</surname>
          </string-name>
          , H.:
          <article-title>Semantic web in data mining and knowledge discovery: A comprehensive survey</article-title>
          .
          <source>Journal of Web Semantics</source>
          <volume>36</volume>
          ,
          <issue>1</issue>
          {
          <fpage>22</fpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Shen</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>J</given-names>
            ., Han, J
          </string-name>
          .:
          <article-title>Entity linking with a knowledge base: Issues, techniques, and solutions</article-title>
          .
          <source>IEEE Transactions on Knowledge and Data Engineering</source>
          <volume>27</volume>
          (
          <issue>2</issue>
          ),
          <volume>443</volume>
          {
          <fpage>460</fpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Shneiderman</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>The eyes have it: A task by data type taxonomy for information visualizations</article-title>
          .
          <source>In: The craft of information visualization</source>
          , pp.
          <volume>364</volume>
          {
          <fpage>371</fpage>
          .
          <string-name>
            <surname>Elsevier</surname>
          </string-name>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Zloof</surname>
            ,
            <given-names>M.M.</given-names>
          </string-name>
          :
          <article-title>Query-by-example: A data base language</article-title>
          .
          <source>IBM Syst. J</source>
          .
          <volume>16</volume>
          ,
          <issue>324</issue>
          {
          <fpage>343</fpage>
          (
          <year>1977</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>