<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Understanding through Semantic Annotation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Abiola Paterne Chokki</string-name>
          <email>abiola-paterne.chokki@unamur.be</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rabeb Abida</string-name>
          <email>rabeb.abida@unamur.be</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Benoît Frénay</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Benoît Vanderose</string-name>
          <email>benoit.vanderose@unamur.be</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Anthony Cleve</string-name>
          <email>anthony.cleve@unamur.be</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Namur</institution>
          ,
          <addr-line>Rue de Bruxelles 61, 5000 Namur</addr-line>
          ,
          <country country="BE">Belgium</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2022</year>
      </pub-date>
      <abstract>
        <p>Many governments have published their data on the web with the goals of improving transparency and stimulating innovation, among others. In order to achieve these goals, users must be able to discover and understand these Open Government Data (OGD). The use of semantic annotation has been proven in previous studies to be efective in meeting this need. Yet, the process of annotating data remains an open challenge. Although eforts have been made in recent years to simplify this process, there is still a lack of semantic annotation tools that integrate well with OGD portals. To this end, we present ODSAG (Open Data Semantic Annotation and Graph), a chrome extension that can be easily interoperable with any OGD portal, automatically annotates an open dataset and creates graphs from it.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Around the world, many governments have implemented Open Government Data (OGD)
policies to make their data more accessible and usable by the public [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The release of these
data is most often motivated by values that include improving government transparency [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]
and stimulating innovation [
        <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
        ]. However, there is still a number of barriers that prevent
various OGD initiatives from reaching their full potential [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Among these challenges are
discoverability and understanding [
        <xref ref-type="bibr" rid="ref5 ref6 ref7">5, 6, 7</xref>
        ]. Indeed, before being able to use the data, consumers
must be able to find data relevant to their needs (discoverability) [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Once they discover the
relevant data, they must be able to understand the metadata and content of the data in order
to exploit it, such as for data integration, data cleaning, data mining, machine learning and
knowledge discovery tasks (understanding) [
        <xref ref-type="bibr" rid="ref5 ref7">5, 7</xref>
        ].
      </p>
      <p>
        The use of semantic annotation and Linked Open Data (LOD) has been proven in previous
studies [
        <xref ref-type="bibr" rid="ref6 ref8 ref9">8, 9, 6</xref>
        ] to be eficient in solving the mentioned challenges. Semantic annotation is the
process of assigning semantic tags from Knowledge Graphs (KGs) (e.g., Wikidata, DBpedia)
to data items [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. It mainly consists of the following tasks (see Figure 1): (1) Column Type
Annotation (CTA) which involves assigning a semantic type (e.g., a Wikidata class) to each
nEvelop-O
column (see green color), (2) Cell Entity Annotation (CEA) which involves mapping each cell to
an entity in KG (see magenta color) and (3) Column Property Annotation (CPA) which involves
assigning a property or predicate in KG to the relation between two columns (see blue color).
Although the benefits of semantically annotated data have been widely recognized, there is still
a vast number of datasets without any semantic annotation being published on open data portals
every day, probably because adding semantic annotations to data is a laborious, error-prone,
challenging task for publishers [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] and also because the majority of existing tools are not capable
of automating the process and being both interoperable with open data portals. Therefore, our
research question is as follows: ”How to design a tool that supports semantic annotation of OGD? ”
      </p>
      <p>The methodology used to answer the research question is structured in three parts. First,
we identify, through existing tools, a list of requirements that should be included in a tool
that can facilitate the semantic annotation of OGD. Then, we implement these requirements in
the tool called ODSAG (Open Data Semantic Annotation and Graph). ODSAG has been
implemented as a chrome extension that can be easily interoperable with any OGD portal,
automatically annotates open data and creates graphs from the annotated data. Finally, we
evaluate its efectiveness using a COVID dataset available on the Namur, Belgium portal. The
target users of ODSAG are primarily publishers who can use the tool to annotate their data
before publishing them, but can also be extended to data analysts who can use the tool to
self-annotate unannotated datasets found on OGD portals to improve their understanding
before using them.</p>
      <p>The rest of this paper is divided into four main sections. Section 2 explores existing tools
for semantic data annotation. Section 3 presents the requirements identified to support OGD
semantic annotation. Section 4 describes the proposed ODSAG prototype. Section 5 describes
the implementation of ODSAG and demonstrates a use case. Finally, Section 6 provides a
conclusion that summarizes the contributions of this paper and proposes some directions for
future work.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        Open data on the web still has a high level of heterogeneity, lack of metadata and lack of
interoperability, making it dificult to explore and understand. Unfortunately, many data
producers are not familiar with LOD technologies and are not willing to invest time to integrate
their data with KGs (semantic annotation). To address these gaps, many tools have been
proposed. Reviewing each of them would beyond the scope of this paper. We have focused here
on the most recent or most cited tools in the literature: OpenRefine 2, SemanticBot [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], Odalic
[
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], DataGraft [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], MantisTable [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], Mtab [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], JenTab [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. Table 1 presents the reviewed
tools according to the following criteria: annotation method (C1), data pre-processing capability
(C2), subject-column detection capability (C3), semantic annotation (C4), technologies based on
(C5), KGs or ontologies used (C6), graph generation capability (C7), export capability (C8) and
interoperability capability with open data portals (C9). Table 1 also provides a comparison with
the proposed tool, ODSAG.
      </p>
      <p>Referring to Table 1, none of the reviewed tools is able to be both interoperable with open
data portals, automatically annotate data, and generate graph from the annotated data. This</p>
      <p>DBpedia</p>
      <p>YAGO</p>
      <p>LOV
Wikidata
Wikidata
DBpedia
Wikidata
DBpedia
Wikidata
DBpedia
Wikidata
DBpedia
Wikidata</p>
      <p>Y
N
N
N
N
N
Y</p>
      <p>Y
Y
Y
N
N
N
N</p>
      <p>C9
Open
Data
Soft
N/A
N/A
N/A
N/A
N/A
All
justifies the need for our ODSAG tool, which, compared to the other tools, can satisfy all these
features.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Identification of Requirements to Support Semantic</title>
    </sec>
    <sec id="sec-4">
      <title>Annotation of Open Data</title>
      <p>Based on the tools reviewed in Section 2, we are able to identify the list of requirements that a
tool might have to support semantic annotation of open data. The identified requirements are a
summary of existing features in the reviewed tools (see Table 2).</p>
      <p>Once the requirements are identified, we implement them in a tool that we will present in
the next section.</p>
    </sec>
    <sec id="sec-5">
      <title>4. ODSAG Prototype</title>
      <p>
        This section describes our proposed ODSAG prototype, which aims to address the shortcomings
mentioned in the previously discussed tools and to incorporate the identified requirements
in Section 3. Instead of starting from scratch, the ODSAG prototype integrates an existing
automatic semantic table annotation tool Mtab [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] into its process and then enhances it (1) to
make it interoperable with any open data portal and (2) to generate graphs from the annotated
data. Figure 2 presents an overview of the prototype which consists of 4 steps: 1) dataset
selection; 2) data pre-processing; 3) semantic annotation and 4) graph generation. The following
paragraphs describe each of these steps in more detail.
      </p>
      <p>
        During the dataset selection step, ODSAG takes as input an URL to access a dataset in
Excel, CSV or JSON format on any open data portal (e.g., CKAN, OpenDataSoft, Socrata, DKAN)
or on the user’s computer (section (a) of Figure 3). Unlike SemanticBot [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], ODSAG allows users
to interact with any open data portal (R1).
      </p>
      <p>During the data pre-processing step, ODSAG removes all numeric columns (except columns
containing postal code, NSI code, etc., which have a match in Wikidata), as well as geographic
columns (R2), because currently most numeric and geographic values in datasets available on
open data portals (e.g., the number of cases column in the COVID data) do not have a match in
the used KG: Wikidata (section (b) of Figure 3).</p>
      <p>
        The semantic annotation step aims at enabling users to automatically annotate an open
dataset (section (c) of Figure 3). This step is divided into two sub-steps: the detection of the
subject column (i.e., the column that is likely to have the most relations with the other columns)
(R3) and the generation of CEA, CPA and CTA (R4). For the subject column detection, we use
the procedure described in MantisTable [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. This process starts by determining the literal
columns (e.g., address, phone number, color, URL) using regular expressions. Once this step is
complete, the system chooses from the remaining columns (called named entity columns) the
subject column based on diferent statistic features, such as average number of words in each
cell, fraction of empty cells in the column, fraction of cells with unique content and distance
from the first named entity column [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. More details on the subject column detection can
be found in [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. Once the subject column detection step is complete, we rely on Mtab to
automatically generate the CEA, CPA and CTA. The mapping process in Mtab is done in three
steps. Step 1 involves generating Wikidata resources using the Wikidata entity dump and
history revisions, as well as creating two indexes for fuzzy entity search and fuzzy statement
search. In Step 2, the fuzzy entity search is used to find relevant entity candidates for each cell in
the table. Fuzzy statement search is used to handle the ambiguity of the table cells and consists
of using the values of two cells in the same row to determine if there is a statement (relation)
between them. In the end, only the entity candidates for which there is a relationship between
the cells are retained. In Step 3, for each retained entity candidate, the system calculates a
value matching (which depends on the statement similarities of each candidate and the other
cells in that row) and keeps only the entity candidate with the highest value. Once all CEA
are assigned, CPA are retrieved by aggregating all properties of statement candidates in the
same rows, and then using majority voting to select the CPA annotations. For CTA annotations,
we get the direct types from the CEA annotations in a column and vote for the majority types
to get the CTA annotations. More details about each step of MTab can be found in [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. Our
contribution in this step is that we have combined the strengths of MantisTable and MTab to
perform both sub-steps. MTab does not ofer a methodical subject column detection but has
excellent results for semantic annotation and MantisTable does not ofer excellent results for
semantic annotation like MTab but has a consistent subject column detection.
      </p>
      <p>During the graph generation step, ODSAG automatically creates two graphs (minimal
graph and full graph) from the annotated data (R6). In the minimal graph, only the CTA and
CPA are taken into account. CTA are represented as nodes and CPA as links connecting the
subject column to other nodes (section (d) of Figure 3). This graph allows users to visualize the
relations (CPA) between the subject column and other columns (CTA). The full graph, on the
other hand, is an extended version of the minimal graph and includes all the information of the
annotated data: CEA, CTA and CPA. CTA and CEA are represented as nodes and CPA as links
connecting the subject column (resp. CEA of subject column) with other CTA (resp. CEA of
other CTA) (section (e) of Figure 3). This graph allows users to visualize the relations between
data content items and column names and to discover hidden relations between them. Both
generated graphs are interactive, so users can move the nodes and discover relations between
elements in a more readable way. The nodes in the graph are also clickable, allowing users to
get more details about each entity if needed. Unlike some previous studies presented in Section
2 that generate only the minimal graph, ODSAG generates a full graph that helps users to better
explore the relations between cell values (CEA).</p>
    </sec>
    <sec id="sec-6">
      <title>5. Implementation and Demonstration</title>
      <p>In this section, we briefly explain the implementation of the ODSAG prototype (which source
code is available on GitHub 3 and show a use case of the prototype.</p>
      <p>Regarding the implementation, in order to provide a tool that is easy to install and use and
that is interoperable with any open data portal (R7), we chose to implement: (1) a chrome
extension that users can interact with (frontend) and (2) a django web application that is used
to interact between the chrome extension and Mtab which can be hosted online or locally
(backend).</p>
      <p>Figure 3 illustrates an example of annotation and graphs generated by ODSAG when using
the COVID1 open dataset available on the Namur (Belgium) portal. This portal was chosen as
it is the most advanced portal in Wallonia (Belgium) and access with key stakeholders of this
portal was possible (useful to evaluate later the direct integration of the tool to a portal). In order
to annotate a dataset, the user must first load the extension in the chrome extensions (only for
the first run). Then, as shown in section (a) of Figure 3, he/she has to go to any open data portal,
copy the URL link of the desired open dataset and paste it into the ODSAG URL field, then click
on “Generate” button. The system removes the numeric columns (“Nbre de cas”, “Nombre de cas
minimum”) and geographic columns (“limite communale”, “geo_point_2d”) from the selected
dataset (section (b) of Figure 3). A few seconds later, the system generates the annotated data and
returns it as a table and graphs. Section (c) of Figure 3 shows the annotated data in table form
where the type row includes the semantic tags associated with the name of each column (CTA).
The property row includes the semantic tags associated with the relation between the subject
3https://github.com/chokkipaterne/odsag
column (tx_descr_fr) and the other columns. The entity annotations are in red and located
below the table cell values. For example, the cell values “Anhée” and “Arrondissement de Dinant”
have been respectively mapped to the entities Q545889 “Anhée” and Q93740 “Arrondissement of
Dinant” (CEA), the column tx_descr_fr was associated with the class Q493522 “Municipality of
Belgium” (CTA) and the system detects a property P131 “located in the administrative territory
entity” between the column tx_descr_fr and tx_adm_dstr_descr_fr. Section (d) of Figure 3
shows the minimal graph. The subject column in this graph is represented by a light blue color,
the CTA are represented by a dark blue color and the CPA are represented by the black links.
Section (e) of Figure 3 shows the full graph. The subject column is represented by an orange
color, the CTA are represented by a light blue color, the CEA are represented by a dark blue
color and the CPA are represented by the black links.</p>
    </sec>
    <sec id="sec-7">
      <title>6. Conclusion and Future Work</title>
      <p>The aim of this paper was to facilitate the semantic annotation of open data in order to improve its
discoverability and understanding. To achieve that goal, we first identified a list of 7 requirements
that need to be implemented in a tool to facilitate semantic annotation of open data (see Table
2). Then, we implemented these requirements in a tool called ODSAG (Open Data Semantic
Annotation and Graph) and its efectiveness was evaluated with a COVID data from the Namur
(Belgium) portal.</p>
      <p>This research contributes to theory by proposing a list of requirements that need to be
implemented in a tool to facilitate semantic annotation of open data (see Table 2). It also
provides a comparative table highlighting the strengths and weaknesses of some existing
semantic annotation tools used in the literature (see Table 1). This research also contributes
to practice by implementing the identified requirements in a tool and providing the source
code of the tool. This can be used as a starting point for developers to create their tool to
facilitate semantic annotation of open data or to improve the prototype. However, this research
has two main limitations that will need to be addressed in future work: the non-validation
of the identified requirements and the non-evaluation of the proposed with the stakeholders
(publishers and data analysts).</p>
      <p>In the near future, we plan to validate the identified requirements, to test our prototype with
other datasets and stakeholders, compare it with other existing tools, and extend the prototype
with additional features, such as (1) integrating annotation of numerical and geographical
columns, (2) integrating additional knowledge graphs such as DBPedia, LOV, Geonames and
YAGO to improve the annotation and (3) generating a RDF file of the annotated open dataset
for use in Linked Open Data.</p>
    </sec>
    <sec id="sec-8">
      <title>Acknowledgments</title>
      <p>The research was supported by a CERUNA PhD fellowship from the University of Namur.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J.</given-names>
            <surname>Attard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Orlandi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Scerri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Auer</surname>
          </string-name>
          ,
          <article-title>A systematic review of open government data initiatives'</article-title>
          ,
          <source>Government Information Quarterly, Elsevier Ltd</source>
          <volume>32</volume>
          (
          <year>2015</year>
          )
          <fpage>399</fpage>
          -
          <lpage>418</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>J.</given-names>
            <surname>Bertot</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Jaeger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Grimes</surname>
          </string-name>
          ,
          <article-title>Using icts to create a culture of transparency: E-government and social media as openness and anti-corruption tools for societies'</article-title>
          ,
          <source>Government Information Quarterly, Elsevier B.V</source>
          <volume>27</volume>
          (
          <year>2010</year>
          )
          <fpage>264</fpage>
          -
          <lpage>271</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>T.</given-names>
            <surname>Davies</surname>
          </string-name>
          , Open Data,
          <article-title>Democracy and Public Sector Reform: A Look at Open Government Data Use from Data, Gov</article-title>
          . Uk,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>P.</given-names>
            <surname>Johnson</surname>
          </string-name>
          , P. Robinson, Civic hackathons: Innovation, procurement, or civic engagement?',
          <source>Review of Policy Research</source>
          <volume>31</volume>
          (
          <year>2014</year>
          )
          <fpage>349</fpage>
          -
          <lpage>357</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <article-title>[5] Results of semtab 2021'</article-title>
          , in: CEUR Workshop Proceedings,
          <year>2021</year>
          , p.
          <fpage>1</fpage>
          -
          <lpage>12</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>P.</given-names>
            <surname>Křemen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Nečaský</surname>
          </string-name>
          ,
          <article-title>Improving discoverability of open government data with rich metadata descriptions using semantic government vocabulary'</article-title>
          ,
          <source>Journal of Web Semantics, Elsevier B.V</source>
          <volume>55</volume>
          (
          <year>2019</year>
          )
          <fpage>1</fpage>
          -
          <lpage>20</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>A.</given-names>
            <surname>Polleres</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Umbrich</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Figl</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Beno, Perception of key barriers in using and publishing open data', JeDEM - EJournal of EDemocracy and Open Government 9 (</article-title>
          <year>2017</year>
          )
          <fpage>134</fpage>
          -
          <lpage>165</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Beno</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Filtz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kirrane</surname>
          </string-name>
          ,
          <string-name>
            <surname>A. Polleres,</surname>
          </string-name>
          <article-title>Doc2rdfa: Semantic annotation for web documents'</article-title>
          ,
          <source>CEUR Workshop Proceedings</source>
          <volume>2451</volume>
          (
          <year>2019</year>
          )
          <fpage>0</fpage>
          -
          <lpage>4</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>C.</given-names>
            <surname>Bizer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Heath</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Berners-Lee</surname>
          </string-name>
          ,
          <article-title>Linked data - the story so far'</article-title>
          ,
          <source>International Journal on Semantic Web and Information Systems</source>
          <volume>5</volume>
          (
          <year>2009</year>
          )
          <fpage>1</fpage>
          -
          <lpage>22</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>B.</given-names>
            <surname>Moreau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Terpolilli</surname>
          </string-name>
          ,
          <string-name>
            <surname>P.</surname>
          </string-name>
          <article-title>Serrano-alvarado, A semi-automatic tool for linked data integration'</article-title>
          ,
          <source>in: 18th International Semantic Web Conference (ISWC2019</source>
          ,
          <year>2019</year>
          , p.
          <fpage>4</fpage>
          -
          <lpage>8</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>T.</given-names>
            <surname>Knap</surname>
          </string-name>
          ,
          <article-title>Towards odalic, a semantic table interpretation tool in the adequate project'</article-title>
          ,
          <source>in: CEUR Workshop Proceedings</source>
          ,
          <year>2017</year>
          , p.
          <fpage>26</fpage>
          -
          <lpage>37</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>D.</given-names>
            <surname>Roman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Nikolov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Putlier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Sukhobok</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Berre</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Ye</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dimitrov</surname>
          </string-name>
          , Datagraft :
          <article-title>One-stop-shop for open data management'</article-title>
          ,
          <source>Semantic Web Journal</source>
          <volume>9</volume>
          (
          <year>2016</year>
          )
          <fpage>393</fpage>
          -
          <lpage>411</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>M.</given-names>
            <surname>Cremaschi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Avogadro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Barazzetti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Chieregato</surname>
          </string-name>
          ,
          <article-title>Mantistable se: An eficient approach for the semantic table interpretation'</article-title>
          ,
          <source>in: CEUR Workshop Proceedings</source>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>P.</given-names>
            <surname>Nguyen</surname>
          </string-name>
          , I. Yamada,
          <string-name>
            <given-names>N.</given-names>
            <surname>Kertkeidkachorn</surname>
          </string-name>
          ,
          <year>Semtab 2021</year>
          :
          <article-title>Tabular data annotation with mtab tool'</article-title>
          ,
          <source>in: CEUR Workshop Proceedings</source>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>N.</given-names>
            <surname>Abdelmageed</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Schindler</surname>
          </string-name>
          ,
          <article-title>Jentab meets semtab 2021's new challenges'</article-title>
          ,
          <source>in: CEUR Workshop Proceedings</source>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <article-title>Efective and eficient semantic table interpretation using tableminer'</article-title>
          ,
          <source>Semantic Web Journal</source>
          <volume>8</volume>
          (
          <year>2017</year>
          )
          <fpage>921</fpage>
          -
          <lpage>957</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>