<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Milestone for Kepler-aSI at SemTab 2022</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Wiem Baazouzi</string-name>
          <email>wiem.baazouzi@ensi-uma.tn</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marouen Kachroudi</string-name>
          <email>marouen.kachroudi@fst.rnu.tn</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sami Faiz</string-name>
          <email>sami.faiz@insat.rnu.tn</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Tabular Data, Knowledge Graph, Kepler-aSI, SPARQL</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Université de Tunis El Manar, Ecole Nationale d'Ingénieurs de Tunis, Laboratoire de Télédétection et Systèmes</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Université de Tunis El Manar, Faculté des Sciences de Tunis</institution>
          ,
          <addr-line>Informatique Programmation Algorithmique et</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2022</year>
      </pub-date>
      <abstract>
        <p>In this paper, we present our system Kepler-aSI, for the Semantic Web Challenge on Tabular Data to Knowledge Graph Matching (SemTab 2022). This system is participating for the second time in this challege edition, bringing improvements and new technical aspects. Kepler-aSI analyzes tabular data to be able to detect correct matches in Wikidata and Dbpedia. It should be noted that each data resource, or each round of the challenge imposes a certain number of constraints, requiring advanced techniques. The aforementioned task turns out to be dificult for the machines, which requires an additional efort in order to deploy the congenitive capacity in the matching methods. Kepler-aSI [1, 2, 3, 4] still relies on the SPARQL query to semantically annotate tables in Knowledge Graphs (KG), in order to solve the critical problems of matching tasks. The results obtained during the evaluation phase are encouraging and show the strengths of the proposed system.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        (Comma-Separated Values)). The main reason for the popularity of this format is its simplicity:
many common ofice tools are available to facilitate their generation and use. Tables on the
Web are a very valuable data source. Thus, injecting semantic information into arrays on the
web has the potential to boost a wide range of applications, such as web searching, answering
queries, and building Knowledge Bases (KB). Research reports that there are various issues
with tabular data available on the Web, such as learning with limited labeled data, defining
or updating ontologies, exploiting prior knowledge, and / or scaling up existing solutions.
Therefore, this task is often dificult in practice, due to missing, incomplete or ambiguous
metadata (e.g., table and column names). In recent years, we have identified several works that
can be mainly classified as supervised (in the form of annotated tables to carry out the learning
task) [
        <xref ref-type="bibr" rid="ref10 ref11 ref7 ref8 ref9">7, 8, 9, 10, 11</xref>
        ] or unsupervised (tables whose data is not dedicated to learning) [
        <xref ref-type="bibr" rid="ref11 ref12">12, 11</xref>
        ].
To solve these problems, we propose a global approach named Kepler-aSI, which addresses the
challenge of matching tabular data to knowledge graphs.This method is based on previous
work, which deals with ontology alignment [
        <xref ref-type="bibr" rid="ref13 ref14 ref15 ref16 ref17">13, 14, 15, 16, 17</xref>
        ].
      </p>
      <p>This year’s SemTab challenge difers from the last two sessions 1 2, in that it deals with
Wikidata and Dbpedia. In this challenge, the input is a CSV file, but three diferent challenges
had to be met :
1. CTA : A type of the Wikidata (or eventually Dbpedia) ontology had to be assigned a
class KG to a column (Column-Type Annotation ).
2. CEA : A Wikidata or Dbpedia entity had to be matched to the diferent cells (Cell-Entity</p>
      <p>Annotation).
3. CPA : A KG (Wikidata or Dbpedia) property had to be assigned to the relationship
between two columns (Columns Property Annotation).</p>
      <p>
        Data annotation is a fundamental process in tabular data analysis [
        <xref ref-type="bibr" rid="ref18 ref19">18, 19</xref>
        ], it allows to infer
the meaning of other information. Then deduce the meaning of a tabular Knowledge Graph.
The data we used was based both on Wikidata and Dbpedia. It should be noted that in a broader
context, the data used and manipulated obey the triples format representation : subject ( ), a
predicate ( ) and an object ( ). This notation ensures semantic navigability through the data
and makes all data manipulation more fluid, explicit and reliable. Indeed, Cell Entity Annotation
(CEA) matches a cell to a KG entity. At this level, we have to annotate each individual element
of the subject ( ) and the object ( ). Column Property Annotation (CPA) assigns a KG property
to the relationship between two columns. The task is to find out which property of the two
columns are connected to either Wikidata or Dbpedia. Column Type Annotation (CTA) assigns
connected semantic type to a column.
      </p>
      <p>Our goal is to design a fast and eficient approach to annotate tabular data with entities
from Wikidata or Dbpedia. Our approach combines a multitude of NLP and search and filter
strategies, based on text preprocessing techniques. Experiments carried out in the context of
SemTab 2022 for all tasks have shown encouraging results.</p>
      <sec id="sec-1-1">
        <title>1https://www.cs.ox.ac.uk/isg/challenges/sem-tab/2019/ 2https://www.cs.ox.ac.uk/isg/challenges/sem-tab/2020/</title>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>2. Kepler-aSI approach</title>
      <p>In this section, we will describe in detail the diferent stages of our system, while presenting
some basic notions to highlight the technical issues identified.
2.1. Key notions
• Tabular Data :  is a two-dimensional tabular structure made up of an ordered set of N
rows and M columns, as depicted by Figure ??.   is a row of the table (i = 1 ... N),   is a
column of the table (j = 1 ... M). The intersection between a row   and a column   is   , ,
which is a value of the cell   , . The table contents can have diferent types (string, date,
lfoat, number, etc.).</p>
      <p>– Target Table (S): M × N.
– Subject Cell:  (,0) (i = 1, 2 ... N).
– Object Cell:  (,) (i = 1, 2 ... M),(j = 1, 2 ... N).</p>
      <p>Col0
 1 ⎛  1,0</p>
      <p>⎜⎜ ⋮⋮
  ⎜   ,0
⎜ ⋮
⎜ ⋮
Row ⎝   ,0
…
⋱
⋱
…
⋱
⋱
…</p>
      <p>Col
…
⋱
⋱
  ,
⋱
⋱
…
…
⋱
⋱
…
⋱
⋱
…</p>
      <p>Col
 1, ⎞</p>
      <p>
        ⋮⋮ ⎟⎟
  , ⎟
⋮ ⎟
⋮ ⎟
  , ⎠
• Knowledge Graph : Knowledge Graphs have been in the focus of research since 2012,
resulting in a wide variety of published descriptions and definitions. The lack of a common
core, a fact that is also indicated by Paulheim [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] in 2015. Paulheim listed in his survey
of Knowledge Graph refinement, the minimum set of characteristics that must be present
to distinguish Knowledge Graphs from other knowledge collections, which basically
restricts the term to any graph based knowledge representation. In the online reviewing
[
        <xref ref-type="bibr" rid="ref20">20</xref>
        ], authors agreed that a more precise definition was hard to find at that point. This
statement points out the need of a closer investigation and deeper reflection in this area.
Farber et al. defined a Knowledge Graph as an Resource Description Framework (RDF)
graph and stated that the term KG was coined by Google to describe any graph-based
Knowledge Base (KB) [21]. Although this definition is the only formal one, it contradicts
with more general definitions as it explicitly requires the RDF data model. In the following
we present a detailed description of our contribution, namely Kepler-aSI.
2.2. System description
In order to address the above mentioned SemTab challenge tasks, Kepler-aSI is designed
according to the workflow depicted by Figure 2. There are three major complementary modules
which consist in, respectively, Preprocessing, Annotation context and Tabular data to KG
matching. The aforementioned steps are the same for each round, but the changes remain minimal
depending on the variations observed in each case.
      </p>
      <p>As shown in Figure 2 Preprocessing aims to prepare the data inside the considered table.
While Annotation Context, seeks to create a list of terms denoting the same context.</p>
      <sec id="sec-2-1">
        <title>2.2.1. Preprocessing</title>
        <p>It should be noted that the content of each table can be expressed according to diferent types
and formats, namely: numeric, character strings, binary data, date/time, boolean, addresses,
etc. Indeed, with the great diversity of data types, the pre-processing step is crucial. Therefore,
the goal of preprocessing is to ensure that the processing of each table is triggered without
errors. The efort is especially accentuated when the data contains spelling errors. In other
words, these issues must be resolved before we apply our approach. In order to well carry out
this step, we used several techniques and libraries such as (Textblob3, Pyspellchecker4, etc.)
to rectify and correct all the noisy textual data in the considered tables. As an example, we
detect punctuation, parentheses, hyphen and apostrophe, and also stop words by using the
P a n d a s 5 library to remove them. Like a classic treatment in this register, we ended this phase by
transforming all the upper case letters into lower case.</p>
        <sec id="sec-2-1-1">
          <title>3https://textblob.readthedocs.io/en/dev/ 4https://pypi.org/project/pyspellchecker/ 5https://pandas.pydata.org</title>
        </sec>
      </sec>
      <sec id="sec-2-2">
        <title>2.2.2. Annotation context</title>
        <p>This phase allows to explicitly extract the candidates for the annotation process. The priming is
carried out by a processing columns analysis, which aims to understand and delimit the set of
regular expressions which contains a set of units: the area, the currency, the density, the electric
current, the energy , flow rate, force, frequency, energy eficiency, unit of information, length,
density, mass, numbers, population density, power, pressure, speed, temperature, time, torque,
voltage and volume. This step allows to identify multiple Regextypes using regular expressions
(e.g. numbers, geographic coordinates, address, code, color, URL). Since all values of type text
are selected, preprocessing for natural languages was performed using the l a n g r i d 6 library to
detect 26 languages in our data. By the way, it’s a novelty for this year’s SemTab challenge, i.e.,
which makes the task more dificult with the introduction of natural language barriers. The
l a n g r i d library is a stand-alone language identification tool. It is preformed on a large number
of languages (97 currently). Doing so, correction, data type and language detection is performed.
This can considerably reduce the efort and the cost of executing our approach by avoiding the
massive repetition of these treatments for all the table cells, and this in each subtask.</p>
      </sec>
      <sec id="sec-2-3">
        <title>2.2.3. Assigning a semantic type to a column (CTA)</title>
        <p>As depicted by Figure 3, the task is to annotate each entity column with elements from Wikidata
(or possibly Dbpedia) as its type identified during the preprocessing phase.</p>
        <p>Each item is marked with the tag in Wikidata or Dbpedia. This treatment allows semantics
identification. The CTA task can be performed based on Wikidata or Dbpedia APIs which allows
us to search for an item according to its description. The main information collected about
a given entity and used in our approach are: a list of instances (expressed by the i n s t a n c e O f
primitive and accessible by the P31 code), the subclass of (expressed by the s u b c l a s s O f primitive
and accessible by code P279) and overlaps (expressed by the p a r t O f primitive and accessible
by code P361). At this point, we are able to process the CTA task using a SPARQL query. The
SPARQL query is our interrogation mean fed by the main information of the entity which
governs the choice of each data type, since they are a list of instances (P31), of subclasses (P279)
or a part of a class (P361). The result of the SPARQL query may return a single type but for
some cases the result is more than one type, so in this case no annotation is produced for the
CTA task.</p>
      </sec>
      <sec id="sec-2-4">
        <title>2.2.4. Matching a cell to a KG entity (CEA)</title>
        <p>The CEA task aims to annotate the cells of a given table to a specific entity listed on Wikidata
or Dbpedia.</p>
        <p>Figure 4 gathers the CEA task that can be performed based on the same principle of CTA
task. Our approach reuses the results of the CTA task process by introducing the necessary
modifications on the SPARQL query. If the operation returns more than one annotation and
since we are conducting a fuzzy search [22, 23], we run a process based on examining the context
of the considered column, relatively to what was obtained with the CTA task, to overcome the
ambiguity problem.</p>
      </sec>
      <sec id="sec-2-5">
        <title>2.2.5. Matching a property to a KG entity (CPA)</title>
        <p>After having annotated the cell values as well as the diferent types of each of the considered
entities, we will identify the relationships between two cells appearing on the same row via a
property using a SPARQL query, as flagged by Figure 5. Indeed, the CPA task look for annotating
the relationship between two cells in a row via a property. Similarly, this latter task can be
performed in an analogous manner to the CTA and CEA tasks. The only diference in the CPA
task is that the SPARQL query must select both the entity and the corresponding attributes.
The properties are fairly easy to match since we have already determined them during CEA
and CTA task processing.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Kepler-aSI performance and results</title>
      <p>In this section we will present the results of Kepler-aSI for the diferent matching tasks in the 3
rounds of SemTab 2022. These results highlight the strengths of Kepler-aSI with its encouraging
performance despite the multiplicity of issues.
3.1. Round 1
In this first round, and for this version of SemTab 2022, three tasks are presented: CTA-WD,
CEA-WD and CPA-WD. The column type annotation (CTA -WD) assigns a Wikidata semantic
type (a Wikidata entity) to a column. Cell Entity Annotation (CEA-WD) maps a cell to a KG
entity. Annotation should be represented by its full IRI, where case is not sensitive. Each
line should include a column identified by a table ID and a column ID, along with the column
annotation (a Wikidata item). This means that a row must include three fields: ” Table ID”,
”Column ID”, and ”IRI Annotation”, where:
• ”Table ID” is the filename of the table data, but does not include the extension.
• ”Column ID” is the position of the column in the input, starting from 0, i.e., the ID of the
ifrst column is 0.
• ”IRI annotation”: the prefix of h t t p : / / w w w . w i k i d a t a . o r g / e n t i t y / instead of
h t t p s : / / w w w . w i k i d a t a . o r g / w i k i / which is the URL prefix of the Wikidata page.</p>
      <p>When it comes to associating a cell with an entity on the Knowledge Graph, the task is to
annotate each target cell with an entity from Wikidata. Each annotation must contain the
target cell annotation. A cell can be annotated by an entity with the prefix
http://www.wikidata.org/entity/. Each CTA annotation must contain the annotation of a cell identified by a
table identifier, a column identifier and a row identifier. Namely, an annotation must have four
ifelds: ” Table ID”, ”Row ID”, ”Column ID” and ”Entity IRI ”, where:
• ”Table ID”: does not include the filename extension; and removing the .csv extension
from the filename.
• ”Column ID”: is the position of the column in the table file, starting from 0, i.e. the ID of
the first column is 0.
• “Row ID”: is the row position in the table file, starting from 0, i.e. the first row ID is 0.
• ”Entity IRI ”: the prefix of h t t p : / / w w w . w i k i d a t a . o r g / e n t i t y / instead of
h t t p s : / / w w w . w i k i d a t a . o r g / w i k i / which is the URL prefix of the Wikidata page.</p>
      <p>As for the annotation of column properties by Wikidata (CPA-WD), it consists in annotating
the relations between the columns in a table with the properties of Wikidata. Each annotation
must contain the two-column annotation which is itself identified by a table identifier, a first
column identifier and a second column identifier. Namely, a row must have four fields: ” Table
ID”, ”Column ID 1”, ”Column ID 2” and ”Property IRI ”. Each pair of columns must be annotated
by at most one property, as follows:
• ”Table ID” does not include filename extension
• ”Column ID 1”, ”Column ID 2”, is the position of the column in the table file, starting from
0, i.e. the first column ID is 0.
• ”Property IRI ”: prefix of h t t p : / / w w w . w i k i d a t a . o r g / p r o p / d i r e c t / instead
h t t p s : / / w w w . w i k i d a t a . o r g / w i k i / which is the URL prefix of the Wikidata page.
of</p>
      <p>It should be noted that the CTA-WD, CEA-WD and CPA-WD task data contains 3691 tables.
Results are summarized in Table 1:
3.2. Round 2
Round 2 includes 3 main families of tests, the results of which are summarized in Table 2:
• HardTables (HT-WD): represented by 4649 tables;
• ToughTablesR2-WD (2T-WD): represented by 114 tables;
• ToughTablesR2-DBP (2T-DBP): represented by 114 tables.</p>
      <p>APrecision
3.3. Round 3
Round 3 includes 3 main families of tests, metrics are in Table 3:
• GitTables schema: represented by 45 tables;
• GitTables DBP: represented by 6898 tables;
• Bio-Div-Tables: represented by 45 tables.</p>
      <sec id="sec-3-1">
        <title>BiodivTab-CTA-DBP</title>
      </sec>
      <sec id="sec-3-2">
        <title>BiodivTab-CEA-DBP</title>
      </sec>
      <sec id="sec-3-3">
        <title>GitTables-CTA-DBP</title>
      </sec>
      <sec id="sec-3-4">
        <title>GitTables-CTA-SCH</title>
        <p>Round 3</p>
      </sec>
      <sec id="sec-3-5">
        <title>APrecision</title>
        <p>0.781
0.534
AF1
0.731
0.534
—
—</p>
      </sec>
      <sec id="sec-3-6">
        <title>Rank</title>
        <p>3/7
4/7</p>
        <p>In Round 3, we realized that there were significant amounts of entity duplication in our
result. Thus, the pairing process has been improved by adding the following features. First,
spell checking of misspelled sentences was used. However, approaches based on relishing
content duplications can achieve results without column duplication. In order to overcome
duplicate columns, We used Fuzzy matching in pandas to detect duplicate rows (eficiently). In
fact, FuzzyWuzzy is an implementation of edit distance, which would be a good candidate for
constructing a pairwise distance matrix in numpy or similar. To detect ”duplicates” or close
matches, We have to compare each row to the other rows or We will never know if two are
close to each other</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusion</title>
      <p>To conclude, we have presented in this paper the second version of our Kepler-aSI approach.
Our system is participating in the challenge for the third time, it is approaching maturity
and achieving very encouraging performance. We have succeeded in combining several
strategies and treatment techniques, which is also the strength of our system. We boosted the
preprocessing and spellchecking steps that got the system up and running.</p>
      <p>In addition, despite the data size, which is quite large, we managed to get around this
problem by using a kind of local dictionary, which allows us to reuse already existing matches.
Thus, we realized a considerable saving of time, which allowed us to adjust and rectify after
each execution. We also participated in all the tasks without exception, which allowed us to
test our system on all facets, i.e., to identify its strengths and weaknesses.</p>
      <p>In this paper, we presented our contribution to the SemTab2022 challenge, Kepler-aSI.
We tackled the several proposed tasks. Our solution is based on a generic SPARQL query
using the cell contents as a description of a given item. In each round, despite the time
allocated by the organizers running out, we continued the work and the improvements, having
the conviction that each efort counts and brings us closer to the good control of the studied field.</p>
      <p>Demos, SuCCESS) 48 (2016) 1–4.
[21] M. Färber, F. Bartscherer, C. Menne, A. Rettinger, Linked data quality of dbpedia, freebase,
opencyc, wikidata, and yago, Semantic Web 9 (2018) 77–129.
[22] H. Akremi, S. Zghal, Dof: a generic approach of domain ontology fuzzification, Frontiers</p>
      <p>Comput. Sci. 15 (2021) 153322.
[23] H. Akremi, M. G. Ayadi, S. Zghal, To medical ontology fuzzification purpose: Covid-19
study case, Procedia Computer Science 207 (2022) 1027–1036.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>W.</given-names>
            <surname>Baazouzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kachroudi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Faïz</surname>
          </string-name>
          ,
          <article-title>Kepler-asi: Kepler as a semantic interpreter, in: Proceedings of the Semantic Web Challenge on Tabular Data to Knowledge Graph Matching (SemTab 2020) co-located with the 19ℎ International Semantic Web Conference (ISWC</article-title>
          <year>2020</year>
          ),
          <article-title>Virtual conference (originally planned to</article-title>
          be in Athens, Greece),
          <source>November 5</source>
          ,
          <year>2020</year>
          , volume
          <volume>2775</volume>
          ,
          <year>2020</year>
          , pp.
          <fpage>50</fpage>
          -
          <lpage>58</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>W.</given-names>
            <surname>Baazouzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kachroudi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Faïz</surname>
          </string-name>
          ,
          <article-title>Kepler-asi at semtab 2021, in: Proceedings of the Semantic Web Challenge on Tabular Data to Knowledge Graph Matching co-located with the 20ℎ International Semantic Web Conference (ISWC</article-title>
          <year>2021</year>
          ), Virtual conference,
          <source>October</source>
          <volume>27</volume>
          ,
          <year>2021</year>
          , volume
          <volume>3103</volume>
          ,
          <year>2021</year>
          , pp.
          <fpage>54</fpage>
          -
          <lpage>67</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>W.</given-names>
            <surname>Baazouzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kachroudi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Faiz</surname>
          </string-name>
          ,
          <article-title>Towards an eficient fairification approach of tabular data with knowledge graph models</article-title>
          ,
          <source>in: Proceedings of the 26ℎ Knowledge-Based and Intelligent Information Engineering Systems International Conference KES</source>
          <year>2022</year>
          , volume
          <volume>207</volume>
          ,
          <year>2022</year>
          , pp.
          <fpage>2727</fpage>
          -
          <lpage>2736</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>W.</given-names>
            <surname>Baazouzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kachroudi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Faiz</surname>
          </string-name>
          ,
          <article-title>A matching approach to confer semantics over tabular data based on knowledge graphs</article-title>
          ,
          <source>in: Proceedings of the 11ℎ International Conference on Model and Data Engineering</source>
          , Springer,
          <year>2023</year>
          , pp.
          <fpage>236</fpage>
          -
          <lpage>249</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>J.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Jiménez-Ruiz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Horrocks</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Sutton</surname>
          </string-name>
          ,
          <article-title>Colnet: Embedding the semantics of web tables for column type prediction</article-title>
          ,
          <source>in: Proceedings of the AAAI Conference on Artificial Intelligence</source>
          , volume
          <volume>33</volume>
          ,
          <year>2019</year>
          , pp.
          <fpage>29</fpage>
          -
          <lpage>36</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>S.</given-names>
            <surname>Malyshev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Krötzsch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>González</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Gonsior</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bielefeldt</surname>
          </string-name>
          ,
          <article-title>Getting the most out of wikidata: Semantic technology usage in wikipedia's knowledge graph</article-title>
          , in: International Semantic Web Conference, Springer,
          <year>2018</year>
          , pp.
          <fpage>376</fpage>
          -
          <lpage>394</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>M.</given-names>
            <surname>Pham</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Alse</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. A.</given-names>
            <surname>Knoblock</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Szekely</surname>
          </string-name>
          ,
          <article-title>Semantic labeling: a domain-independent approach</article-title>
          , in: International Semantic Web Conference, Springer,
          <year>2016</year>
          , pp.
          <fpage>446</fpage>
          -
          <lpage>462</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Taheriyan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. A.</given-names>
            <surname>Knoblock</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Szekely</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. L.</given-names>
            <surname>Ambite</surname>
          </string-name>
          ,
          <article-title>Learning the semantics of structured data sources</article-title>
          ,
          <source>Journal of Web Semantics</source>
          <volume>37</volume>
          (
          <year>2016</year>
          )
          <fpage>152</fpage>
          -
          <lpage>169</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>S. K.</given-names>
            <surname>Ramnandan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Mittal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. A.</given-names>
            <surname>Knoblock</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Szekely</surname>
          </string-name>
          ,
          <article-title>Assigning semantic labels to data sources</article-title>
          ,
          <source>in: European Semantic Web Conference</source>
          , Springer,
          <year>2015</year>
          , pp.
          <fpage>403</fpage>
          -
          <lpage>417</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>C. A.</given-names>
            <surname>Knoblock</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Szekely</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. L.</given-names>
            <surname>Ambite</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Goel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Lerman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Muslea</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Taheriyan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Mallick</surname>
          </string-name>
          ,
          <article-title>Semi-automatically mapping structured sources into the semantic web</article-title>
          ,
          <source>in: Extended Semantic Web Conference</source>
          , Springer,
          <year>2012</year>
          , pp.
          <fpage>375</fpage>
          -
          <lpage>390</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>M.</given-names>
            <surname>Cremaschi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>De Paoli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Rula</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Spahiu</surname>
          </string-name>
          ,
          <article-title>A fully automated approach to a complete semantic table interpretation, Future Generation Computer Systems (</article-title>
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <article-title>Efective and eficient semantic table interpretation using tableminer+</article-title>
          ,
          <source>Semantic Web</source>
          <volume>8</volume>
          (
          <year>2017</year>
          )
          <fpage>921</fpage>
          -
          <lpage>957</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>M.</given-names>
            <surname>Kachroudi</surname>
          </string-name>
          , G. Diallo,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ben</surname>
          </string-name>
          <string-name>
            <surname>Yahia</surname>
          </string-name>
          ,
          <article-title>OAEI 2017 results of KEPLER</article-title>
          , in
          <source>: Proceedings of the 12th International Workshop on Ontology Matching co-located with the 16th International Semantic Web Conference (ISWC</source>
          <year>2017</year>
          ), Vienna, Austria, October
          <volume>21</volume>
          ,
          <year>2017</year>
          , volume
          <volume>2032</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2017</year>
          , pp.
          <fpage>138</fpage>
          -
          <lpage>145</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>M.</given-names>
            <surname>Kachroudi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. Ben</given-names>
            <surname>Yahia</surname>
          </string-name>
          ,
          <article-title>Dealing with direct and indirect ontology alignment</article-title>
          ,
          <source>J. Data Semant</source>
          .
          <volume>7</volume>
          (
          <year>2018</year>
          )
          <fpage>237</fpage>
          -
          <lpage>252</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>M.</given-names>
            <surname>Kachroudi</surname>
          </string-name>
          , G. Diallo,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ben</surname>
          </string-name>
          <string-name>
            <surname>Yahia</surname>
          </string-name>
          ,
          <source>KEPLER at OAEI</source>
          <year>2018</year>
          ,
          <source>in: Proceedings of the 13th International Workshop on Ontology Matching co-located with the 17th International Semantic Web Conference, OM@ISWC</source>
          <year>2018</year>
          , Monterey, CA, USA, October
          <volume>8</volume>
          ,
          <year>2018</year>
          , volume
          <volume>2288</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2018</year>
          , pp.
          <fpage>173</fpage>
          -
          <lpage>178</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>M.</given-names>
            <surname>Kachroudi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Zghal</surname>
          </string-name>
          ,
          <string-name>
            <surname>S.</surname>
          </string-name>
          <article-title>Ben Yahia, Bridging the multilingualism gap in ontology alignment</article-title>
          ,
          <source>International Journal of Metadata, Semantics and Ontologies</source>
          <volume>9</volume>
          (
          <year>2014</year>
          )
          <fpage>252</fpage>
          -
          <lpage>262</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>M.</given-names>
            <surname>Kachroudi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Zghal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. Ben</given-names>
            <surname>Yahia</surname>
          </string-name>
          ,
          <article-title>Using linguistic resource for cross-lingual ontology alignment</article-title>
          ,
          <source>International Journal of Recent Contributions from Engineering</source>
          <volume>1</volume>
          (
          <year>2013</year>
          )
          <fpage>21</fpage>
          -
          <lpage>27</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>J.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Jiménez-Ruiz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Horrocks</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Sutton</surname>
          </string-name>
          ,
          <article-title>Learning semantic annotations for tabular data</article-title>
          , arXiv preprint arXiv:
          <year>1906</year>
          .
          <volume>00781</volume>
          (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>V.</given-names>
            <surname>Efthymiou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Hassanzadeh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Rodriguez-Muro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Christophides</surname>
          </string-name>
          ,
          <article-title>Matching web tables with knowledge base entities: from entity lookups to entity embeddings</article-title>
          , in: International Semantic Web Conference, Springer,
          <year>2017</year>
          , pp.
          <fpage>260</fpage>
          -
          <lpage>277</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>L.</given-names>
            <surname>Ehrlinger</surname>
          </string-name>
          , W. Wöß,
          <article-title>Towards a definition of knowledge graphs</article-title>
          .,
          <source>SEMANTiCS</source>
          (Posters,
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>