<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Milan: Automatic Generation of R2RML Mappings</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Sahil Nakul Mathur</string-name>
          <email>mathurs@tcd.ie</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Declan O'Sullivan</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rob Brennan</string-name>
          <email>rob.brennang@cs.tcd.ie</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>ADAPT Centre, School of Computer Science and Statistics, Trinity College</institution>
          ,
          <addr-line>Dublin</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>School of Computing, Dublin City University</institution>
          ,
          <addr-line>Dublin</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>Milan automatically generates R2RML mappings between a source relational database and a target ontology, using novel multi-level algorithms. It address real world inter-model semantic gap by resolving naming con icts, structural and semantic heterogeneity, thus enabling high delity mapping generation for realistic databases. Despite the importance of mappings for interoperability across relational databases and ontologies, a labour and expertise-intensive task, the current state of the art has achieved only limited automation. The paper describes an experimental evaluation of Milan with respect to the state of the art systems using the RODI benchmarking tool which shows that Milan outperforms all systems in all categories.</p>
      </abstract>
      <kwd-group>
        <kwd>RDB2RDF</kwd>
        <kwd>OBDA</kwd>
        <kwd>Schema and Ontology Matching</kwd>
        <kwd>Mapping Rules</kwd>
        <kwd>Linked Data</kwd>
        <kwd>Automatic Mapping</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        The Semantic Web promises easy data integration but in practice, this depends
on the availability of detailed mappings, especially when converting from other
data models like the relational model to RDFs data model [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. Schema and
ontology matching research [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] provide tools and methods to identify semantic
correspondences between models such as database schema and OWL ontologies
or Linked Data vocabularies. These correspondences are then validated, usually
by domain experts, and then formalised as mappings which can be expressed
in standard languages such as the W3C R2RML (Relational to RDF Mapping
Language) recommendation 3. However creating mappings is hard; it requires
labour, domain expertise, and knowledge modeling expertise. Hence automated
approaches are extensively studied. Relational database (RDB) to RDF
mappings have additional complexity since the relational and RDF data models do
not exactly align, have di erent expressivity and each emphasizes a di erent
modeling repertoire [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. Hence for human validation of RDB to RDF
mappings, additional expertise is required in both RDB and R2RML/RDF modeling.
This, in the enterprise context means the potential gains of automating relational
to RDF mapping generation are considerable.
3 www.w3.org/TR/r2rml/
      </p>
      <p>The research question studied in this paper is To what extent can an
automatic RDB to RDF mapping generation technique, Milan, based on semantic,
lexical and structural analysis of both a source relational database and a target
ontology create complete and accurate R2RML mappings?</p>
      <p>Milan creates correspondences based on heuristic RDB to RDF mapping
patterns we have observed in real databases. It uses a multi-level algorithm to
detect class-table, object property-referential integrity and data property-column
correspondences for realistic, complex relational databases. In order to minimise
human intervention, these correspondences use Levenstein distance-based fuzzy
label matching and identify optimal matches using combinatorial optimization.
R2RML mappings are then generated to express these correspondences.</p>
      <p>
        The contributions of this paper are as follows: (1) Milan, a novel method for
detection of correspondences between relational databases and semantic web
ontologies or vocabularies; and (2) an evaluation of the performance Milan against
the Relational-to-Ontology Data Integration (RODI) benchmark [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] and
comparison to state of the art mapping generation techniques.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        This section brie y discusses di erent state of the art systems. Pinkel et al [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]
have performed a detailed state of the art review of selected automated systems.
While a number of semi-automatic RDB to RDF mapping generation systems
exist, this paper focuses on fully automatic systems and so limits the scope of
this section accordingly. Automatic systems can be divided into one-stage and
two-stage systems. Two-stage systems such as BootOx[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], Automap4OBDA[
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]
produce a target-agnostic ontology followed by ontology aligmnment to the
target ontology. One stage systems such as IncMap[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] directly map without the
use of intermediate ontology. Other inter-model mapping tools include
-ontop[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ],MIRROR[
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], D2RQ[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and COMA++[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>
        BootOx [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] applies direct mapping 4 and provides support for di erent OWL
pro les. It enriches the bootstrapped ontology using explicit and implicit database
constraints from the RDB that creates axioms about the classes and properties.
This is followed by ontology alignment using LogMap. BootOx is best at
resolving semantic heterogeneity by using OWL features. [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>
        IncMap [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] exploits mapping patterns to enrich their graph-based data
structure representing RDB and RDF data model. It then uses lexical and structural
analysis to obtain correspondences between RDB and ontology. However, IncMap
addressed limited mapping patterns and doesn't focus on semantic heterogeneity.
      </p>
      <p>
        Automap4OBDA (A4MO) [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] is based on RDBToOnto [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ].Unlike IncMap,
A4MO uses ontology learning techniques to extends correspondences between
the data and classes of target ontology. While both BootOx and A4MO enrich
their bootstrapped/putative ontology, only A4MO addresses class hierarchies.
However, A4MO is not as versatile as IncMap in inferring class hierarchies.
A4MO is better than other systems in addressing structural heterogeneity.
      </p>
      <sec id="sec-2-1">
        <title>4 www.w3.org/TR/rdb-direct-mapping/</title>
        <p>The following lists the gaps in the current state of the art systems. They all
fail to detect n : m due to junction table and n : 1 class-table relationships due
to column splitting. They also miss out on many properties matching for tables
that have 1 : 1 relationship with other tables. They don't utilize annotation
properties, resolve datatype property-column label collisions. They also
underperform at addressing semantic heterogeneity. They don't use the expressivity of
OWL such as InverseOf, unionOf to enrich the relational-ontology relationship.
3</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Requirements</title>
      <p>As mentioned above, most real-world relational schema and corresponding
ontologies cannot be related using nave direct mappings as there are vast di
erences in the ways which same concepts are modelled. This section references
examples of di erent data modelling techniques discussed in state of the art.
Based on such observations, literature also provides a comprehensive set of
requirements for automatic mapping systems. These are brie y discussed below.</p>
      <p>
        C.Pinkel et al[
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] discusses various modelling repertoire using tables such as
author and reviewer. This is shown in Figure 1 above. A popular set of classi
cations for mapping challenges in relational-to-ontology patterns, which was rst
discussed by Batini et al [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] and subsequently discussed by Pinkel et al [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] , are
as follows: (1) naming con icts, (2) structural heterogeneity and (3) semantic
heterogeneity. Table 1, which is adopted from [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] summarizes the requirements
of relational-to-ontology mapping approaches for automated systems. C.Pinkel
et al[
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] discusses each of these requirements in detail.
ID
      </p>
      <p>Challenge
Type</p>
      <p>Input-Output</p>
      <p>Relational-to-ontology systems requirement</p>
    </sec>
    <sec id="sec-4">
      <title>Automatic Mapping Generation Algorithm</title>
      <p>Milan has been developed to ll the gaps identi ed by the requirements and in
the state of the art mapping generation systems. Milan comprises of three main
processes and 6 supporting processes that are invoked by the main processes as
needed (Fig. 2). The source relational database and target ontology act as inputs
to Milan and it produces a set of R2RML mappings as an output.</p>
      <p>Label Matching : Milan uses a variant of FuzzyWuzzy5, uses Levenshtein
Distance, producing scores in the range [0; 1]. Milan modi ed FuzzyWuzzy to
add camelCase and special character based tokenization. Label matching
addresses the naming heterogeneity patterns involving token re-ordering using
sort token ratio which reorders the tokens based on lengths. Label matching uses</p>
      <sec id="sec-4-1">
        <title>5 https://github.com/seatgeek/fuzzywuzzy</title>
        <p>
          a speci c variant for Column Splitting (Sec. 4.1), where it lters out the
superclass label token from its subclass or sibling label, thereby improving matching
scores. Combinatorial Optimization: Milan uses the Hungarian algorithm [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] to
solve assignment problem across class-table, data property-column and object
property-FK/PK. For two set of labels (xm; yn), it uses the label scores of x y
to obtain min(m; n) 1:1 matches such that the sum of scores is the maximum.
4.1
        </p>
        <sec id="sec-4-1-1">
          <title>Class-Table Relationship</title>
          <p>The Class-Table Relationship process detects 1 : 1, n : 1, and some 1 : n
classtable relationships using Label Matching, Column Splitting and Combinatorial
Optimization. Column Splitting detects categorical values in columns, thereby
splitting the table based on these values. These values are matched to sub-classes
and sibling classes.</p>
          <p>The class-table matching result is input to the Object Property Foreign
Key/Primary Key process to discover links using Referential Integrity, Label
Matching, Junction Table Detection and Combinatorial Optimization.
Referential Integrity discovers the keys linking two pairs of matching class-tables and the
object properties across the two classes. Junction Table Detection enriches
existing relationships with the inclusion of implicit many-to-many relations across
two or more tables.</p>
          <p>The algorithm rst detects 1:1 followed n : 1 class-table relationships.
Algorithm 1 takes class and table rdfs:label as input. Label matching is used over
the two lists producing a three column matching table Oa b;3. Oa b;3 is then
post-processed by ltering out results with low string match scores based on a
threshold, which is determined experimentally ( = 0:7). This is followed by
ltering out lower match scores due to duplicate occurrences of any class or table
label, represented as uniqueCriteria in Algorithm 1.</p>
          <p>The result of this is a one-one match table. Each 1:1 class-table pair is
considered for column splitting by considering unique elements of all non-key columns.
This is done by gathering all non-key columns (Wm;1) along with sub-classes
and sibling classes(Vn;1) for the given table-class pair. Then, for each column,
the unique element values and their columns are stored in a two column
table (Ta;2). In order to maximize computation e ciency and matching accuracy,
columns having a very large number of unique elements are removed from
consideration. This is done using a threshold which is experimentally determined to
be = 1:2 n. A modi ed variant of Label Matching is then run over this list of
unique values with sub-classes and siblings of a target class against all labels of
unique column values, using score threshold . Data diversity is enhanced by the
overlooking label of target class when considering the sub/sibling class labels.</p>
          <p>Using the Hungarian algorithm over the resultant label match table provides
1:1 column value-class matches, which are optimized by maximizing the sum of
scores.
4.2</p>
        </sec>
        <sec id="sec-4-1-2">
          <title>Object Property-Referential Integrity Relationship</title>
          <p>Algorithm 2 uses rdfs:domain, rdfs:range, owl:UnionOf and owl:inverseOf of
object property 0, Alg. 2,3 along with FK/PK relationship R using
primaryforeign key pairs represented by foreign column names R ( Alg. 2,2). Using the
class-table pair from Sec. 4.1, it obtains the corresponding two pairs of classes
and tables. Algorithm 2 matches the obtained list of object properties with keys
of the relational database using label matching. However, a nave approach will
miss out on detecting complex relationships 3 such as junction tables, one-to-one
table relations, and inverse relationships.</p>
          <p>Alg.2 detects junction tables via pattern mentioned patterns 3 and infers a
many-to-many relationship across the table pair. The label used to describe this
relationship is the junction table's name. Alg.2 also detects 1:1 table-table
mappings and inherits all columns of parent table to the child tables. Lastly, while
relationships in relational databases have a clear sense of directionality,
relationship across the class may not always be uni-directional. Sometimes they also have
a pair of properties which are inverse of each other. This is represented using
rdfs:InverseOf predicate. Milan overcomes this impedance mismatch by
dropping directionality (Alg.2,6) and prioritizing direction (Alg.2,10) in the presence
of alternates.</p>
          <p>Table 2 lists the challenges that Alg.2 resolves by reasoning over rdfs:InverseOf.
Consider case(a) of Table 2, both object property present (OP) and FK/PK
relation (Col) are present for relation C1=T1 ! C2=T2 (1). However in relation (2),
the rdfs:InverseOf(A), X is present, but FK/RK relation is not present. Hence
a has to be inferred, which will be matched to X. In case(b)2, single object
property and FK/PK relations are present in opposite direction. The FK/PK
relation is therefore inferred to match the object property. Case (c)2 is a
speci c case of junction table relationships, where more than one relation exists
across two tables. It could occur that both the object property A &amp; B have
inverses. And corresponding FK/PK relations are divided across di erent
directions. Inferences are made to include missing FK/PK relations for each relation.
The nal matching is performed by label matching and then performing
Hungarian algorithm with labelM atch(P roperty; F K=P Ka) labelM atch(inv :
P roperty; F K=P Kb) an additional constraint for every property and its inverse
pair. This is to prevent a property and its inverse, both to occur as a result of
the Hungarian algorithm.
4.3</p>
        </sec>
        <sec id="sec-4-1-3">
          <title>Datatype Property and Column Relationship</title>
          <p>The Datatype of data property is retrieved by querying its rdfs:label, rdfs:range
and if present its owl:UnionOf. Milan performs label matching across all data
properties and nonkey columns for each class-table pair. In addition to this,
Milan addresses label collisions by segregating datatypes based on W3C datatype
mapping6. The rationale behind this segregation is likes match. i.e a column
having varchar datatype cannot possibly match to a datatype property having
xsd:date as its range. Hence o ering greater accuracy in cases of string
ambiguity. For a given pair of class-table, each segregation of a datatype group e.g
V archar xsd : string, Milan uses Hungarian algorithm to obtain a 1:1 match
based on label matching scores. It is important to note that this algorithm is also
capable of detecting 1 : n relationship for a property-column pair.This is done in
three ways. One, when multiple properties related to each other by ow:sameAs.
Second, owl:UnionOf contains multiple classes for a given datatype property,
hence mapping the same property to multiple tables. Lastly, when detecting
1:1 table-table relationships in 4.2, child tables inherit all columns of the main
table, allowing possible matches to data property.
6 www.w3.org/TR/r2rml/</p>
        </sec>
        <sec id="sec-4-1-4">
          <title>R2RML Mapping Generation</title>
          <p>
            Many papers such as [
            <xref ref-type="bibr" rid="ref12">12</xref>
            ] have discussed R2RML mapping patterns. In addition
to those, Milan has additional features to support 1:1 table inheritance and 1:n
table-class relationship based on column splitting. The SQL queries involve a
JOIN condition. These query templates are trivial hence is left to readers.
5
          </p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Evaluation</title>
      <p>
        The purpose of the evaluation is to test the quality of mappings generated by
Milan in realistic deployment scenarios. Several papers such as Tarasowa et al
[
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] and RODI benchmark [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] have discussed the quality evaluation aspects of
mappings. The evaluation in this paper uses the RODI benchmark and
methodology.
      </p>
      <p>The RODI suite 7 includes a wide range of relational-to-ontology scenarios,
each based on mapping challenges similar to Sec. 3. Each scenario provides an
input database and a target ontology. It requests from systems a complete set of
mappings that enable to execute queries over the target ontology (T-Box). Each
scenario contains a list of SPARQL-SQL query pairs, which are categorized under
various mapping challenges such as class-table, attributes, links. These query
pairs are executed to evaluate if the results from the SQL queries match the
SPARQL queries to the ontology constructed using the mappings provided. It
then calculates the averages of precision and recall of all queries in each scenario,
thereby calculating a tness function for mapping quality. In addition to total
scores, RODI also provides aggregated scores for each category of the query
pairs.</p>
      <p>
        This paper evaluates Milan based on 4 scenarios provided by RODI. The
RODI authors have already performed ontology alignment of output ontologies
with the target ontology for systems like MIRROR and -ontop-, which dont
produce mappings. They have also provided mapping translation for systems
like D2QRQ and COMA++, by translating system speci c mapping language
to R2RML. The hypothesis is that Milan's approach performs better than the
current state of the art in terms of the mapping quality metric used by RODI.
Benchmark Datasets This section describes the four scenarios used to
evaluate Milan. These scenarios are based on real open data such as Conference,
Norwegian Petroleum Directorate [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
      <p>Each scenario has varied di culty levels. For instance, the relational schemata
of cmt renamed closely follows modeling patterns from their corresponding
ontologies. cmt structured additionally introduces 1 : n class hierarchy mapping
challenge (R3a,4). conference nofk is void of primary key-foreign key relations
making it tough to detect object property mappings (R3b). npd atomic, which is
based on the NPD dataset is the most challenging scenario of all. 1 : n matches</p>
      <sec id="sec-5-1">
        <title>7 https://github.com/chrpin/rodi</title>
        <p>
          (R3a,4), for both classes and properties are present. 1:n matches as a structural
feature can therefore best be tested in the npd atomic tests scenario [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ].The
queries are designed to check on the existence of accurate mappings [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ].
Results &amp; Analysis Table 4 shows RODI scores for each scenario on every
tested system. Technically de ned as per test F-measures, these scores indicate
the fraction of successfully passed query tests. Milan outperforms current state
of the art systems. Therefore the hypothesis is accepted. Fig. 3 below is a
        </p>
        <p>B.OX IncM. Ontop MIRR. COMA D2RQ A4MO Milan
cmt renamed 0.76
cmt structured 0.41
conference nofk 0.33
npd atomic 0.14
breakdown of the agregated score for cmt renamed, where Milan performs better
across queries under class, data property and object property.</p>
        <p>Milan correctly detected 75/134 class-table, 42/213 data properties-column
and 15/92 object property-FK/PK matches in the npd atomic.</p>
        <p>The impact of Algorithm 1 is evident in the npd atomic dataset. It correctly
detects 56% of the total class-table mappings. The outcome of this algorithm
also detected 12 out of 45 of the 1 : n class-table match is present. Although
cmt structured had many patterns re ecting column splitting, Algorithm 1 was
not so successful here. Additionally, there are cases of column splits which
detected categorical values such "1" and "2", which were either foreign or non-key
column. These were true matches to separate classes but label matching failed
to match these cases.</p>
        <p>The relatively high scores achieved in the object property-referential integrity
relationship are credited to junction table detection, 1:1 relationship matching
followed by property inheritance and inverse property inferencing. It also
detected 1:1 table-table relationships, which comprises of 34% of the
primaryforeign key relations. This increased object property mappings, caused due to
inheritance. However Milan couldn't capture object property mappings when
object properties don't have explicit rdfs:range/domain or owl:unionOf.</p>
        <p>Better results with detecting data property are credited to the use of datatype
partitioning and the Hungarian algorithm with Label Matching for the
classtable pair. In addition to this, the algorithm uses annotation properties such
as rdfs:label and rdfs:comment to match to the column. Additionally, the
performance is enhanced by the inheritance of properties due to 1:1 table-table
detection. However, the strategy of datatype partitioning had its compromises.
The tables of conference nofk scenario has columns which have boolean datatype
while the matching data property is a string. Milan is unable to match these
cases, thereby missing on some of the data properties.
6</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Conclusion</title>
      <p>Milan has been demonstrated to automatically generate R2RML mappings
between a relational database and target ontology with an overall f-measure of
between 0.86 and 0.3 under the RODI benchmark conditions. This measure is
an indicator of the accuracy and completeness of the mappings generated.
Milan has out-performed all the other automatic mapping generation systems that
have been tested to date with RODI. The relative performance of Milan
improves in more complex scenarios like npd atomic, where it performed 30.4%
better than the next best system,A4MO. Similarly, the performance pf Milan
was 26.8% better than the next leading system, IncMap, in the cmt structured
scenario. This demonstrates the relative e ectiveness of a direct mapping
generation approach, as implemented by Milan, compared to the more popular putative
ontology-based mapping generation methods.</p>
      <p>Further work is required to broaden our con dence in this work by evaluating
Milan over additional datasets. In addition this evaluation has helped us to
identify the following potential RDB-RDF mapping patterns or improvements
to Milan: Detecting class-table mappings where multiple tables form one class
in the target ontology; extending the column-based table splitting pattern to
account account for foreign key columns ; and inferring implicit links between
tables based on undeclared foreign keys whose use is observed in the database.
Acknowledgments. This research has received funding from the ADAPT
Centre for Digital Content Technology, funded under the SFI Research Centres
Programme (Grant 13/RC/2106) and co-funded by the European Regional
Development Fund.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Batini</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lenzerini</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Navathe</surname>
            ,
            <given-names>S.B.</given-names>
          </string-name>
          :
          <article-title>A comparative analysis of methodologies for database schema integration</article-title>
          .
          <source>ACM computing surveys 18(4)</source>
          ,
          <volume>323</volume>
          {
          <fpage>364</fpage>
          (
          <year>1986</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Bizer</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>D2rq - treating non-RDF databases as virtual RDF graphs</article-title>
          .
          <source>In: In Proceedings of the 3rd International Semantic Web Conference ISWC</source>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Calvanese</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cogrel</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Komla-Ebri</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kontchakov</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lanti</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rezk</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rodriguez-Muro</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xiao</surname>
          </string-name>
          , G.:
          <article-title>Ontop: Answering SPARQL queries over relational databases</article-title>
          .
          <source>Semantic Web</source>
          <volume>8</volume>
          (
          <issue>3</issue>
          ),
          <volume>471</volume>
          {487 (Jan
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Do</surname>
            ,
            <given-names>H.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rahm</surname>
          </string-name>
          , E.:
          <article-title>COMA: A System for Flexible Combination of Schema Matching Approaches</article-title>
          .
          <source>In: Proceedings of the 28th International Conference on Very Large Data Bases</source>
          . pp.
          <volume>610</volume>
          {
          <fpage>621</fpage>
          . VLDB '02,
          <string-name>
            <given-names>VLDB</given-names>
            <surname>Endowment</surname>
          </string-name>
          , Hong Kong,
          <string-name>
            <surname>China</surname>
          </string-name>
          (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Jimenez-Ruiz</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grau</surname>
            ,
            <given-names>B.C.</given-names>
          </string-name>
          :
          <article-title>Logmap: Logic-based and scalable ontology matching</article-title>
          .
          <source>In: International Semantic Web Conference</source>
          . pp.
          <volume>273</volume>
          {
          <fpage>288</fpage>
          . Springer (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Jimnez-Ruiz</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kharlamov</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zheleznyakov</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pinkel</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Skjveland</surname>
            ,
            <given-names>M.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thorstensen</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mora</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>BootOX: Practical Mapping of RDBs to OWL 2</article-title>
          . In: The Semantic Web - ISWC
          <year>2015</year>
          . pp.
          <volume>113</volume>
          {
          <fpage>132</fpage>
          . Lecture Notes in Computer Science, Springer, Cham (Oct
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Medeiros</surname>
            ,
            <given-names>L.F.d.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Priyatna</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Corcho</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          :
          <article-title>MIRROR: Automatic R2rml Mapping Generation from Relational Databases</article-title>
          . In:
          <article-title>Engineering the Web in the Big Data Era</article-title>
          . pp.
          <volume>326</volume>
          {
          <fpage>343</fpage>
          . Lecture Notes in Computer Science, Springer, Cham (Jun
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Munkres</surname>
          </string-name>
          , J.:
          <article-title>Algorithms for the Assignment and Transportation Problems</article-title>
          .
          <source>Journal of the Society for Industrial and Applied Mathematics</source>
          <volume>5</volume>
          (
          <issue>1</issue>
          ),
          <volume>32</volume>
          {38 (Mar
          <year>1957</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Pinkel</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Binnig</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jimenez-Ruiz</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kharlamov</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nikolov</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schwarte</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heupel</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kraska</surname>
          </string-name>
          , T.:
          <article-title>IncMap: A Journey towards Ontology-based Data Integration</article-title>
          .
          <source>Gesellschaft fr Informatik</source>
          ,
          <source>Bonn</source>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Pinkel</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Binnig</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jimnez-Ruiz</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kharlamov</surname>
          </string-name>
          , E.,
          <string-name>
            <surname>May</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nikolov</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sasa</surname>
            <given-names>Bastinos</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Skjveland</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.G.</given-names>
            ,
            <surname>Solimando</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Taheriyan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Heupel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Horrocks</surname>
          </string-name>
          ,
          <string-name>
            <surname>I.</surname>
          </string-name>
          :
          <article-title>RODI: Benchmarking relational-to-ontology mapping generation quality</article-title>
          .
          <source>Semantic Web</source>
          <volume>9</volume>
          (
          <issue>1</issue>
          ),
          <volume>25</volume>
          {52 (Jan
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Seleng</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Laclav</surname>
            <given-names>k</given-names>
          </string-name>
          , M.,
          <string-name>
            <surname>Balogh</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hluchy</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Rdb2onto: Approach for creating semantic metadata from relational database data</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Sequeda</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Priyatna</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Villazn-Terrazas</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Relational Database to RDF Mapping Patterns</article-title>
          .
          <source>In: Proceedings of the 3rd International Conference on Ontology Patterns -</source>
          Volume
          <volume>929</volume>
          . pp.
          <volume>97</volume>
          {
          <fpage>108</fpage>
          . CEUR-WS.org, Germany (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Shvaiko</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Euzenat</surname>
          </string-name>
          , J.: Ontology Matching:
          <article-title>State of the Art and Future Challenges</article-title>
          .
          <source>IEEE Transactions on Knowledge and Data Engineering</source>
          <volume>25</volume>
          (
          <issue>1</issue>
          ),
          <volume>158</volume>
          {
          <fpage>176</fpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14. Sicilia, ., Nemirovski, G.:
          <article-title>AutoMap4obda: Automated Generation of R2rml Mappings for OBDA</article-title>
          .
          <source>In: Knowledge Engineering and Knowledge Management</source>
          . pp.
          <volume>577</volume>
          {
          <fpage>592</fpage>
          . Lecture Notes in Computer Science, Springer, Cham (Nov
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Tarasowa</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lange</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Auer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Measuring the quality of relational-to-rdf mappings</article-title>
          .
          <source>In: International Conference on Knowledge Engineering and the Semantic Web</source>
          . pp.
          <volume>210</volume>
          {
          <fpage>224</fpage>
          . Springer (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>