<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>The Mastro Protégé plug-in for OBDA</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Giacomo RONCONI</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marco RUZZI</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Valerio SANTARELLI</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Domenico Fabio SAVO</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>OBDA Systems</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Sapienza University of Rome</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>Ontology-based data access (OBDA) is a recent approach for accessing data where an ontology is connected to autonomous, and generally pre-existing, data repositories through mappings, so as to provide a high-level, conceptual view over such data. Mastro is a Java tool for OBDA developed at Sapienza University of Rome and at the startup OBDA Systems, which is able to manage an OBDA specification where the ontology is specified in DL-Lite. In this work, we present the Mastro plug-in for the popular ontology editor Protégé. By means of this plug-in, the users can specify and manage full OBDA specifications and execute SPARQL queries posed over the ontology level to access data stored in the underlying data sources.</p>
      </abstract>
      <kwd-group>
        <kwd />
        <kwd>Ontology-based Data Access</kwd>
        <kwd>DL-Lite</kwd>
        <kwd>Mastro</kwd>
        <kwd>Protégé</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Ontology-based Data Access (OBDA) is a recent paradigm for accessing data sources
through the mediation of a concentual domain view, given in terms of an ontology [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
OBDA features a three-level architecture composed of the ontology, which provides a
formal description of the domain of interest, the data sources used in organizations for
storing their information, and the mapping used to specify the semantic relationships
between the ontology layer and the data sources. Hence, OBDA can be seen as a form of
information integration, where the ontology, which is expressed in a logic-based language,
replaces the global schema [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. Currently the two most popular systems for OBDA are
Mastro [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] and Ontop [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>
        Mastro is developed by OBDA Systems1 and Sapienza University of Rome, and has
been used in recent years in numerous projects with important business partners from the
private and public sectors, for example, see [
        <xref ref-type="bibr" rid="ref1 ref13">1,13</xref>
        ]. Mastro comes with its own
commercial application, the Mastro Studio system [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], and now, in this paper, we introduce the
Mastro plug-in for the popular Protégé [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] editor for OWL ontologies.
      </p>
      <p>
        Ontologies in Mastro are specified through languages belonging to the DL-Lite [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]
family of lightweight Description Logics, and support mappings in the standard R2RML
format [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], and also in their native mapping language. Data sources are seen as relational
databases and can be accessed through SPARQL queries over the ontology, exploiting
the query answering services provided by Mastro.
      </p>
      <p>
        We demonstrate the Mastro Protégé plug-in through the Ontology of
Multidimensional Research Assessment, or Sapientia [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], an OBDA specification developed
and currently used within a project funded by Sapienza University of Rome which
models aspects of assessing research activities and their impact on human knowledge and the
economic system. For example, it deals with inter-relationships between research
activities, between research activities and people’s personal knowledge, and between research
activities and other missions of individuals and institutions. The ontology is composed
of fourteen modules, which formalize a wide array of activities, ranging from teaching,
publishing, and research, to funding and preservation.
      </p>
      <p>Users of the plug-in can inspect, query and edit the Sapientia OBDA specification
through the plug-in. The ontology is managed through the features offered by Protégé,
while the plug-in’s Mappings and SQL Views tabs illustrate how the ontology is linked
to the data sources through the mapping assertions, and how to edit or create new
mappings and views; the SPARQL query tab and its execution log allow various querying
functionalities; and lastly, the plugin allows to approximate OWL 2 into DL-Lite.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Overview of the Plug-in</title>
      <p>
        Mastro’s Protégé plug-in provides a full-fledged environment to define an OBDA
specification for Mastro, and to access its query answering service. Along with these core
features, the plug-in is equipped with other functionalities among which are: (i) the
approximation module that is based on the semantic approach presented in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] for
approximating OWL 2 ontologies in DL-Lite, the language supported by Mastro. This allows
to load OWL 2 ontologies in Protégé, while using its DL-Lite representation for query
answering with Mastro; (ii) in order to comply with the current recognized standards of
the Semantic Web, Mastro is able to import R2RML mapping into its proprietary format
and viceversa, and to export the results of the queries over the ontology into RDF.
      </p>
      <p>The components of Mastro’s plug-in are organized in five sub-tabs under the main
Mastro tab, one item in the “Reasoner” menu, and one “Mastro” menu item which
provides the features described above. Below we briefly describe each component.
Configuration. The Configuration sub-tab is used to create, open, and save a
mapping specification, and to define the jdbc parameter connections to the source relational
database, and to a database which Mastro uses handle query executions and to store all
their information, e.g., results, execution time, ontology and mapping rewritings.
SQL Views. This sub-tab is used to create and inspect the views in the specification, and
to specify assertions over them, i.e., inclusion assertions, disjointness assertions, and key
dependency assertions. Each view is defined by its name and the SQL query code.
Mappings. The Mappings sub-tab, shown in Figure 1, is used to create and inspect
the ontology predicate mappings in the specification, and to define the IRI templates
that Mastro uses to build the answers from the data in the data source. Each ontology
predicate mapping is identified by an ID, and contains the ontology predicate that is
being mapped, a conjunctive query over the SQL views, and, optionally, a description.
SPARQL Query. The SPARQL Query sub-tab, shown in Figure 2, allows to define and
execute a SPARQL query through Mastro over the ontology. Once the query has been
executed, the user can consult its results and its ontology and mapping rewritings, and
export the results in RDF or CSV formats. The user can also save the query into a catalog,
which is managed through a file that can be loaded, exported, and edited. Additionally,
Mastro saves all query executions into a log, allowing the user to visualize those of the
selected query.</p>
      <p>SPARQL Query Execution Log. This sub-tab provides information regarding all
executions of SPARQL queries over an OBDA specification. The data shown in the log can
be exported, and the queries re-executed.</p>
      <p>The items in the Mastro menu show the details of the approximation performed by
the semantic approximation module (Approximation details), allow to access a
configuration panel (Mastro properties), to export Mastro’s mappings in R2RML (Export
mappings to R2RML), and to consult a tutorial of the Mastro plug-in (Help).
Acknowledgments. Work supported by MIUR under the SIR project “MODEUS”
grant n. RBSI14TQHQ.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>N.</given-names>
            <surname>Antonioli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Castanò</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Civili</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Coletta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Grossi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lembo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lenzerini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Poggi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. F.</given-names>
            <surname>Savo</surname>
          </string-name>
          , and
          <string-name>
            <given-names>E.</given-names>
            <surname>Virardi</surname>
          </string-name>
          .
          <article-title>Ontology-based data access: The experience at the italian department of treasury</article-title>
          .
          <source>In Proceedings of CAISE</source>
          <year>2013</year>
          , València, Spain, volume
          <volume>1017</volume>
          <source>of CEUR Workshop Proceedings</source>
          , pages
          <fpage>9</fpage>
          -
          <lpage>16</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Cogrel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Komla-Ebri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Kontchakov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lanti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Rezk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Rodriguez-Muro</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Xiao.</surname>
          </string-name>
          <article-title>Ontop: Answering SPARQL queries over relational databases</article-title>
          .
          <source>Semantic Web</source>
          ,
          <volume>8</volume>
          (
          <issue>3</issue>
          ):
          <fpage>471</fpage>
          -
          <lpage>487</lpage>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          , G. De Giacomo,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lembo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lenzerini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Poggi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Rodriguez-Muro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Rosati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ruzzi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D. F.</given-names>
            <surname>Savo</surname>
          </string-name>
          .
          <article-title>The Mastro system for ontology-based data access</article-title>
          .
          <source>Semantic Web J.</source>
          ,
          <volume>2</volume>
          (
          <issue>1</issue>
          ):
          <fpage>43</fpage>
          -
          <lpage>53</lpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          , G. De Giacomo,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lembo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lenzerini</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Rosati</surname>
          </string-name>
          .
          <article-title>Tractable reasoning and efficient query answering in description logics: The DL-Lite family</article-title>
          .
          <source>JAR</source>
          ,
          <volume>39</volume>
          (
          <issue>3</issue>
          ):
          <fpage>385</fpage>
          -
          <lpage>429</lpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>C.</given-names>
            <surname>Civili</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Console</surname>
          </string-name>
          , G. De Giacomo,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lembo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lenzerini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Lepore</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Mancini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Poggi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Rosati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ruzzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Santarelli</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D. F.</given-names>
            <surname>Savo</surname>
          </string-name>
          . MASTRO STUDIO:
          <article-title>Managing ontology-based data access applications</article-title>
          .
          <source>PVLDB</source>
          ,
          <volume>6</volume>
          :
          <fpage>1314</fpage>
          -
          <lpage>1317</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M.</given-names>
            <surname>Console</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Mora</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Rosati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Santarelli</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D. F.</given-names>
            <surname>Savo</surname>
          </string-name>
          .
          <article-title>Effective computation of maximal sound approximations of description logic ontologies</article-title>
          .
          <source>In Proc. of ISWC</source>
          , volume
          <volume>8797</volume>
          <source>of LNCS</source>
          , pages
          <fpage>164</fpage>
          -
          <lpage>179</lpage>
          . Springer,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>C.</given-names>
            <surname>Daraio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lenzerini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Leporelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Naggar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bonaccorsi</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Bartolucci</surname>
          </string-name>
          .
          <article-title>The advantages of an ontology-based data management approach: openness, interoperability and data quality</article-title>
          .
          <source>Scientometrics</source>
          ,
          <volume>108</volume>
          (
          <issue>1</issue>
          ):
          <fpage>441</fpage>
          -
          <lpage>455</lpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>S.</given-names>
            <surname>Das</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Sundara</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Cyganiak</surname>
          </string-name>
          .
          <article-title>R2RML: RDB to RDF mapping language</article-title>
          .
          <source>W3C Recommendation</source>
          , W3C, Sept.
          <year>2012</year>
          . Available at http://www.w3.org/TR/r2rml/.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>F.</given-names>
            <surname>Di Pinto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lembo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lenzerini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Mancini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Poggi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Rosati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ruzzi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D. F.</given-names>
            <surname>Savo</surname>
          </string-name>
          .
          <article-title>Optimizing query rewriting in ontology-based data access</article-title>
          .
          <source>In Proc. of EDBT</source>
          , pages
          <fpage>561</fpage>
          -
          <lpage>572</lpage>
          . ACM Press,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>J. H.</given-names>
            <surname>Gennari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Musen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. W.</given-names>
            <surname>Fergerson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. E.</given-names>
            <surname>Grosso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Crubézy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Eriksson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. F.</given-names>
            <surname>Noy</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S. W.</given-names>
            <surname>Tu</surname>
          </string-name>
          .
          <article-title>The evolution of Protégé: an environment for knowledge-based systems development</article-title>
          .
          <source>International Journal of Human-computer studies</source>
          ,
          <volume>58</volume>
          (
          <issue>1</issue>
          ):
          <fpage>89</fpage>
          -
          <lpage>123</lpage>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>M.</given-names>
            <surname>Lenzerini</surname>
          </string-name>
          .
          <article-title>Data integration: A theoretical perspective</article-title>
          .
          <source>In Proc. of PODS</source>
          , pages
          <fpage>233</fpage>
          -
          <lpage>246</lpage>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>A.</given-names>
            <surname>Poggi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lembo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Calvanese</surname>
          </string-name>
          , G. De Giacomo,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lenzerini</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Rosati</surname>
          </string-name>
          .
          <article-title>Linking data to ontologies</article-title>
          .
          <source>J. on Data Semantics</source>
          , X:
          <fpage>133</fpage>
          -
          <lpage>173</lpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>D. F.</given-names>
            <surname>Savo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lembo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lenzerini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Poggi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Rodríguez-Muro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Romagnoli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ruzzi</surname>
          </string-name>
          , and
          <string-name>
            <surname>G. Stella.</surname>
          </string-name>
          <article-title>MASTRO at work: Experiences on ontology-based data access</article-title>
          .
          <source>In Proc. of DL</source>
          , volume
          <volume>573</volume>
          <source>of CEUR, ceur-ws.org</source>
          , pages
          <fpage>20</fpage>
          -
          <lpage>31</lpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>