<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>SEMANTiCS</journal-title>
      </journal-title-group>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>NFDI4DSO: Towards a BFO Compliant Ontology for Data Science</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Harald Sack</string-name>
          <email>harald.sack@fiz-kalrsruhe.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Genet Asefa Gesese</string-name>
          <email>genet-asefa.gesese@fiz-kalrsruhe.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jörg Waitelonis</string-name>
          <email>Joerg.Waitelonis@fiz-Karlsruhe.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Zongxiong Chen</string-name>
          <email>zongxiong.chen@fokus.fraunhofer.de</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sonja Schimmler</string-name>
          <email>sonja.schimmler@fokus.fraunhofer.de</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="editor">
          <string-name>Data Science, Artificial Intelligence, Ontology, Knowledge Graph, NFDI4DS</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>FIZ Karlsruhe, Leibniz Institute for Information Infrastructure</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Fraunhofer FOKUS</institution>
          ,
          <addr-line>Berlin</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Karlsruhe Institute of Technology</institution>
          ,
          <addr-line>KIT</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2024</year>
      </pub-date>
      <volume>20</volume>
      <fpage>17</fpage>
      <lpage>19</lpage>
      <abstract>
        <p>The NFDI4DataScience (NFDI4DS) project aims to enhance the accessibility and interoperability of research data within Data Science (DS) and Artificial Intelligence (AI) by connecting digital artifacts and ensuring they adhere to FAIR (Findable, Accessible, Interoperable, and Reusable) principles. To this end, this poster introduces the NFDI4DS Ontology, which describes resources in DS and AI and models the structure of the NFDI4DS consortium. Built upon the NFDICore ontology and mapped to the Basic Formal Ontology (BFO), this ontology serves as the foundation for the NFDI4DS knowledge graph currently under development.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>CEUR
ceur-ws.org</p>
    </sec>
    <sec id="sec-2">
      <title>1. Introduction</title>
      <p>Netherlands
(H. Sack)</p>
      <p>
        https://tinyurl.com/3cx37b9x (G. A. Gesese); https://shorturl.at/UwDND (J. Waitelonis);
resources, including individuals, organizations, projects, data portals, and more. NFDICore
provides mappings to a broad range of standards across diferent domains, such as the Basic Formal
Ontology (BFO) [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and Schema.org [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] to advance knowledge representation, data exchange,
and collaboration across diverse domains. To address domain-specific research questions for
each consortium, NFDICore follows a modular architecture. Examples for modular extensions
include the NFDI4Culture ontology module CTO3[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and the NFDI-MatWerk ontology module
MWO4, which are specifically designed for the cultural heritage and materials science domains,
respectively. In this paper, we present an ontology named NFDI4DSO for the data science
domain as a domain-specific modular extension of NFDICore.
      </p>
      <p>NFDI4DataScience (NFDI4DS)5 is one of the NFDI consortia and its project aims to enhance
the accessibility and interoperability of research data in the domain of Data Science (DS) and
Artificial Intelligence (AI). Data Science (DS) is a multidisciplinary field combining diferent
aspects of mathematics, statistics, computer science, and domain-specific knowledge to extract
meaningful insights from diverse data sources. DS and Artificial Intelligence (AI) involve various
artifacts, e.g., datasets, models, ontologies, code repositories, execution platforms, repositories,
etc. The project achieves this by linking digital artifacts and ensuring their FAIR (Findable,
Accessible, Interoperable, and Reusable) accessibility, thereby fostering collaboration across
various DS and AI platforms. To this end, the NFDI4DS Ontology (NFDI4DSO) is built.</p>
    </sec>
    <sec id="sec-3">
      <title>2. The NFDI4DataScience Ontology (NFDI4DSO)</title>
      <p>
        As mentioned earlier, NFDI4DSO is created in a modular fashion, building upon NFDICore.
Similar to NFDICore, the NFDI4DSO ontology is developed using a bottom-up, iterative,
usercentered approach. NFDICore comprises 51 classes, 55 object properties, 8 data properties,
18 annotation properties, and 5 SWRL rules [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] (for details refer to NFDICore documentation6).
In NFDI4DSO, in addition to what is provided in NFDICore, 42 classes, 38 object properties,
9 data properties, and 8 SWRL rules are added. The NFDI4DSO ontology not only describes
various data science artifacts but also provides information about the resources of the NFDI4DS
Consortium, such as personas, consortium members, spokespersons, and task area leads. AS
in NFDICore, the classes introduced in NFDI4DSO are also mapped to the top-level ontology
BFO and also other ontologies such as schema.org, the FaBiO ontology [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], and the Conference
Ontology7.
      </p>
      <p>NFDI4DSO contains various kinds of classes such as processes, roles, and independent
continuants. For instance, Figure 1 depicts how NFDI4DSO represents the relationship between the
independent continuant nfdi4dso:SonjaSchimmler and her spokesperson role
nfdi4dso:SpokespersonRole by mapping it to BFO. By using roles and processes, NFDI4DSO enables a detailed
representation of the relationship between diferent entities enhancing the ontology’s level
of expressivity. On the other hand, to support easier integration and use of less complex
relations, shortcuts are also introduced to simplify the ontology by implementing easy-to-use</p>
      <sec id="sec-3-1">
        <title>3https://gitlab.rlp.net/adwmainz/nfdi4culture/knowledge-graph/culture-ontology</title>
        <p>4https://git.rwth-aachen.de/nfdi-matwerk/ta-oms/mwo
5https://www.nfdi4datascience.de/
6https://ise-fizkarlsruhe.github.io/nfdicore/
7http://www.scholarlydata.org/ontology/doc/#toc
direct shortcut properties, which can be expanded to fully-fledged BFO-compliant complex path
expressions. For instance, in Figure 1, the shortcut relation nfdi4dso:spokesperson is provided
and its corresponding SWRL8 rule is given below.</p>
        <p>Person(?p) ∧ Consortium(?c) ∧ SpokespersonRole(?sr) ∧ Leading(?l) ∧ participates in(?p, ?l) ∧
participates in(?c, ?l) ∧ has role(?p, ?sr) ∧ realised in(?sr, ?l) → spokesperson(?c, ?p)
Ontology Implementation The Protégé ontology editor 9 for the OWL-based
formalization of terminological knowledge has been used to develop and implement NFDI4DSO.
Widoco10 has been used to create an enriched and customized documentation of the
ontology automatically. The stable version of the ontology NFDI4DSO v1.0.0 is available at
https://github.com/ISE-FIZKarlsruhe/NFDI4DS-Ontology/tree/main and the latest development
version is at https://github.com/ISE-FIZKarlsruhe/NFDI4DS-Ontology/tree/develop-1.0.1.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>3. NFDI4DSO in Use</title>
      <p>The NFDI4DSO is designed to form the foundation of the NFDI4DS Knowledge Graph
(NFDI4DSKG), which is currently under development. The NFDI4DS-KG consists of two main components:
the Research Information Graph (RIG) and the Research Data Graph (RDG). RIG includes
metadata about the NFDI4DS consortium’s resources, persons, and organizations, while the
RDG encompasses content-related index data from the consortium’s heterogeneous data sources.</p>
      <sec id="sec-4-1">
        <title>8https://ise-fizkarlsruhe.github.io/NFDI4DS-Ontology/#d4e7620 9https://protege.stanford.edu/ 10https://github.com/dgarijo/Widoco</title>
        <p>RIG serves as the backend for the NFDI4DS web portal, facilitating interactive access and
management of this data. Both RIG and RDG will be accessible and searchable via the NFDI4DS
Registry platform. Additionally, the NFDI4DS consortium plans to collaborate with other NFDI
consortia to further integrate domain-specific knowledge into the RDG seamlessly. Currently,
the first version of the NFDI4DS-KG 11 with RIG is publicly available. For example, to view the
list of co-spokespersons of the NFDI4DS Consortium, you can either navigate through the data
using SHMARQL12, as depicted in Figure 2 or query it using SPARQL, as shown in Figure 3.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>4. Conclusion and Future Work</title>
      <p>This paper presents the NFDI4DS Ontology and its use for the NFDI4DS-KG that is currently
under-development. The ontology facilitates the representation and interoperability of data
science artifacts within and outside of NFDI4DS. NFDI4DSO is built on top of the NFDICore
ontology and mapped to BFO and other ontologies. In the future, there is a plan to perform
extensive ontology evaluation using competency questions based on the persona definitions
from the NFDI4DS consortium.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>This publication was written by the NFDI consortium NFDI4DataScience in the context of
the work of the association German National Research Data Infrastructure (NFDI) e.V.. NFDI
is financed by the Federal Republic of Germany and the 16 federal states and funded by the
Federal Ministry of Education and Research (BMBF) – funding code M532701 / the Deutsche
Forschungsgemeinschaft (DFG, German Research Foundation) - project number
NFDI4DataScience (460234259).
11https://nfdi.fiz-karlsruhe.de/4ds/sparql, https://nfdi.fiz-karlsruhe.de/4ds/shmarql
12https://shorturl.at/eNb5e</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>H.</given-names>
            <surname>Sack</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Schrade</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Bruns</surname>
          </string-name>
          , E. Posthumus,
          <string-name>
            <given-names>T.</given-names>
            <surname>Tietz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Norouzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Waitelonis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Fliegl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Söhn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Tolksdorf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Steller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Azócar</given-names>
            <surname>Guzmán</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Fathalla</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Zainul</given-names>
            <surname>Ihsan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Hofmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Sandfeld</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Fritzen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Laadhar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Schimmler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Mutschke</surname>
          </string-name>
          ,
          <article-title>Knowledge Graph Based RDM Solutions: NFDI4Culture-NFDI-MatWerk-NFDI4DataScience</article-title>
          ,
          <source>in: 1st Conf. on Research Data Infrastructure</source>
          ,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>J. N.</given-names>
            <surname>Otte</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Beverley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ruttenberg</surname>
          </string-name>
          , BFO: Basic formal ontology, Applied ontology (
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>R. V.</given-names>
            <surname>Guha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Brickley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Macbeth</surname>
          </string-name>
          , Schema. org:
          <article-title>evolution of structured data on the web</article-title>
          ,
          <source>Communications of the ACM</source>
          (
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>T.</given-names>
            <surname>Tietz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Bruns</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Söhn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Tolksdorf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Posthumus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. J.</given-names>
            <surname>Steller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Fliegl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Norouzi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Waitelonis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Schrade</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Sack</surname>
          </string-name>
          , From Floppy Disks to 5-
          <string-name>
            <surname>Star</surname>
            <given-names>LOD</given-names>
          </string-name>
          :
          <article-title>FAIR Research Infrastructure for NFDI4Culture</article-title>
          , in: DaMaLOS, co-located
          <source>with ESWC</source>
          <year>2023</year>
          ,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>I.</given-names>
            <surname>Horrocks</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. F.</given-names>
            <surname>Patel-Schneider</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Boley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Tabet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Grosof</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dean</surname>
          </string-name>
          , et al.,
          <article-title>Swrl: A semantic web rule language combining owl and ruleml, W3C Member submission (</article-title>
          <year>2004</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>S.</given-names>
            <surname>Peroni</surname>
          </string-name>
          , D. Shotton,
          <article-title>FaBiO and CiTO: Ontologies for describing bibliographic resources and citations</article-title>
          ,
          <source>Journal of Web Semantics</source>
          (
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>