<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>S</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>modelling for Duchenne and Becker Muscular Dystrophy rare diseases</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Pietro Spitali</string-name>
          <email>P.Spitali@lumc.nl</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Núria Queralt-Rosinach</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pablo Perdomo-Quinteiro</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sergiu Siminiuc</string-name>
          <email>sergiu@duchennedatafoundation.org</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Paraskevi Sakellariou</string-name>
          <email>sakellariou.elvina@duchennedatafoundation.org</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marco Roos</string-name>
          <email>M.Roos@lumc.nl</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Duchenne Data Foundation</institution>
          ,
          <addr-line>Veenendaal</addr-line>
          ,
          <country country="NL">the Netherlands</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>FAIR data</institution>
          ,
          <addr-line>semantic modelling, BYOD, Duchenne Muscular Dystrophy</addr-line>
          ,
          <country>Becker Muscular Dystrophy</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Leiden University Medical Center</institution>
          ,
          <addr-line>Leiden</addr-line>
          ,
          <country country="NL">the Netherlands</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <volume>1</volume>
      <issue>3</issue>
      <abstract>
        <p>The BIND project is a EU-funded project that attempts to improve the characterisation of brain involvement in Duchenne and Becker Muscular Dystrophies (DMD and BMD). Here, we present our ongoing work on making multimodal data FAIR, the semantic models, and discuss challenges and opportunities based on our FAIRification experience. SWAT4HCLS 2023: The 14th International Conference on Semantic Web Applications and Tools for Health Care and Life</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Motivation</title>
      <p>The BIND project is a EU-funded project that attempts to improve the characterisation of brain
involvement in Duchenne and Becker Muscular Dystrophies (DMD and BMD). To achieve
this, 19 organizations across Europe and Japan are collaborating and obtaining phenotypic and
molecular data such as transcriptomic, proteomic, clinical, behavioral, and MRI brain images. It
is crucial to make this information Findable, Accessible, Interoperable and Reusable (FAIR), as
making data FAIR will allow machines to exchange, integrate and analyze these data within
the consortium and with other external FAIR data sources. Additionally, making data FAIR
improves transparency, reproducibility, and maximizes the use of scientific output. Ultimately,
FAIR data will advance our understanding of DMD and BMD and drive innovation on new
therapeutic strategies. In order to achieve FAIRification, i.e., the process of making data FAIR,
data must be properly described and annotated using standard metadata models and ontologies
used by the DMD and BMD community for research, and must be stored in a persistent and
Sciences
∗Corresponding author.
accessible repository. Additionally, data must be available in machine-readable formats and
should follow best practices for data management and sharing.</p>
      <p>
        The FAIRification process has been developed together with EBRAINS, a digital research
infrastructure created by the EU-funded Human Brain Project with the objective of
understanding human brain function and disease. By making use of EBRAINS standards, the project can
benefit from all the information gathered in EBRAINS, and at the same time, other researchers
can make use of the data provided by BIND. The FAIRification of the data has been done using
Semantic Web technologies (RDF, OWL and SHEX) and reusing the semantic models provided
by the European Joint Programme on Rare Diseases (EJP RD). Our approach is to use FAIR
implementations selected by the rare disease community. To this end, we worked in
collaboration with the EJP RD, as a key driver project on FAIRification of rare disease resources for
research and a large representation of the rare disease community. Our method is based on the
FAIRification workflow [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], where the first step is to assess the FAIR status of the data.
      </p>
      <p>
        Several Bring Your Own Data (BYOD) workshops [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] have been organized throughout 2022,
with the two-fold objective of assessing the FAIR status of the data and explaining the meaning
of the data to FAIR experts and bringing the FAIRification methodology closer to the data
owners. In these workshops, we as FAIR experts needed to fully understand the data to correctly
perform the FAIRification process. One of the challenges is that data owners are often unfamiliar
with the FAIRification process, and viceversa, FAIR experts are often not familiar with domain
specific knowledge. Another challenge is to reach a consensus of what the data mean and how
best to represent it. The modelling has been developed following the semantic core data model
of the set of common data elements for rare disease patient registries provided by the EJP RD [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
Here, we present our ongoing work on making multimodal data FAIR, the semantic models we
are developing to represent data such as patient phenotypic behavioral information compiled in
the form of questionnaires or single cell RNA-seq data obtained in preclinical mouse models;
and discuss challenges and opportunities based on our FAIRification experience so far.
      </p>
    </sec>
    <sec id="sec-2">
      <title>Acknowledgments</title>
      <p>We thank all BIND team, the data owners and domain experts. We also thank the EJP RD for
providing the semantic models used in this project. This project was supported by a grant
from the European Union’s Horizon 2020 research and innovation programme under grant
agreement No 847826 (Brain Involvement iN Dystrophinopathies (BIND)).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A.</given-names>
            <surname>Jacobsen</surname>
          </string-name>
          , et al.,
          <article-title>A Generic Workflow for the Data FAIRification Process</article-title>
          ,
          <source>Data Intelligence</source>
          <volume>2</volume>
          (
          <year>2020</year>
          )
          <fpage>56</fpage>
          -
          <lpage>65</lpage>
          . URL: https://direct.mit.edu/dint/article/2/1-2/56/9988/ A-Generic-
          <article-title>Workflow-for-the-Data-FAIRification</article-title>
          .
          <source>doi:1 0 . 1 1</source>
          <volume>6</volume>
          <fpage>2</fpage>
          <string-name>
            <surname>/ D I N T _</surname>
          </string-name>
          <article-title>A _ 0 0 0 2 8</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>R.</given-names>
            <surname>Hooft</surname>
          </string-name>
          , et al.,
          <source>ELIXIR-EXCELERATE D5</source>
          .
          <article-title>3: Bring Your Own Data (BYOD) (</article-title>
          <year>2019</year>
          ). URL: https://zenodo.org/record/3207809.
          <source>doi:1 0 . 5 2</source>
          <volume>8</volume>
          <fpage>1</fpage>
          <string-name>
            <surname>/ Z E N O D O</surname>
          </string-name>
          .
          <volume>3 2 0 7 8 0 9.</volume>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>R.</given-names>
            <surname>Kaliyaperumal</surname>
          </string-name>
          , et al.,
          <article-title>Semantic modelling of common data elements for rare disease registries, and a prototype workflow for their deployment over registry data</article-title>
          ,
          <source>Journal of</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>