<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <article-id pub-id-type="doi">10.1038/sdata.2016.18</article-id>
      <title-group>
        <article-title>Overview of a suite of middle-ware services for implementing FAIR data principles</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Mark Thompson</string-name>
          <email>m.thompson@lumc.nl</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Luiz Bonino</string-name>
          <email>luiz.bonino@dtls.nl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mark D Wilkinson</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rajaram Kaliyaperumal</string-name>
          <email>r.kaliyaperumal@lumc.nl</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kees Burger</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Shamanou van Leeuwen</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Annika Jacobsen</string-name>
          <email>a.jacobsen@lumc.nl</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Claudio Carta</string-name>
          <email>claudio.carta@iss.it</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Erik Schultes</string-name>
          <email>erik.schultes@dtls.nl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>David van Enckevort</string-name>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Richard Finkers</string-name>
          <email>richard.finkers@wur.nl</email>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mascha Jansen</string-name>
          <email>mascha.jansen@dtls.nl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Barend Mons</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marco Roos</string-name>
          <email>m.roos@lumc.nl</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Dutch Techcentre for Life Sciences</institution>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Istituto Superiore di Sanita</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Leiden University Medical Centre</institution>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Universidad Politcnica de Madrid</institution>
          ,
          <country country="ES">Spain</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>University Medical Center Groningen</institution>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
        <aff id="aff5">
          <label>5</label>
          <institution>Wageningen Plant Research</institution>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2016</year>
      </pub-date>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>Principles of Findable, Accessible, Interoperable, and Reusable data for humans
and computers (FAIR)1are widely endorsed by organizations such as the
European Open Science Cloud, the life science data infrastructure ELIXIR, the NIH
via its commons program, the biobanking infrastructure consortium
BBMRIERIC, the G20 and the G7. Implementing a data ecosystem based on FAIR
principles requires guidelines, tools, and training, and FAIR data stewards to
help apply them. The principles as such do not recommend any particular
implementation: user communities will have to decide the most appropriate
implementation for their domain. Here, we demonstrate the use of a suite of Semantic
Web-based middle-ware services that help communities implement FAIR data
principles2. Aiming to facilitate adoption, the services are made to complement
existing data infrastructures, including local and centralised data resources, and
thus establish a robust, federated ecosystem of FAIR resources. The services
are also particularly suited for training data stewards. We demonstrate the
application of the services by rare disease and plant breeding communities where
the combination of Ontologies, Linked Data, and light-weight FAIR services are
being explored as the means to implement FAIR data principles.</p>
      <p>Overview of a suite of FAIR services</p>
    </sec>
    <sec id="sec-2">
      <title>FAIR services</title>
      <p>We present the following middle-ware services and tools to implement FAIR
principles:
{ F: a FAIR Data Search Engine based on harvesting metadata from FAIR
Data Points using widely adopted metadata structures and standards (see
\A" below). The FDP web interface also exposes (bio)schema.org-compatible
metadata for use by third-party search engines.
{ A: the FAIR Data Point RESTful API that uses the Data Catalogue
Vocabulary (DCAT3) and Datacites Registry of Research Data Repositories
(RE3Data4) to provide high level metadata descriptors about data deposits,
and to provide instructions to access various distributions of data sets (such
as both an original CSV le and its fully interoperable RDF representation).
{ I: a FAIRi er tool that is based on OpenRe ne and its RDF plug-in and used
to convert tabular data into ontology-grounded RDF; this tool is actively
used for events such as the bring your own data workshops (BYODs) , and
other FAIR data training courses.
{ R: for reusability we provide the FAIR Metadata Editor to support richer
(meta)data to optimise future reuse of FAIR data sources, and early
prototypes that apply machine readable metadata to govern access, such as
license, consent, and privacy preservation.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Conclusion</title>
      <p>In the context of our contribution to cross-national infrastructure for data
stewardship in communities such as the rare disease community and the plant
breeding community, we consider that tools described above, and their implementation
based on Semantic Web technologies to help the adoption of the FAIR approach.
The services we present are light-weight and still allow communities su cient
freedom to make design decisions together with FAIR data stewards. Training
data stewards is therefore an important objective for further adoption and we
found the service suite valuable for training events and BYODs. We also work
with software service providers in user communities. Turning for instance
patient registry software and biobank cataloguing software such as MOLGENIS
into FAIR data generating tools will substantially lower the burden of FAIRi
cation.</p>
      <p>Acknowledgments. We acknowledge the participants of the BYODs, FAIR
workshops and hackathons for their kind contributions and feedback, and
FAIRdICT, ODEX4All, ELIXIR and ELIXIR-EXCELERATE, BBMRI-ERIC and
BBMRI-NL, RD-Connect, Istituto Superiore di Sanita in Italy, the Dutch
Techcentre for Life Science (representative of ELIXIR-NL), Ministerio de Economa
y Competitividad (Spain) grant number TIN2014-55993-RM, and the NBDC/
DBCLS BioHackathon series.</p>
    </sec>
  </body>
  <back>
    <ref-list />
  </back>
</article>