<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Margherita Martorana[</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Connecting the Dots: Transparent FAIRi cation of Restricted Data</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Vrije Universiteit Amsterdam</institution>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>0000</year>
      </pub-date>
      <volume>0001</volume>
      <fpage>41</fpage>
      <lpage>48</lpage>
      <abstract>
        <p>In the age of information technology and Linked Data, the importance of creating transparent infrastructures for management and exchange of data have emerged to play a focal role for the development of new open research. The FAIR guiding principles have been introduced to advise in the improvement of Findable, Accessible, Interoperable and Reusable technical resources, but the literature is still lacking of practical guidelines for the management and digitization of sensitive and restricted data. The purpose of this research is to introduce new insights aimed at the exploration and discovery of restricted data-sets. The methodology sets out by creating a model for FAIR metadata architecture, which will later be used for automatic ingestion of external restricted data-set les, as well metadata enrichment. Quantitatively and qualitatively evaluations will be performed to assess the work ow, which will later be implemented in the data-set's search engine. Initial results of the metadata ingestion and enrichment are presented as the RML mapper rules and the OPL ontology, respectively.</p>
      </abstract>
      <kwd-group>
        <kwd>Linked Data</kwd>
        <kwd>FAIR Principles</kwd>
        <kwd>RDF Mapping Language</kwd>
        <kwd>Restricted Data</kwd>
        <kwd>Metadata Enrichment</kwd>
        <kwd>Semantic Search</kwd>
        <kwd>Ontology</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        In the era of digitalisation and Open Science [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], the management of research
and sensitive data is still considered an issue to be solved [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Although extensive
e ort has been made by the scienti c community to share and re-use research
data, there is still uncertainty on the mechanisms underlying the management
of sensitive information. Despite the rapid increase of initiatives and regulations
(e.g. GDPR) focusing on this issue, data providers are still lacking technical
solutions to allow research groups and institutions to share and re-use their large
data banks. One of the most common challenges that is faced by researchers, is
the absence of digitized versions of the data. The growing body of information
collected by government organisations such as CBS (Statistics Netherlands)1, is
an essential resource for scientists, but it is generally di cult not only to access
but also to nd in the rst place. In order to support organisations in developing
solutions to enable data management and exchange, the FAIR Guiding Principles
[
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] have been introduced to facilitate reusability and ndability of data. The
aim is to generate data that is FAIR: Findable, Accessible, Interoperable and
Reusable. Central to the entire concept, is the need to integrate both human and
machine-readable formats of research data, in order to facilitate the retrieval,
analysis and discovery of knowledge.
      </p>
      <p>
        Many archives in the biomedical domain have already made their data open
and ndable, such as GenBank [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] and menoci [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], just to mention a few. The
latter, together with EUDAT [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], are examples of research data
infrastructures based on metadata information. Other projects have proposed certi
cation processes [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] and data sanitization techniques [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] as possible solutions to
reproducibility and di erential privacy challenges, in the context of con dential
research data.
      </p>
      <p>
        Due to the sensitive nature of a large number of available data-sets, a growing
number of initiatives are in facts developing research tools based on the top-level
information, the metadata. Nevertheless, formal guidelines for the management
and digitization of restricted data and metadata creation is still lacking.
Moreover, a broader perspective regarding the use of detailed metadata les as a
in uential factor in di erential privacy and privacy budgets supported queries
[
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] still need to be explored.
1.1
      </p>
    </sec>
    <sec id="sec-2">
      <title>Restricted Data and Metadata</title>
      <p>
        The term \metadata" is generally understood to mean \data about data", and
it usually describes resources' embedded information such as authors, dates,
versions and technical details [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], but it is regularly seen that important pieces
of the puzzle are still missing. In the context of sensitive data, the metadata must
be complying with data protection rules, and often important non-con dential
information are lacking. For example, the metadata of a certain data-set can
report the type of license the researcher has to agree upon for using the data
once access is granted. Nevertheless, such information usually reports only the
name of the license, and not what the actual agreement de nes. For instance,
certain license agreements require the researcher to submit their work to the data
owner before publishing, and others do not allow for the data to be shown during
public speeches or presentations. At this point it is also important to specify the
de nition of restricted data in the context of this research: by \restricted" we
refer to data that is legally bound either by con dentiality or license agreements.
      </p>
      <p>It is clear that such information are truly valuable resources to allow
researchers for a more targeted data-set search and exploration, and there is an
evident need for such knowledge to be available to the end user.
2</p>
      <sec id="sec-2-1">
        <title>State of the Art</title>
        <p>A more detailed account of relevant researches are described in the next section.
Firstly, we will discuss the transformation from raw data into Linked Data with
the RDF Mapping Language (RML). Afterwards, we will present the Open
Digital Rights Language (ODRL) and the CESSDA Metadata Model (CMM), which
represent the core frameworks for the novel OPL ontology and the metadata
architecture model respectively.</p>
        <p>
          RML. An important step in this research is the creation of Linked Data from
unstructured metadata les. For this step, the RDF Mapping Language (RML)
[
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] has been chosen to map les from data providers, such as DANS, into RDF
format. A number of tools based on RML have been developed, and we found
that the most user-friendly one is YARRRML [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]. YARRRML is built on top of
the RML language, and allows the use of all of its default functions as well as
the creation of jar les for custom functions.
        </p>
        <p>
          ODRL. The Open Digital Rights Language [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] was created with the aim to
provide a model and a vocabulary for the expression of statements referring to
the usage of services. ODRL has been extensively used thanks to its exibility
and interoperability, and it is now recommended by the W3C as the expression
language for describing policies' permissions, prohibitions and constraints. One of
the main bene t of using ODRL, is that it allows for the creation of new pro les,
grating the community the possibility of expressing additional semantics.
        </p>
        <p>
          CMM. The Consortium of European Social Science Data Archives (CESSDA)
has made extensive e orts in the introduction of various guidelines for data
management and maintenance, by highlighting the importance of structured
metadata in the social science domain. One of the core activity of the CMO (CESSDA
Metadata O ce) has been supporting the data-origination process by
establishing CMM [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. The aim of the CMM model is to allow the research community
to attain a consistent guide for metadata production, in order to increase
consistency and interoperability of this process.
3
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Problem Statement and Contributions</title>
        <p>The main question of this study is: How can restricted data be FAIRi ed?. This
study will focus on social science data, and more speci cally on tabular data,
but its approach could be extended to any eld of research. The assessment of
this speci c objective consists in the evaluation of the following sub-questions:</p>
        <p>RQ1.How can we represent tabular data that comes with access
restrictions in a FAIR manner? The rst step in applying the FAIR principle
on restricted research data is the creation of a model or system architecture that
is both open and transparent, as well as consistent with the requirements
essential to the research community. Due to the constraints imposed by the context
of this resource, we will focus on a metadata model that expresses
information about the underlying data, therefore avoiding con dential material being
exposed.</p>
        <p>RQ2.How can existing scienti c data-sets with restricted access be
FAIRi ed in a reproducible and transparent fashion? The second step
consists in the practical evaluation of the approach. In order to achieve this, we
aim to map and enrich the metadata of available data-sets, from a raw format
into their Linked Data counterparts.</p>
        <p>RQ3.How can researchers dealing with access-restricted data be
supported by semantic tools to create data-sets that are FAIR from
the start? We want to support the research community and data providers in
the generation of metadata for restricted tabular data, by creating a
transparent tool based on the model proposed. The tool will support the use of semantic
web technologies in to order to facilitate logic-based annotations, as well decrease
disambiguation in entity-recognition processes.</p>
        <p>RQ4.How can FAIR metadata for access-restricted data improve
the e ectiveness and usability of a scienti c data-set search engine?
The end goal of this study is to provide a data-set semantic search portal, where
users are able to nd resources by querying the metadata database. Therefore,
our nal question for this research, involves the comparison between the
performance of the data-set search portal created and other available search
environments. Moreover, we will evaluate whether certain implementations in the
semantic search of the portal are more bene cial and powerful than others, in
order to make suggestions for future work and progress.
3.1</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Contributions</title>
      <p>The contributions that this research is set to implement are:
{ The formulation of a comprehensive metadata model suitable for restricted
scienti c tabular data, with the input of eld experts in the social science
domain.
{ A method for generating enriched and FAIR metadata by applying the model
to existing restricted data-sets.
{ The creation of an automated yet transparent and FAIR tool for the
generation of Linked Metadata from the upload of data-sets.
{ The contribution to a data-set search portal, by implementing semantic
search features to optimise the ndability of resources.
4</p>
      <sec id="sec-3-1">
        <title>Research Methodology and Approach</title>
        <p>The overall approach of this research is based on the principles of: metadata
architecture and enrichment, con dentiality-aware data processing, semantic rights
representation and access negotiation. Having de ned the research questions, we
will now address the methodology in more details:</p>
        <p>
          RQ1. A systematic review will be performed to assess, summarise and
determine the relevant literature in regards to the management of con dential
data-sets, as well as common practices for the exploration and use of restricted
data. The outcome of the review will be translated into the metadata model'
requirements, and compared to already available resources such as the CMM
model, the RDF Discovery Vocabulary (disco) [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] and data management plans'
resources [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ].
        </p>
        <p>
          RQ2. Raw metadata can be obtained by di erent data providers (e.g. CBS
and DANS), and can be transformed into Linked Metadata following the model
mentioned, using known technologies such as RML. Nevertheless, during this
process original information are kept intact, and no extra knowledge is derived.
In order to enrich the metadata we can utilise available vocabularies linkable
to the original data, such as the DBpedia Ontology [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] and to the European
Language Social Science Thesaurus (ELSST) [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]. Moreover, metadata
information about data accessibility (e.g. open source, restricted) and data license (e.g.
creative common licenses) can be mapped to other known vocabularies such as
EuroVoc Thesaurus2 and the Creative Commons Rights Expression Language
(ccREL) [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ]. Furthermore, we have created a new Ontology for Policies and
Licenses that can also be utilised to map licensing agreements information found
in the metadata. In order to map the mentioned vocabularies to the terms in
the metadata we can use RML rules, such as the built-in \DBpedia Spotlight
Lookup" function.
        </p>
        <p>RQ3. A novel tool is needed to automatize the generation of metadata,
therefore supporting restricted-data owners in the creation and release of their
resources into digital libraries and metadata portals. Semantic tools, such as
entity-recognition and recommendation services will be put in place to map the
data-set terms to known thesaurus, such as ELSST, and users will also have
the option to customise the terms by adding or suggesting a more appropriate
mapping. We aim to create a secure and transparent environment where data
owners are in full control over their data, and where the tool work ow is open
and accessible to the users, in order to allow them to check every stage of the
transformation and decision-making processes.</p>
        <p>RQ4. Firstly, semantic search results among di erent portals and the
proposed data-set search engine will be compared, in order to better understand
its performance in nding information. Moreover, we aim to analyse the
ndability of resources, by using two versions of the novel designed portal: 1) this
version will be considered to be the \control" group, and it will consist of the
portal with all the search functionalities running, 2) this version, instead, will
be considered as the \knock-out" group, where certain search functionalities are
disabled. The experiment aims to understand which parameters are more
powerful and, therefore, to suggested further areas of study. Finally, we will evaluate
whether the FAIR implementation in the metadata model allow users to nd
restricted data-sets, as well as clearly stating the steps and requirements needed
to be taken in order to have access to the data from the data-owner. A possible
addition to this methodology, is also to investigate privacy budgets supported
queries, by tracking users' history and therefore calculating the level of security
of the portal.</p>
        <sec id="sec-3-1-1">
          <title>2 http://publications.europa.eu/resource/dataset/eurovoc</title>
          <p>5</p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>Evaluation Plan</title>
        <p>Having de ned the research questions and the methodology of this research, in
the following section we will discuss how to evaluate the results.</p>
        <p>RQ1. Following the systematic review, metadata requirements found in the
literature will be assessed and implemented in the metadata model, and major
importance will be given to literature exploring the Reusability and Findability
of restricted data. The focus group will be eventually be questioned regarding
the nal version of the model.</p>
        <p>RQ2. A collection of open data-sets in raw format will be used to manually
generate their correspondent metadata les. A case study will be performed to
quantitatively evaluate the results by counting the number of terms that have
been correctly mapped to the model, as well as counting the triples before and
after the metadata enrichment. Once the metadata creation is optimised, restricted
data-sets from providers such as CBS will be requested and investigated.</p>
        <p>RQ3. The tool will be tested by feeding it a collection of data-sets for the
automatic mapping to the metadata model, with the aim to achieve the same
performance as in the manual mapping mentioned above. Moreover, a focus
group consisting of open- and restricted-data owners, as well as researchers, will
be asked to perform the task extensively, and their feedback will be assessed.</p>
        <p>RQ4. To evaluate the last research questions, we will organise a
comparative user study to examine the features implemented in the last iteration of the
proposed portal, compared to other available resources. Moreover, we will check
whether all requirements found during the systematic review, and further
feedback that may arise during this study have been integrated. Furthermore, we
will qualitatively assess the strength of semantic search features, by checking the
resources returned by the \control" version and the \knock-out" version.
Privacy budget queries could also be evaluated by collecting analytical information
about the search history of users, and specify a clear privacy cost limit. Finally,
the aim of the portal is to include metadata from restricted data-sets, and make
them Findable for the research community.
6</p>
      </sec>
      <sec id="sec-3-3">
        <title>Intermediate Results</title>
        <p>To create an initial sample for experimentation, RML was used to create a
mapping of XML les from the DANSeasy archive, into RDF Linked Data format.
RML default functions were used to enrich the metadata, by mapping XML
access terms to the OPL ontology. Therefore, we have initials answers to research
questions 1 and 2:
{ RQ1: during the RDF transformation with RML, the CMM model has been
used as a template. We have seen that this model is exible and expressive
enough for initial experiments, but further discussion with the research group
are needed to evaluate potential implementations.
{ RQ2: metadata enrichment has been performed by mapping XML terms to
OPL, and initial feedback shows that a number of additional information
about license agreements and data usage have been added. This step is
important in the FAIRi cation of the model, as the aim is to provide end users
with transparent and e ective information about the data. The code for the
XML mapping can be found at 3, and for OPL ontology at 4.
7</p>
      </sec>
      <sec id="sec-3-4">
        <title>Conclusion</title>
        <p>The present study was designed to underline a model for FAIRi cation of
restricted research data. Although this research has only started at the beginning
of 2021, early ndings have signi cant implications in the rst and second
research questions, regarding the syntactic and semantic features of the model. In
fact, we have shown current metadata models (e.g. CMM) and resources (e.g.
ODRL) can be expanded to optimise ndability of data. Moreover, early
implementations of the OPL ontology have shown how license agreements' details can
express important features necessary to the research community. Furthermore,
we have acknowledged the usefulness of available tools such as RML during
the transformation from raw to structured metadata, and we have also shown
how such tools are exible enough to allow the creation of transparent custom
functions to better extract and map information.</p>
        <p>Considerable work is needed to gain access to restricted data-sets from providers
such as CBS, and a great focus is indeed necessary to guarantee the con
dentiality and e ectiveness of the model. Notwithstanding these limitation, the aim
of this study is to strengthens the importance of metadata and application of
the FAIR principles as solutions to restricted data-sets search, and although
it focuses on the eld of social science, the ndings may well have signi cant
implications in other scienti c communities.</p>
        <p>Acknowledgements. This project is funded by the Netherlands
Organisation of Scienti c Research (NWO), ODISSEI Roadmap project: 184.035.014.
Furthermore, I would like to thank Jacco van Ossenbruggen, Tobias Kuhn and
Ronald Siebes for the help and support needed for the success of this research
and its future endeavors.</p>
        <sec id="sec-3-4-1">
          <title>3 https://github.com/ritamargherita/opl 4 https://github.com/ritamargherita/yarrrml-mapper</title>
        </sec>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>Hal</given-names>
            <surname>Abelson</surname>
          </string-name>
          , Ben Adida, Mike Linksvayer, and Nathan Yergler.
          <volume>10</volume>
          .
          <article-title>cc rel: The creative commons rights expression language</article-title>
          .
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>Lorna</given-names>
            <surname>Balkan</surname>
          </string-name>
          ,
          <string-name>
            <surname>Taina</surname>
            <given-names>J</given-names>
          </string-name>
          aaskelainen, Christina Frentzou, and
          <string-name>
            <given-names>Chryssa</given-names>
            <surname>Kappi</surname>
          </string-name>
          .
          <article-title>European language social science thesaurus (elsst): issues in de-signing a multilingual tool for social science researchers</article-title>
          .
          <source>CHAT</source>
          <year>2011</year>
          :
          <article-title>Creation, Harmonization and Application of Terminology Resources</article-title>
          , page
          <volume>11</volume>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Dennis</surname>
            <given-names>A Benson</given-names>
          </string-name>
          , Mark Cavanaugh, Karen Clark, Ilene Karsch-Mizrachi, David J Lipman, James Ostell, and Eric W Sayers. Genbank.
          <source>Nucleic acids research</source>
          ,
          <volume>41</volume>
          (
          <issue>D1</issue>
          ):
          <source>D36{D42</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>Christian</given-names>
            <surname>Bizer</surname>
          </string-name>
          , Jens Lehmann, Georgi Kobilarov, Soren Auer, Christian Becker, Richard Cyganiak, and
          <string-name>
            <given-names>Sebastian</given-names>
            <surname>Hellmann</surname>
          </string-name>
          .
          <article-title>Dbpedia-a crystallization point for the web of data</article-title>
          .
          <source>Journal of web semantics</source>
          ,
          <volume>7</volume>
          (
          <issue>3</issue>
          ):
          <volume>154</volume>
          {
          <fpage>165</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>Kerrin</given-names>
            <surname>Borschewski</surname>
          </string-name>
          , Andre Forster, Tanja Friedrich, Wolfgang Zenk-Moltgen, Patr cia Miranda, Pedro Moura Ferreira, Jelena Banovic,
          <string-name>
            <surname>Aleksandra</surname>
            <given-names>BradicMartinovic</given-names>
          </string-name>
          , Larisa Malic, Henri Ala-Lahti,
          <article-title>and</article-title>
          et al.
          <article-title>Cmm cessda metadata model</article-title>
          .
          <source>Nov</source>
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Thomas</surname>
            <given-names>Bosch</given-names>
          </string-name>
          , Richard Cyganiak, Arofan Gregory, and
          <string-name>
            <given-names>Joachim</given-names>
            <surname>Wackerow</surname>
          </string-name>
          .
          <article-title>Ddirdf discovery vocabulary: a metadata vocabulary for documenting research and survey data</article-title>
          .
          <source>In LDOW</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Andrew</surname>
            <given-names>M Cox</given-names>
          </string-name>
          ,
          <article-title>Stephen Pin eld, and Jennifer Smith. Moving a brick building: Uk libraries coping with research data management as a `wicked'problem</article-title>
          .
          <source>Journal of Librarianship and Information Science</source>
          ,
          <volume>48</volume>
          (
          <issue>1</issue>
          ):3{
          <fpage>17</fpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>Anastasia</given-names>
            <surname>Dimou</surname>
          </string-name>
          , Miel Vander Sande, Pieter Colpaert, Ruben Verborgh, Erik Mannens, and Rik Van de Walle.
          <article-title>Rdf mapping language (rml)</article-title>
          .
          <source>Speci cation proposal draft</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>Pieter</given-names>
            <surname>Heyvaert</surname>
          </string-name>
          , Ben De Meester, Anastasia Dimou, and
          <string-name>
            <given-names>Ruben</given-names>
            <surname>Verborgh</surname>
          </string-name>
          .
          <article-title>Declarative rules for linked data generation at your ngertips!</article-title>
          <source>In European Semantic Web Conference</source>
          , pages
          <volume>213</volume>
          {
          <fpage>217</fpage>
          . Springer,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <given-names>Renato</given-names>
            <surname>Ianella</surname>
          </string-name>
          .
          <article-title>Open digital rights language (odrl)</article-title>
          .
          <source>Open Content Licensing: Cultivating the Creative Commons</source>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Elizabeth</surname>
            <given-names>W King.</given-names>
          </string-name>
          <article-title>The ethics of mining for metadata outside of formal discovery</article-title>
          .
          <source>Penn St. L. Rev.</source>
          ,
          <volume>113</volume>
          :
          <fpage>801</fpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Christophe</surname>
            <given-names>Perignon</given-names>
          </string-name>
          , Kamel Gadouche, Christophe Hurlin, Roxane Silberman, and
          <string-name>
            <given-names>Eric</given-names>
            <surname>Debonnel</surname>
          </string-name>
          .
          <article-title>Certify reproducibility with con dential data</article-title>
          .
          <source>Science</source>
          ,
          <volume>365</volume>
          (
          <issue>6449</issue>
          ):
          <volume>127</volume>
          {
          <fpage>128</fpage>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Plato L Smith</surname>
            ,
            <given-names>Crystal</given-names>
          </string-name>
          <string-name>
            <surname>Felima</surname>
          </string-name>
          , Fletcher Durant, David Van Kleeck,
          <string-name>
            <surname>Helene Huet</surname>
          </string-name>
          , and
          <string-name>
            <surname>Laurie N Taylor. Building</surname>
          </string-name>
          socio
          <article-title>-technical systems to support data management and digital scholarship in the social sciences</article-title>
          .
          <source>In Anthropological Data in the Digital Age</source>
          , pages
          <volume>31</volume>
          {
          <fpage>57</fpage>
          . Springer,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>M. Suhr</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>C. R.</given-names>
          </string-name>
          <string-name>
            <surname>Bauer</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Bender</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Knopp</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Freckmann</surname>
            ,
            <given-names>B. O</given-names>
          </string-name>
          <string-name>
            <surname>st Hansen</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Henke</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Aschenbrandt</surname>
            ,
            <given-names>L. K.</given-names>
          </string-name>
          <string-name>
            <surname>Ku</surname>
          </string-name>
          <article-title>hlborn, and</article-title>
          et al.
          <article-title>Menoci: lightweight extensible web portal enhancing data management for biomedical research projects</article-title>
          .
          <source>BMC Bioinformatics</source>
          ,
          <volume>21</volume>
          (
          <issue>1</issue>
          ),
          <year>Dec 2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Ruben</surname>
          </string-name>
          Vicente-Saez and
          <article-title>Clara Mart nez-Fuentes. Open science now: A systematic literature review for an integrated de nition</article-title>
          .
          <source>Journal of business research</source>
          ,
          <volume>88</volume>
          :
          <fpage>428</fpage>
          {
          <fpage>436</fpage>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <given-names>Heinrich</given-names>
            <surname>Widmann</surname>
          </string-name>
          and
          <string-name>
            <given-names>Hannes</given-names>
            <surname>Thiemann</surname>
          </string-name>
          .
          <article-title>Eudat b2 nd: a cross-discipline metadata service and discovery portal</article-title>
          .
          <source>In EGU General Assembly Conference Abstracts</source>
          , pages
          <fpage>EPSC2016</fpage>
          {
          <fpage>8562</fpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Mark D Wilkinson</surname>
          </string-name>
          , Michel Dumontier, IJsbrand Jan Aalbersberg, Gabrielle Appleton, Myles Axton, Arie Baak, Niklas Blomberg,
          <string-name>
            <surname>Jan-Willem</surname>
            <given-names>Boiten</given-names>
          </string-name>
          ,
          <source>Luiz Bonino da Silva Santos</source>
          ,
          <string-name>
            <surname>Philip E Bourne</surname>
          </string-name>
          , et al.
          <article-title>The fair guiding principles for scienti c data management and stewardship</article-title>
          .
          <source>Scienti c data</source>
          ,
          <volume>3</volume>
          (
          <issue>1</issue>
          ):1{
          <issue>9</issue>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <given-names>Jimmy</given-names>
            <surname>Ming-Tai</surname>
          </string-name>
          <string-name>
            <given-names>Wu</given-names>
            , Gautam Srivastava, Alireza Jolfaei, Philippe Fournier-Viger, and
            <surname>Jerry</surname>
          </string-name>
          <string-name>
            <given-names>Chun-Wei</given-names>
            <surname>Lin</surname>
          </string-name>
          .
          <article-title>Hiding sensitive information in ehealth datasets</article-title>
          .
          <source>Future Generation Computer Systems</source>
          ,
          <volume>117</volume>
          :
          <fpage>169</fpage>
          {
          <fpage>180</fpage>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Yang</surname>
            <given-names>Zhao</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jun Zhao</surname>
            ,
            <given-names>Jiawen</given-names>
          </string-name>
          <string-name>
            <surname>Kang</surname>
            , Zehang Zhang, Dusit Niyato,
            <given-names>Shuyu</given-names>
          </string-name>
          <string-name>
            <surname>Shi</surname>
          </string-name>
          , and
          <string-name>
            <surname>Kwok-Yan Lam</surname>
          </string-name>
          .
          <article-title>A blockchain-based approach for saving and tracking di erentialprivacy cost</article-title>
          .
          <source>IEEE Internet of Things Journal</source>
          ,
          <volume>8</volume>
          (
          <issue>11</issue>
          ):
          <volume>8865</volume>
          {
          <fpage>8882</fpage>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>