<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>What do Biodiversity Scholars Search for? Identifying High-Level Entities for Biological Metadata</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Felicitas Lo¨ ffler</string-name>
          <email>felicitas.loeffler@uni-jena.de</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Claas-Thido Pfaff</string-name>
          <email>claas-thido.pfaff@uni-leipzig.de</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Naouel Karam</string-name>
          <email>naouel.karam@fu-berlin.de</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>David Fichtmu¨ ller</string-name>
          <email>d.fichtmueller@bgbm.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Friederike Klan</string-name>
          <email>friederike.klan@uni-jena.de</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Botanic Garden and Botanical Museum (BGBM), Freie Universita ̈t Berlin</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Heinz-Nixdorf Endowed Chair for Distributed Information Systems</institution>
          ,
          <addr-line>FSU Jena</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Institute of Computer Science, Freie Universita ̈t Berlin</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Systematic Botany and Functional Biodiversity Lab, University of Leipzig</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Research questions in biodiversity are as diverse and heterogeneous as data are. Most metadata standards are mainly data-focused and pay little attention to the search perspective. In this work, we introduce a method to analyze the actual information need of biodiversity scholars based on two individual studies: (1) a series of workshops with domain experts and (2) an analysis of research and search questions collected in three different biodiversity projects. We finally present 12 high-level entities that appear in all kinds of biological data across the different sources evaluated.</p>
      </abstract>
      <kwd-group>
        <kwd>biological data</kwd>
        <kwd>life sciences</kwd>
        <kwd>biodiversity</kwd>
        <kwd>metadata</kwd>
        <kwd>information retrieval</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        In the last decade, we have witnessed an unprecedented increase of open data ranging
from species-related observations, digitized specimen collections to genome or
environmental data offered, e.g., through remote sensing. This opens up unforeseen opportunities
particularly for biodiversity research, which relies on cross-disciplinary data analysis
to elucidate the interplay between individuals and the conditions of the environment
they inhabit, both on the macroscopic and microscopic level. At the flip side of this
development, discovering and filtering these large volumes of multidisciplinary data
becomes a more and more time-consuming and demanding task [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Thus, there are two
big challenges: exploring effective retrieval mechanisms that support humans in finding
relevant data and creating proper and rich metadata in order to make data findable (FAIR
principles [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]).
      </p>
      <p>The biodiversity community has responded to the latter requirement by developing
metadata standards for biological data, such as Darwin Core (DwC)5, ABCD6 or EML7 .</p>
    </sec>
    <sec id="sec-2">
      <title>5 Darwin Core, http://rs.tdwg.org/dwc/ 6 ABCD, http://www.tdwg.org/activities/abcd/ 7 EML, http://www.dcc.ac.uk/resources/metadata-standards/eml-ecological-metadata-language</title>
      <p>
        At the same time, considerable effort has been put on the formalization of domain
knowledge in terms of vocabularies and ontologies. By referencing this formal knowledge,
data can be richly annotated and become machine-readable. In the last years, numerous
ontologies for specific biological domains have been created, e.g., the Gene Ontology
(GO)8 for genes, the Chemical Entities of Biological Interest (ChEBI) ontology for
chemical compounds, the Environmental Ontology (ENVO) 9 for environmental features
and materials, the Phenotype Quality Ontology (PATO) for phenotypes and the NCBI
Taxonomy10 for species. In addition, high-level ontologies with an emphasis on
interlinking biological data from different sources have been developed, e.g., the Biological
Collections Ontology (BCO) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], the Extensible Observation Ontology (OBOE) [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] or
the Semantic Sensor Network Ontology (SSN) [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>In the search applications we are hosting within the biodiversity projects GFBio (The
German Federation for Biological Data)11 and AquaDiva12, we observe that existing
metadata standards and ontologies often take a data-centric view. They provide means
to well-described biodiversity data, their characteristics, their origin and the process of
their creation. However, when searching for data, scholars often do not have specific data
in mind, but rather a research question they would like to answer. Hence, we argue that
when designing metadata standards and ontologies for biodiversity both perspectives
have to be considered, the requirements given by available datasets and the way scholars
are looking for data. The contribution of the paper is twofold: (1) We propose and apply
a method that combines the findings of two different and independent approaches to
identify high-level entities that are relevant for biodiversity researchers when searching
for data (Sects. 2.1 and 2.2). (2) As a first result, we present the findings of the two
individual approaches and propose a consolidated set of biological entities (Sect. 2.3). We
consider this as a first step towards enriched metadata with information that is relevant
to information seekers. It also serves as a prerequisite for increasing the findability of
biodiversity data.
2</p>
      <sec id="sec-2-1">
        <title>Methodology</title>
        <p>Our approach to analyze the search perspective comprises two independent studies:
Assuming that properly described data can be found more easily, the goal in dedicated
workshops with scholars was to define an annotation schema that can be used to richly
describe ecological data (Sect. 2.1). The second study is oriented to evaluation methods
in information retrieval and analyzes research and search questions collected in three
biodiversity projects (Sect. 2.2). In both approaches, the aim was to enhance search
applications and to detect high-level entities that can be either used as metadata fields
or that can be linked with ontologies. The first result of biological high-level entities is
presented in Sect. 2.3.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>8 GO, http://www.geneontology.org/</title>
      <p>9 ENVO, https://github.com/EnvironmentOntology/envo
10 NCBI Taxonomy, https://www.ncbi.nlm.nih.gov/taxonomy
11 GFBio, https://www.gfbio.org
12 AquaDiva, http://www.aquadiva.uni-jena.de
2.1</p>
      <sec id="sec-3-1">
        <title>Workshops with domain experts</title>
        <p>In close collaboration between GFBio and the German Centre for Integrative
Biodiversity Research Halle – Jena – Leipzig (iDiv)13, we conducted ten workshops with 35
domain experts from ecology and adjacent disciplines to develop a metadata schema
and a controlled vocabulary, the Essential Annotation Schema for Ecology (EASE)14.
This annotation framework aims at describing ecological data from a scholar’s search
perspective. Annotation in this context refers to metadata.</p>
        <p>Two design principles have been formulated for the development: Parsimony: The
framework aims at being as simple as possible in structure and content. Optimization
here has to be done carefully to maintain a differentiated and consistent annotation. One
example: Larger time frames in ecology are referred to by a relative reference, (e.g.,
18 million years ago) or by named geological time periods. These periods are getting
more granular from eons to ages and are nested in each other. It could be argued to make
ages optional in the annotation which sacrifices some granularity but still maintains
a consistent larger temporal context. Comprehensiveness: The framework aims at a
certain comprehensiveness defining essential orthogonal dimensions of information
which allow ecological content to be described and located in the search space of
ecology. Comprehensiveness is not accomplished by using many different dimensions
and concepts but rather a few essential and complementary ones which also reflect the
mindset and questions of researchers when looking for data.</p>
        <p>Based on these guidelines, 8 top level categories have been selected. During the
workshops, the top level categories were substantiated in a top down approach with increasing
detail (~1600 concepts). Here, we relied on expert knowledge of the contributors but
also on other sources such as EML, ABCD and DwC, various topic specific textbooks
(e.g., related to organic and inorganic chemistry) and standardized vocabularies (e.g., the
World Reference Base for Soil15, and The International Chronostratigraphic Chart16).</p>
        <p>The top level categories are 1. Time (e.g., date, time, timezone), 2. Space (e.g.,
bounding box, coordinates, location names), 3. Sphere (e.g., pedo-, hydro-, atmosphere
aspects), 4. Biome (e.g., zones, water availability, land use), 5. Organism (species
classification), 6. Process (e.g., processes, objects and interactions), 7. Method (general
approach, setup of gradients), 8. Chemical (e.g., elements, compounds, functions). In
addition, the framework covers a set of general information to handle associations
between primary data and annotation (e.g., data format, contact person, download URL).
2.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Research and search questions in the biodiversity domain</title>
        <p>
          In information retrieval, a lot of research has been done towards a perfect ranking [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]
whereas little attention has been paid to a user’s actual information need. What research
questions are biodiversity scholars working on? What kind of data do they want to reuse?
Do the provided metadata actually reflect a researcher’s information need? Therefore,
13 iDiv, https://www.idiv.de/
14 EASE: https://github.com/cpfaff/ease
15 WRB, http://www.fao.org/soils-portal/soil-survey/soil-classification/world-reference-base/en/
16 ICS, http://www.stratigraphy.org/index.php/ics-chart-timescale
Do butterflies occur on How does agriculture affect the
calcareous grassland? groundwater composition?
Is there data on the influence of What are suitable methods to
geographic elevation on the characterize microbial soil
        </p>
        <p>growth rate and plant processes by gas analytical
development of Zea mays? techniques?
we collected 184 search and research questions from scholars who are involved in three
biodiversity projects in Germany: GFBio (73), AquaDiva (98) and iDiv (13). Examples
are presented in Table 1. We asked for full questions as well as keywords to get the
actual information need together with the search query. We left it to the scholars to either
provide search questions posed to a search interface or broader research questions they
are currently working on to get a wider spectrum of information needs.</p>
        <p>We analyzed the questions manually and explored whether the noun entities could be
grouped into high-level categories, such as Organism or Environment. For instance, given
the question: Is there DNA data about Amphimonhystrella (Nematoda)? the noun entities
are ’DNA data’ and ’Amphimonhystrella (Nematoda)’. The latter one is an Organism
whereas ’DNA data’ points to a certain Data Type. Finally, we grouped the noun entities
into 13 categories presented in Table 2. Organism comprises all individual life forms
including plants, fungi, bacteria, animals and microorganisms. All species live in certain
Environments and have certain characteristics that are summarized with Quality and
Phenotype. Biological, chemical and physical Processes are re-occurring and transform
materials or organisms due to chemical reactions or other influencing factors. Events
are processes that appear only once at a specific time, such as environmental disasters.
Chemical compounds, rocks, sand and sediments can be grouped as Materials and
Substances. Anatomical Entities comprise the structure of organisms, e.g., body or plant
parts, organs, cells and genes. The term Method describe all operations and experiments
that have to be conducted to lead to a certain result. Outcomes of research methods
are delivered in Data Types. All kinds of geographic information is summarized with
Location and time data including geological eras are described with Time. Person and
Organization are either projects or authors of data. As reflected in the search questions,
scholars in biodiversity are highly interested in Human Intervention on landscape and
environment, e.g., fishery, agriculture.
2.3</p>
      </sec>
      <sec id="sec-3-3">
        <title>Discussion</title>
        <p>Table 3 constitutes a consolidation of the main entities identified by the previously
described processes. While there is a broad consensus on entities such as Organism,
Process, Method and Time, some wording and classification on others are different.
Space in EASE actually means location information and Sphere comprises altitude
indications that is covered under Environment in the search questions. In EASE, Biome
Organism
quercus,
cyclothone,
globigerina
bulloides
Material and</p>
        <p>Substance
sediment, rock,</p>
        <p>CO2
contains subfields for Land use (Human Intervention in the search questions) and data
attributes are defined as Factor under Method (Quality and Phenotype in the search
questions). Chemical is grouped under Material &amp; Substance in the search questions
that additionally covers soil, sediments and rocks. Anatomical Entity, Data Type and
Event only occur in the search questions.</p>
        <p>Looking at potential linkage with existing ontologies, we finally selected 12
biological high-level entities. We left out Sphere since ontologies such as ENVO already cover
environmental features and conditions and could be extended with altitude information.
Preliminary, we will leave Data Type out. It needs to be further discussed and investigated
whether it can be classified under other entities, e.g., Method.
3</p>
        <sec id="sec-3-3-1">
          <title>Conclusion</title>
          <p>We described and applied a methodology for identifying high-level entities in the
biodiversity domain that reflect the scholars’ point of view. In our future work, we will link
the identified entities to existing ontologies. Our aim is to improve the indexing process
of search applications over research data by means of these 12 categories. We would
like to automatically extract information from metadata that is related to the entities. We
believe, that this will help to improve data retrieval methods in the biodiversity domain.</p>
        </sec>
        <sec id="sec-3-3-2">
          <title>Acknowledgements</title>
          <p>We would like to thank the numerous scientists from GFBio, AquaDiva and iDiv who
took part in the workshops and/or provided questions. This work was partially funded
by the Deutsche Forschungsgemeinschaft (DFG) within the scope of the GFBio and
AquaDiva projects.</p>
        </sec>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>M.</given-names>
            <surname>Compton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Barnaghi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Bermudez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Garca-Castro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Corcho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Cox</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Graybeal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hauswirth</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Henson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Herzog</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Janowicz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. D.</given-names>
            <surname>Kelsey</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. L.</given-names>
            <surname>Phuoc</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Lefort</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Leggieri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Neuhaus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Nikolov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Page</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Passant</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sheth</surname>
          </string-name>
          , and
          <string-name>
            <given-names>K.</given-names>
            <surname>Taylor.</surname>
          </string-name>
          <article-title>The ssn ontology of the w3c semantic sensor network incubator group</article-title>
          .
          <source>Web Semantics: Science, Services and Agents on the World Wide Web</source>
          ,
          <volume>17</volume>
          :
          <fpage>25</fpage>
          -
          <lpage>32</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>M.</given-names>
            <surname>Diepenbroek</surname>
          </string-name>
          ,
          <string-name>
            <surname>F.</surname>
          </string-name>
          <article-title>Gl o¨ckner</article-title>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Grobe</surname>
          </string-name>
          , A. G u¨ntsch,
          <string-name>
            <given-names>R.</given-names>
            <surname>Huber</surname>
          </string-name>
          ,
          <string-name>
            <surname>B.</surname>
          </string-name>
          <article-title>K o¨nig-</article-title>
          <string-name>
            <surname>Ries</surname>
            ,
            <given-names>I. Kostadinov</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Nieschulze</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Seeger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Tolksdorf</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Triebel</surname>
          </string-name>
          .
          <article-title>Towards an integrated biodiversity and ecological research data management and archiving platform: GFBio</article-title>
          .
          <source>In Informatik</source>
          <year>2014</year>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>J.</given-names>
            <surname>Madin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bowers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Schildhauer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Krivov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Pennington</surname>
          </string-name>
          , and
          <string-name>
            <surname>V. F.</surname>
          </string-name>
          <article-title>An ontology for describing and synthesizing ecological observation data</article-title>
          .
          <source>Ecological Informatics</source>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>C. D. Manning</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Raghavan</surname>
          </string-name>
          , and H. Schu¨tze. Introduction to Information Retrieval. Cambridge University Press,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>R. L.</given-names>
            <surname>Walls</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Deck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Guralnick</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Baskauf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Beaman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Blum</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bowers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. L.</given-names>
            <surname>Buttigieg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Davies</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Endresen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Gandolfo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Hanner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Janning</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Krishtalka</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Matsunaga</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Midford</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Morrison</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. O.</given-names>
            <surname>Tuama</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Schildhauer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Smith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. J.</given-names>
            <surname>Stucky</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Thomer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wieczorek</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Whitacre</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Wooley</surname>
          </string-name>
          .
          <article-title>Semantics in support of biodiversity knowledge discovery: An introduction to the biological collections ontology and related ontologies</article-title>
          .
          <source>PLoS ONE</source>
          ,
          <volume>9</volume>
          (
          <issue>3</issue>
          ):e89606+, Mar.
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>M. D. Wilkinson</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Dumontier</surname>
            ,
            <given-names>I. J.</given-names>
          </string-name>
          <string-name>
            <surname>Aalbersberg</surname>
            , G. Appleton,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Axton</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Baak</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Blomberg</surname>
            ,
            <given-names>J.-W.</given-names>
          </string-name>
          <string-name>
            <surname>Boiten</surname>
            ,
            <given-names>L. B. da Silva</given-names>
          </string-name>
          <string-name>
            <surname>Santos</surname>
            ,
            <given-names>P. E.</given-names>
          </string-name>
          <string-name>
            <surname>Bourne</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Bouwman</surname>
            ,
            <given-names>A. J.</given-names>
          </string-name>
          <string-name>
            <surname>Brookes</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Clark</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Crosas</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          <string-name>
            <surname>Dillo</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          <string-name>
            <surname>Dumon</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Edmunds</surname>
            ,
            <given-names>C. T.</given-names>
          </string-name>
          <string-name>
            <surname>Evelo</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Finkers</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>GonzalezBeltran</surname>
            ,
            <given-names>A. J.</given-names>
          </string-name>
          <string-name>
            <surname>Gray</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Groth</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Goble</surname>
            ,
            <given-names>J. S.</given-names>
          </string-name>
          <string-name>
            <surname>Grethe</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Heringa</surname>
          </string-name>
          , P. A.
          <string-name>
            <surname>t Hoen</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Hooft</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Kuhn</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Kok</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Kok</surname>
            ,
            <given-names>S. J.</given-names>
          </string-name>
          <string-name>
            <surname>Lusher</surname>
            ,
            <given-names>M. E.</given-names>
          </string-name>
          <string-name>
            <surname>Martone</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Mons</surname>
            ,
            <given-names>A. L.</given-names>
          </string-name>
          <string-name>
            <surname>Packer</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Persson</surname>
            , P. RoccaSerra, M. Roos, R. van Schaik,
            <given-names>S.-A.</given-names>
          </string-name>
          <string-name>
            <surname>Sansone</surname>
            , E. Schultes,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Sengstag</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Slater</surname>
            , G. Strawn,
            <given-names>M. A.</given-names>
          </string-name>
          <string-name>
            <surname>Swertz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Thompson</surname>
            ,
            <given-names>J. van der</given-names>
          </string-name>
          <string-name>
            <surname>Lei</surname>
            , E. van Mulligen,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Velterop</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Waagmeester</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Wittenburg</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Wolstencroft</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Zhao</surname>
            , and
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Mons</surname>
          </string-name>
          .
          <article-title>The fair guiding principles for scientific data management and stewardship</article-title>
          .
          <source>Scientific Data</source>
          <volume>3</volume>
          , (
          <issue>160018</issue>
          ),
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>