<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Supporting Ontology-Based Standardization of Biomedical Metadata in the CEDAR Workbench</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Marcos Martínez-Romero</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Martin J. O'Connor</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Michael Dorf</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jennifer Vendetti</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Debra Willrett</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Attila L. Egyedi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>John Graybeal</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mark A. Musen</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Center for Biomedical Informatics Research, Stanford University</institution>
          ,
          <addr-line>1265 Welch Rd, Stanford, CA 94305</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The availability of associated descriptive metadata for scientific datasets is important for discovering and reproducing scientific experiments. The use of ontologies has become a key focus for increasing the quality of these metadata. Despite the wide availability of biomedical ontologies, scientists wishing to use these ontologies when developing metadata descriptions face a number of practical difficulties. A core difficulty is the lack of tools for developing ontology-linked metadata specifications that can be published and shared. Additional difficulties include the lack of support for defining new terms in cases when no existing terms are found and for creating custom term collections to meet domain-specific needs. To address these problems, we developed tools that allow scientists to find terms in ontologies for annotating their data and to dynamically create new terms and value sets. This work has been incorporated into a Web-based platform called the CEDAR Workbench. The resulting integrated environment presents a set of highly interactive interfaces for creating and publishing ontology-rich metadata specifications.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>
        In biomedicine, high-quality, standardized metadata are
crucial for facilitating the discovery of scientific datasets
and reproducibility of the corresponding experiments. In the
last few years, the biomedical community has driven the
development of metadata standards and guidelines for a
variety of experiment types. Scientists use these
specifications to inform their annotation of experimental results
        <xref ref-type="bibr" rid="ref15">(Tenenbaum, Sansone, &amp; Haendel, 2014)</xref>
        . One of the
earliest examples is the MIAME standard
        <xref ref-type="bibr" rid="ref1">(Brazma et al., 2001)</xref>
        ,
which is used to describe metadata about microarray
experiments. These standards and guidelines underpin metadata
submissions to many public metadata repositories
        <xref ref-type="bibr" rid="ref4">(Edgar,
Domrachev, &amp; Lash, 2002)</xref>
        . The BioSharing resource
        <xref ref-type="bibr" rid="ref7">(McQuilton et al., 2016)</xref>
        catalogs hundreds of these
standardization efforts.
      </p>
      <p>
        Despite the growing use of standards for defining
metadata and the wide availability of biomedical ontologies,
metadata submitted to public repositories rarely use standard
terms
        <xref ref-type="bibr" rid="ref2">(Bui &amp; Park, 2006)</xref>
        . As a result, finding or reusing the
metadata is a challenge and understanding the underlying
experiments can be extremely hard, often requiring
significant post-processing of metadata to extract useful content.
      </p>
      <p>A key problem is that scientists face considerable
practical barriers when attempting to link their metadata to
ontology terms. Submission mechanisms for biomedical
repositories are typically based on spreadsheets, with a variety of ad
hoc formats that rarely support inclusion of ontology-based
annotations. Even in cases where such annotations can be
entered, scientists have no easy way to find and use terms
from ontologies to include in their metadata submissions.
Other difficulties include poor support for on-the-fly term
creation when the necessary terms are not found and for
creating custom lists of terms to meet domain-specific
needs.</p>
      <p>
        A variety of tools have been developed to address the
challenge of metadata quality. Foremost among these are the
ISA Tools
        <xref ref-type="bibr" rid="ref13">(Rocca-Serra et al., 2010)</xref>
        , which allow curators
to create spreadsheet-based submissions for metadata
repositories. LinkedISA provides a means to interoperate with
Linked Open Data, effectively adding controlled term
linkage to templates
        <xref ref-type="bibr" rid="ref15 ref5">(González-Beltrán, Maguire, Sansone, &amp;
Rocca-Serra, 2014)</xref>
        . A similar spreadsheet-based tool called
RightField
        <xref ref-type="bibr" rid="ref17">(Wolstencroft et al., 2011)</xref>
        provides a mechanism
for embedding ontology annotation capabilities in Excel or
Open Office spreadsheets using ontologies from the
BioPortal repository
        <xref ref-type="bibr" rid="ref10">(Noy et al., 2009)</xref>
        . Annotare
        <xref ref-type="bibr" rid="ref14">(Shankar et al.,
2010)</xref>
        , which is used to submit experimental data to the
ArrayExpress metadata repository
        <xref ref-type="bibr" rid="ref12">(Parkinson et al., 2005)</xref>
        ,
also supports ontology-based suggestions. These tools
address specific issues of metadata quality but they do not
provide an integrated environment that can support the
entire metadata specification and submission process for
widely used biomedical repositories.
      </p>
      <p>
        The Center for Expanded Data Annotation and Retrieval
(CEDAR)1 is developing a computational ecosystem to
overcome the barriers to creating high-quality metadata in
biomedicine
        <xref ref-type="bibr" rid="ref8">(Musen et al., 2015)</xref>
        . CEDAR provides a suite
of highly sophisticated tools designed to make the authoring
of metadata as natural as possible, while also using
ontologies to enrich the generated descriptions with standard
terms.
      </p>
      <p>In this paper, we describe the main features CEDAR
developed to make it possible to easily construct Web-based
metadata-acquisition forms, enrich those forms with
ontology concepts, and then fill out the forms to create
ontologyannotated descriptions of scientific experiments.
1 https://metadatacenter.org/</p>
    </sec>
    <sec id="sec-2">
      <title>BACKGROUND</title>
      <p>The CEDAR Workbench2 is a suite of Web-based tools and
REST APIs centered on the use of highly-modular
metadata-acquisition forms called metadata templates (or simply
templates). These templates define the data attributes—
termed template fields or fields—needed to describe
biomedical experiments. For example, an experiment template
may have an organism field containing the name of the
organism being studied by the experiment (e.g., Homo
sapiens). The templates may specify lists of permissible values
for template fields. The central goal when designing a
template is to enable the capture of sufficiently precise and
complete metadata about experimental data to facilitate data
discovery, interpretation, and reuse.</p>
      <p>The CEDAR Workbench provides three core components
that form a metadata construction pipeline (Fig. 1): (1) a
Template Designer, which supports interactive template
creation; (2) a Metadata Editor, which allows end-users to
fill in templates with metadata; and (3) a Metadata
Repository for storing both templates and the metadata created
using those templates. The CEDAR Workbench also allows
scientists to upload the metadata created to public
biomedical repositories.
2.1</p>
      <sec id="sec-2-1">
        <title>Template Designer and Metadata Editor</title>
        <p>In the Template Designer, template authors assemble
templates from one or more input fields. There are numerous
field types available to template authors (e.g., text,
paragraph, e-mail, numeric, and date). Users can also define
reusable groups of fields, called elements. For example, the
fields that describe a publication (e.g., authors, title, year,
publication type, etc.) could be grouped together to form a
publication element, which can then be reused in multiple
templates. After a template is created, the Metadata Editor
can be used to automatically generate a forms-based
acquisition interface for entering metadata for that template.
Scientists entering metadata using the Metadata Editor are
prompted in real time with drop-down lists, auto-completion
suggestions, and verification hints, significantly reducing
their error rate while speeding metadata entry and repair.
These prompts are driven by the value constraints specified
in templates.
2.2</p>
      </sec>
      <sec id="sec-2-2">
        <title>Metadata Repository</title>
        <p>
          Templates and metadata produced by the Workbench are
stored in CEDAR’s metadata repository. CEDAR
incorporates a standardized model of templates and metadata,
together with Web-based services to store, search, and share
these resources
          <xref ref-type="bibr" rid="ref11">(O’Connor et al., 2016)</xref>
          . This model is based
on the JSON Schema and JSON-LD specifications. It allows
users to publish their metadata as both JSON-LD and RDF,
thus facilitating interoperation with Linked Open Data.
2.3
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>Support for ontology-based metadata</title>
        <p>
          The CEDAR tools provide mechanisms for structurally
describing templates and publishing metadata created using
those templates in an open format. To increase the metadata
quality further, we offer the ability to enrich these
descriptions with controlled terms from ontologies. We extended
the Template Designer and Metadata Editor to let users
specify semantic content for templates and to easily enter
semantically precise terms in their metadata. These
extensions, can help to improve metadata adherence to the FAIR
data principles
          <xref ref-type="bibr" rid="ref16">(Wilkinson et al., 2016)</xref>
          and interoperability
with Linked Open Data.
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>IMPLEMENTATION</title>
      <p>
        We have enhanced the CEDAR Workbench to provide the
ability to link ontology terms selected from BioPortal to
biomedical metadata. BioPortal, developed by the National
Center for Biomedical Ontology (NCBO)
        <xref ref-type="bibr" rid="ref9">(Musen et al.,
2012)</xref>
        , is a popular platform for hosting and sharing
biomedical ontologies. It provides access to more than 550
ontologies, and contains over 8 million classes and 64,000
properties. The BioPortal API provides a rich set of operations to
access and use ontologies. We extended this API to provide
the fine grained, highly interactive class and property
lookup features needed by CEDAR’s term search and
selection features. To facilitate general use of these features, we
encapsulated BioPortal’s API as a CEDAR service and
made it available as a public REST endpoint.3 We now
describe these extensions.
3 The REST endpoints that provide ontology-based services to the CEDAR
Workbench are documented at https://terminology.metadatacenter.net/api.
3.1
      </p>
      <sec id="sec-3-1">
        <title>Class and Property Search</title>
        <p>CEDAR allows template authors to search for ontology
terms to annotate their templates, that is, to add type and
property assertions to template elements and fields using
ontology classes and properties. Classes and object, data,
and annotation properties for performing these annotations
can be selected from terms supplied by BioPortal. Fig. 2
shows a screenshot of the ontology lookup user interface of
the CEDAR Workbench. In the example shown, the
template author entered the search term publication and then
selected the Publication class from the National Cancer
Institute Thesaurus (NCIT). The interface shows detailed
information both for the selected class and the associated
ontology, as well as for the position of the class in the class
tree of NCIT.
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Value Set creation</title>
        <p>A value set is a list of possible values for a specific purpose.
In the CEDAR Workbench, value sets are a useful
mechanism to define pick lists of permissible values for template
fields. CEDAR works in conjunction with BioPortal to
allow template authors to dynamically create value sets
containing the terms in these pick lists. Value sets can contain
classes from any combination of BioPortal ontologies. Upon
creation, a value set is immediately assigned a unique,
provisional IRI. The CEDAR Workbench supports the creation,
retrieval, update, and deletion of these value sets.</p>
        <p>For example, suppose that the template author wishes to
constrain the values of a Study type field to three specific
types of longitudinal studies (prospective study,
retrospective study, and hybrid study). The Clinical Trials Ontology
(CTO) is a good source of these types since it contains 375
study type classes (represented as descendants of the Study
type class). Instead of selecting all these types, the template
author can create a value set containing only the desired
types. Fig. 3 shows a screenshot of value set creation
features for this example presented in the Template Designer.
Here, the user creates a value set named Longitudinal study
types with three terms selected from CTO.
3.3</p>
      </sec>
      <sec id="sec-3-3">
        <title>Class creation</title>
        <p>Despite the vast number of classes and properties available
in biomedical repositories, ontologies often do not contain
the exact term a user requires. To address this problem,
CEDAR allows users to dynamically define new classes and
immediately to use them. When generating a new class,
users can optionally link it to one or several existing classes
by means of the RDFS subclassOf relationship and SKOS
relationships (closeMatch, exactMatch, broadMatch,
narrowMatch, relatedMatch). Upon creation, a class is
immediately assigned a unique, provisional IRI.</p>
        <p>For example, suppose that a user needs to use the
anatomical term adductor dorsalis. This term is not available in
any BioPortal ontology, though the adductor muscle class in
the UBERON ontology is a close conceptual match. In this
case, the user decides to create an adductor dorsalis class
via the CEDAR Workbench and indicate that the new term
is a subclass of the adductor muscle UBERON class. Fig. 4
shows the class creation interface for this example. The
adductor dorsalis class is stored in BioPortal as a CEDAR
provisional class and is immediately available to all
CEDAR users. Eventually, maintainers of UBERON may
decide to incorporate the adductor dorsalis class to the
ontology or may decide to reject it. If the class is added to
UBERON, the permanent identifier for the class will be
stored as part of the information of the provisional class. If
adductor dorsalis is not included in the next version of the
ontology, the subclassOf link will removed, but the class
will still be valid in CEDAR.
3.4</p>
      </sec>
      <sec id="sec-3-4">
        <title>Value Constraints</title>
        <p>With the above functionality, the system can limit the
possible values of a template field to a predefined sets of
ontology terms or value sets. Some template authors may need to
define value constraints that go beyond predefined term
Fig. 4. Example of class creation in the CEDAR Workbench. The
user creates the adductor dorsalis class and links it to the adductor
muscle class in the UBERON ontology via the subclassOf relation.
lists. For example, a user may wish to constrain the values
of a disorder template field to all subclasses in three specific
branches of the DOID ontology rooted at the terms cognitive
disorder, sleep disorder, and dissociative disorder.</p>
        <p>To deal with use cases such as this one, the system
effectively allows template designers to constrain field values to
any combination of (1) all classes in an ontology branch, (2)
all classes from a specific ontology, (3) new or existing
classes, and (4) new or existing value sets. Multiple
constraint types can be specified for the same field.</p>
        <p>Users populating templates using the Metadata Editor are
presented in real time with a list of choices driven by these
value constraints. Fig. 5 shows an example of choices
presented for a disorder field that has had its values constrained
to come from the three DOID ontology disease branches
described in the earlier example. All terms from these three
branches are combined in real time and presented as a single
list.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>EVALUATION</title>
      <p>CEDAR is working with several biomedical communities to
perform an initial evaluation of our ontology-based
annotation functionality. This evaluation is being carried out in the
context of using CEDAR to develop metadata submission
pipelines for three biomedical groups. These groups are (1)
the LINCS Consortium,4 which is developing a catalog of
cellular signatures; (2) ImmPort,5 a portal for
immunologyrelated datasets; and (3) the AIRR Community,6 which is
developing standards for describing datasets acquired using
advanced sequencing technologies. In all three cases, the
workflow is: (1) design metadata templates for each group’s
4 http://www.lincsproject.org
5 http://www.immport.org
6 http://airr-community.org
relevant datasets; (2) enhance these templates with
ontology-based annotations; (3) scientists populate the templates
with metadata describing their experiments; and (4) submit
the generated metadata to the appropriate repositories.</p>
      <p>Working together with the LINCS, ImmPort, and AIRR
teams we first used the Template Designer tool to develop a
basic version of the templates required by each group. We
then annotated those templates using ontologies. Each
project required a slightly different annotation workflow.</p>
      <p>To annotate ImmPort data, members of the Human
Immunology Project Consortium (HIPC)7 performed an
analysis of all fields and value constraints in the ImmPort system
to identify appropriate controlled-term linkages. They used
the Template Designer to comprehensively annotate the
ImmPort templates with the controlled terms identified.
They also specified value constraints for controlled-value
fields to ensure that the generated acquisition interfaces
restricted the acquisition of metadata to appropriate terms. In
the cases where custom value sets were required for fields,
CEDAR used BioPortal’s value set features to let users
define these resources. The process for the AIRR community
was slightly different, since that community already
incorporated ontology-based annotations as an integral part of
their metadata-specification process. All these annotations
were available in spreadsheet format and the only required
step was to formalize them using the Template Designer.
Finally, the LINCS team identified and encoded controlled
term linkage for an initial subset of their templates.</p>
      <p>The system successfully represented all required
controlled-term annotations for the three groups. We are now
completing the metadata submission pipeline for each
group. For the LINCS and ImmPort projects, we are
submitting the generated metadata into their community domain
repositories. The AIRR submission process involves
submitting the generated metadata to the public NCBI
BioSample repository.8 We have completed prototype LINCS
and NCBI pipeline submissions and will evaluate the speed,
reliability, and completeness of the submission process
before releasing each submission pipelines for public use.</p>
    </sec>
    <sec id="sec-5">
      <title>5 DISCUSSION</title>
      <p>Despite the growing number of ontologies in biomedicine,
scientists rarely select standard terms for describing their
experiments. Consequently, finding scientific datasets and
understanding the corresponding experiments can be
extremely hard and time-consuming, and often requires
considerable post-processing of metadata to extract relevant
content. A fundamental problem is the lack of convenient
and openly available tools for linking metadata to
ontologies. It takes time and effort to create well-specified
metadata and scientists often view the task of metadata authoring as
a burden that does not bring them any direct benefit.
7 https://www.immuneprofiling.org/hipc
8 https://www.ncbi.nlm.nih.gov/biosample</p>
      <p>The CEDAR Workbench allows template authors to
make extensive use of ontologies from BioPortal to add type
and property assertions to template fields and to constrain
the values of fields to ontology terms. Once those templates
are created, metadata authors can easily use them to
generate rich metadata without needing any understanding of
ontology structures. The features described in this paper
represent a major step toward overcoming the barriers to the
creation of high-quality metadata in biomedicine. Through
our approach, we hope to make it easier, and even fun, for
scientists to annotate their experimental data in ways that
ensure their value to the scientific community.</p>
      <p>
        We are studying a variety of technologies to further ease
the work of entering metadata. We developed a
recommendation service that identifies common patterns in the
metadata repository and that generates real-time suggestions
for filling out templates
        <xref ref-type="bibr" rid="ref6">(Martínez-Romero et al., 2017)</xref>
        .
This service is the first of a planned set of intelligent
authoring components that will also include the extraction and
semantic annotation of templates and metadata from
semistructured sources, such as spreadsheets, scientific articles,
and Web pages.
      </p>
      <p>
        We also plan to develop an ontology enrichment pipeline
in which ontology owners receive term requests based on
the new classes created from CEDAR, which could be used
to refine and extend their ontologies. The TermGenie
        <xref ref-type="bibr" rid="ref3">(Dietze et al., 2014)</xref>
        tool for requesting new Gene Ontology
classes provides a model for the planned functionality.
      </p>
    </sec>
    <sec id="sec-6">
      <title>ACKNOWLEDGMENTS</title>
      <p>CEDAR is supported by the National Institutes of Health
through an NIH Big Data to Knowledge program under
grant 1U54AI117925. NCBO is supported by the NIH
Common Fund under grant U54HG004028. The CEDAR
Workbench is available at https://cedar.metadatacenter.net,
and on GitHub (https://github.com/metadatacenter).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Brazma</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hingamp</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Quackenbush</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sherlock</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Spellman</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stoeckert</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aach</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , et al. (
          <year>2001</year>
          ).
          <article-title>Minimum information about a microarray experiment (MIAME)-toward standards for microarray data</article-title>
          .
          <source>Nat Genet</source>
          ,
          <volume>29</volume>
          ,
          <fpage>365</fpage>
          -
          <lpage>371</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Bui</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Park</surname>
            ,
            <given-names>J.-R.</given-names>
          </string-name>
          (
          <year>2006</year>
          ).
          <article-title>An assessment of metadata quality: A case study of the National Science Digital Library Metadata Repository</article-title>
          .
          <source>Proceedings of CAIS/ACSI</source>
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Dietze</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Berardini</surname>
            ,
            <given-names>T. Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Foulger</surname>
            ,
            <given-names>R. E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hill</surname>
            ,
            <given-names>D. P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lomax</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>OsumiSutherland</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Roncaglia</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , et al. (
          <year>2014</year>
          ).
          <article-title>TermGenie - a webapplication for pattern-based ontology class generation</article-title>
          .
          <source>Journal of Biomedical Semantics</source>
          ,
          <volume>5</volume>
          (
          <issue>1</issue>
          ),
          <fpage>48</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Edgar</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Domrachev</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Lash</surname>
            ,
            <given-names>A. E.</given-names>
          </string-name>
          (
          <year>2002</year>
          ).
          <article-title>Gene Expression Omnibus: NCBI gene expression and hybridization array data repository</article-title>
          .
          <source>Nucleic Acids Res</source>
          ,
          <volume>30</volume>
          (
          <issue>1</issue>
          ),
          <fpage>207</fpage>
          -
          <lpage>210</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>González-Beltrán</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Maguire</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sansone</surname>
            ,
            <given-names>S.-A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Rocca-Serra</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>linkedISA: semantic representation of ISA-Tab experimental metadata</article-title>
          .
          <source>BMC bioinformatics</source>
          ,
          <volume>15</volume>
          <issue>Suppl 1</issue>
          ,
          <fpage>S4</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Martínez-Romero</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>O</given-names>
            <surname>'Connor</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. J.</given-names>
            ,
            <surname>Shankar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Panahiazar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Willrett</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            ,
            <surname>Egyedi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. L.</given-names>
            ,
            <surname>Gevaert</surname>
          </string-name>
          ,
          <string-name>
            <surname>O.</surname>
          </string-name>
          , et al. (
          <year>2017</year>
          ).
          <article-title>Fast and accurate metadata authoring using ontology-based recommendations</article-title>
          .
          <source>Proceedings of AMIA 2017 Annual Symposium</source>
          (to appear).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>McQuilton</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gonzalez-Beltran</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rocca-Serra</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thurston</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lister</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Maguire</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Sansone</surname>
            ,
            <given-names>S.-A.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>BioSharing: curated and crowd-sourced metadata standards, databases and data policies in the life sciences</article-title>
          .
          <source>Database: the journal of biological databases and curation</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Musen</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bean</surname>
            ,
            <given-names>C. A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cheung</surname>
            ,
            <given-names>K. H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dumontier</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Durante</surname>
            ,
            <given-names>K. A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gevaert</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gonzalez-Beltran</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , et al. (
          <year>2015</year>
          ).
          <article-title>The Center for Expanded Data Annotation and Retrieval</article-title>
          .
          <source>Journal of the American Medical Informatics Association</source>
          ,
          <volume>22</volume>
          (
          <issue>6</issue>
          ),
          <fpage>1148</fpage>
          -
          <lpage>1152</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Musen</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Noy</surname>
            ,
            <given-names>N. F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shah</surname>
            ,
            <given-names>N. H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Whetzel</surname>
            ,
            <given-names>P. L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chute</surname>
            ,
            <given-names>C. G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Story</surname>
            , M.-
            <given-names>A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          (
          <year>2012</year>
          ).
          <article-title>The National Center for Biomedical Ontology</article-title>
          .
          <source>Journal of the American Medical Informatics Association.</source>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Noy</surname>
            ,
            <given-names>N. F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shah</surname>
            ,
            <given-names>N. H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Whetzel</surname>
            ,
            <given-names>P. L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dai</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dorf</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Griffith</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jonquet</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          , et al. (
          <year>2009</year>
          ).
          <article-title>BioPortal: ontologies and integrated data resources at the click of a mouse</article-title>
          .
          <source>Nucleic acids research</source>
          ,
          <volume>37</volume>
          (
          <issue>Web Server issue</issue>
          ),
          <fpage>W170</fpage>
          -3.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>O'Connor</surname>
            ,
            <given-names>M. J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Martinez-Romero</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Egyedi</surname>
            ,
            <given-names>A. L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Willrett</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Graybeal</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Musen</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>An open repository model for acquiring knowledge about scientific experiments</article-title>
          .
          <source>Proceedings of the 20th International Conference on Knowledge Engineering and Knowledge Management (EKAW2016)</source>
          . (Vol.
          <volume>10024</volume>
          , pp.
          <fpage>762</fpage>
          -
          <lpage>777</lpage>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Parkinson</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sarkans</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shojatalab</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Abeygunawardena</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Contrino</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Coulson</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Farne</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , et al. (
          <year>2005</year>
          ).
          <article-title>ArrayExpress--a public repository for microarray gene expression data at the EBI</article-title>
          .
          <source>Nucleic acids research</source>
          ,
          <volume>33</volume>
          (Database issue),
          <fpage>D553</fpage>
          -
          <lpage>D555</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Rocca-Serra</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brandizi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Maguire</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sklyar</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Taylor</surname>
          </string-name>
          , C.,
          <string-name>
            <surname>Begley</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Field</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , et al. (
          <year>2010</year>
          ).
          <article-title>ISA software suite: Supporting standards-compliant experimental annotation and enabling curation at the community level</article-title>
          .
          <source>Bioinformatics</source>
          ,
          <volume>26</volume>
          (
          <issue>18</issue>
          ),
          <fpage>2354</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>Shankar</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parkinson</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Burdett</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hastings</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Miller</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Srinivasa</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , et al. (
          <year>2010</year>
          ).
          <article-title>Annotare-a tool for annotating highthroughput biomedical investigations and resulting data</article-title>
          .
          <source>Bioinformatics</source>
          ,
          <volume>26</volume>
          (
          <issue>19</issue>
          ),
          <fpage>2470</fpage>
          -
          <lpage>2471</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Tenenbaum</surname>
            ,
            <given-names>J. D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sansone</surname>
            ,
            <given-names>S.-A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Haendel</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2014</year>
          ).
          <article-title>A sea of standards for omics data: sink or swim?</article-title>
          <source>Journal of the American Medical Informatics Association</source>
          ,
          <volume>21</volume>
          (
          <issue>2</issue>
          ),
          <fpage>200</fpage>
          -
          <lpage>203</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <surname>Wilkinson</surname>
            ,
            <given-names>M. D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dumontier</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aalbersberg</surname>
          </string-name>
          , Ij. J.,
          <string-name>
            <surname>Appleton</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Axton</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baak</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Blomberg</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          , et al. (
          <year>2016</year>
          ).
          <article-title>The FAIR Guiding Principles for scientific data management and stewardship</article-title>
          .
          <source>Scientific Data</source>
          ,
          <volume>3</volume>
          ,
          <fpage>160018</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <surname>Wolstencroft</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Owen</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horridge</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krebs</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mueller</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Snoep</surname>
            ,
            <given-names>J. L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>du Preez</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          , et al. (
          <year>2011</year>
          ).
          <article-title>RightField: Embedding ontology annotation in spreadsheets</article-title>
          .
          <source>Bioinformatics</source>
          ,
          <volume>27</volume>
          (
          <issue>14</issue>
          ),
          <fpage>2021</fpage>
          -
          <lpage>2022</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>