<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Prot e´g e´ 5.5 - Improvements for Editing Biomedical Ontologies</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Matthew Horridge</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rafael S. Gon c¸alves</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Csongor Nyulas</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tania Tudorache</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mark A. Musen</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Stanford Center for Biomedical Informatics Research</institution>
          ,
          <addr-line>Stanford, California</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2018</year>
      </pub-date>
      <fpage>7</fpage>
      <lpage>10</lpage>
      <abstract>
        <p>We present Prote´ge´ 5.5, a significant update to the Prote´ge´ Desktop software, which contains new features that are geared towards editing biomedical ontologies. This version of Prote´ge´ contains user-interface enhancements and optimizations that should make the browsing and editing of OBO-library-style biomedical ontologies easier, faster and more efficient when compared to previous versions of Prote´ge´. In what follows we describe the major features that are included with this release of Prote´ge´. Inspiration for these features came directly from the biomedical ontology community.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
    </sec>
    <sec id="sec-2">
      <title>NEW FEATURES</title>
      <p>2.1</p>
      <sec id="sec-2-1">
        <title>Viewing and Searching for Terms</title>
        <p>This version of Prote´ge´ contains various enhancements related to
the display of terms in various views:
Display of OBO Ids Prote´ge´ now displays OBO-style identifiers
alongside human readable labels in various places throughout the
user interface. This includes presentation in the search results view
(Fig. 1), in tooltips that are shown when hovering over terms (Fig.
2) and also in the term browsing view (Fig. 3).</p>
        <p>Links in Annotation Values We have expanded the kinds of
tokens in annotation and XRef values that get rendered as clickable
links. In this latest version of Prote´ge´, substrings that correspond
to PubMed Ids (e.g. “PMID:18461074”), Wikipedia pages (e.g.
“Wikipedia:DNA repair”) and ISBNs (e.g. “ISBN:0198506732”)
are clickable and link out to the appropriate Web pages. In the future,
we intend to offer some form of customization of this functionality
so that other database identifiers can be made clickable.
Fig. 1: The Search Results View now displays OBO Ids for terms that have
IRIs corresponding to OBO Ids. Multi-word search is also now supported.
Search Improvements The default search functionality in Prote´ge´
now supports multi-word search that is insensitive to the order in
which search words are entered. For example, searching for “Heart
Valve” will now return terms whose labels contain “Heart Valve”
and also “Valve of Heart”. Search can also be performed on OBO Ids
without changing any configuration settings (e.g. the renderer). For
example, searching the Gene Ontology (GO) for “GO:0005634”
will find the term nucleus.
2.2</p>
      </sec>
      <sec id="sec-2-2">
        <title>Browsing Hierarchical Structure</title>
        <p>Biomedical ontologies tend to be large, with hierarchies that are
deep and that have large spans. Furthermore, hierarchies are
multi-axial, with relationships other than IS-A (SubClassOf) being
important. We have made two major improvements with regards to
hierarchy browsing:
Mixed Hierarchy Viewing We have added the ability to display
relationships other than IS-A in the class hierarchy view. This
longawaited feature can be toggled on or off via the View menu. Figure
2 shows an example mixed hierarchy from the GO. SubClass
childparent relationships are distinguished from non-subclass
parentchild relationships using differently colored arrows — blue arrows
represent non-subclass relationships.</p>
        <p>Hovering over a class in the hierarchy displays a tooltip stating
the relationship that causes the child-parent relationship to hold. In
the future we intend to allow subsets of properties to be used in
the display of the tree. We are also considering using customizable
colors in order to provide more of a visual clue as to the underlying
relationship type between child-parent terms.</p>
        <p>Breadcrumb Trails When browsing large hierarchies it is easy to
lose sight of the positions of selected terms and the branch that they
appear in. We have therefore added a breadcrumb trail to the top of
the main Prote´ge´ window. This can be toggled on or off as necessary.
Horridge et al
Hovering over a term name in the breadcrumb trail shows the display
name for the term along with its OBO Id and its IRI. Term names
can be clicked to change the term selection in order to jump up to
parent and ancestor terms in the hierarchy.
2.3</p>
      </sec>
      <sec id="sec-2-3">
        <title>Workflow Support for Obsoleting Terms</title>
        <p>One significant new feature that we have added is a wizard for
obsoleting (deprecating) terms. It is typically the case that several
detailed steps are involved in the deprecation of terms. Furthermore,
each ontology project can have slightly different deprecation
strategies. We have therefore provided a configurable deprecation
wizard that leads users through the steps that are required when
deprecating terms. Deprecation profiles, which are simple YAML
files, can be dropped into a Prote´ge´ installation in order to provide
project specific deprecation workflows. We have provided three
profiles out of the box (1) A generic “Basic” profile; (2) A “Gene
Ontology (GO)” profile, that follows the steps required by the GO
consortium1; and (3) A “Ontology for Biomedical Investigation
(OBI)” profile, that is as detailed as the GO profile, but requires
different steps.
2.4</p>
      </sec>
      <sec id="sec-2-4">
        <title>Integration with Git</title>
        <p>Biomedical ontology projects have long used issue trackers and
version control systems for project management. In recent years
GitHub has come to dominate the version control repository
landscape and projects such as the GO project now use this
technology. The GO workflow encourages editors to create Git
branches for additions and bug fixes. These branches are then
merged into the master branch once the editing for a particular
feature is complete. We have therefore provided a lightweight
integration of Prote´ge´ with Git.</p>
        <p>Display of Git Repository Status If Prote´ge´ detects that an
ontology has been loaded from a location within a Git repository, it
will display the checked out branch on the left hand side of the status
1 http://go-ontology.readthedocs.io/en/latest/
ObsoleteTerm.html
bar (Fig. 4). It will also display an indication as to whether there
have been uncommitted changes to the ontologies in the imports
closure of the loaded ontology.</p>
        <p>Use of Git User Name for Term Creation Metadata The last few
versions of Prote´ge´ have offered the ability for various term creation
metadata to be automatically stamped onto a term as annotations.
One auto-generated annotation is “created by”, which takes the
name of an editor. In this latest version, Prote´ge´ can now utilize the
Git user name for the current Git repository (Fig. 5), so that “created
by” annotation values correspond to Git user names in commits to a
repository.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>CONCLUSIONS</title>
      <p>We have presented the major features of Prote´ge´ 5.5 that are geared
towards the editing of biomedical ontologies. We seek feedback
from the biomedical ontology community in order that we may
continue to enhance Prote´ge´ so that it meets the needs of this
community.</p>
    </sec>
    <sec id="sec-4">
      <title>ACKNOWLEDGEMENTS</title>
      <p>We are heavily indebted to the Biomedical Ontology Community for
suggestions and feedback on the new features that we have presented
in this paper. We would especially like to thank the participants
of the 2017 Berkeley GO Editors Workshop: (in alphabetical
order) Kimberly Van Auken, Tanya Berardini, Seth Carbon, Karen
Christie, Eric Douglass, Pascale Gaudet, Harold Drabkin, David
Hill, Huaiyu Mi, Suzi Lewis, Donghui Li, Chris Mungall, Monica
Munoz-Torres, David Osumi-Sutherland, and Nicole Vasilevsky –
this work would not have been possible without their input. Special
thanks go to Chris Mungall for an invitation to this meeting.</p>
      <p>This work was funded by Grant GM121724 from the National
Institute of General Medical Sciences at the United States National
Institutes of Health.</p>
    </sec>
  </body>
  <back>
    <ref-list />
  </back>
</article>