<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Standardization of the Histopathology Cancer Report: An Ontological Approach</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Anna Maria Masci</string-name>
          <email>annamaria.masci@duke.edu</email>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff6">6</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Shannon McCall</string-name>
          <xref ref-type="aff" rid="aff4">4</xref>
          <xref ref-type="aff" rid="aff6">6</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alessandro Racioppi</string-name>
          <xref ref-type="aff" rid="aff6">6</xref>
          <xref ref-type="aff" rid="aff7">7</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Helena Judge Ellis</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff6">6</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jihad S. Obeid</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff6">6</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Barry Smith</string-name>
          <xref ref-type="aff" rid="aff5">5</xref>
          <xref ref-type="aff" rid="aff6">6</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Christian Stoeckert</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff6">6</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jie Zheng</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
          <xref ref-type="aff" rid="aff6">6</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Biobanking Without Borders, LLC</institution>
          ,
          <addr-line>Durham, NC</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Biomedical Informatics Center, Medical University of South Carolina</institution>
          ,
          <addr-line>Charleston, SC</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Department of Biostatistics and Bioinformatics, Duke University.</institution>
          <addr-line>Durham, NC</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Department of Genetics, University of Pennsylvania</institution>
          ,
          <addr-line>Philadelphia, PA</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>Department of Pathology, Duke University Health System</institution>
          ,
          <addr-line>Durham, NC</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff5">
          <label>5</label>
          <institution>Department of Philosophy, University of Buffalo NY</institution>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff6">
          <label>6</label>
          <institution>Duke University Department of Biostatistics and Bioinformatics Pilot grant 391-8538</institution>
        </aff>
        <aff id="aff7">
          <label>7</label>
          <institution>University of North Carolina</institution>
          ,
          <addr-line>Chapel Hill, NC</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2018</year>
      </pub-date>
      <fpage>7</fpage>
      <lpage>10</lpage>
      <abstract>
        <p>Extended Abstract</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Background</title>
      <p>
        In recent years, the complexity of cancer pathology reporting
has increased significantly. The pathology report covers not
only general information such as the presence or absence of
cancer, but includes a collection of specific parameters such as
tumor size, grade, margin, lymphatic or vascular involvement
as well as molecular testing e.g. proteomics and genomics
(Figure 1). Soon, biomarkers and immune profiling will play
an increasingly more important role in determining the
eligibility for particular therapies, along with genetic
predisposition and social risk factors. The increased use of
digital pathology, which allows streamlined sharing of images,
has highlighted the importance of clear communication of the
information displayed in the pathology report. In the past
years, significant effort has been devoted to redefining the
way that histopathology report information is recorded. The
College of American Pathologists (CAP)
(http://www.cap.org/), a leading organization of
boardcertified pathologists, introduced synoptic cancer reports, a
structured checklist to standardize clinical documentation.
Despite continuous improvement and generation of electronic
reports, formal representation [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] is still lacking. This lack of
standardization limits the ability to integrate pathology
information with other genomic and proteomic data and often
results in loss of information.
      </p>
    </sec>
    <sec id="sec-2">
      <title>Ontology</title>
      <p>
        Standardized, computable representations, in the form of
ontologies and structured data, are foundational methods for
sharing and integrating data. Ontologies constitute
heterogeneous resources combinable for reasoning and
hypothesis testing. Ontology as we practice it draws on the
example of Gene Ontology (http://www.geneontology.org)
[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], which provides a controlled structured vocabulary for the
description of biological processes, molecular functions and
cellular components. The methodology is standardized
through the Open Biomedical Ontologies (OBO) Foundry
initiative (http://www.obofoundry.org) [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Ontologies that
follow this methodology represent complex bodies of
knowledge drawing from various sources by using a consistent
framework, a feature that simpler network representations
lack. Ontologies allow us to use standard W3C
(https://www.w3.org) inference tools to reason over the data
that are annotated with ontology terms. They allow us to
enhance the quality of our knowledge by expanding the power
of data retrieval and analysis.
      </p>
    </sec>
    <sec id="sec-3">
      <title>Methods</title>
      <p>
        To address the lack of formal terminology and computational
resources critical to improving the use of the histopathology
reports, we are developing an ontological representation of the
required elements for cancer histopathology. As proof of
concept, we are creating a representation of the required terms
that are shared by most cancer histopathology reports. We are
developing our representation by applying the formalism used
in the OBO Foundry ontologies, including the use of upper
level BFO as common architecture
(http://ifomis.unisaarland.de/bfo/).
Our development activities are taking place in close
collaboration with the developers of several OBO ontologies,
notably, Ontology of BioBanking (OBIB)
(https://github.com/biobanking/biobanking) [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], Ontology of
Biomedical Investigations (OBI) (http://obi-ontology.org) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ],
Quantitative Histopathology Image Ontology (QHIO) [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], and
Information Artifact Ontology (IAO)
https://github.com/information-artifact-ontology/IAO/.
As a first step we identified the ontological classes
corresponding to the data elements within the report. We next
searched for the terms of interest in other OBO Foundry
ontologies to avoid duplication. For the ones that were not
already in another OBO Foundry ontology, we created new
terms with clear definitions. These new terms will be
submitted to the appropriate OBO Foundry ontology through
their tracker system for approval. In order to capture the
complexity represented in the pathology report we have begun
to connect the different classes. The result is a graph
representation, where the classes are the nodes and the
relations are the edges. To link the different classes we rely on
the relations defined in Relation Ontology (RO) [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] an
example of the use of relations is shown in Figure 2.
      </p>
    </sec>
    <sec id="sec-4">
      <title>Advantages</title>
      <p>We are constructing an ontological network that will allow
queries and inferences concerning diverse information in the
report. Examples are the following:
• If the tissue under observation is a breast tissue, a
progesterone receptor will automatically be a biomarker.
• Any specimen must have size, weight, and laterality.
• If lymph node invasion has been reported, one of the
specimens has to be a lymph node.</p>
      <p>This prototype for ontological representation of
histopathology report is an innovative approach that will
facilitate maximal integration of information from various
types of cancer reports, leading to a comprehensive picture of
all the parameters captured in the report. Implementation of
this approach can take different forms from tagging data
collected in existing systems to instantiation of the ontological
representation in semantic graph databases. Applying this
ontological approach will not only improve the annotation
capability of histopathology reporting, but additionally would
enhance the ability to share information and exponentially
increase the power of data retrieval.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.   Campbell,  W.S.,  et  al.,
          <article-title> A  computable  pathology  report   for  precision  medicine:  extending  an  observables   ontology  unifying  SNOMED  CT  and</article-title>
           LOINC.  J  Am  Med   Inform  Assoc,  
          <year>2017</year>
          .  
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.   Ashburner,  M.,  et  al.,  
          <article-title>Gene  ontology:  tool  for  the   unification  of  biology</article-title>
          .  
          <source>The  Gene  Ontology  Consortium.   Nat  Genet</source>
          ,  
          <year>2000</year>
          .  
          <volume>25</volume>
          (
          <issue>1</issue>
          ):  p.  
          <fpage>25</fpage>
          -­‐9.  
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.   Smith,  B.,  et  al.,  
          <article-title>The  OBO  Foundry:  coordinated   evolution  of  ontologies  to  support  biomedical  data   integration</article-title>
          .  Nat  Biotechnol,  
          <year>2007</year>
          .  
          <volume>25</volume>
          (
          <issue>11</issue>
          ):  p.  
          <fpage>1251</fpage>
          -­‐5.  
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.   Brochhausen,  M.,  et  al.,
          <string-name>
            <surname> OBIB</surname>
          </string-name>
          -­
          <article-title>‐a  novel  ontology  for   biobanking</article-title>
          .  J  Biomed  Semantics,  
          <year>2016</year>
          .  7:  p.  
          <volume>23</volume>
          .  
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.   Bandrowski,  A.,  et  al.,  The  Ontology  for  Biomedical   Investigations.  PLoS  One,  
          <year>2016</year>
          .  
          <volume>11</volume>
          (
          <issue>4</issue>
          ):  p.  
          <year>e0154556</year>
          .  
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.   Gurcan,  M.N.,  et  al.,
          <article-title> Developing  the  Quantitative   Histopathology  Image  Ontology  (QHIO):  A  case  study   using  the  hot  spot  detection  problem</article-title>
          .  Journal  of   biomedical  informatics,  
          <year>2017</year>
          .  
          <volume>66</volume>
          :  p.  
          <fpage>129</fpage>
          -­
          <lpage>‐</lpage>
          135.  
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.   Smith,  B.,  et  al.,  Relations  in  biomedical  ontologies.   Genome  Biol,  
          <year>2005</year>
          .  
          <volume>6</volume>
          (
          <issue>5</issue>
          ):  p.  
          <fpage>R46</fpage>
          .  
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>