<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Ontology-enabled Breast Cancer Characterization</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Oshani Seneviratne</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sabbir M. Rashid</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Shruthi Chari</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>James P. McCusker</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kristin P. Bennett</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>James A. Hendler</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Deborah L. McGuinness</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Rensselaer Polytechnic Institute</institution>
          ,
          <addr-line>Troy NY 12180</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>We address the problem of characterizing breast cancer, which today is done using staging guidelines. Our demo will show di erent breast cancer staging results that leverage the Whyis semantic nanopublication knowledge graph framework [8]. The system we developed is able to ingest breast cancer characterization guidelines in a semi-automated manner and then use our deductive inferencer to generate new information based on those guidelines as described in our ISWC resource track paper `Knowledge Integration for Disease Characterization: A Breast Cancer Example' [11]. In this paper we demonstrate the versatility of our framework using a synthetic patient pro le.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        There are existing ontological representations for cancer characterization based
on previous AJCC cancer staging editions [
        <xref ref-type="bibr" rid="ref2 ref7">7,2</xref>
        ]. The cancer characterization in
these ontologies is di erent from ours due to the inclusion of additional
biomarkers as per the AJCC 8th edition staging criteria. Unlike previous ontologies,
our ontologies also include mappings from breast cancer terms to
communityaccepted terms from the National Cancer Institute thesaurus (NCIt) [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], and
incorporate recommended tests and treatment plans from the openly reusable
CIViC database [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Furthermore, we include terms that are not included in NCIt
or AJCC, such as more speci c subclasses of tumor characteristics (T1, T1 as,
T1 am, T1NOS, etc.) that are available in the Surveillance, Epidemiology, and
End Results (SEER) dataset [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Additionally, we provide an end-to-end system
that demonstrates our ontology's utility for breast cancer characterization.
3
      </p>
    </sec>
    <sec id="sec-2">
      <title>Prototype System Demonstration</title>
      <p>Some patients who do not respond as expected to traditional treatment paths
may require the physician to consider additional testing measures, new sources,
and/or alternative treatment plans. We will exemplify the interplay of disease
characterization and personalized medicine being o ered by our cancer
characterization tool, powered by semantic web tools, in our demonstration.</p>
      <p>Let's consider the patient pro le with the biomarkers1 given in Table 1.
Suppose there is a physician considering this multitude of parameters related to
tumor biology as well as standard pathology to inform a diagnosis, treatment,
and monitoring plan for this patient. With the utilization of the SDD
methodology, the data in Table 1 is converted to the nanopublication format in the
knowledge graph. Speci c to this use case, our cancer staging ontologies include
the axioms in Listings 3.1 and 3.2 that were extracted from the AJCC 7th and
8th editions respectively.
1 The abbreviations used include: HER2 (Human Epidermal Growth factor receptor
2 ), ER (Estrogen Receptor), PR (Progesterone Receptor).</p>
      <p>Once these axioms, along with the other 400+ staging axioms, are applied
on the patient nanopublication data, the stages as per the 7th and 8th editions
are determined (IIIA in 7th and IIB in 8th). Because of the additional data
streams considered in the 8th guideline (i.e. Tumor Grade, HER2, ER and PR),
the patient now has improved prognosis. The visualization tool in Fig. 1 shows
the changes to the treatment and monitoring options based on the new stage.</p>
      <p>More information on our system, including a video of the system
demonstration, is available at https://cancer-staging-ontology.github.io.
4</p>
    </sec>
    <sec id="sec-3">
      <title>Conclusion</title>
      <p>With new understanding of cancer biology, guidelines are expected to increase in
complexity and personalized care is more sought after as a component of
treatment. Similarly, as the data streams for diagnosing and treating cancer patients
becomes complicated, physicians may have to consult many di erent trusted
sources and use knowledge from clinical trials and literature to decide on
alternative treatment options which can take a great deal of the doctors' and the
patients' precious time. Using our cancer characterization tool, physicians have
access to patient records in RDF nanopublication format, and can infer the stage,
as per the cancer staging ontology, that models the new staging guidelines, and
then can investigate alternative, evidence-based treatment options. Our
visualizations present evidence-based, updated staging determinations and treatment
options, along with provenance. Physician-facing software applications can use
our cancer characterization tool to provide physicians with an e cient way to
investigate alternative treatment options based on di erent staging guidelines.</p>
      <p>In the future, new guidelines for cancer staging are expected to incorporate
genomic test results analyzed in the context of the patient's history. Many predict
a rapid in ux of information related to cancer from clinical trials, as well as
from basic science research. Leveraging all these heterogeneous data sources and
making connections to understand the data is of utmost importance. Our system
demonstration shows how semantics are being used to support this fast changing
landscape.</p>
    </sec>
    <sec id="sec-4">
      <title>Acknowledgements</title>
      <p>This work is partially supported by IBM Research AI through the AI Horizons
Network. We thank our colleagues from IBM (Amar Das, Ching-Hua Chen) and
RPI (John Erickson, Alexander New, Rebecca Cowan) who provided insight and
expertise that greatly assisted the research.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Amin</surname>
            ,
            <given-names>M.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Greene</surname>
            ,
            <given-names>F.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Edge</surname>
            ,
            <given-names>S.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Compton</surname>
            ,
            <given-names>C.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gershenwald</surname>
            ,
            <given-names>J.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brookland</surname>
          </string-name>
          , R.K., Meyer, L.,
          <string-name>
            <surname>Gress</surname>
            ,
            <given-names>D.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Byrd</surname>
            ,
            <given-names>D.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Winchester</surname>
            ,
            <given-names>D.P.: The</given-names>
          </string-name>
          <string-name>
            <surname>Eighth Edition AJCC Cancer Staging</surname>
          </string-name>
          <article-title>Manual: Continuing to build a bridge from a population-based to a more personalized approach to cancer staging</article-title>
          .
          <source>CA: a cancer journal for clinicians 67(2)</source>
          ,
          <volume>93</volume>
          {
          <fpage>99</fpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Boeker</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Franca</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bronsert</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schulz</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>TNM-O: ontology support for staging of malignant tumours</article-title>
          .
          <source>Journal of biomedical semantics 7</source>
          (
          <issue>1</issue>
          ),
          <volume>64</volume>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Golbeck</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fragoso</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hartel</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hendler</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oberthaler</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>The National Cancer Institute's thesaurus and ontology</article-title>
          .
          <source>Web Semantics: Science, Services and Agents on the World Wide Web</source>
          <volume>1</volume>
          (
          <issue>1</issue>
          ) (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Gri</surname>
            <given-names>th</given-names>
          </string-name>
          , M.,
          <string-name>
            <surname>Spies</surname>
            ,
            <given-names>N.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krysiak</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McMichael</surname>
            ,
            <given-names>J.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Co</surname>
            <given-names>man</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>A.C.</given-names>
            ,
            <surname>Danos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.M.</given-names>
            ,
            <surname>Ainscough</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.J.</given-names>
            ,
            <surname>Ramirez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.A.</given-names>
            ,
            <surname>Rieke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.T.</given-names>
            ,
            <surname>Kujan</surname>
          </string-name>
          ,
          <string-name>
            <surname>L.</surname>
          </string-name>
          , et al.:
          <article-title>CIViC is a community knowledgebase for expert crowdsourcing the clinical interpretation of variants in cancer</article-title>
          .
          <source>Nature genetics 49(2)</source>
          ,
          <volume>170</volume>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Hayat</surname>
            ,
            <given-names>M.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Howlader</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reichman</surname>
            ,
            <given-names>M.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Edwards</surname>
            ,
            <given-names>B.K.</given-names>
          </string-name>
          :
          <article-title>Cancer statistics, trends, and multiple primary cancer analyses from the Surveillance, Epidemiology, and End Results (SEER) Program</article-title>
          .
          <source>The oncologist 12(1)</source>
          ,
          <volume>20</volume>
          {
          <fpage>37</fpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>H.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Puymon</surname>
            ,
            <given-names>M.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Qin</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guru</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mohler</surname>
            ,
            <given-names>J.L.</given-names>
          </string-name>
          :
          <article-title>NCCN clinical practice guidelines in oncology (</article-title>
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Massicano</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sasso</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tomaz</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oleynik</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nobrega</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Patrao</surname>
            ,
            <given-names>D.F.</given-names>
          </string-name>
          :
          <article-title>An Ontology for TNM Clinical Stage Inference</article-title>
          .
          <source>In: ONTOBRAS</source>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>McCusker</surname>
            ,
            <given-names>J.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rashid</surname>
            ,
            <given-names>S.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Agu</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bennett</surname>
            ,
            <given-names>K.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McGuinness</surname>
            ,
            <given-names>D.L.</given-names>
          </string-name>
          :
          <article-title>The Whyis Knowledge Graph Framework in Action</article-title>
          .
          <source>International Semantic Web Conference</source>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Mons</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Velterop</surname>
          </string-name>
          , J.:
          <article-title>Nano-Publication in the e-science era</article-title>
          . In: Workshop on Semantic Web Applications in Scienti c Discourse (SWASD
          <year>2009</year>
          ). pp.
          <volume>14</volume>
          {
          <issue>15</issue>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Rashid</surname>
            ,
            <given-names>S.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chastain</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stingone</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McGuinness</surname>
            ,
            <given-names>D.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McCusker</surname>
            ,
            <given-names>J.P.:</given-names>
          </string-name>
          <article-title>The Semantic Data Dictionary Approach to Data Annotation &amp; Integration</article-title>
          .
          <source>Proceedings of the First Workshop on Enabling Open Semantic Science (SemSci)</source>
          pp.
          <volume>47</volume>
          {
          <issue>54</issue>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Seneviratne</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rashid</surname>
            ,
            <given-names>S.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chari</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McCusker</surname>
            ,
            <given-names>J.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bennett</surname>
            ,
            <given-names>K.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hendler</surname>
            ,
            <given-names>J.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McGuinness</surname>
            ,
            <given-names>D.L.</given-names>
          </string-name>
          :
          <article-title>Knowledge Integration for Disease Characterization: A Breast Cancer Example</article-title>
          .
          <source>International Semantic Web Conference</source>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>