<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Developing and Deploying an NLP Capability to Accelerate Cancer Research</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Aaron Cohen</string-name>
          <email>cohenaa@ohsu.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Oregon Health &amp; Science University Portland</institution>
          ,
          <addr-line>Oregon</addr-line>
          ,
          <country country="US">United States</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>- It has been well documented that a great deal of data useful for medical research is present in clinical narrative text.</p>
      </abstract>
      <kwd-group>
        <kwd>Text mining</kwd>
        <kwd>Cancer research</kwd>
        <kwd>Translational medicine</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>There is perhaps less discussion about how often what was
structured data at its origin has become inaccessible except in
free text form. This problem is further compounded in tertiary
care institutions, like the OHSU Knight Cancer Institute, where
the entire history of a referred patient's condition may only be
present in the electronic health record (EHR) as free text.</p>
      <p>At the same time, future medical advances, such as in cancer
research, will require much more complete patient data than has
been previously available. Such advances include the discovery of
new cures, expanding early detection, and realizing the promise
of precision medicine. Phenotype description and outcome
characterization are two areas in particular where text sources
could greatly supplement our current data.</p>
      <p>The OHSU Knight Cancer Institute has begun a program to
create a natural language processing (NLP) capability to extract,
store, and link data from free text sources at the patient level,
and make this data available to researchers in a continuous,
reusable, efficient and timely manner through services delivery
from the Translational Research Hub (TRH). This talk will
present the challenges, progress, and future goals of our program
to build NLP capabilities that can help us use free text from the
EHR to first support the transformation of cancer research with
the hopes of positively impacting clinical care in the future.</p>
    </sec>
  </body>
  <back>
    <ref-list />
  </back>
</article>