<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Building Evidence Graph for Clinical Decision Support</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jing Mei</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Wen Sun</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jing Li</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Haifeng Liu</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Xiang Li</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yiqin Yu</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Guotong Xie</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>IBM Research</institution>
          ,
          <addr-line>Beijing</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Evidence-based medicine intends to optimize clinical decision making by using evidence. Semantic query answering could help to find the most relevant evidence. However, at point of care, it still lacks time for clinicians to do literature study. In this poster, we propose to build an evidence graph for clinical decision support, in which an evidence ontology is defined with extension of SWRL rules. On top of this graph, we do evidence query and evidence fusion to generate the ranking list of decision options. Our prototype implementation of the evidence graph demonstrates its assistance to decision making, by combining a variety of knowledge-driven and data-driven decision services.</p>
      </abstract>
      <kwd-group>
        <kwd />
        <kwd>Evidence-based Medicine</kwd>
        <kwd>Clinical Decision Support</kwd>
        <kwd>Knowledge Graph</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Providing clinical decision support needs evidence [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The evidence could be a risk
model from cohort studies (such as the Framingham cardiovascular disease 10-year risk
model), or a rule model from clinical guidelines (such as the NICE guideline for
management of type 2 diabetes), or a similarity analysis from electronic health records (such
as the distribution of stroke treatment in similar patients), etc. However, at point of care,
given a patient, what are the most relevant evidence to provide a personalized clinical
decision support for him/her? To address this problem, there are two big challenges.
      </p>
      <p>First is how to find the most relevant evidence. A general solution is a search engine,
which requires (1) the evidence representation, (2) the evidence extraction, (3) the
evidence indexing. Considering that evidence has semantics (e.g. disease hierarchy, drug
interaction), the basic keyword-based search is not enough, and we propose to build an
evidence graph for answering semantic queries.</p>
      <p>Second is how to use the found evidence to provide a personalized clinical decision
support. The state of the art solution is to rank the evidence search results for human
reading, however, at point of care, clinicians rarely have time to do literature study. For
instance, at point of care, given a patient with non-rheumatic atrial fibrillation, the
clinician wants to know the patient’s risk score of stroke, instead of reading papers – even
if the most relevant papers such as the CHA2DS2-VASc study have been retrieved.
Therefore, our evidence graph aims to making evidence machine understandable and
processable, for giving the ranked decision options (instead of the ranked documents).</p>
    </sec>
    <sec id="sec-2">
      <title>Evidence graph</title>
      <p>Description
This class represents evidence sources from external files, web pages
or data sets (e.g. the NICE guideline pdf file, the Framingham Heart
Study web page, the CHARLS data repository).</p>
      <p>This class represents code systems (e.g. disease codes in ICD 10, lab
test codes in LOINC).</p>
      <p>This class represents the population studied in the evidence (e.g.
Chinese retired residents, or overweight diabetes patients).</p>
      <p>This class represents the intervention studied in the evidence (e.g.
insulin therapy, or life style intervention).</p>
      <p>This class represents the comparison studied in the evidence (e.g.
placebo, or an alternative intervention)
This class represents the outcome studied in the evidence (e.g. death,
or hospitalization)
This class represents features defined by logical and/or arithmetic
expressions of terminologies (e.g. the feature of elder is defined as:
age&gt;=60, the feature of male is defined as: gender=male).</p>
      <p>In Figure 1, the left hand side is the 11 properties of 8 classes, such as hasSource
(from Evidence to Source), hasPICO (from Evidence to P,I,C,O), hasFeature (from
P,I,C,O to Feature), hasTerminology (from Feature to Terminology). The right hand
side illustrates an evidence instance, which has data source from the Chinese Health
and Retirement Longitudinal Study (CHARLS1). It has a population instance, namely
retried, with 4 feature instances, i.e., gender=male, gender=female, age&gt;=55, age&gt;=60,
followed by two terminology instances, namely gender and age. The retried population
has an expression: ((gender=male) and (age&gt;=60)) or ((gender=female) and
(age&gt;=55)). Besides, this evidence instance has attributes of id, name, code and
restService, etc., where the restService is the URI of its decision service implementation.</p>
      <p>Here, we remark two key points in the evidence graph. One is the evidence semantic
query answering, and the other is the evidence fusion for decision option ranking.
Actually, the semantic part is nature for our evidence graph, which is defined by an
ontology with extension of SWRL rules. Below is a SWRL sample for evidence inference.
Evidence(?e1) &amp; hasPopulation(?e1, ?p1) &amp; hasFeature(?p1, ?f1) &amp; hasTerminology(?f1, ?t1) &amp;
Evidence(?e2) &amp; hasPopulation(?e2, ?p2) &amp; hasFeature(?p2, ?f2) &amp; hasTerminology(?f2, ?t2) &amp;
subTermOf(?t1, ?t2) =&gt; subEvidenceOf(?e1, ?e2)</p>
      <p>From the ICD 10 code system, we have subTermOf(Stroke, CardiovascularDisease),
which implies that, given a patient with diagnosis of non-rheumatic atrial fibrillation
(AF), the risk models of stroke and other cardiovascular diseases (CVD) could be all
applicable for him/her.</p>
      <p>
        The fusion part is novel, via calling the restService URI of each retrieved evidence.
Taking the above AF patient as an example, both the CHA2DS2-VASc evidence (for
stroke risk model) and the Framingham evidence (for CVD risk model) have been
retrieved. The CHA2DS2-VASc evidence service takes input features of hypertension,
diabetes, congestive heart failure, age &gt;= 75, pre stroke or transient ischemic attach or
thromboembolism, and outputs the stroke risk score from 0 to 6. The Framingham
evidence service takes input features of diabetes, smoking, age, systolic blood pressure,
total cholesterol, HDL cholesterol, and outputs the CVD risk score from 0 to 1. These
different evidence services will be activated (most possibly by input of different
features), and generate scores of different decision options. Finally, we will do a decision
fusion of evidence services, which could leverage fusion algorithms such as majority
voting, weighted average, and meta-classification [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. In this example, the stroke risk
socre is 3, the CVD risk score is 0.75, and after decision fusion, the stroke evidence
would be diffused to the CVD evidence, getting the final CVD risk score of 0.86.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Conclusion and Outlook</title>
      <p>An evidence graph for clinical decision support is appealing and challenging. This
poster presents an early phase, and we have manually built an evidence graph, including
4 sources from Framingham, CHA2DS2-VASc, NICE, and CHARLS. Based on that,
we implemented the Decision Advisor with integration of a care management system
(namely Curam2), and Figure 2 is the screenshot of a patient’s care plan. At point of
care, suppose that a patient Cao Ping comes, and 3 objectives (a.k.a. outcomes) are
selected. By semantic query answering on the evidence graph, we retrieve out the
evihttp://charls.ccer.edu.cn/en
http://www-01.ibm.com/software/info/curam/
dences applicable for Cao Ping. After posting the patient data to each evidence
restService URI, 5 life-style intervention options are respectively scored by different
evidence services, and the decision fusion finally gives the ranked list. Social activity has
been first recommended to Cao Ping, with the highest rank score.</p>
      <p>Care plan and objectives defined
in Care Plan Mgmt System</p>
      <p>Case worker can select one or more
objectives to see decision advisor result
Options are
from care plan
mgmt. system.</p>
      <p>List ranking score of each option for this
care plan towards the selected objectives.</p>
      <p>Case worker can take use of the result to
make selection for planning.</p>
      <p>
        In future work, we are planning to release the evidence graph of large scale. First of
all, we will leverage Bio2RDF [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] which has published 11 billion triples across 35
bioinformatics datasets. In particular, PubMed could serve as the evidence source, from
which we expect to extract PICO elements. Besides, MeSH (Medical Subject Headings)
could contribute to the terminology class, and NDC (National Drug Code) could
contribute to the intervention class. A harder issue is the service generation, e.g. how to
extract the risk factors from the evidence, and implement the risk calculator with
publishment of the restService URI. Last but not least, we realize that evidence itself has
the quality level and confidence score, while text mining and data mining technologies
further exacerbate the uncertainty. Decision fusion paves the way to combine various
evidences, however it still needs more investigation, special for evidence conflicts.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Craig</surname>
            <given-names>JC</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Irwig</surname>
            <given-names>LM</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stockler</surname>
            <given-names>MR</given-names>
          </string-name>
          .
          <article-title>Evidence-based medicine: useful tools for decision making</article-title>
          .
          <source>Med J Aust</source>
          .
          <year>2001</year>
          ;
          <volume>174</volume>
          (
          <issue>5</issue>
          ):
          <fpage>248</fpage>
          -
          <lpage>53</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Huang</surname>
            <given-names>X</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lin</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Demner-Fushman</surname>
            <given-names>D</given-names>
          </string-name>
          .
          <article-title>Evaluation of PICO as a knowledge representation for clinical questions</article-title>
          .
          <source>AMIA Annual Symposium Proceedings</source>
          .
          <year>2006</year>
          :
          <fpage>359</fpage>
          -
          <lpage>63</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Mei</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            <given-names>H</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            <given-names>X</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xie</surname>
            <given-names>G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yu</surname>
            <given-names>Y.</given-names>
          </string-name>
          <article-title>A Decision Fusion Framework for Treatment Recommendation Systems</article-title>
          .
          <source>Studies in Health Technology and Informatics</source>
          .
          <year>2015</year>
          ;
          <volume>216</volume>
          :
          <fpage>300</fpage>
          -
          <lpage>4</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Belleau</surname>
            <given-names>F</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nolin</surname>
            <given-names>MA</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tourigny</surname>
            <given-names>N</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rigault</surname>
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Morissette</surname>
            <given-names>J.</given-names>
          </string-name>
          <article-title>Bio2RDF: towards a mashup to build bioinformatics knowledge systems</article-title>
          .
          <source>J Biomed Inform</source>
          . 2008 Oct;
          <volume>41</volume>
          (
          <issue>5</issue>
          ):
          <fpage>706</fpage>
          -
          <lpage>16</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>