<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Querying Biomedical Ontologies in Natural Language using Answer Set Programming</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Halit Erdogan</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Umut Oztok</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yelda Erdem</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Esra Erdem</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Faculty of Engineering and Natural Sciences, Sabancı University</institution>
          ,
          <addr-line>I ̇stanbul</addr-line>
          ,
          <country country="TR">Turkey</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Research and Development Department, Sanovel Pharmaceutical Inc.</institution>
          ,
          <addr-line>I ̇stanbul</addr-line>
          ,
          <country country="TR">Turkey</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Recent advances in health and life sciences have led to generation of a large amount of data. To facilitate access to its desired parts, such a big mass of data has been represented in structured forms, like biomedical ontologies. On the other hand, representing ontologies in a formal language, constructing them independently from each other and storing them at different locations have brought about many challenges for answering queries about the knowledge represented in these ontologies. One of the challenges for the users is to be able represent a complex query in a natural language, and get its answers in an understandable form: Currently, such queries are answered by software systems in a formal language, however, the majority of the users lack the necessary knowledge of a formal query language to represent a query; moreover, none of these systems can provide informative explanations about the answers. Another challenge is to be able to answer complex queries that require appropriate integration of relevant knowledge stored in different places and in various forms. In this work, we address the first challenge by developing an intelligent user interface that allows users to enter biomedical queries in a natural language, and that presents the answers (possibly with explanations if requested) in a natural language. We address the second challenge by developing a rule layer over biomedical ontologies and databases, and use automated reasoners to answer queries considering relevant parts of the rule layer. The main contributions of our work can be summarized as follows: - We introduce a controlled natural language, a subset of natural language with a restricted grammar and vocabulary, specifically for biomedical queries towards drug discovery; we call this controlled natural language as BIOQUERYCNL [4]. For instance, in this language, we can pose the following query: “What are the genes that are targeted by the drug Epinephrine and that interact with the gene DLG4?” - We present an algorithm that converts a biomedical query in BIOQUERYCNL into a program in answer set programming (ASP) - a formal framework to automate reasoning about knowledge [8] - making use of the parsing engine APE [5]. Figure 1 shows the overall idea behind this algorithm. For instance, according to this algorithm, the query above is translated into the following ASP program: what_be_genes(GN1) :gene_gene(GN1,"DLG4"), drug_gene("Epinephrine",GN1).</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        – Once we transform the biomedical query into an ASP program and extract the
relevant part of the rule layer (also an ASP program), we can compute its answers
(if exists) using a state-of-the-art ASP system, such as CLASP [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], DLV [
        <xref ref-type="bibr" rid="ref3 ref7">3,7</xref>
        ] or
DLVHEX [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], as described in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Figure 2 shows the overall idea behind this
algoQuerying Biomedical Ontologies using ASP
      </p>
      <p>rithm. For instance, using CLASP, we compute the following answer to the query
above: “ADRB1”.
– We construct an algorithm to provide minimal explanations to the answers. For
instance, for “ADRB1” our algorithm provides the following minimal explanation:
the drug “Epinephrine” targets the gene “ADRB1” according to CTD and
the gene “ADRB1” interacts with the gene “DLG4” according to BIOGRID.</p>
      <p>The applicability of our methods is illustrated with some complex queries over
PHARMGKB, DRUGBANK, BIOGRID, SIDER and CTD, using the ASP systems CLASP
(with GRINGO), DLV and DLVHEX.</p>
      <p>Acknowledgments. This work has been supported by TUBITAK Grant 108E229.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Bodenreider</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Coban</surname>
            ,
            <given-names>Z.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Doganay</surname>
            ,
            <given-names>M.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Erdem</surname>
          </string-name>
          , E.:
          <article-title>A preliminary report on answering complex queries related to drug discovery using answer set programming</article-title>
          .
          <source>In: Proc. of the 3rd International Workshop on Applications of Logic Programming to the Semantic Web and Web Services</source>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Eiter</surname>
            , T., G.Ianni,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Schindlauer</surname>
          </string-name>
          , H.Tompits:
          <article-title>Effective integration of declarative rules with external evaluations for Semantic-Web reasoning</article-title>
          .
          <source>In: Proc. of ESWC</source>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Eiter</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leone</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mateis</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pfeifer</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scarcello</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>A deductive system for nonmonotonic reasoning</article-title>
          .
          <source>In: Proc. of LPNMR</source>
          . pp.
          <fpage>364</fpage>
          -
          <lpage>375</lpage>
          (
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Erdem</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yeniterzi</surname>
          </string-name>
          , R.:
          <article-title>Transforming controlled natural language biomedical queries into answer set programs</article-title>
          .
          <source>In: Proc. of the Workshop on BioNLP</source>
          . pp.
          <fpage>117</fpage>
          -
          <lpage>124</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Fuchs</surname>
            ,
            <given-names>N.E.</given-names>
          </string-name>
          :
          <article-title>Attempto controlled english</article-title>
          .
          <source>In: Proc. of WLP</source>
          . pp.
          <fpage>211</fpage>
          -
          <lpage>218</lpage>
          (
          <year>2000</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Gebser</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kaufmann</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Neumann</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schaub</surname>
          </string-name>
          , T.: T.:
          <article-title>Conflict-driven answer set solving</article-title>
          .
          <source>In: Proc. of IJCAI</source>
          . pp.
          <fpage>386</fpage>
          -
          <lpage>392</lpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Leone</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pfeifer</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Faber</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eiter</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gottlob</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Perri</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scarcello</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>The dlv system for knowledge representation and reasoning</article-title>
          .
          <source>ACM Trans. Comput. Log</source>
          .
          <volume>7</volume>
          (
          <issue>3</issue>
          ),
          <fpage>499</fpage>
          -
          <lpage>562</lpage>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Lifschitz</surname>
          </string-name>
          , V.:
          <article-title>What is answer set programming?</article-title>
          <source>In: Proc. of AAAI</source>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>