<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>KB Bio 101: A Repository of Graph-Structured Knowledge</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Vinay K. Chaudhri</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Michael Wessel</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stijn Heymans</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Arti cial Intelligence Center, SRI International</institution>
          ,
          <addr-line>Menlo Park, CA, 94025</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        The goal of Project Halo is to develop a \Digital Aristotle" | a reasoning
system capable of answering novel questions and solving advanced problems in a
broad range of scienti c disciplines and related human a airs [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. As part of this
e ort, SRI has created a system called Automated User-Centered Reasoning and
Acquisition System (AURA) [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], which enables educators to encode knowledge
from science textbooks in a way that it can be used for answering questions by
reasoning.
      </p>
      <p>
        A team of biologists is currently using AURA to encode a popular biology
textbook that is used in advanced high school and introductory college courses in
the United States [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. The knowledge base called KB Bio 101 is an outcome of
this e ort and contains concept taxonomy for the whole textbook and detailed
rules for 20 chapters of the textbook. The current focus in the project is to
expand the KB Bio 101 to cover all the 56 chapters of the book by December
2013. In the longer-term, KB Bio 101 will be expanded both in expressiveness
and coverage. In terms of expressiveness, the Project Halo team is investigating
the use of defaults, exceptions, negations, disjunctions and a process language.
In terms of scope, the KB will likely be expanded to cover multiple textbooks
potentially spanning a full undergraduate curriculum.
      </p>
      <p>
        AURA uses a knowledge representation and reasoning system called
Knowledge Machine (KM) [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. KM supports a variety of representation features that
include a facility to de ne classes and organize them into a hierarchy and de ne
concept partitions (disjointness and covering axioms), ability to de ne relations
(also known as slots) and organize them into a relation hierarchy, support for
nominals, a facility to de ne horn rules, a procedure language, a situation
mechanism, and a STRIPS representation for actions. KM performs reasoning by using
inheritance, description-logic style classi cation of individuals, backward
chaining over rules, and a heuristic uni cation. In addition, KM can use its situation
mechanism and STRIPS representation of actions to simulate their execution.
While the AURA team has experimented with the use of all of these features,
the current core of AURA leverages only a small subset. The Project Halo team
has invested signi cant e ort to identify these core features and to specify them
in a declarative manner. One example of such an e ort is the work to specify the
heuristic uni cation in KM using an answer set programming framework [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. The
net result of these e orts is that the team is now able to export the KB Bio 101
in a variety of standard declarative languages, for example, rst order logic with
equality [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], SILK [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], description logics (DLs) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] and answer set programming
[
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
      <p>
        The KB Bio 101 is a central component of an electronic textbook application
called Inquire Biology [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] aimed at students studying from it. SRI has worked
with teachers and students to collect a large number of questions that are of
practical interest for this application. Working from those questions, the team
has formulated logical reasoning tasks that must be performed by a reasoner.
      </p>
      <p>
        The KB Bio 101 presents a unique opportunity for us to test our reasoners
and to motivate further development. Recognizing that logical reasoning is only
one component of the overall task of answering questions, the team at SRI is in
the process of formulating similar challenges for knowledge representation [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] and
natural language generation [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] which are also centered on KB Bio 101. Taken
collectively, these multiple challenges position us to make major leaps in AI in
general, and knowledge-based question answering in particular.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Representation of Graphs in a Standard DL Syntax</title>
      <p>There are two problems that need to be addressed to provide a representation of
graphs in the DLs: de ning a syntax for describing graphs and de ning a family
of graph expressiveness layers. We explain this in more detail next.</p>
      <p>
        In principle, role value maps would be needed in order to truthfully represent
the content of the KB Bio 101. Role-value maps are a standard-way of
expressing graph-structured descriptions in DL syntax. Unfortunately, unrestricted role
value maps quickly lead to undecidability. There are decidable variants of role
value maps, e.g. the restricted role-value-maps in a description logic with
existential restrictions and terminological cycles (E L with cyclical TBoxes) of Baader
[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], and we will check the applicability of this work to KB Bio 101.
      </p>
      <p>
        In recent work on description graphs [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] and description graph logic
programs [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], a DL knowledge base is extended using a graph structure. While this
proposal allows representation of graphs, it does not extend the conventional DL
syntax in a graceful manner in that the conventional syntax can be completely
abandoned in favor of this new syntax. The OWL export of KB Bio 101 extends
the conventional syntax of OWL to encode graph structures.
      </p>
      <p>Restrictions in description graphs prohibit the use of certain forms of cycles
are too severe for KB Bio 101 which needs cyclicity in addition to the ability
to express graphs. While the work on description graphs acknowledges the need
for more expressive formalisms that go beyond tree structures, the nature of
KB Bio 101 is su ciently di erent from the setting in description graphs that it
requires further research and could prove to be a data set that drives research
beyond the current state.</p>
    </sec>
    <sec id="sec-3">
      <title>Reasoning with Graph-Structured Descriptions</title>
      <p>Similarity reasoning and relationship reasoning are two tasks that are of great
practical interest to our application. In a similarity reasoning task, we are given
two graph structured descriptions A and B, and the task is to compute new
descriptions that correspond to their intersection and di erence.</p>
      <p>In the relationship reasoning task, we rst create an ABOX by instantiating
each concept in the TBOX, and then given two individuals A and B, we wish to
compute all possible paths of a certain length between those individuals.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Summary</title>
      <p>An initial version of the KB Bio 101 in OWL is now available. We are interested
in identifying collaborators interested in exploiting this KB in the context of
their tool set. We will work with them to rst de ne an acceptable translation,
and then participate in an experimental evaluation of the results of the reasoning
tasks suggested above.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <p>This work has been funded by Vulcan Inc.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <article-title>Deep knowledge representation and reasoning challenge</article-title>
          . https://sites. google.com/site/dkrckcap2011/ and https://sites.google.com/site/ 2nddeepkrchallenge/.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Inquire</surname>
          </string-name>
          :
          <article-title>An Intelligent Textbook</article-title>
          . http://aivideo.org/
          <year>2012</year>
          /.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>Project</given-names>
            <surname>Halo</surname>
          </string-name>
          . http://www.projecthalo.com.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>Franz</given-names>
            <surname>Baader</surname>
          </string-name>
          .
          <article-title>Restricted role-value-maps in a description logic with existential restrictions and terminological cycles</article-title>
          .
          <source>In Proceedings of the International Workshop on Description Logics (DL</source>
          <year>2003</year>
          ),
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>Franz</given-names>
            <surname>Baader</surname>
          </string-name>
          , Diego Calvanese, Deborah L.
          <string-name>
            <surname>McGuinness</surname>
          </string-name>
          ,
          <string-name>
            <surname>Daniele Nardi</surname>
          </string-name>
          , and
          <string-name>
            <surname>Peter F.</surname>
          </string-name>
          Patel-Schneider, editors.
          <source>The Description Logic Handbook: Theory, Implementation and Applications</source>
          . Cambridge University Press, 2nd edition,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>Eva</given-names>
            <surname>Banik</surname>
          </string-name>
          , Claire Gardent,
          <string-name>
            <given-names>Donia</given-names>
            <surname>Scott</surname>
          </string-name>
          , Nikhil Dinesh, and Fennie Liang.
          <article-title>KBGen Text Generation from Knowledge Bases as a New Shared Task</article-title>
          .
          <source>In International conference on Natural Language Generation</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Vinay</surname>
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Chaudhri</surname>
            and
            <given-names>Tran C.</given-names>
          </string-name>
          <string-name>
            <surname>Son</surname>
          </string-name>
          .
          <article-title>Specifying and Reasoning with Underspeci ed Knowledge Bases Using Answer Set Programming</article-title>
          .
          <source>In Proc. of International Conference on Knowledge Representation and Reasoning (KR)</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Peter</surname>
            <given-names>E.</given-names>
          </string-name>
          <string-name>
            <surname>Clark</surname>
            and
            <given-names>Bruce</given-names>
          </string-name>
          <string-name>
            <surname>Porter</surname>
          </string-name>
          .
          <article-title>Knowledge machine userss guide</article-title>
          .
          <source>Technical report</source>
          , University of Texas at Austin.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>Melvin</given-names>
            <surname>Fitting</surname>
          </string-name>
          .
          <source>First-Order Logic and Automated Theorem Proving</source>
          . Springer,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <given-names>M.</given-names>
            <surname>Gelfond</surname>
          </string-name>
          and
          <string-name>
            <given-names>V.</given-names>
            <surname>Lifschitz</surname>
          </string-name>
          .
          <article-title>Logic programs with classical negation</article-title>
          . In D.Warren and Peter Szeredi, editors,
          <source>Logic Programming: Proceedings of the Seventh International Conference</source>
          , pages
          <volume>579</volume>
          {
          <fpage>597</fpage>
          ,
          <year>1990</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Benjamin</surname>
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Grosof</surname>
          </string-name>
          . SILK:
          <article-title>Higher Level Rules with Defaults and Semantic Scalability</article-title>
          . In Axel Polleres and Terrance Swift, editors,
          <source>Web Reasoning and Rule Systems</source>
          , Third International Conference (RR
          <year>2009</year>
          ), volume
          <volume>5837</volume>
          of Lecture Notes in Computer Science, pages
          <volume>24</volume>
          {
          <fpage>25</fpage>
          . Springer,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12. David Gunning,
          <string-name>
            <given-names>Vinay K.</given-names>
            <surname>Chaudhri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Peter</given-names>
            <surname>Clark</surname>
          </string-name>
          , Ken Barker,
          <string-name>
            <surname>Shaw-Yi</surname>
            <given-names>Chaw</given-names>
          </string-name>
          , Mark Greaves, Benjamin Grosof, Alice Leung,
          <string-name>
            <surname>David McDonald</surname>
            ,
            <given-names>Sunil</given-names>
          </string-name>
          <string-name>
            <surname>Mishra</surname>
            , John Pacheco, Bruce Porter, Aaron Spaulding, Dan Tecuci, and
            <given-names>Jing</given-names>
          </string-name>
          <string-name>
            <surname>Tien</surname>
          </string-name>
          .
          <source>Project Halo Update { Progress Toward Digital Aristotle. AI Magazine</source>
          ,
          <year>Fall 2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Despoina</surname>
            <given-names>Magka</given-names>
          </string-name>
          , Boris Motik, and
          <string-name>
            <given-names>Ian</given-names>
            <surname>Horrocks</surname>
          </string-name>
          .
          <article-title>Modeling Structured Domains using Description Graphs and Logic Programming</article-title>
          .
          <source>In Proceedings of the International Workshop on Description Logics (DL</source>
          <year>2012</year>
          ),
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Boris</surname>
            <given-names>Motik</given-names>
          </string-name>
          , Bernardo Cuenca Grau, Ian Horrocks, and
          <string-name>
            <given-names>Ulrike</given-names>
            <surname>Sattler</surname>
          </string-name>
          .
          <article-title>Representing Ontologies using Description Logics, Description Graphs and Rules</article-title>
          .
          <source>Arti cial Intelligence</source>
          ,
          <volume>173</volume>
          :
          <fpage>1275</fpage>
          {
          <fpage>1309</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Jane</surname>
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Reece</surname>
          </string-name>
          , Lisa A.
          <string-name>
            <surname>Urry</surname>
          </string-name>
          , Michael L. Cain, Steven A.
          <string-name>
            <surname>Wasserman</surname>
          </string-name>
          ,
          <string-name>
            <surname>Peter</surname>
            <given-names>V.</given-names>
          </string-name>
          <string-name>
            <surname>Minorsky</surname>
          </string-name>
          , and
          <string-name>
            <surname>Robert</surname>
            <given-names>B. Jackson. Campbell</given-names>
          </string-name>
          <string-name>
            <surname>Biology</surname>
          </string-name>
          .
          <source>Benjamin Cummings, 9th edition</source>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>