<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>The LogAnswer Pro ject at QA4MRE 2012</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ingo Glockner</string-name>
          <email>ingo.gloeckner@fernuni-hagen.de</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bjorn Pelzer</string-name>
          <email>bpelzer@uni-koblenz.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science, Arti cial Intelligence Research Group University of Koblenz-Landau</institution>
          ,
          <addr-line>Universitatsstr. 1, 56070 Koblenz</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Intelligent Information and Communication Systems Group (IICS), University of Hagen</institution>
          ,
          <addr-line>59084 Hagen</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>For QA4MRE 2012, we have improved our prototype of a pure logic-based answer validation system that was originally developed for QA4MRE 2011. The system uses core functionality of the LogAnswer question answering (QA) system available on the web, which was combined and extended such as to best meet the demands of the QA4MRE task. In particular, coreference resolution is used in order to allow knowledge processing on the document level, and a fragment of OpenCyc has been integrated in order to allow more exible reasoning. In addition to a general consolidation of the original prototype that took part in QA4MRE 2011, the current system was optimized for accuracy on nonrejected questions. While last year's system only achieved an accuracy of 0.24 for answered questions, the various measures taken to improve the rejection decision now yield an accuracy of 0.31 for answered questions in the QA4MRE 2012 task. Results show that these improvements were su cient to raise the overall quality of answers to a c@1 score of 0.26, which is clearly above the c@1 baseline of 0.20 for random guessing. Due to the di culty of the reading tests in QA4MRE, we consider this result promising, recalling the near-baseline scores of most QA4MRE participants in 2011.1</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The DFG-funded project LogAnswer investigates the use of deep linguistic
processing and logical reasoning for QA, with a special emphasis on the issues
of robustness and e ciency. An experimental QA system for German (also
called LogAnswer) [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] that evolved from this research can be tested online
(www.loganswer.de), a screenshot is shown in see Fig. 1. The system took part
in the QA@CLEF and ResPubliQA QA system evaluations [
        <xref ref-type="bibr" rid="ref3 ref4 ref5">3,4,5</xref>
        ]. Recently, we
have focused on the issue of providing automatic QA support for human Q&amp;A
portals. Users of the German FragWikia! portal (http://frag.wikia.com/) can
subscribe to the automatic answer service on the LogAnswer website. LogAnswer
keeps track of new questions at FragWikia! and sends an answer email whenever
1 LogAnswer is supported by the DFG with grants FU 263/12-2 and GL 682/1-2.
      </p>
      <p>
        How many people died when the MS Estonia sank?
more than 900 people
it nds su ciently good results for a question of a registered user. In this new
use case for QA, we face the challenge that there is often few lexical overlap
between question and answer. Moreover the system must validate answers in
order to achieve better precision. This is because users do not necessarily expect
the system to answer at all in this scenario. But if it does post a response, then
the answer should show a high probability of correctness, in order to avoid users
getting annoyed by the machine-generated answers. This scenario resembles the
present QA4MRE machine reading exercise [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] in that there is often no close
relationship between question and answers, and in the need to validate answers
rather than always providing a ranked list of answer candidates.
      </p>
      <p>Despite these similarities that make the QA4MRE participating appealing for
our project, application of existing LogAnswer system (even with the
precisionoriented re nements for QA portals) to the QA4MRE task does not make sense:
{ LogAnswer normally nds answer from scratch, based on a candidate
retrieval backend whose retrieval scores are considered in determining the
nal ranking. In QA4MRE, however, there is no retrieval score as one of the
starting points for reranking.
{ LogAnswer usually works with a sentence-level segmentation of documents,
while QA4MRE may demand a justi cation of answers from several parts of
a document or even from several documents.
{ Since the validation features of LogAnswer depend on sentence-level
segmentation and since the retrieval score as one of these features, the validation
models of LogAnswer as determined from a learning-to-rank approach are
not applicable.
{ The existing QA@CLEF-based training set for the machine learning (ML)
technique cannot be reused either because it shows strong e ect of lexical
overlap (unlike QA4MRE) and because it comprises sentence-level items
only.</p>
      <p>
        Owing to these di culties that were already present in QA4MRE 2011, we
decided to build a new LogAnswer prototype for the QA4MRE task in the last
year [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], using only those basic techniques from the regular LogAnswer system
that seemed useful for the task:
{ The core system for QA4MRE was designed to achieve logical answer
validation on document level representations;
{ Existing LogAnswer functionality was reused whenever possible;
{ Instead of using the supplied \QA4MRE Background Collection" or a
webbased validation approach, we decided to strengthen the background
knowledge of LogAnswer by integrating part of OpenCyc;
{ Generation of explanations for a chosen answer was added.
      </p>
      <p>For QA4MRE 2012, our goal was a general consolidation of the LogAnswer
prototype built for QA4MRE and an improvement of its capabilities for rejecting
wrong answers in particular.
2</p>
    </sec>
    <sec id="sec-2">
      <title>System Description</title>
      <p>
        The LogAnswer prototype for QA4MRE relies on a deep linguistic analysis.
{ The reading tests, questions and answer candidates are parsed by the WOCADI
parser [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
{ WOCADI computes a meaning representation for each sentence in the form
of a Multilayered Extended Semantic Network (MultiNet [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]).
{ The question is then subjected to a question classi cation, based on a system
of 248 classi cation rules that operate on the generated semantics networks.
In QA4MRE 2011, the question classi cation was of minor importance compared
to regular Q&amp;A here since no predictive annotation and no expected answer
type check was used. For QA4MRE 2012, some e ort was spent in utilizing the
computed question category, speci c answer types (as known from the former
QA@CLEF and ResPubliQA exercises) and other information determined by
the question classi cation rules in order to eliminate questions that the system
is unable to answer:
{ LogAnswer is designed for answering single-sentence questions. Therefore,
questions that span more than one sentence are now discarded.
{ LogAnswer has no special means for validating list answers (e.g. answers
involving a coordination such as `London and Paris'). Therefore question
that target at several answers (such as `Which cities. . . ') are now recognized
and discarded since they are not likely to be answered correctly.
{ Questions are now generally discarded if no expected answer type can be
established, except if the question is classi ed into the `DEFINITION'
category (for de nition questions).
      </p>
      <p>In the following we sketch the construction of the logical representation on the
document level.</p>
      <p>
        { Coreference resolution is essential to the success of logic-based processing on
whole documents: under the unique name assumption, the combined
information about a discourse entity can only be utilized if all mentions of the
entity in the text are represented by the same constant. We therefore use
a coreference resolver, CORUDIS [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], for handling anaphoric pronouns and
other forms of coreference.
{ The coreference resolver is complemented with a simplistic solution to
identifying the speaker from the manuscript of a talk which was activated in the
QA4MRE runs. The method simply chooses the rst name mentioned at the
beginning of the manuscript. Knowing the speaker is necessary for resolving
rst person deixis `Ich'/'I').
{ In a process called assimilation, the text representation is constructed from
all logical sentence representations, thereby using the results of coreference
analysis and speaker recognition.
      </p>
      <p>In order to capture more inferences, ca. 20,000 subsumption relationships were
extracted from OpenCyc and translated into our concept system for German,
see Section 3. For e ciency reasons, this subsumption hierarchy is not added to
the background knowledge seen by the prover. Instead, the text representation
is enriched by explicitly declaring each instance of a concept an instance of all
of its superconcepts as well.</p>
      <p>
        Example:
{ A document mentions a tsunami.
{ Logical representation contains statement like SUB(c24; tsunami.1.1),
expressing that entity c24 is an instance of a tsunami (with HaGenLex word
sense index 1.1, see [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]).
{ From the OpenCyc link, we know that every tsunami is a `Naturkatastrophe'
(natural disaster).
{ The OpenCyc expansion step then adds an assertion expressing that c24 is
also an instance of a natural disaster, viz SUB(c24; naturkatastrophe.1.1).
      </p>
      <p>Let us now turn to hypothesis construction. By a hypothesis, we mean a
declarative statement formed from the question and the candidate answer.
Consider the question `What is Bono's attitude with respect to the digital age?' and
the possible answer \enthusiasm". In this case, the textual hypothesis would
be `Bono's attitude with respect to the digital age is enthusiasm'. In our
prototype, the template-based construction of a textual hypothesis from question and
answer, followed by parsing, is only used as a fallback. This is because the
textbased hypothesis construction is problematic for German due to mismatches of
word forms with respect to syntactic case. Instead, our system normally
constructs the logical hypothesis directly on the logical level, by combining the
literal lists that represent question and answer.</p>
      <p>Based on the logical hypothesis (as a list of literals) and the logical
representation of the document for the given reading test, our system then uses robust
logical processing for justifying or rejecting the hypothesis (and in turn, the
answer candidate). This procedure follows the general principle of logical answer
validation:
{ Try to prove the logical hypothesis from the logical representation of the
reading test document and from the general background knowledge of the
system.
{ If such a proof succeeds, then the answer from which the hypothesis was
constructed is considered correct.</p>
      <p>In practice, linguistic analyses contain errors, coreference resolution can fail, and
the background knowledge is incomplete. Therefore, the proof attempt may fail
for correct hypotheses, and we cannot conclude from a failed proof that the
hypothesis is indeed wrong. For ensuring su cient robustness, we thus add a
relaxation loop to the logical validation process:
{ Remove problematic hypothesis literals one at a time until a proof of the
remaining hypothesis fragment succeeds.
{ Impose a limit on the number of admissible relaxation steps in order to keep
processing time within reasonable bounds.</p>
      <p>The skipped literal count and other criteria related to the relaxation result are
then used as features that a ect answer scoring.</p>
      <p>QA4MRE 2011 required systems to provide one or more so-called provenance
elements that justify the chosen answer. This requirement was simpli ed in this
year's task, which only assumes general information of the resources involved to
be provided. Still, we did not eliminate the extraction of detailled justi cations
from our system since it proved useful for computing validation scores, depending
on the coherence of found justi cations.</p>
      <p>The computation of provenance information (i.e. of the individual justifying
elements that together explain the answer) is based on prover results:
{ All names of used axioms (implicative rules) are added as provenance
elements, since axiom names in our system are chosen in a meaningful way.
{ Used facts from the background knowledge generally express lexical-semantic
relations, such as CHPA(hoch.1.1; hohe.1.1) (i.e., property `high' corresponds
to attribute `height'). Knowing the MultiNet documentation, such a fact is
self-explanatory and can be added as a provenance element.
{ For literals from the text representation such as SUB(c24; tsunami.1.1), the
system presents a witness sentence rather than the used fact itself. These
witness sentences are also considered as provenence elements and thus included
into the computed justi cation.</p>
      <p>Since no su cient training data was available for applying a machine-learning
technique that automatically determines validation scores, we used an ad-hoc
scoring metric that closely resembles the validation criterion already used in the
last year.2 Many of the validation features normally used by LogAnswer had to
be omitted since it was not clear how to utilize them in a simple hand-crafted
scoring criterion.</p>
      <p>As in the last year, we started from a basic scoring metric % de ned as the
arithmetic mean of:
{ %1 = #skipped-lits, where = 0:7.3
{ %2 = #skipped-lits #unknown-status-lits, where = 0:8.4
{ %3: optimistic proportion of provable question literals:
{ %4: pessimistic proportion of proved question literals:
{ %5: optimistic proportion of provable answer literals:
{ %6: pessimistic proportion of proved answer literals:
%3 = 1
#skipped-q-lits</p>
      <p>#all-q-lits
%4 =
#proved-q-lits</p>
      <p>#all-q-lits
%5 = 1
#skipped-a-lits</p>
      <p>#all-a-lits
%6 =
#proved-a-lits</p>
      <p>#all-a-lits
This basic validation score is then combined with a criterion intended to capture
the coherence of the justi cation of the answer as found by the system using a
relaxation proof. In order to determine this coherence score, we organize the
witness sentences into blocks of adjacent sentences. Let us say that a block of
adjacent witness sentences is `unconnected' to earlier blocks if it is not linked
by coreference to these blocks. Let u be the number of unconnected blocks of
witness sentences for the given answer a. The nal answer selection score is
then computed from the basic score % and the unconnected sentence score u as
= %
u 1
where = 0:7 in our experiments.</p>
      <p>The nal step in solving an individual reading test item is the selection of
the best answer, or alternatively the decision not to commit to any answer at
all, leaving the question unanswered.
2 The QA4MRE 2011 data for LogAnswer was considered too small for learning a
useful model.
3 #skipped-lits is the number of skipped hypothesis literals due to relaxation.
4 #unknown-status-lits is the number of non-skipped hypothesis literals not yet proved
in the last proof attempt of the relaxation loop.
{ The prototype generally refuses to answer if question analysis fails (i.e., if
there is no full parse of the question or an empty question literal pattern).
{ An answer candidate is also rejected if hypothesis construction fails (e.g. if
the constructed hypothesis contains no literals associated with the original
answer).</p>
      <p>
        As opposed to our system setup in QA4MRE 2011 described in [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], the prototype
no longer rejects answer candidates for which provenance generation fails. In this
case that an empty justi cation for the answer is extracted, the system rather
determines based on u = 1 (i.e. assuming that the candidate answer is justi ed
by a single witness sentence).
      </p>
      <p>If all answers are rejected for one of the reasons mentioned, then the question
is left unanswered. Otherwise the answer with maximum score is chosen.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Resources Used</title>
      <p>The background knowledge of LogAnswer comprises:
{ A system of 49,000 synonym classes involving 112,000 lexemes, used for
synonym normalization (concepts are replaced by a canonical representative);
{ About 10,500 facts, e.g.</p>
      <p>{ nominalization relationships (`to attack' vs. `the attack')
{ adjective-attribute relationships (`high' vs. `height')
{ gender-speci c profession names (`Professor' vs. `Professorin' [female
professor] in German)
{ Approx. 120 rules for basic inferences (e.g. for making use of the
lexicalsemantic relationships listed above).</p>
      <p>
        In order to enhance the background knowledge, the system is further equipped
with knowledge from OpenCyc (http://sw.opencyc.org/), To this end, the
HaGenLex [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] concept system of German word senses was linked to a
fragment of OpenCyc using a semi-automatic alignment technique. We started with
automatic generation of hypotheses for HaGenLex-OpenCyc links:
{ HaGenLex ! wikipedia.de topic or redirect ! DBPedia map (de-en) !
wikipedia.en-OpenCyc link, or alternatively:
{ HaGenLex ! HaEnLex ! wikipedia.en topic/redirect ! wikipedia.en-OpenCyc
link,
where HaEnLex is a core lexicon for English designed analogously to HaGenLex
and interlinked with the German lexicon. The automatic generation of link
hypotheses was followed by a manual validation, checking several ten thousands of
links generated in this way for their actual correctness. The checked links are then
used for knowledge extraction: the validated alignment lets us map synonyms,
subsumptions, and disjoint declarations from OpenCyc into our HaGenLex
concept system. A nal re nement consists in a check for cyclic subsumptions and
inconsistencies, which are found automatically but had to be xed by hand.
Moreover, there is an automatic elimination of redundancy (i.e., optimization of
subsumption chains). This process resulted in 20,000 new redundancy-free
subsumption relationships, 1,500 new synonyms, and 4,000 disjointness declarations,
which substantially extend the background knowledge available to LogAnswer.
4
      </p>
    </sec>
    <sec id="sec-4">
      <title>Experimental Results</title>
      <p>Two runs were submitted to QA4MRE 2012, using German as both the source
and target language. These runs were based on the following system con
gurations:
{ loga12011dede: only elementary synonyms (spelling variants, wrong spellings)
and elementary background knowledge (e.g. simple temporal and spatial
regularities), no OpenCyc integration.
{ loga12023dede: full background knowledge of LogAnswer including OpenCyc
expansion.</p>
      <p>
        The main evaluation results obtained for the 160 test questions of QA4MRE
2012 are shown in Table 1. Here, `aw' is the number of answered (non-rejected)
questions, `corr' is the number of answered questions with a correct choice of
the answer, 'accjaw' is the accuracy achieved for answered questions, `acc' is
overall accuracy, and `c@1' is the c@1 score achieved by the given run. The
system attempts to answer 94 questions, with 28 correct answers in the rst
run and 30 correct answers in the second run supported by the full background
knowledge of LogAnswer. Thus, there is a slight positive e ect of the added
knowledge including the OpenCyc integration. The achieved c@1 score of 0.26
clearly exceeds the c@1 score of 0.20 for random guessing. The accuracy of 0.31
for non-requested questions in the second run is substantially higher than the
corresponding value of 0.24 achieved in the last year [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Moreover the c@1 score
has improved from 0.22 to the current 0.26. This indicates a positive e ect of
the consolidation of the system and of the re nements for recognizing questions
that the system is unable to handle.
      </p>
      <p>Table 2 also shows the results by topics. T1. T2, T3 and T4 are the c@1
scores achieved for topics 1 (AIDS), 2 (Climate Change), 3 (Music and Society),
and 4 (Alzheimer), respectively. `all' is the overall c@1 score for comparison.</p>
      <p>While the self-assessment of the system with respect to failed answer
attempts has been improved, mainly by better exploiting the results of question
classi cation, some problems of the prototype with the QA4MRE tasks remain:
We have described the LogAnswer prototype for QA4MRE 2012, which resulted
from last year's system by a general consolidation and re nements for better
recognizing questions that the system cannot answer. These improvements have
raised the c@1 score of the system to 0.26, which is substantially better than the
expected c@1 score of 0.20 for random guessing. There was also a slight positive
e ect of using background knowledge including the OpenCyc integration,
compared to using no background knowledge at all. However, defects in reading test
documents (missing blanks and structuring) still make it di cult to apply deep
linguistic processing and subsequent logical reasoning for validating answers. No
`shallow' techniques were used as a fallback if the deep linguistic analysis fails,
since lexical overlap and similar shallow linguistic criteria will not help too much
in QA4MRE. It would have been best to complement logical answer validation
with web-based validation or hit scores obtained from the QA4MRE Background
Collection. The modest c@1 score achieved by LogAnswer can also be attributed
to the fact that the QA4MRE background collections were not yet used by the
system. Looking at concrete questions, a decomposition of coordinating answers
such as `London and Paris' would have been useful, but for the time being the
system discards questions that target at such answers. From a more general
perspective, we will continue working on techniques for increasing the precision of
results in order to make our system more useful in the QA portals use case.
One concrete approach that we are currently working on is the use of techniques
from Case-Based Reasoning (CBR). These techniques could help to utilize user
feedback for continually improving validation quality.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Peters</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Deselaers</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ferro</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gonzalo</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jones</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kurimo</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mandl</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          , Pen~as,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Petras</surname>
          </string-name>
          , V., eds.
          <source>: Evaluating Systems for Multilingual and Multimodal Information Access: 9th Workshop of the Cross-Language Evaluation Forum</source>
          ,
          <string-name>
            <surname>CLEF</surname>
          </string-name>
          <year>2008</year>
          , Aarhus, Denmark,
          <source>Revised Selected Papers</source>
          , Heidelberg, Springer (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Furbach</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          , Glockner, I.,
          <string-name>
            <surname>Pelzer</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>An application of automated reasoning in natural language question answering</article-title>
          .
          <source>AI</source>
          Communications
          <volume>23</volume>
          (
          <issue>2-3</issue>
          ) (
          <year>2010</year>
          )
          <volume>241</volume>
          {
          <fpage>265</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3. Glockner, I.,
          <string-name>
            <surname>Pelzer</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Combining logic and machine learning for answering questions</article-title>
          . [1]
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4. Glockner, I.,
          <string-name>
            <surname>Pelzer</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>The LogAnswer project at CLEF 2009</article-title>
          . In: Working Notes for the CLEF 2009 Workshop, Corfu, Greece (
          <year>September 2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5. Glockner, I.,
          <string-name>
            <surname>Pelzer</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>The LogAnswer project at ResPubliQA 2010</article-title>
          . In: CLEF 2010 Working Notes. (
          <year>September 2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6. Pen~as,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Hovy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            ,
            <surname>Forner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Rodrigo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Sutcli</surname>
          </string-name>
          <string-name>
            <given-names>e</given-names>
            , R.,
            <surname>Sporleder</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Forascu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Benajiba</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            ,
            <surname>Osenova</surname>
          </string-name>
          ,
          <string-name>
            <surname>P.</surname>
          </string-name>
          : Overview of QA4MRE at CLEF 2012:
          <article-title>Question Answering for Machine Reading Evaluation</article-title>
          . In:
          <article-title>CLEF 2012 Evaluation Labs</article-title>
          and Workshop Working Notes Papers, Rome, Italy (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7. Glockner, I.,
          <string-name>
            <surname>Pelzer</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dong</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>The LogAnswer project at QA4MRE 2011</article-title>
          . In: CLEF 2011 Working Notes. (
          <year>September 2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Hartrumpf</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Hybrid Disambiguation in Natural Language Analysis</article-title>
          . Der Andere Verlag, Osnabruck,
          <string-name>
            <surname>Germany</surname>
          </string-name>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Helbig</surname>
          </string-name>
          , H.:
          <article-title>Knowledge Representation and the Semantics of Natural Language</article-title>
          . Springer, Berlin (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Hartrumpf</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Helbig</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Osswald</surname>
            ,
            <given-names>R.:</given-names>
          </string-name>
          <article-title>The semantically based computer lexicon HaGenLex</article-title>
          .
          <source>Traitement automatique des langues 44(2)</source>
          (
          <year>2003</year>
          )
          <volume>81</volume>
          {
          <fpage>105</fpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>