<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Links with Answers: Query Answering for Customer Support?</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Veronika Thost</string-name>
          <email>veronika.thost@ibm.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Achille Fokoue</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Vittorio Castelli</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Salim Roukos</string-name>
          <email>roukosg@us.ibm.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>IBM Research</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>MIT-IBM Watson AI Lab</institution>
        </aff>
      </contrib-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        The increasing complexity of today's systems has triggered a tremendous
demand for customer support [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Customers often ask about standard procedures
or common issues that are well documented somewhere so that their questions
could be answered e ciently in an automated fashion. But current solutions
often fail in addressing their problems { we assume that every reader has enough
experience with unhelpful chat bots, automated phone systems, and
standardized e-mail replies in order to see that there is room for improvement.
      </p>
      <p>One main challenge for automated solutions is to understand (enough of) the
customer question to either link it to the answer, if it is known by the system, or
forward it to special, e.g., human support. Natural language understanding is not
easy in general and an entire area of research by itself. However, in the context
of a concrete system, the most relevant vocabulary can be narrowed down to
the one of the documents containing the answers and its speci c semantics are
known. For example, the user types and relevant tasks and tools are usually xed
and thus could be described semantically and applied for answering the queries.</p>
      <p>
        In this paper, we focus on a set of natural language questions asking for IT
support from within IBM. Each question is associated with an URL, the link
to a document containing the answer. These documents are available in HTML
but, alas, not annotated in the form of Semantic Web (SW) pages; we assume
that this is a scenario which is common, also outside of IBM. In this initial
investigation, we show that ontologies and reasoning using SW technologies can
help in resolving problems with underspeci cation, ambiguity, and variety in
the questions, and in nding answers which are missed by standard information
retrieval (IR) approaches. We compare a standard TF-IDF-based approach to a
semantic solution based on a custom ontology, the state-of-the-art rule learning
system AMIE [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], and SWRL reasoning, using the learned rules. Although the
bene ts of including semantic knowledge become evident, we point out several
issues with and gaps in the current tool set; challenges for future development of
semantic technologies. In this sense, we propose a semantic baseline to motivate
research in an area of rising importance where, in our opinion, SW technologies
could provide fundamental bene ts.
? Copyright 2019 for this paper by its authors. Use permitted under Creative
Commons License Attribution 4.0 International (CC BY 4.0).
      </p>
    </sec>
    <sec id="sec-2">
      <title>The ITS Dataset</title>
      <p>We focus on the ITS dataset (IT Support dataset), a set of natural language
questions asking for IT support from within IBM, where each question is
associated with an URL, the link to a document containing the answer. Alas, we
were not able to disclose the data so far, but we are planning to do so.</p>
      <p>Altogether, there are 966 queries, all in English, with a min/max/avg length
of 1/150/24 words. Figure 1 gives a more detailed overview on the lengths. The
following two examples show that some questions are very short, mentioning only
the basic problem, while others are rather complex, containing more detailed
problem descriptions:</p>
      <p>Password is not working please send recovery key to change the password
My laptop has been having some issues. 1) Slows down and freezes.
2) sometimes Does not shut down because [...] 3) it has screen refresh
issues and when I am working in excel some numbers can not be read
There are 276 di erent links (URLs). Figure 1 shows an aggregated view of
the number of example questions associated with the same URL; for example,
there are 20 di erent URLs for which we have 3 query examples. Note that, for
175 links, we have only a single question as example. Initially, the answers to
the queries were determined by human agents who answered the questions by
consulting the corresponding documents. However, while our data contains only
the original user questions, it is likely that there was a discussion process where
the agent asked for additional information in order to capture the problem of
the user. For that reason, some of the original query and answer pairings cannot
be rated to be correct in a more general context. For instance, one query is
associated to a user guide in Chinese although it does not mention anything
related to Chinese or China at all; hence, the English version of the user guide
should rather be considered as the answer. The question and answer pairs have
been validated, 20% of the data by more than one annotator, to estimate the
con dence of that validation. Since the inter-annotator agreement was only 84%,
we afterwards checked the data by ourselves, changed some answers, and removed
pairs we found ambiguous. There are several observations we can make about
the dataset. In general, it is too small to apply learning as a standalone solution.
In particular, there are several links for which we have only few query examples.
Also, many queries are underspeci ed descriptions of the actual problem. For
instance, the rst example query above mentions a password but not its kind
{ users generally have dozens of passwords at IBM; on the other hand, the
mentioning of a recovery key might be a hint to the password of a PC.
3</p>
    </sec>
    <sec id="sec-3">
      <title>A Baseline for Ontology-Based Query Answering</title>
      <p>We propose a semantic baseline based on a (very small) custom ontology, the
rule learning system AMIE, and SWRL reasoning, using the learned rules.
Representing the Queries as Facts. Each query is preprocessed by repairing
common typos, ltering out stopwords, and stemming the words. Then, a set of
facts is created for each by considering the remaining words as constants and
using a single predicate HasKey. For the above example query, we have the
following facts (amongst others), including one for the answer:
HasKey(q0,laptop); HasKey(q0,issue); HasKey(q0,slow); HasKey(q0,down)
The Ontology. Our ontology is a small, manually built set of rdf:type
statements relevant for the ITS queries. The main goal is generalization during
training. For instance, we have statements to integrate di erent device types:</p>
      <p>Type(ipad,device); Type(laptop,device); Type(computer,device)
We have not found any publicly available ontology for the IT support domain.
Rule Learning. As outlined above, we use AMIE for learning rules such as:
SWRL Reasoning. Given a set of queries as facts (w/o answer facts), we then
can use SWRL reasoning to nd possible answers.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Experiments</title>
      <p>We compared our semantic baseline (SB) to a standard IR approach (IRB)
using TF-IDF and Linear Support Vector classi cation, with a train/test split of
4/1, and the test set containing only queries for whose answer the training set
contains at least 2 examples. We compared the top 1/5/10 answers per query
of IRB to those of SB, for which we considered a random subset of the same
size or less, if it did not yield that many answers. Note that SB may return
several (unrated) answers since several rules may apply. Best results are:
precision@1/5/10 is 0.50/0.15/0.08 (IRB) and 0.49/0.35/0.33 (SB); recall@1/5/10
is 0.50/0.73/0.79 (IRB) and 0.39/0.55/0.55 (SB). However, the approaches are
quite di erent in nature, so that a comparison based just on these numbers is
misleading. Random checks show, e.g., that the answers of SB are closer to the
correct ones than the additional ones (i.e., the ones not top-rated) of IRB. The
poster gives more details about all experiments we ran and also describes
performance e ects of the AMIE parameter selection, such as maximal rule length.
Observe also that, to learn any rules at all for a link, both approaches need some
examples, so they cannot be considered as stand-alone solutions. Nevertheless,
we believe that SW methods are useful to augment IR approaches.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusions</title>
      <p>
        Our initial investigation of applying state-of-the-art SW technology for IT
support question answering revealed several open challenges, amongst others:
{ There is little work on ontologies for customer support [
        <xref ref-type="bibr" rid="ref1 ref4">1, 4</xref>
        ].
{ Semantic parsing of natural language { into a better representation than
our simple keyword facts { is an active area of research but has not been
considered much in the SW context (e.g., its integration with ontologies etc.)
{ State-of-the-art rule learning systems only consider Datalog, though some
with negation, and struggle with larger rule bodies. More complex rules
could model the queries more faithfully.
{ The links, or rather the linked documentation, could similarly be described
in an ontology. Such a hierarchy would also allow for suggesting more general
answers in case there is doubt about the exact answer.
{ There is basically no public example data.
{ The documents could be used as additional information for learning, and
especially help to link to answers with no or only few example queries.
Overall, the performance of our approach is mostly limited by the simple query
representation and by the rule learning system. On the other hand, it turned
out that in most cases an ontology which is a simple hierarchy is su cient to
resolve ambiguity and meaning. There are larger e orts at IBM to build an IT
support ontology and a question answering dataset. In this paper, we share our
insights in rst work with the data since customer support is a use case of rising
importance, to motivate discussion and research in the SW community.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>1. eClassOWL. http://www.heppnetz.de/projects/eclassowl/</mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2. https://www.inc.com/peter-roesler
          <article-title>/american-express-study-shows-risingconsumer-expectations-for-good-customer-service</article-title>
          .html, accessed:
          <fpage>2019</fpage>
          -04-04
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Galarraga</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Te ioudi</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hose</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Suchanek</surname>
            ,
            <given-names>F.M.</given-names>
          </string-name>
          :
          <article-title>Fast rule mining in ontological knowledge bases with AMIE+</article-title>
          .
          <source>VLDB J</source>
          .
          <volume>24</volume>
          (
          <issue>6</issue>
          ),
          <volume>707</volume>
          {
          <fpage>730</fpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Quan</surname>
          </string-name>
          , T.T.,
          <string-name>
            <surname>Nguyen</surname>
          </string-name>
          , T.D.:
          <article-title>Ontology evolution for customer services</article-title>
          .
          <source>In: Proceedings of the Knowledge Representation Ontology Workshop</source>
          . pp.
          <volume>61</volume>
          {
          <issue>69</issue>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>