<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Towards Comprehensive Noise Detection in Automatically-Created Knowledge Graphs</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Nandana Mihindukulasooriya?</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Oktie Hassanzadeh</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sarthak Dash</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Al o Gliozzo</string-name>
          <email>gliozzog@us.ibm.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>IBM Research</institution>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Ontology Engineering Group, Universidad Politecnica de Madrid</institution>
          ,
          <addr-line>Madrid</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Knowledge Graphs (KGs) play a key role in many arti cial intelligence applications. Large KGs are often constructed through a noisy automatic knowledge extraction process. Noise detection is, therefore, an important task for having high-quality KGs. We argue that the current noise detection approaches only focus on a speci c type of noise (i.e., fact checking) whereas knowledge extraction methods result in more than one type of noise. To this end, we propose a classi cation of noise found in automatically-constructed KGs, and an approach for noise detection focused on speci c types of noise.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>Knowledge Graphs (KGs) are key components of most modern arti cial
intelligence and cognitive applications. These KGs are largely constructed from textual
corpora using automatic information extraction techniques. Such techniques
introduce noise in the KGs and noise detection and removal become essential steps.</p>
      <p>Most of the current noise detection in knowledge graphs is done with human
supervision. For instance, large KGs such as YAGO2, DBpedia, or Wikidata
use human contributors to verify the correctness of a given fact. This is a
timeconsuming task and requires a lot of human e ort for large KGs. Although
crowd-sourcing could be a solution for public general-domain KGs, it may not
be a viable solution for enterprise KGs, due to both privacy issues as well as
the need to rapidly create KGs from a large number of distinct corpora and in
speci c domains that require deep expertise. Thus, there is a need for automatic
techniques for detecting noise in KGs.</p>
      <p>Current automatic noise detection techniques are focused on factual
correctness of triples. In this paper, we discuss the need for di erent types of noise in
KGs and how to detect those speci c types of noise.
? This research is partially supported by the 4V project (TIN2013-46238-C4-2-R) and
the BES-2014-068449 grant. Work done while the author was an intern at IBM.</p>
      <p>Mihindukulasooriya et al.</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        Knowledge Graph Re nement [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], more concretely, noise detection is an
important concern of industrial KGs. Noise detection is discussed mainly under two
tasks in the literature: Fact Checking (or Factual correctness ) and Triple
Classication. In fact checking [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], the correctness of a statement is veri ed by nding
external sources that con rm it. DeFacto [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] looks up the Web to nd statements
expressed in natural language in web pages while Liu et al. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] do so by nding
consensus in other external knowledge graphs. Triple classi cation [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] is a binary
classi cation task to predict correctness of facts using graph embeddings. Given
two entities, e1 and e2 and a relation R, a function g(e1,R,e2) is de ned in a way
that a triple is true only if its value is above a given threshold [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. In both these
tasks, the main focus is on di erentiating factual true triples in a single step.
We argue that this can be divided into several sub-tasks by analyzing di erent
types of noise. Our hypothesis is that by de ning methods for detecting speci c
types of noise, we could improve the overall KG noise detection results.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Types of Noise</title>
      <p>Based on the analysis of the output of a commercial information extraction
framework, which parses text corpora and produces triples, we de ned the
categories in Fig. 1. Out of these, we identify Factual True triples as the most
relevant and Inconsistent, Generic, Factual False categories as noise.</p>
      <p>Inconsistent triples contradict the domain model they represent. As such,
these triples are completely implausible and meaningless. For instance, (Barack
Obama, siblingOf, White House), is not plausible. If siblingOf relation is speci ed
formally to have a range of Person and if Person and Building are disjoint, that
leads to a logical inconsistency. Nevertheless, those granular axioms are not
Towards Comprehensive Noise Detection in Automatically-Created KGs
always present for such reasoning and data pro ling can be used to identify
common patterns in data that can provide heuristics of inconsistencies.</p>
      <p>Generic triples do not mention speci c entities and have less information
value. For example, (family, residesIn, New York ). Nevertheless, if both
entities are generic such triples can provide schema-level information, for example,
(family, residesIn, city ). Finally, factually false triples are the ones that contain
incorrect information. For example, (Boston, capitalOf, USA) is factually false.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Noise Detection Work ow</title>
      <p>We propose to lter inconsistent and generic triples prior to fact checking as
illustrated in the Fig. 2. This section describes approaches for detecting
inconsistent and generic triples using the knowledge in external KGs.</p>
      <p>Inconsistent triple checking is performed by mapping both entities as well as
relations to external KGs and considering both ontological axioms that de ne
formal conceptulizations (e.g., domain and range of properties or disjoint types)
and common patterns in data. A relation mapping is one-to-one if both
relations in information extraction and properties in KG have same granularity, for
example, siblingOf to dbo:sibling. Otherwise, the mapping is conditional based
on the domain class, for example, partOfMany maps to dbo:country in dbo:City
class and dbo:album in dbo:Single class.</p>
      <p>Generic triple detection is performed using part-of-speech tagging using NLP
tools. We use the intuition that when the subject or the object is not a proper
noun, it typically refers to a generic entity rather than a speci c one. If either
the subject or the object is generic, we label the triple generic. For determining
factual correctness, we use a similar approach to fact checking by looking for
evidences that con rm a given triple in an external KG using entity disambiguation
and relation mapping described in Algorithm 1.</p>
      <p>Fact checking is performed by nding evidences con rming a given triple
in external knowledge graphs similar to the approaches in Section 2. We
conducted preliminary experiments to evaluate the algorithms using 2,342
manuallylabelled triples. Each triple was labelled with its type (i.e., Inconsistent, Generic,
or Factual ) and its truth value (i.e., True, False, or True in the past ) by human
annotators. The results are presented in Table 1.</p>
      <p>Mihindukulasooriya et al.</p>
      <p>Algorithm 1: Inconsistent triple detection
Category
Inconsistent triple detection
Generic triple detection</p>
      <p>Con guration
Standford NLP
Open NLP
Combined</p>
      <p>Precision
86.84%
85.65%
78.16%
98.25%</p>
    </sec>
    <sec id="sec-5">
      <title>Ongoing and Future Work</title>
      <p>Our intuition is that the more-speci c noise detection will improve the overall
quality of KGs. One challenge for evaluating this hypothesis is the lack of a large
gold standard with noise types. We plan to create one using crowd-sourcing.</p>
      <p>Further, we also plan to test another alternative approach based on graph
embeddings to demote triples that are inconsistent with other triples. For this,
we learn representations for entities and relations by constructing functions that
represent interactions between related entities.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Paulheim</surname>
          </string-name>
          , H.:
          <article-title>Knowledge Graph Re nement: A Survey of Approaches and Evaluation Methods</article-title>
          .
          <source>Semantic Web</source>
          <volume>8</volume>
          (
          <issue>3</issue>
          ) (
          <year>2017</year>
          )
          <volume>489</volume>
          {
          <fpage>508</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Gerber</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Esteves</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , Buhmann, L.,
          <string-name>
            <surname>Usbeck</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ngomo</surname>
            ,
            <given-names>A.C.N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Speck</surname>
          </string-name>
          , R.:
          <article-title>DeFacto - Temporal and Multilingual Deep Fact Validation</article-title>
          .
          <source>Web Semantics: Science, Services and Agents on the World Wide Web</source>
          <volume>35</volume>
          (
          <year>2015</year>
          )
          <volume>85</volume>
          {
          <fpage>101</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , dAquin,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Motta</surname>
          </string-name>
          , E.:
          <article-title>Measuring Accuracy of Triples in Knowledge Graphs</article-title>
          .
          <source>In: International Conference on Language, Data and Knowledge</source>
          . (
          <year>2017</year>
          )
          <volume>343</volume>
          {
          <fpage>357</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Socher</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Manning</surname>
            ,
            <given-names>C.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ng</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>ReasoningWith Neural Tensor Networks for Knowledge Base Completion</article-title>
          .
          <source>In: Advances in Neural Information Processing Systems</source>
          <volume>26</volume>
          . (
          <year>2013</year>
          )
          <volume>926</volume>
          {
          <fpage>934</fpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>