<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Universal Schemas Using Shortest Dependency Paths for Free Word Order Languages</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jiho Kim</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sangha Nam</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Key-Sun Choi</string-name>
          <email>kschoig@kaist.ac.kr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Korea Advanced Institute of Science and Technology</institution>
          ,
          <country>Republic of Korea</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Universal schemas are a remarkable approach for solving relation extraction, in which new facts are extracted by jointly learning latent feature vectors of entity pairs and relation types through matrix factorization models. However, in free word order languages where surface form predicates do not fully reveal the characteristics of a sentence, universal schemas cannot be constructed in the same manner. Therefore, in this study, we introduce a novel expansion of universal schemas, dependency-path-based universal schemas. Our model uses shortest dependency paths and entity types instead of surface form predicates. For veri cation of our model, we constructed and evaluated a universal schema in Korean with a combination of Korean DBpedia and Korean Wikipedia distant supervision data.</p>
      </abstract>
      <kwd-group>
        <kwd>Universal Schemas</kwd>
        <kwd>Relation Extraction</kwd>
        <kwd>Shortest Depen- dency Paths</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Relation extraction (RE) is a core step in various natural language processing
applications, and can be de ned as the task of nding ontological relations
between two target entities in a sentence. The concept of universal schemas [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]
was proposed as a considerable approach in RE. A universal schema can be
constructed using a combination of all usable knowledge bases (KBs) and natural
language text, then expressed through a huge matrix with entity tuples as rows
and relations as columns. The novelty of universal schemas comes from avoiding
pre-labeled datasets by using surface form predicates as a source for relation
types, and mutually supporting both unstructured and structured data.
      </p>
      <p>
        In English, surface form predicates between two entities can usually serve
as relations [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. However, in free word order languages, the extraction of surface
pattern relations between entities is di cult owing to exible word orders.
Without the presence of language-speci c surface form predicate extractors, universal
schemas cannot be built in these languages. Even though several algorithmic
extensions have been studied [
        <xref ref-type="bibr" rid="ref4 ref6">4, 6</xref>
        ], none have solved the dependency on surface
form predicates.
      </p>
      <p>In this study, we propose a novel expansion of universal schemas: dependency
path based universal schemas (DPUSs). Shortest dependency paths (SDPs)
illustrate the syntactic structure between entities according to a sequence of directed
binary grammatical relations. Unlike surface form predicates, grammatical
dependency is a common feature in all languages. We also injected entity types into
the matrix, expecting the e ect of narrowing of the range of relations that the
model should predict. To verify that our model shows a decent performance, we
constructed two DPUSs (baseline/extended) on a Korean dataset and measured
the average precision (AP) for each ontological relation.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Methods</title>
      <p>
        We rst built a matrix with entity tuples as rows and a combination of KB
relations, SDPs, and entity types as columns. For all observed and unobserved facts,
we lled in 1 and 0, respectively in the corresponding slot. We extracted SDPs
between the target entities, including directions of each dependency. Figure 1
illustrates the lling of the DPUS matrix based on raw text. Our model mainly
employs the nfe matrix factorization model from [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], which showed the best
performance among the proposed models. The embedding vector dimensions were
optimized using the leave-one-out cross-validation. To ensure consistency of slot
values, we normalized each row on a scale of 0 to 1.
3
3.1
      </p>
    </sec>
    <sec id="sec-3">
      <title>Experiments</title>
      <p>
        Dataset and Evaluation
For the baseline, we built a DPUS by using Korean DBpedia [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] as the KB and
Korean Wikipedia distant supervision data as raw text. The distant
supervision data is obtained from Korean Wikipedia documents, which were ltered
by Korean DBpedia triples. Speci cally, we extracted sentences, each of which
contained two entities with a relation in Korean DBpedia. We rejected sentences
with SDP of length longer than three because these SDPs are rarely observed
in our dataset. For our target relation set, we selected the most frequent 35
prede ned KB relations, each with more than 200 entity tuples in the distant
supervision data. Then, we chose entity tuples with two or more SDPs in the
distant supervision data. We nally obtained 12,360 entity tuples, 35 KB relations,
and 348 frequent SDPs. The size of the baseline DPUS matrix was 12360 383.
      </p>
      <p>For the extended model, we added coarse-grained entity types extracted by
Named Entity Recognition (NER) of each entity pair to the baseline DPUS
matrix. We used the following entity type categorization: artifacts (e.g., TV
programs, books, and movies), data/time, locations (e.g., countries, cities, and
towns), organizations (e.g., governments, public corporations, and companies),
persons, and others. For each entity tuple, we added a new slot indicating the
entity types in the tuple. A total of 15 new columns were added to the baseline
matrix, extending the size of the extended matrix to 12360 398.</p>
      <p>For evaluation, we performed a ten-fold experiment to measure APs for each
KB relation over all the entity tuples in the test set. Then we measured MAP and
WMAP for the performance of the whole model. MAP is simply the average of
nsubjpass</p>
      <p>nmod:at
The White House is located at Pennsylvania Avenue in Washington, D.C..</p>
      <p>Entity 1 Entity 2
Entity tuple
Relation
SDP
Entity Types
&lt; White House, Pennsylvania Avenue &gt;</p>
      <p>location
[nsubjpass, -nmod:at]</p>
      <p>&lt; LC, LC &gt;
Parsing result</p>
      <p>&lt;White House,
Pennsylvania Avenue&gt;
location founder -[nnsmuobdjp:aats]s, &lt;LC, LC&gt;
1
0
1</p>
      <p>1
Ontological Relations SDPs Entity Types</p>
      <p>
        DPUS matrix
APs, and WMAP is the weighted version of MAP in which each AP is weighted
by the number of entity tuples in each relation. MAP and WMAP are shown by
a previous work [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] to be robust and stable metrics for evaluating classi cation
models.
The experimental results of all KB relations are listed in Table 1. Winners of
each relation are marked in bold. The baseline DPUS showed performances of
0.62 (MAP) and 0.64 (WMAP), while the extended version of DPUS showed
performances of 0.72 (MAP) and 0.75 (WMAP), respectively. Thus, the
performance shows great improvement after the injection of entity types. This is
because for each entity tuple, the model can consider a signi cantly fewer set of
relations when the entity types are decided.
      </p>
      <p>Even though the extended DPUS shows better overall performance, the
baseline method performs better in some relations, such as birthPlace and writer. As
most of these relations are represented by unique SDPs, the injection of entity
types seems to confuse the model.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Conclusion</title>
      <p>In this study we introduced DPUSs, which are a novel expansion of universal
schemas. DPUSs are built by using SDPs and entity tuple types instead of
surface form predicates. SDPs contain information about the syntactic structure
of a sentence, which makes SDPs a suitable alternative of surface form
predicates. Entity tuple types greatly improve performance by reducing the relation
candidates for the model. Our model has higher potential in terms of future
development and versatility because DPUS is based on globally consistent features
rather than language speci c features.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgement</title>
      <p>This work was supported by Institute for Information &amp; communications
Technology Promotion(IITP) grant funded by the Korea government(MSIT)
(20170-01780, The technology development for event recognition/relational reasoning
and learning knowledge based system for video understanding)</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Etzioni</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Banko</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Soderland</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weld</surname>
            ,
            <given-names>D.S.</given-names>
          </string-name>
          :
          <article-title>Open information extraction from the web</article-title>
          .
          <source>Communications of the ACM</source>
          <volume>51</volume>
          (
          <issue>12</issue>
          ),
          <volume>68</volume>
          {
          <fpage>74</fpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>E.k.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weidl</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Choi</surname>
            ,
            <given-names>K.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Auer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Towards a korean dbpedia and an approach for complementing the korean wikipedia based on dbpedia</article-title>
          .
          <source>OKCon</source>
          <volume>575</volume>
          , 12{
          <fpage>21</fpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Manning</surname>
            ,
            <given-names>C.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Raghavan</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , Schutze, H., et al.:
          <article-title>Introduction to information retrieval</article-title>
          , vol.
          <volume>1</volume>
          . Cambridge university press Cambridge (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Neelakantan</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Roth</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mc-Callum</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Compositional vector space models for knowledge base inference</article-title>
          .
          <source>In: 2015 AAAI spring symposium series</source>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Riedel</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yao</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McCallum</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Marlin</surname>
            ,
            <given-names>B.M.:</given-names>
          </string-name>
          <article-title>Relation extraction with matrix factorization and universal schemas</article-title>
          .
          <source>In: Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</source>
          . pp.
          <volume>74</volume>
          {
          <issue>84</issue>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Yao</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Riedel</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McCallum</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Universal schema for entity type prediction</article-title>
          .
          <source>In: Proceedings of the 2013 workshop on Automated knowledge base construction</source>
          . pp.
          <volume>79</volume>
          {
          <fpage>84</fpage>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>