<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>UNED PanMorCrepsTeam at M-WePNaD</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Pablo Panero</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Manuel Moreno</string-name>
          <email>mmorenomaldonado@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tomas Crespo</string-name>
          <email>t.crespo.g@outlook.com</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jorge Carrillo-de-Albornoz</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Enrique Amigo</string-name>
          <email>enriqueg@lsi.uned.es</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Junta de Andaluc a</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>NLP&amp;IR Group</institution>
          ,
          <addr-line>UNED</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Salenda Software Factory</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2017</year>
      </pub-date>
      <fpage>152</fpage>
      <lpage>156</lpage>
      <abstract>
        <p>This paper describes the participation of the PanMorCresp team in the Multilingual Web Person Name Disambiguation task of IberEval 2017. The solutions consisted of di erent variants of the traditional hierarchical agglomerative clustering algorithm. The four approaches have been de ned and implemented independently by three Master's students over the same vocabulary generation software. The purpose of this is to analyse to what extent the HAC design and implementation can a ect the e ectiveness of clustering. Using a simplistic approach based on hierarchical agglomerative clustering method, considering just word occurrence, is able to achieve relatively good results regarding the rest of systems presented in the campaign.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>This work proposes a new people name disambiguation system used in the
Multilingual Web Person Name Disambiguation task of IberEval 2017. In this task
we receive a set of training web pages and an associated gold standard with the
grouping of these web pages according to the di erent individuals they refer to.
The goal is to group a new set of web pages belonging to a test set data, where
no information about the correct grouping is provided.</p>
      <p>
        It is usual to search information on the Web about people, where the query
that expresses the information need is a person name. Because di erent people in
the world share the same name, the results returned by a search engine can
contain web pages related to several persons, not only for the searched individual.
For this reason this task is really interesting, especially because of the
multilingual nature of the Web. Despite this, the previous campaigns dedicated to this
task focus only on corpora with web pages in a single language, for instance, the
WePS campaigns [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] and [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] in English, and a Chinese campaign [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. The
objective of the MWepDNaD task is providing a chance to develop person name
disambiguation systems, with the additional challenge that results for a query,
as well as each individual, can be written in multiple languages.
      </p>
      <p>In this work, using the same vector generation software, three Master's
students have implemented independently the HAC agglomerative clustering
methods taking di erent decisions about the vocabulary size, linkage and stop
criterion. The purpose of this is to check to what extent the implementation of
the HAC algorithm and the related decisions can a ect the e ectiveness of
approaches.</p>
      <p>The rest of the paper is organised as follows: our proposed methods to
disambiguate person names are described in Section 2. Section 3 present the results
obtained by our proposals, and the analysis and discussion of them can be found
in Section 4. Finally, Section 5 presents the conclusions.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Methods</title>
      <p>In a rst step we transform each document into a vector of values which is
used as input for the hierarchical agglomerative clustering algorithm. To this
aim each document is divided in tokens by just splitting the text by blank
characters. After this, each token is transformed into a lowercase representation
in order to avoid ambiguity and decrease the number of words in the dictionary
for the vector representation. The vocabulary is generated independently for
each person name (query). Finally, all words with frequency one in the corpus
for the corresponding query were removed. Due to computational constrains,
for each entity only the most frequent n words in the dictionary generated in
the previous step were selected. The experiments include variants for several n
values. In all runs, we have used the presence of words as projection function.</p>
      <p>The four approaches have been de ned and implemented independently by
three Master's students over the same vocabulary generation software. The
evaluated approaches in the competition are:
{ PanMorCresp Team - run 1: The vocabulary contains the 4000 most
frequent terms.The feature projection is the word occurrence. The HAC
algorithm works under the complete linkage (maximum distance between
items from both clusters), and the used similarity criterion is the cosine. As
stop criterion, it considers a similarity threshold, which is adapted for each
clustering case. That is the average similarity between documents divided
by n. Several n values have been checked over the training corpus. Finally,
we set the n parameter at two.
{ PanMorCresp Team - run 2: The vocabulary generation criterion is the
same as in the previous approach. In this case, the employed linkage is the
average similarity between documents in both clusters. The similarity
threshold was tunned over the training corpus.
{ PanMorCresp Team - run 3: As well as in the previous approaches,
it uses cosine similarity. For this run, we have eliminated stopwords and
puntctuation marks. The vocabulary contains the 7500 most frequent terms
in the collection. It uses the single linkage and the stop criterion is based
on similarity (0.65). The similarity threshold was tunned over the training
corpus.
{ PanMorCresp Team - run 4: This approach is analogous than the
previous one, but using 9 clusters as stop criterion.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Results</title>
      <p>
        We have submitted 4 runs and the results are shown in Tables 1 and 2. We report
the following metrics: Reliability (R), Sensibility (S) and their harmonic mean
F0:5(R; S) [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The nal value of the evaluation will be the average of F0:5(R; S)
in all person names evaluated.
      </p>
      <p>Table 1 shows the results achieved by our methods considering in the
evaluation only related web pages, and Table 2 shows the results considering all web
pages. In addition, both tables show the result of the two baselines provided
by the organizers: One-in-one, where every Web page is assigned to a di erent
cluster, and All-in-one, where all Web pages are assigned to a single cluster.
The results suggest that increasing the vocabulary (from 4000 to 7500 words)
increases substantially the e ectiveness of the algorithm, as well as using single
linkage instead of other approaches such as the average linkage or complete
linkage (runs 3 and 4 vs. runs 1 and 2). However, notice that the rst run
(using complete linkage and the average cost function value as stop criterion)
achieves a high Reliability (precision) value. In fact, there exists a strong trade
o (Reliability vs. Sensitivity) between the rst run and the rest. Therefore, the
results cannot be compared objectively. They depend to a great extent on the
relative weight of Reliability and Sensitivity in the F measure.</p>
      <p>On the other hand, Run 4 outperforms substantially the third run: from
47 to 57 when considering only related documents and from 0.5 to 0.58 in F
when considering all documents. This improvement is mainly due to an increase
in Sensitivity (recall). That is, using 9 clusters as stop criterion captures more
relationships than using a similarity threshold without penalising the precision
(Reliability).
5</p>
    </sec>
    <sec id="sec-4">
      <title>Conclusions</title>
      <p>We have presented in this paper the evaluation of four di erent runs in the
Web Person Name Disambiguation task of IberEval 2017. The most remarkable
result is that using a simplistic method (word occurrences, HAC, single linkage
an number of clusters as stop criterion, is able to achieve an e ectiveness which
is (relatively) comparable with the best approach presented in the campaign. In
fact, other more sophisticated approaches produce lower evaluation results.</p>
      <p>The approaches based on HAC have been designed and implemented
independenly by di erent students. This experiment also suggests that, considering
the HAC algorithm, its e ectiveness is highly sensitive to the decisions about
linkage, vocabulary size and stop criterion, as well as the relative weight of the
complementary evaluation metrics (reliability and sensitivity).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>Emrique</given-names>
            <surname>Amigo</surname>
          </string-name>
          &amp;
          <string-name>
            <surname>Julio</surname>
            <given-names>Gonzalo &amp; Felisa</given-names>
          </string-name>
          <string-name>
            <surname>Verdejo</surname>
          </string-name>
          .
          <article-title>A General Evaluation Measure for Document Organization Tasks</article-title>
          .
          <source>In Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR</source>
          <year>2013</year>
          ), pp.
          <fpage>643</fpage>
          -
          <lpage>652</lpage>
          . (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>Javier</given-names>
            <surname>Artiles</surname>
          </string-name>
          &amp;
          <string-name>
            <given-names>Andrew</given-names>
            <surname>Borthwick</surname>
          </string-name>
          &amp;
          <article-title>Julio Gonzalo &amp; Satoshi Sekine &amp; Enrique Amigo. WePS-3 Evaluation Campaign: Overview of theWeb People Search Clustering and Attribute Extraction Tasks</article-title>
          .
          <source>In Third Web People Search Evaluation Forum (WePS-3)</source>
          ,
          <source>CLEF</source>
          <year>2010</year>
          (
          <year>2010</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>Javier</given-names>
            <surname>Artiles</surname>
          </string-name>
          &amp;
          <article-title>Julio Gonzalo &amp; Satoshi Sekine. Weps 2 Evaluation Campaign: Overview of the Web People Search Clustering Task</article-title>
          .
          <source>In 2nd Web People Search Evaluation Workshop (WePS</source>
          <year>2009</year>
          ),
          <source>18th WWW Conference</source>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>Javier</given-names>
            <surname>Artiles</surname>
          </string-name>
          &amp;
          <string-name>
            <surname>Julio</surname>
            <given-names>Gonzalo &amp; Satoshi</given-names>
          </string-name>
          <string-name>
            <surname>Sekine. The SemEval-2007 WePS Evaluation</surname>
          </string-name>
          <article-title>: Establishing a Benchmark for the Web People Search Task</article-title>
          .
          <source>In Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007)</source>
          , pages
          <fpage>6469</fpage>
          , Prague, Czech Republic,
          <year>June 2007</year>
          .
          <article-title>Association for Computational Linguistics</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>Ying</given-names>
            <surname>Chen</surname>
          </string-name>
          &amp;
          <string-name>
            <given-names>Peng</given-names>
            <surname>Jin</surname>
          </string-name>
          &amp;
          <string-name>
            <given-names>Wenjie</given-names>
            <surname>Li</surname>
          </string-name>
          &amp;
          <string-name>
            <surname>Chu-Ren Huang</surname>
          </string-name>
          .
          <article-title>The Chinese Persons Name Disambiguation Evaluation: Exploration of Personal Name Disambiguation in Chinese News</article-title>
          .
          <source>In CIPS-SIGHAN Joint Conference on Chinese Language Processing</source>
          , pp.
          <fpage>346</fpage>
          -
          <lpage>352</lpage>
          . (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>Soto</given-names>
            <surname>Montalvo</surname>
          </string-name>
          &amp;
          <article-title>Raquel Mart nez</article-title>
          &amp; Leonardo Campillos &amp;
          <string-name>
            <surname>Agust</surname>
            n
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Delgado</surname>
          </string-name>
          &amp;
          <article-title>V ctor Fresno &amp; Felisa Verdejo. MC4WePS: a multilingual corpus for web people search disambiguation, Language Resources</article-title>
          and Evaluation. (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>