<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Temporal Evolution of Entity Relatedness using Wikipedia and DBpedia</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Narumol Prangnawarat</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Conor Hayes</string-name>
          <email>conor.hayesg@insight-centre.org</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Insight Centre for Data Analytics, National University of Ireland</institution>
          ,
          <addr-line>Galway</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>Entity relatedness is a task that is required in many applications such as entity disambiguation and clustering. Although there are many works on entity relatedness, most of them do not consider temporal aspects and thus, it is unclear how entity relatedness changes over time. This paper attempts to address this gap by showing how entity relatedness develops over time with graph-based approaches using Wikipedia and DBpedia as well as how transient links and stable links, by which is meant links that do not persist over di erent times and links that persist over time respectively, a ect the relatedness of the entities. We show that di erent versions of the knowledge base at di erent times give di erent accuracy on the KORE dataset, and show how using multiple versions of the knowledge base through time can increase the accuracy of entity relatedness over a version of the knowledge graph from a single time point.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>Many researches have made use of Wikipedia and DBpedia for relatedness and
similarity tasks. However, most approaches use only the current information
but not temporal information. For example, the entities Mobile and Camera
may have been less related in the past but may be more related at present.
This work shows how semantic relatedness develops over time using graph-based
approaches over the Wikipedia network as well as how transient links, links that
do not persist over di erent times, and stable links, links that persist over time,
a ect the relatedness of the entities.</p>
      <p>We hypothesise that using graph-based approaches on the Wikipedia network
provides higher accuracy in term of relatedness score to the ground truth data
in comparison to text-based approaches. We take each Wikipedia article as an
entity, for example, the article on Semantic similarity 1 corresponds to the entity
Semantic similarity. Although the term article and entity may be referred to
interchangeably in this work, article refers to Wikipedia article and entity refers
to a single concept in the semantic network. Wikipedia users provide Wikipedia
page links within articles, so we can make use of the provided links as entities. We
assume that entities which are closely related to an entity, a Wikipedia article in
1 https://en.wikipedia.org/wiki/Semantic_similarity
this case, are mentioned in that Wikipedia article. Hence, closely related entities
share the same adjacent nodes in the Wikipedia article link network. We make
use of the temporal Wikipedia article link network to demonstrate the evolution
of entity relatedness and how transient or stable links a ect the relatedness of
the entities. We analyse di erent models of aggregated graphs, which integrate
networks at various times into the same graph as well as time-varying graphs
which are the series of the networks at each time step.</p>
      <p>We rst show that the proposed graph-based approach outperforms the
textbased approach in terms of the accuracy of relatedness score. Then, we present
our proposed similarity method, which outperforms the baseline graph-based
similarity methods in term of relatedness score accuracy over various di erent
network models. We also show the evolution of relatedness as well as how
transient links and stable links e ect the relatedness using graph-based similarity.
2
2.1</p>
    </sec>
    <sec id="sec-2">
      <title>Related Works</title>
      <sec id="sec-2-1">
        <title>Semantic Relatedness</title>
        <p>
          Semantic relatedness works have been carried out for words (natural language
texts such as common nouns and verbs) and entities (concepts in semantic
networks such as companies, people and places). The approaches used for semantic
relatedness includes corpus-based or text-based approaches as well as
structurebased or graph-based approaches. Wikipedia and DBpedia have been widely
used as resources to nd semantic relatedness. However, most approaches use
only the current information without temporal aspects. One of the well known
system is WikiRelated [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] introduced by Strube and Ponzetto. WikiRelated
use the structure of Wikipedia links and categories to compute the relatedness
between concepts.
        </p>
        <p>
          Gabrilovich and Markovitch proposed Explicit Semantic Analysis (ESA) [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]
to compute semantic relatedness of natural language texts using high-dimensional
vectors of concepts derived from Wikipedia. DiSER [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ], presented by Aggarwal
and Buitelaar, improves ESA by using annotated entities in Wikipedia.
        </p>
        <p>
          Leal et al. [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] proposed a novel approach for computing semantic relatedness
as a measure of proximity using paths on DBpedia graph. Hulpus et al. [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]
provided a path-based semantic relatedness using DBpedia and Freebase as well
as presenting the use in word and entity disambiguation.
        </p>
        <p>
          Radinsky et al. proposed a new semantic relatedness model, Temporal
Semantic Analysis (TSA) [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]. TSA computes relatedness between concepts by
analysing the time series between words and nd the correlation over time.
Although this work make use of the temporal aspect to nd relatedness between
words, it does not show how the relatedness evolve over time.
2.2
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Time-aware Wikipedia</title>
        <p>
          There have been a number of works in the area of time-aware Wikipedia
analysis. The authors have primarily focused on content and statistical analysis.
WikiChanges [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] presents a Wikipedia article's revision timeline in real time as
a web application. The application presents the number of article edits over time
in day and month granularity. The authors also provide the extension script for
embedding a revision activity to Wikipedia. Ceroni et. al. [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] introduced a
temporal aspect for capturing entity evolution in Wikipedia. They provided time-based
framework for temporal information retrieval in Wikipedia as well as statistical
analysis such as the number of daily edits, Wikipedia pages having edits and top
Wikipedia pages that has been changed over time. Whiting et. al. [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] presented
Wikipedia temporal characteristics, such as topic coverage, time expressions,
temporal links, page edit frequency and page views, to nd how the knowledge
can be exploited in time-aware research. However, in-degree and out-degree are
the only networks properties that are discussed in the paper.
        </p>
        <p>
          Recent research from Bairi et. al. [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] presents statistics of categories and
articles, such as the number of articles, the number of links and the number
of categories, comparing between the Wikipedia instance in October 2012 and
the Wikipedia instance in June 2014. The authors also analysed the Wikipedia
category hierarchy as a graph and provided statistics of the category graph such
as number of cycles and and the cycle length of the two Wikipedia instances.
Contropedia [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] identi es when and which topics have been most controversial
in Wikipedia article using Wikipedia links as the representation of the topics.
However, the approach also focus on the content changed in the article.
        </p>
        <p>Our work make use of the changes in Wikipedia to analyse evolution of entity
relatedness. We show how entity relatedness develops over time using graph
based approaches over Wikipedia network as well as how transient links and
stable links a ect the relatedness of the entities.
3</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Dataset</title>
      <p>The Wikipedia data can be downloaded from the Wikimedia download page2,
where dumps are extracted twice a month. A major limitation of data availability
from Wikipedia dumps is that the oldest available Wikipedia dump at the time
of our experiment was from 20 August 2016.</p>
      <p>We treat each Wikipedia article as an entity. Each Wikipedia article has user
provided Wikipedia article links which link to the corresponding Wikipedia
articles. We extract Wikipedia links within Wikipedia articles from each Wikipedia
dump. Figure 1 shows an example of a part of Wikipedia article link network
of the DBpedia article3. The part of DBpedia article contains links to Database,
Structured content, Wikipedia, World Wide Web, Semantic Query, Tim
BernersLee, Dataset, and Linked Data. The article links of all articles in Wikipedia
construct the full Wikipedia article link network.</p>
      <p>As the oldest Wikipedia dumps available for download on Wikimedia
Downloads page is Wikipedia dump on 20 August 2016 at the time we conduct this</p>
      <sec id="sec-3-1">
        <title>2 https://dumps.wikimedia.org/</title>
      </sec>
      <sec id="sec-3-2">
        <title>3 https://en.wikipedia.org/wiki/DBpedia</title>
        <p>experiment, in order to get older Wikipedia link data, we obtained data from
DBpedia4 which is an open knowledge base extracted from Wikipedia and other
Wikimedia projects. We use the page links datasets which are the
relationship of article links within each article, the same information as the Wikipedia
links we extracted from Wikipedia. Each DBpedia concept corresponds to a
Wikipedia article. For example, the DBpedia concept http://dbpedia.org/
resource/Semantic_similarity corresponds to the article Semantic
similarity 5. We refer to both the Semantic similarity article and the DBpedia concept
http://dbpedia.org/resource/Semantic_similarity as the entity
Semantic similarity. We make use of page links dataset from DBpedia to construct a
series of Wikipedia article link networks for each year from 2007 to 2016. The
DBpedia versions used are shown in Table 1.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Methodology</title>
      <p>We take each Wikipedia article as an entity. We assume that entities which are
closely related to an entity, a Wikipedia article in this case, are mentioned in that
Wikipedia article. Hence, closely related entities share the same adjacent nodes
in the Wikipedia article link network. However, there might be some articles that
link to a lot of the articles that might not be semantically related. For example,
the main page, which is the landing page for featured articles and news, is not
semantically related to the articles it links to. On the other hand, articles that
do not have many links to other pages might have more semantically relation
to their links. Because of this reason, we apply weights to the relationships to
penalise the articles that link to many other unrelated articles.</p>
      <sec id="sec-4-1">
        <title>4 http://wiki.dbpedia.org</title>
      </sec>
      <sec id="sec-4-2">
        <title>5 https://en.wikipedia.org/wiki/Semantic_similarity</title>
        <p>First, we introduce di erent models we used to represent Wikipedia article
link networks. Then, we explain our approach we used to nd relatedness over
the proposed models.
To re ect how relatedness changes over time, we represent temporal information
from Wikipedia article link networks in two di erent ways. One is as
timevarying graphs which are a series of Wikipedia article link snapshots at each time
step. We use each version of datasets to construct a network as each snapshot.
Another is as an aggregated graph, which aggregates all networks at each time
step together with time information as weights.</p>
        <p>Time-Varying Graphs Given an article a for a corresponding entity, the series
of networks GaS at the set of time T = f1; ::; ng is constructed as a set of time
step graphs fG1a; G2a; :::; Gang. Each graph Gta = (Vta; Eta) represents a snapshot
of the 2-hop ego network of the article links around an entity a at the time t,
where Vta is a set of nodes where each node represents a Wikipedia entities that
have links with a or have links with the nodes that are adjacent to a at time t and
Eta is a set of edges where each edge eijt is an internal link between Wikipedia
entities i and j at time t. In other words, v 2 Vta if at time t; eavt 2 Eta or
eabt 2 Ea t</p>
        <p>t ^ ebvt 2 Ea. Figure 2 demonstrate the example of a 2-hop ego network
around the entity a at time t. We construct a series of 2-hop ego networks of
Wikipedia article links over time around each seed entity that we are interested
in.</p>
        <p>Aggregated Graphs We create di erent models of aggregated graphs as
follows.</p>
        <p>Intersection model
Given a Wikipedia article a for a corresponding entity, GIa = (VIa; EIa), VIa is a
set of nodes where each node represents a Wikipedia article that have links with
a or have links with the nodes that are adjacent to a at all time points and EIa is
a set of edges where each edge eij is an internal link between Wikipedia entities
i and j which appear at all time. In other words, v 2 VIa if v 2 V1a \ V2a \ ::: \ Vna
and eij 2 EIa if e 2 E1a \ E2a \ ::: \ Ena for [1::n] 2 T .</p>
        <p>Union model
Given a Wikipedia article a for a corresponding entity, GaU = (VUa; EUa ), VUa a set
of nodes where each node represents a Wikipedia article that have links with a
or have links with the nodes that are adjacent to a at any time in T and EUa is
a set of edges where each edge eij is an internal link between Wikipedia entities
i and j. In other words, v 2 VUa if v 2 V a
1 [ V2a [ ::: [ Vna and eij 2 EUa if
eij 2 E1a [ E2a [ ::: [ Ena for [1::n] 2 T .
4.2</p>
        <sec id="sec-4-2-1">
          <title>Proposed Extended Jaccard Similarity</title>
          <p>Jaccard similarity coe cient measures similarity between two objects using
binary attributes. Given object a and b with a vector of features A and B
respectively, Jaccard similarity coe cient of a and b, J (a; b) is computed as the
following equation.</p>
          <p>J (a; b) = jA \ Bj</p>
          <p>jA [ Bj</p>
          <p>
            Taking adjacent nodes of an entity a in the Wikipedia article link network
as the features of the entity a, Jaccard similarity coe cient can re ect our
assumption that closely related entities share the same adjacent links in Wikipedia
article links network. However, Jaccard similarity coe cient cannot take into
account non-binary features. As we discussed before, there might be some article
that links to a lot of the pages that might not be semantically related so we
want to apply weights to the relationships to penalise pages that link to many
unrelated pages. PageRank [
            <xref ref-type="bibr" rid="ref11">11</xref>
            ] is used to rank the importance of nodes. It was
originally created to rank web pages in World Wide Web network in Google
search engine. We use the same idea to apply to our Wikipedia article link
network. An article that mentions a lot of other articles may not be semantically
related to the articles that they link to. On the other hand, an article that is
mentioned in a lot of articles may just be a general article that is not
semantically related to them. We make use of Tanimoto similarity [
            <xref ref-type="bibr" rid="ref14">14</xref>
            ] to extend Jaccard
similarity using reciprocal PageRank, which is 1 divided by PageRank score, as
weights to nd similarity between each entity in the network. The underlying
assumption is that articles with lower PageRank scores might have more
semantically relation to their links as they only link to fewer articles that are really
related to them. Given A is a vector of reciprocal PageRank score of the articles
having links with an entity a in the 2-hop ego network around an entity a and
B is a vector of reciprocal PageRank score of the articles having links with an
entity b in the 2-hop ego network around an entity b, the relatedness between
two entities a and b is computed as:
          </p>
          <p>R(a; b) =</p>
          <p>A B
jAj2 + jBj2</p>
          <p>A B</p>
          <p>We apply the extended Jaccard similarity with reciprocal PageRank to our
models, time-varying graphs and aggregated graphs.
5</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Experiment Results</title>
      <p>
        We conducted experiments and evaluated with the KORE [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] dataset. The
KORE dataset has been created to measure relatedness between named
entities. It consists of 420 related entity pairs from a selected set of 21 seed entities
from the YAGO2 [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] knowledge base from 4 di erent domains, which are 5
entities from IT companies, 5 entities from Hollywood celebrities, 5 entities from
video games, 5 entities from television series, and one singleton entity. Each of
the entities has 20 ranked related entities. All entities in the KORE dataset
corresponds to entities in our Wikipedia article link networks. We use
Spearman Correlation to compare the relatedness scores from each approach with the
scores from the KORE dataset. As the KORE dataset provides only the ranking
but not the score, we assume that the highest entity has a score of 20 and each
subsequent entity has a score 1 lower.
5.1
      </p>
      <sec id="sec-5-1">
        <title>Proposed Extended Jaccard Similarity results</title>
        <p>For each DBpedia version stated above, we constructed the series of networks of
entities seeding the entities from the KORE dataset as described in Section 4.1.</p>
        <p>The experiments were conducted to compare two di erent perspectives. In
the rst evaluation perspective, we performed experiments to show that the
proposed graph-based approach outperforms the text-based approach in terms of
the relatedness scores. We used Term Frequency-Inverse Document Frequency
(TF-IDF) based similarity as the text-base baseline to compared to the proposed
extended Jaccard similarity. In the second evaluation perspective, we performed
experiments to show that our proposed link-based extended Jaccard
similarity with Reciprocal PageRank outperforms the baseline approaches in terms of
the relatedness score accuracy. We used Jaccard similarity as the baseline for
the graph-based approach to compare to our extended Jaccard similarity. We
analysed variations of features for Jaccard methods by considering only direct
predecessors of the nodes which are the entities that have links to the nodes,
only direct successors of the nodes which are the entities that have links from
the nodes, and both direct predecessors and direct successors of the nodes which
are entities that have links to or from the nodes.</p>
        <p>We performed TF-IDF based similarity on three di erent Wikipedia text
revisions. One is the revision at the time when YAGO2 was created
(17-Aug2010) which is used to constructed the KORE dataset. The second one is the
revision at the dump time of DBpedia 2009 dataset (20-Sep-2009) and the last
one is the revision at the dump time of DBpedia 2010 dataset (11-Oct-2010).
We performed Spearman Correlation to evaluate with the gold standard dataset,
the KORE dataset, as described previously. The Spearman Correlations of the
3 di erent datasets compared to the KORE gold standard are shown in Table 2.
We can see that the result of the data acquired at the time when YAGO2 was
created has the highest correlation as the same information is captured at that
time.</p>
        <p>
          We compared the text-based approach over each version of dataset to the
graph-based approaches. In this section, we focused on DBpedia 2009 dataset
and DBpedia 2010 dataset as they are the closest snapshots to the Wikipedia
dump from 2010-08-17 which is used to constructed YAGO2 using by the KORE
dataset. We found that the graph-based approaches outperform the result from
the text-based approach in term of accuracy of relatedness score as shown in
Table 3. The Spearman Correlation to the KORE dataset from our Extended
Jaccard with Reciprocal PageRank is statistically signi cantly better than TF-IDF
based similarity (p-value &lt; 0.05) on both datasets. Moreover, the results show
that our proposed extended Jaccard with reciprocal PageRank gives a better
accuracy of relatedness score than the baseline Jaccard methods. The Spearman
Correlation to the KORE dataset from our Extended Jaccard with Reciprocal
PageRank is statistically signi cantly better than the baseline Jaccard methods
(p-value &lt; 0.05) on DBpedia 2009.
We analysed the change of correlations to the KORE gold standard in di erent
datasets to demonstrate how relatedness progress over subsequent years. We
found that the dataset from the network in 2009 and 2010 got the highest results.
This is because the KORE dataset is constructed using YAGO2 that use the
Wikipedia dump from 2010-08-17 [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ], which is in between DBpedia version 2009
(2009-09-20) and 2010 (2010-10-11). Figure 3 shows the comparison of Spearman
Correlation result of di erent methods from each dataset.
        </p>
        <p>We can see from the result that the most updated knowledge bases will not
give the highest correlation if the ground truth data is created in di erent time.
This is because the relatedness score varies according to the relatedness of the
entities at that time. For instance, Facebook and Justin Timberlake has a higher
relatedness score in 2010 as Justin Timberlake stared in The Social Network
movie, the lm portrays the founding of Facebook, and the score faded after
that as shown in Figure 4.</p>
        <p>As shown in Figure 5, Leonardo DiCaprio became related to Barack Obama
after 2008 as he supported Barack Obama's presidential campaign in the 2008
election and became more related again after 2012 as his support to Obama's
2012 campaign6,7. As the events occurred the end of the years, the changes in
relatedness are not shown in these versions (2008 and 2012) but appear in the
next versions (2009 and 2013) instead.</p>
        <p>We also did a qualitative analysis for the entities which are not in the KORE
dataset as they become more related to the seed entities after the dataset was
created. For instance, Tim Cook started to have high relatedness with Apple Inc.
since 2011 when he become the CEO of the company. Figure 6 shows the
relatedness of Apple Inc. and Tim Cook in comparison to Apple Inc. and Steve Jobs.</p>
        <sec id="sec-5-1-1">
          <title>6 https://en.wikipedia.org/wiki/Leonardo_DiCaprio</title>
        </sec>
        <sec id="sec-5-1-2">
          <title>7 http://www.hollywoodreporter.com/news/julianne-moore-leonardo</title>
          <p>dicaprio-obama-video-375796</p>
          <p>Another example is the relatedness between Jennifer Aniston and Justin
Theroux 8 which became higher from 2011 when they started dating9. While the
relatedness between Jennifer Aniston and Brad Pitt stayed high for a while after
they divorced in 200510 and then faded as shown in Figure 7.</p>
          <p>We performed the Spearman Correlation between di erent datasets and found
that the correlations of the entity relatedness are higher to the dataset versions
that are closer in time to themselves and lower when the time is more di erent
as shown in Table 4. This shows that the entity relatedness gradually change
over time.
In order to nd how transient links, links that do not persist over di erent
times, and stable links, links that persist over time, a ect the relatedness of the
entities, we constructed di erent models of aggregated graphs as described in
Section 4.1. We then applied variations of Jaccard methods described previously
and our proposed extended Jaccard similarity with Reciprocal PageRank over
the models.</p>
          <p>Table 5 shows the comparison of Spearman Correlations to the KORE gold
standard of di erent methods from each dataset over time-varying graphs and
aggregated graphs. ?, , and show that the results are signi cantly lower
than the union model with the Extended Jaccard with Reciprocal PageRank
where p-value &lt; 0.1, p-value &lt; 0.05, p-value &lt; 0.01 and p-value &lt; 0.001
respectively. We use P for direct predecessors, S for direct successors, P+S for both
direct predecessors and RP for reciprocal PageRank. We can see that
aggregating temporal information as a union graph with the Extended Jaccard with</p>
        </sec>
        <sec id="sec-5-1-3">
          <title>8 https://en.wikipedia.org/wiki/Justin_Theroux</title>
        </sec>
        <sec id="sec-5-1-4">
          <title>9 http://people.com/celebrity/jennifer-aniston-justin-theroux-engaged/ 10 http://people.com/celebrity/week-ahead-brad-jen-finalize-divorce/</title>
          <p>Reciprocal PageRank gives more relatedness accuracy than the results from each
dataset from time-varying graphs. Intersection graph gives the least relatedness
accuracy to the KORE dataset but it represents entities that are strongly
related to each other at all time. For instance, the top 3 entities with the highest
relatedness scores from the Intersection model with the Extended Jaccard with
Reciprocal PageRank of Apple Inc. are Steve Jobs, IPhone and IMac.
6</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Conclusions</title>
      <p>We have shown that our proposed graph-based extended Jaccard similarity with
reciprocal PageRank outperforms the baseline text-based approach and
graphbased Jaccard methods. Our relatedness score from DBpedia 2009 and DBpedia
2010, which is the time period when the KORE dataset was created, are the most
correlated to the KORE dataset and using the most recent version of DBpedia
loses a lot of accuracy. This shows that even in a short space of time the
evaluations made by the annotators of the KORE dataset have become outdated.
However, we show that by aggregating temporal information as one graph, the
accuracy of relatedness is better than any other results from each dataset,
demonstrating the value of considering not just the most recent version of a semantic
graph but also temporal information when performing entity relatedness.</p>
      <sec id="sec-6-1">
        <title>Acknowledgements</title>
        <p>This work was supported by Science Foundation Ireland (SFI) under Grant
Numbers SFI/12/RC/2289 (Insight). We would also like to thank Dr. John P.
McCrae for the discussions about this work.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Aggarwal</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Buitelaar</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Wikipedia-based Distributional Semantics for Entity Relatedness</article-title>
          . In:
          <article-title>Association for the Advancement of Arti cial Intelligence (AAAI) Fall Symposium</article-title>
          .
          <source>AAAI-2014</source>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Bairi</surname>
            ,
            <given-names>R.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Carman</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ramakrishnan</surname>
          </string-name>
          , G.:
          <article-title>On the Evolution of Wikipedia: Dynamics of Categories and Articles</article-title>
          . In: Ninth
          <source>International AAAI Conference on Web and Social Media (Apr</source>
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Borra</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weltevrede</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ciuccarelli</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kaltenbrunner</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Laniado</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Magni</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mauri</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rogers</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Venturini</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Societal controversies in wikipedia articles</article-title>
          .
          <source>In: Proceedings of the 33rd annual ACM conference on human factors in computing systems</source>
          . pp.
          <volume>193</volume>
          {
          <fpage>196</fpage>
          .
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Ceroni</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Georgescu</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gadiraju</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Naini</surname>
            ,
            <given-names>K.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fisichella</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          : Information Evolution in Wikipedia.
          <source>In: Proceedings of The International Symposium on Open Collaboration</source>
          . pp.
          <volume>24</volume>
          :
          <issue>1</issue>
          {
          <fpage>24</fpage>
          :
          <fpage>10</fpage>
          . OpenSym '14,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY, USA (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Gabrilovich</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Markovitch</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis</article-title>
          .
          <source>In: Proceedings of the 20th International Joint Conference on Arti cal Intelligence</source>
          . pp.
          <volume>1606</volume>
          {
          <fpage>1611</fpage>
          . IJCAI'
          <fpage>07</fpage>
          , Morgan Kaufmann Publishers Inc., San Francisco, CA, USA (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Ho</surname>
            <given-names>art</given-names>
          </string-name>
          , J.,
          <string-name>
            <surname>Seufert</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nguyen</surname>
            ,
            <given-names>D.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Theobald</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weikum</surname>
          </string-name>
          , G.:
          <article-title>Kore: Keyphrase overlap relatedness for entity disambiguation</article-title>
          .
          <source>In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management</source>
          . pp.
          <volume>545</volume>
          {
          <fpage>554</fpage>
          . CIKM '12,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY, USA (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Ho</surname>
            <given-names>art</given-names>
          </string-name>
          , J.,
          <string-name>
            <surname>Suchanek</surname>
            ,
            <given-names>F.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Berberich</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weikum</surname>
          </string-name>
          , G.:
          <article-title>YAGO2: A Spatially and Temporally Enhanced Knowledge Base from Wikipedia</article-title>
          .
          <source>Arti cial Intelligence</source>
          <volume>194</volume>
          ,
          <fpage>28</fpage>
          {
          <fpage>61</fpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Hulpus</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Prangnawarat</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hayes</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Path-Based Semantic Relatedness on Linked Data and Its Use to Word and Entity Disambiguation</article-title>
          . In: The Semantic Web - ISWC
          <year>2015</year>
          . pp.
          <volume>442</volume>
          {
          <fpage>457</fpage>
          . Springer, Cham (Oct
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Leal</surname>
            ,
            <given-names>J.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rodrigues</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Queiros</surname>
          </string-name>
          , R.:
          <article-title>Computing Semantic Relatedness using DBPedia</article-title>
          . In: Simo~es,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Queiros</surname>
          </string-name>
          , R.,
          <source>da Cruz, D. (eds.) 1st Symposium on Languages, Applications and Technologies. OpenAccess Series in Informatics (OASIcs)</source>
          , vol.
          <volume>21</volume>
          , pp.
          <volume>133</volume>
          {
          <fpage>147</fpage>
          . Dagstuhl,
          <string-name>
            <surname>Germany</surname>
          </string-name>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Nunes</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ribeiro</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>David</surname>
          </string-name>
          , G.:
          <article-title>WikiChanges: Exposing Wikipedia Revision Activity</article-title>
          .
          <source>In: Proceedings of the 4th International Symposium on Wikis</source>
          . pp.
          <volume>25</volume>
          :
          <issue>1</issue>
          {
          <issue>25</issue>
          :
          <fpage>4</fpage>
          . WikiSym '08,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY, USA (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Page</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brin</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motwani</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Winograd</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>The pagerank citation ranking: Bringing order to the web</article-title>
          .
          <source>Technical Report 1999-66</source>
          ,
          <string-name>
            <surname>Stanford</surname>
            <given-names>InfoLab</given-names>
          </string-name>
          (
          <year>November 1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Radinsky</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Agichtein</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gabrilovich</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Markovitch</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>A Word at a Time: Computing Word Relatedness Using Temporal Semantic Analysis</article-title>
          .
          <source>In: Proceedings of the 20th International Conference on World Wide Web</source>
          . pp.
          <volume>337</volume>
          {
          <fpage>346</fpage>
          . WWW '11,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY, USA (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Strube</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ponzetto</surname>
            ,
            <given-names>S.P.:</given-names>
          </string-name>
          <article-title>WikiRelate! Computing Semantic Relatedness Using Wikipedia</article-title>
          .
          <source>In: Proceedings of the 21st National Conference on Arti cial Intelligence - Volume</source>
          <volume>2</volume>
          . pp.
          <volume>1419</volume>
          {
          <fpage>1424</fpage>
          . AAAI'
          <fpage>06</fpage>
          , Boston, Massachusetts (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Tanimoto</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>An Elementary Mathematical Theory of Classi cation and Prediction</article-title>
          .
          <source>International Business Machines Corporation</source>
          (
          <year>1958</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Whiting</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , Jose, J.,
          <string-name>
            <surname>Alonso</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          :
          <article-title>Wikipedia As a Time Machine</article-title>
          .
          <source>In: Proceedings of the 23rd International Conference on World Wide Web</source>
          . pp.
          <volume>857</volume>
          {
          <fpage>862</fpage>
          . WWW '14 Companion,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY, USA (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>