<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Study concept drift in 150-year English literature</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Twente</institution>
          ,
          <addr-line>Drienerlolaan 5, 7522 NB Enschede</addr-line>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The meaning of a concept or a word changes over time. Such concept drift re ects the change of the social consensus as well. Studying concept drift over time is valuable for researchers who are interested in language or culture evolution. Recent word embedding technologies inspire us to automatically detect concept drift in large-scale corpora. However, comparing embeddings generated from di erent corpora is a complex task. In this paper, we propose to use a simple approach for detecting concept drift based on the change in word contexts from di erent time periods and apply it to subsequent time periods so that the detailed drift could be detected and visualised. We dive into certain words to track how the meaning of a word changes gradually over a long time span with relevant historical events which demonstrates the e ect of our method.</p>
      </abstract>
      <kwd-group>
        <kwd>concept drifting</kwd>
        <kwd>word embedding</kwd>
        <kwd>historical event</kwd>
        <kwd>word context</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Concept drift or diachronic semantic shift studies how the meaning of a concept
or a word changes over time [
        <xref ref-type="bibr" rid="ref12 ref15">15, 12</xref>
        ]. Concept drift re ects the change of the
social consensus. For example, the word gay was originally used to mean carefree,
cheerful, or bright [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. However, from the 1960s, the word gay started to describe
homosexual men [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Studying concept drift over time is valuable for researchers
who are interested in language or culture evolution. For people who want to
identify societal changes in literature, who research on historical texts, such as
librarians, historians or linguists, it is desirable if they can discover potential
concept drift in large-scale textual content before conducting in-depth investigation.
Automatically identifying concept drift over time can improve their e ciency.
      </p>
      <p>
        Recent word embedding technologies inspire us to automatically detect
concept drift in large-scale corpora [
        <xref ref-type="bibr" rid="ref5 ref6 ref9">9, 6, 5</xref>
        ] . However, comparing embeddings
generated from di erent corpora is a complex task [
        <xref ref-type="bibr" rid="ref5 ref6">6, 5</xref>
        ]. How to visually inspect
concept drift is also a challenge [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ].
      </p>
      <p>In this paper, we propose to use a simple approach to quantify the concept
drift based on their contexts generated from two time periods and apply it to
subsequent time periods to study concept drift over a long period of time. We
study more than 50 thousand English books published between 1800 and 1950.
We rst divide the whole period into subsequent 20-year time spans. Based on
word embeddings corresponding to each individual time span, we calculate the
context of each word at that particular period. Secondly, we measure the concept
drift by comparing the contexts of the same word from di erent periods. Looking
how the context changes from the starting period to the ending period, we can
easily identify the most dynamic words over the 150 years. For these words which
potentially underwent drastic change in their meaning, we further measure how
their contexts change in subsequent time periods and visualise the change over
time. Some interesting associations to historical events are also discovered.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        Diachronic semantic shifts has gained much attention because of the availability
of large corpora and recent success in computing distributional semantics [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] in
natural languages [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. When computing distributional semantics, words are
represented by sparse or, more recently, dense vectors based on their co-occurrences
in a corpus. In other words, words are embedded in a semantic space and, more
importantly, similar words are embedded nearby each other in this space.
      </p>
      <p>
        To study semantic shift, it is therefore natural to rst construct word
embeddings in separated time periods before comparing these embeddings across
time. However, the similarity between the word embeddings generated from
separate time-speci c corpora cannot be computed directly because the stochastic
embedding algorithms could only roughly guarantee the stability of the pairwise
similarities between words but the numerical embeddings are often rotated after
each training, i.e. invariant under rotation. An earlier proposal was to
incrementally update diachronic embedding models [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], where the word embeddings
trained on the previous time period are used to initialise the training for the
current period. Later researchers proposed to align these spaces by unifying the
coordinates via local word alignments [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] or by projecting one space to another
via orthogonial procrustes [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>
        Another approach is to study the neighbours or the context of a word at
two di erent time periods to measure semantic shift. Azarbonyad et. al. in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
used the neighbors of a word to determine its stability. Their best model uses
the traditional alignment-based method weighting the neighbors' rank and their
stability, requiring computation on whole vocabulary. Later, Gonen et al. [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]
simply took the top-k neighbors in each of the two corpora and measure the
overlap of these two lists. A smaller overlap suggests more drastic change. They
applied this method to corpora separated based on di erent criteria, such as age,
gender, profession, time of tweet. However, the concept stability of neighboring
words is unsure for a long period, for example over centuries. In this paper, we
apply this method to study English literature that spans over 150 years.
      </p>
      <p>
        More recently, researchers also proposed dynamic word embeddings models to
jointly learn word embeddings across di erent times periods [
        <xref ref-type="bibr" rid="ref17 ref2">2, 17</xref>
        ]. By enforcing
the alignments simultaneously, there is no need to train separate time-speci c
embeddings, i.e., the resulting word embeddings are time-aware already. We will
explore this aproach in the future.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Method</title>
      <p>Here, we rst describe the method which measures the changes in word meaning
based on its contexts at two di erent time periods. Secondly, we divide the whole
corpus into subsets corresponding to consequent time periods and study how the
meaning of a word change over time.
3.1</p>
      <sec id="sec-3-1">
        <title>Measuring drift based on context</title>
        <p>
          Inspired by the method proposed in [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ], given two periods in time, we collect
separate corpora corresponding to each period. As shown in Figure 1, we generate
embeddings for each word in the separate corpus. For each word, we calculate its
context as the top K most similar words. For the same word that occurs in both
corpora, we measure the similarity of its contexts at two di erent time periods.
This similarity re ects the changes in word meaning: the more similar the two
contexts are, the less change in meaning the word has.
Word embedding First, we need to generate embeddings for each word that
occurs in each corpus. All words in the corpus are embedded as high-dimensional
vectors, and semantically similar words embedded near to each other in a
semantic space. The purpose of this step is to use numeric vectors to represent
the meaning of words, so that the similarity between words could be computed
easily.
Word context After the embedding spaces, two semantic spaces are generated
from each corpus. These two semantic spaces can be aligned to use the same
coordinate axes before we could compares words in these two spaces, as proposed
in [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]. Here, we adopt the method proposed in [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] to use the closest neighbours of
a word, i.e. its context, to re ect the extensional meaning of the word, therefore,
the change in context at two di erent times re ects the drift in the meaning of
the word. For each word, we select the top K most similar words as its context.
Drift based on context similarity For a word that occurs in both time
periods, how much its context changes from one period to the other re ects the
change in its meaning. Since the context is de ned as the set of top K most similar
words, we use Jaccard similarity coe cient [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] to measure the similarity between
two contexts of the same word but from two di erent time periods. Jaccard
similarity coe cient is a statistic used for gauging the similarity and diversity
of sample sets. Let A and B are two sets, the Jaccard similarity coe cient is
calculated as follows,
        </p>
        <p>J (A; B) = j A \ B j =
j A [ B j</p>
        <p>j A \ B j
j A j + j B j j A \ B j
(1)
A high Jaccard similarity coe cient suggests that the context of the word has
not changed much, while a low value suggests that the meaning of the word
might have changed over time.
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Analysing concept drift over time</title>
        <p>Given a corpus which spans a long period of time, it is then possible to study
how individual words change over time. We divide the whole corpus into
multiple subsets corresponding to subsequent time periods. We measure the drift of
words from the beginning period to the last. This way, we could detect the most
dynamic and static words over the long period of time. It is also possible to look
more carefully when a word has undergone a critical moment when its meaning
changed dramatically.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Data set</title>
      <p>We download the full text content of 51,625 English books from Project
Gutenberg.1 Unfortunately, the exact years of publication for these books are missing.
However, the birth and death years of the author are available in the data. We
therefore took the average of the birth and death year of the author as the
approximate year of publication for the book.</p>
      <p>After grouping books by their year of publication, we nd that, although
the earliest books were written before 500 BC, the number of books published</p>
      <sec id="sec-4-1">
        <title>1 https://www.gutenberg.org/</title>
        <p>earlier than the 18th century is far less than that from the 18th century to the
mid 20th century. Because of the copyright restriction, books in recent years are
also limited. In this study we focus on the books published from 1800 to 1950.
We further divide the corpus into consequent groups of 20-year time spans, as
hown in Table 1.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Experiment and results</title>
      <sec id="sec-5-1">
        <title>Word embedding and context computation</title>
        <p>
          For each time period show in Table 1, we trained a word2vec model using the
gensim library2 using the full text content of the books published within that
period of time. We chose the continuous Bag-of-Words(CBOW) model [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ], set
the vector size as 100, the minimum count 10 (ignoring words that occur less than
10 times), the window size 20 (taking 20 words behind and 20 words ahead as
the training context) and took the rest parameters as their default values. After
embedding, each word at each time period is represented as a 100-dimensional
vector. For each word at a particular time period, we calculated the top 20 most
similar words as its context in that period.
5.2
        </p>
      </sec>
      <sec id="sec-5-2">
        <title>Measuring drift</title>
        <p>After the context of a word is calculated for each time period, we can now
measure how much this word has drifted from one time period to another.
Sensibility of parameter K The model is generally stable. We examine how k
parameter, which de nes the length of similar words lists, a ect the stability.
We change k as [20,30,60,100], and the curves highly coincide with each other
in the whole plot for Fig.2, so in most part of these plot, we regard the same</p>
        <sec id="sec-5-2-1">
          <title>2 https://radimrehurek.com/gensim/models/word2vec.html</title>
          <p>distribution as proof of the stability of our model. In the following experiments,
we take k = 20.</p>
          <p>We calculate the Jaccard similarity between the context at 1800 and that
at 1950. The distribution of the Jaccard similarity is shown in Fig. 3. As the
distribution shows, very few words have stable context throughout these 150
years. Words such as sister, daughter, mother, wife and husband are rather stable
word, while other words such as witch, foster, hive and potion have changed
dramatically.</p>
          <p>Many words have completely changed their contexts. However, these words
are mostly infrequent words, such as chestnut, hive, vantage, and co n. Their
embeddings and consequent contexts are over sensitive to the corpus. We could
not make solid conclusions in terms of the drift of their meaning.</p>
          <p>This still helps us to identify interesting cases of concept drift among the
words that have a low context similarity. Once identi ed, we can dive into the
more granule time periods and inspect the drift more closely. For example, Fig. 4
shows the drift of word peace over time. The Jaccard similarities between the
current context and that of the previous period are plotted. The sharp decrease
from 1890 to 1930 suggests that there was a drastic change of the meaning of
the word peace in that period.
5.3</p>
        </sec>
      </sec>
      <sec id="sec-5-3">
        <title>Visualising individual drift</title>
        <p>The change in Jaccard similarity as shown in Fig. 4 only provides a signal of
drift, but not the content of drift. To dive deeper into what exactly happened for
speci c words in speci c periods, we present a visualization method that helps
to see how words have changed.</p>
        <p>
          Our visualization is inspired by the work of Wijaya and Yeniterzi[
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. In
their visualization, each word is a node and there is an edge between two words
if they co-occur in the same context. The width of the edges is the frequency of
co-occurrence. As shown in Fig. 5, our visualization consists of two clusters. One
is the target word with its top 20 context words at the rst time span. The other
is the target word with its top 20 context words at the second time span. The
line width shows the cosine similarity between the target word and the context
word.
        </p>
        <p>In Fig. 5, the word bishop at 1910 is mostly associated with religious words,
such as prelate, church, cathedral, and dioce while at 1950, it is more related to
the chess game, as its context includes words like checkmate, pawn, and knight.
It is worth mentioning that ktxb, bxkt, and rxkt are notation words in chess but
not meaningless words.</p>
        <p>
          Compared to the popular t-SNE visualization [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ] used in [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ], our
visualization is more comprehensible and intuitive to compare the intersection and
the unique section. It can also be used for tracking how the concept of a word
changed gradually over a long time span. However, this visualization often
become unreadable because of the complexity when the number of the context
words increases.
        </p>
        <p>Fig.6 shows the sequential change of the word peace from 1800 to 1950. Fig.6
(b) shows that war is in the same context with peace at 1800 and 1910. It suggests
that peace and war were often mentioned together .</p>
        <p>However, the link between war and peace disappeared in 1950. New
relationships emerged, such as spirit, humanity, tyranny, and poverty. Link to the statistics
of war. There were high-intensity con icts around the 1800s (Napoleonic Wars,
etc), 1860s (American Civil War, etc.), and 1910s (World War I). There were
few wars after World War II (1945). It makes sense that people transferred their
(a) The word peace at 1800 and 1850
(b) The word peace at 1800 and 1910
(c)The word peace at 1800 and 1950</p>
        <p>Fig. 6. Concept Drift of the word peace over time
concerns about peace into other topics like spirit, humanity, tyranny, and poverty
in the 1950s.</p>
        <p>Because of the copyright restriction, Gutenberg data set only has few books
after 1970. It limits us to apply our method to recently published books. A
potential future work for this work is to apply this approach on possible
contemporary book corpus. There would be more interesting ndings closer to our
life.
6</p>
        <p>conclusion
Concept drift re ects the change of the social consensus. Detecting word usage in
di erent periods is an important research method. We propose a computational
approach to discover the drastic concept drifts by their context in the historical
English books over centuries. It quanti es the extend of concept drift and makes
the rank of drastic change possible. We also present a new way to compare the
concept of a word in di erent periods. We show that our visualization is simple
and intuitive. It also has the unique advantage of demonstrating the gradual
change of concept overtime.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>Hosein</given-names>
            <surname>Azarbonyad</surname>
          </string-name>
          , Mostafa Dehghani, Kaspar Beelen, Alexandra Arkut, Maarten Marx, and
          <string-name>
            <given-names>Jaap</given-names>
            <surname>Kamps</surname>
          </string-name>
          .
          <article-title>Words are malleable: Computing semantic shifts in political and media discourse</article-title>
          .
          <source>In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management</source>
          ,
          <source>CIKM '17, page</source>
          <volume>1509</volume>
          {
          <fpage>1518</fpage>
          , New York, NY, USA,
          <year>2017</year>
          .
          <article-title>Association for Computing Machinery</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>Robert</given-names>
            <surname>Bamler</surname>
          </string-name>
          and
          <string-name>
            <given-names>Stephan</given-names>
            <surname>Mandt</surname>
          </string-name>
          .
          <article-title>Dynamic word embeddings</article-title>
          .
          <source>In Doina Precup and Yee</source>
          Whye Teh, editors,
          <source>Proceedings of the 34th International Conference on Machine Learning</source>
          , volume
          <volume>70</volume>
          <source>of Proceedings of Machine Learning Research</source>
          , pages
          <volume>380</volume>
          {
          <fpage>389</fpage>
          ,
          <string-name>
            <surname>International</surname>
            <given-names>Convention Centre</given-names>
          </string-name>
          , Sydney, Australia,
          <volume>06</volume>
          {
          <issue>11</issue>
          <year>Aug 2017</year>
          . PMLR.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3. Oxford English Dictionary and
          <string-name>
            <given-names>WALTON</given-names>
            <surname>Street</surname>
          </string-name>
          .
          <article-title>Oxford english dictionary</article-title>
          .
          <source>Retrieved February</source>
          ,
          <volume>4</volume>
          :
          <year>2019</year>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>John</given-names>
            <surname>Firth</surname>
          </string-name>
          .
          <article-title>A synopsis of linguistic theory</article-title>
          .
          <year>1930</year>
          -
          <fpage>1955</fpage>
          . Blackwell,
          <year>1957</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>Hila</given-names>
            <surname>Gonen</surname>
          </string-name>
          , Ganesh Jawahar, Djame Seddah, and
          <string-name>
            <given-names>Yoav</given-names>
            <surname>Goldberg</surname>
          </string-name>
          .
          <article-title>Simple, interpretable and stable method for detecting words with usage change across corpora</article-title>
          .
          <source>In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics</source>
          , pages
          <volume>538</volume>
          {
          <fpage>555</fpage>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>William L. Hamilton</surname>
            , Jure Leskovec, and
            <given-names>Dan</given-names>
          </string-name>
          <string-name>
            <surname>Jurafsky</surname>
          </string-name>
          .
          <article-title>Diachronic word embeddings reveal statistical laws of semantic change</article-title>
          .
          <source>In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)</source>
          , pages
          <fpage>1489</fpage>
          {
          <fpage>1501</fpage>
          , Berlin, Germany,
          <year>August 2016</year>
          .
          <article-title>Association for Computational Linguistics</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>Archie</given-names>
            <surname>Hobson</surname>
          </string-name>
          .
          <article-title>The Oxford dictionary of di cult words</article-title>
          . Oxford University Press, USA,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>Paul</given-names>
            <surname>Jaccard</surname>
          </string-name>
          .
          <article-title>The distribution of the ora in the alpine zone. 1</article-title>
          . New phytologist,
          <volume>11</volume>
          (
          <issue>2</issue>
          ):
          <volume>37</volume>
          {
          <fpage>50</fpage>
          ,
          <year>1912</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>Adam</given-names>
            <surname>Jatowt</surname>
          </string-name>
          and
          <string-name>
            <given-names>Kevin</given-names>
            <surname>Duh</surname>
          </string-name>
          .
          <article-title>A framework for analyzing semantic change of words across time</article-title>
          .
          <source>In IEEE/ACM Joint Conference on Digital Libraries</source>
          , pages
          <volume>229</volume>
          {
          <fpage>238</fpage>
          . IEEE,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Yoon</surname>
            <given-names>Kim</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yi-I Chiu</surname>
            , Kentaro Hanaki, Darshan Hegde, and
            <given-names>Slav</given-names>
          </string-name>
          <string-name>
            <surname>Petrov</surname>
          </string-name>
          .
          <article-title>Temporal analysis of language through neural language models</article-title>
          .
          <source>In Proceedings of the ACL 2014 Workshop on Language Technologies and Computational Social Science</source>
          , pages
          <volume>61</volume>
          {
          <fpage>65</fpage>
          ,
          <string-name>
            <surname>Baltimore</surname>
            ,
            <given-names>MD</given-names>
          </string-name>
          , USA,
          <year>June 2014</year>
          .
          <article-title>Association for Computational Linguistics</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Vivek</surname>
            <given-names>Kulkarni</given-names>
          </string-name>
          , Rami Al-Rfou,
          <string-name>
            <given-names>Bryan</given-names>
            <surname>Perozzi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Steven</given-names>
            <surname>Skiena</surname>
          </string-name>
          .
          <article-title>Statistically signi cant detection of linguistic change</article-title>
          .
          <source>In Proceedings of the 24th International Conference on World Wide Web, WWW '15, page</source>
          <volume>625</volume>
          {
          <fpage>635</fpage>
          , Republic and Canton of Geneva, CHE,
          <year>2015</year>
          . International World Wide Web Conferences Steering Committee.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Andrey</surname>
            <given-names>Kutuzov</given-names>
          </string-name>
          , Lilja vrelid, Terrence Szymanski, and
          <string-name>
            <given-names>Erik</given-names>
            <surname>Velldal</surname>
          </string-name>
          .
          <article-title>Diachronic word embeddings and semantic shifts: a survey</article-title>
          .
          <source>In Proceedings of the 27th International Conference on Computational Linguistics</source>
          , pages
          <volume>1384</volume>
          {
          <fpage>1397</fpage>
          ,
          <string-name>
            <surname>Santa</surname>
            <given-names>Fe</given-names>
          </string-name>
          , New Mexico, USA,
          <year>August 2018</year>
          .
          <article-title>Association for Computational Linguistics</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Tomas</surname>
            <given-names>Mikolov</given-names>
          </string-name>
          , Kai Chen,
          <string-name>
            <given-names>G. S.</given-names>
            <surname>Corrado</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Dean</surname>
          </string-name>
          .
          <article-title>E cient estimation of word representations in vector space</article-title>
          .
          <source>In ICLR</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14. Laurens Van der Maaten and Geo rey Hinton.
          <article-title>Visualizing data using t-sne</article-title>
          .
          <source>Journal of machine learning research</source>
          ,
          <volume>9</volume>
          (
          <issue>11</issue>
          ),
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Shenghui</surname>
            <given-names>Wang</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stefan Schlobach</surname>
            , and
            <given-names>Michel</given-names>
          </string-name>
          <string-name>
            <surname>Klein</surname>
          </string-name>
          .
          <article-title>Concept drift and how to identify it</article-title>
          .
          <source>Journal of Web Semantics</source>
          ,
          <volume>9</volume>
          (
          <issue>3</issue>
          ):
          <volume>247</volume>
          {
          <fpage>265</fpage>
          ,
          <year>2011</year>
          .
          <string-name>
            <given-names>Semantic</given-names>
            <surname>Web Dynamics Semantic Web Challenge</surname>
          </string-name>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16. Derry Tanti Wijaya and
          <string-name>
            <given-names>Reyyan</given-names>
            <surname>Yeniterzi</surname>
          </string-name>
          .
          <article-title>Understanding semantic change of words over centuries</article-title>
          .
          <source>In Proceedings of the 2011 international workshop on DETecting and Exploiting Cultural diversiTy on the social web</source>
          , pages
          <volume>35</volume>
          {
          <fpage>40</fpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Zijun</surname>
            <given-names>Yao</given-names>
          </string-name>
          , Yifan Sun, Weicong Ding,
          <string-name>
            <given-names>Nikhil</given-names>
            <surname>Rao</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Hui</given-names>
            <surname>Xiong</surname>
          </string-name>
          .
          <article-title>Dynamic word embeddings for evolving semantic discovery</article-title>
          .
          <source>In Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining</source>
          , WSDM '
          <volume>18</volume>
          , page
          <volume>673</volume>
          {
          <fpage>681</fpage>
          , New York, NY, USA,
          <year>2018</year>
          .
          <article-title>Association for Computing Machinery</article-title>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>