<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Map for</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Visualizing the Drift of Linked Open Data Using Self-Organizing Maps</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Albert Meroño-Peñuela</string-name>
          <email>albert.merono@vu.nl</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Peter Wittek</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sándor Darányi</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science, Vrije Universiteit Amsterdam</institution>
          ,
          <addr-line>NL</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>ICFO-The Institute of Photonic Sciences</institution>
          ,
          <addr-line>ES</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Swedish School of Library and Information Science, University of Borås</institution>
          ,
          <addr-line>SE</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2012</year>
      </pub-date>
      <volume>201</volume>
      <fpage>2</fpage>
      <lpage>05</lpage>
      <abstract>
        <p>The urge for evolving the Web into a globally shared dataspace has turned the Linked Open Data (LOD) cloud into a massive platform containing 100 billion machine-readable statements. Several factors hamper a historical study of the evolution of the LOD cloud, and hence forecast its future: its ever-growing scale, which makes a global analysis difficult; its Web-distributed nature, which challenges the analysis of its data; and the scarcity of regular and time-stamped archival dumps. Recently, a scalable implementation of self-organizing maps (SOM) has been developed to visualize the local topology of high-dimensional data. We use this methodology to address scalability issues, and the Dynamic Linked Data Observatory, a regular biweekly centralized sample of the LOD cloud as a time-stamped collection. We visualize the drift of Linked Datasets between 2012 and 2016, finding that datasets with high availability, high vocabulary reuse, and modeling with commonly used terms in the LOD cloud are better traceable across time.</p>
      </abstract>
      <kwd-group>
        <kwd>Linked Data</kwd>
        <kwd>Semantic Drift</kwd>
        <kwd>Visualization</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The original vision of the Semantic Web [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] is becoming a reality by the
publishing paradigm of Linked Data, which so far consists of a global, Web-accessible
graph of knowledge of 100 billion RDF statements known as the Linked Open
Data (LOD) cloud 4. In this Web dataspace, dynamics are the norm: new data
are added, old are removed, and schemata evolve to accommodate new
requirements and changes in the domain. Terms like semantic drift are used to refer to
problems that arise as a consequence of these dynamics. One of the typical access
methods for Linked Data is SPARQL, a protocol and query language. Datasets
become inaccessible, for instance, due to query templates that no longer
correspond to the schema [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. The maintainability of ontologies is cumbersome –
large change logs are costly and client applications cannot adapt to them. It is
impossible to have an abstract perception of the history of the LOD cloud, and
it is equally infeasible to forecast how it might evolve in the future.
      </p>
      <p>
        Recent work addressing some areas of semantic drift shows that solutions
to specific problems in this domain can provide great real-world benefits, such
as automated maintenance of biomedical ontologies [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] and efficient update of
caches in applications using LOD [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. However, the global challenge of semantic
drift is emphasized by three inherent factors of structured data on the Web:
(1) its unprecedented and unconstrained scale; (2) its highly distributed nature;
and (3) the scarcity of regular and systematic timestamped dumps. Several
approaches study semantic drift over time in the context of LOD and the Semantic
Web, although none of them addresses all of these factors. For example, Wang et
al. [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] provide theoretical ground for heuristics under the assumption of existing
timestamped dumps, but do not address problems (1) and (2). Similarly, the well
established field of ontology evolution [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] has seen a rebirth in approaches that
model it using machine learning [
        <xref ref-type="bibr" rid="ref10 ref9">9,10</xref>
        ]. The Dynamic Linked Data Observatory
(DyLDO) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] covers (2) and (3), but does not solve the problem of scale.
Recently, a massively parallel implementation of self-organizing maps (SOM) has
proven useful to model, reduce the dimensionality of, and visualize large scale
evolving data in an efficient way [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]). In this paper, we study the requirements
derived from problems (1), (2), and (3) for visualizing Linked Data that evolve
over time; and we argue that the combination of DyLDO and Somoclu fulfills
these requirements in a comprehensive and visually explicit manner. We find
that datasets published as LOD in the period 2012-2016 with high availability,
high reuse of vocabularies5, and high adherence to commonly used terms are
better traceable in time. More specifically, the contributions of the paper are as
follows:
– We describe requirements for visualizing the temporal drift of datasets in
      </p>
      <p>LOD (Section 3);
– We argue the convenience and describe the foundations of SOM techniques
to address these requirements (Sections 2 and 4);
– We use a scalable implementation of SOM over the DyLDO dataset, a large
crawl of timestamped LOD in the period 2012-2016 (Section 4). Datasets
that were highly available with online and de-referenceable URI resources,
reused existing vocabularies in the LOD cloud, and chose common and highly
used terms to describe their resources are better traceable over time in
lowdimensional representations of high-dimensional input Linked Data.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        The problems of semantic change and drift concern various research fields. In
the areas of Semantic Web and knowledge representation, ontology evolution [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]
addresses “the timely adaptation of an ontology and consistent propagation of
changes to dependent artifacts” [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Features of evolution have been studied [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]
and used for prediction using machine learning [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Gonçalves et al. [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] use
      </p>
      <sec id="sec-2-1">
        <title>5See http://lov.okfn.org/dataset/lov/</title>
        <p>
          Description Logics to calculate differences between ontologies (so-called semantic
diffs). Wang et al. [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] define the semantics of concept change and drift, and how
to identify them. General surveys of semantic change in other fields, including
language, have recently appeared [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ].
        </p>
        <p>
          Caching Linked Data is the primary goal of repositories like LODCache6. To
the best of our knowledge, the Dynamic Linked Data Observatory, DyLDO [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ],
is the only available crawl of LOD with rigorous timestamps.
        </p>
        <p>
          Dimensionality reduction is a standard technique in statistical data analysis
and machine learning. In sparse spaces, such as the ones we obtain from LOD, the
global topology of the space is often less important than the local regions where
many data instances have overlapping nonzero elements: techniques that focus
on preserving local topology are preferred. Self-organizing maps are an example
of this type [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]. They were introduced to the study of drifts [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ] which was
enabled at scale by a massively parallel implementation of the methodology [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ].
After training a map, each data instance will have a matching point called the
best matching unit (BMU) on the map – the immediate surroundings of this
point reflect the local topology of the original space. Intense colours on the map
indicate high distances between the original data points.
3
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Requirements for Visualizing LOD Drifts Over Time</title>
      <p>
        We describe an empirical application of SOM to different subsequent snapshots
of the LOD cloud over time, using the Somoclu algorithm [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] and the DyLDO [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]
dataset7,8. Somoclu and DyLDO fulfill the requirements of Table 1, and hence
their combination poses a promising solution for visualizing LOD drifts.
Somoclu fulfills the algorithm requirements in terms of efficiency, visualization,
dimensionality reduction, topology preservation and unsupervised learning; while
DyLDO does for the requirements of centralization, sampling and provenance.
      </p>
      <sec id="sec-3-1">
        <title>6https://datahub.io/dataset/openlink-lod-cache 7http://swse.deri.org/dyldo/</title>
        <p>8All experimental data and code are available online at https://github.com/
albertmeronyo/somoclu-dyldo
Name Description
Efficiency Solutions addressing semantic drift in LOD should be
ef</p>
        <p>ficient in order to address its scale.</p>
        <p>Centralization Linked Data is scattered over the Web, and analysis of
drift is impracticable in Web-distributed data. Datasets
to be analyzed need to be centralized.</p>
        <p>Visualization The result of the analysis of drift in LOD should be an
understandable, meaningful and coherent visualization to
the human eye
Sampling The distributed architecture of the Web makes complete
studies of drift over all LOD very difficult. Samples of
different sizes are needed.</p>
        <p>Dimensionality Most features of Linked Data are usually in a
highreduction dimensional space (e.g. total number of URIs in predicate
position). Representing these in a lower-dimensional space
is necessary for human-understandable visualizations
Topology preser- Any technique reducing the dimensionality of the input
vation space must preserve its topology in order to faithfully
rep</p>
        <p>resent the subtleties of drifts in LOD
Unsupervised The high-scale of LOD makes the manual annotation of
learning instances by humans impracticable, and algorithms
analyzing drifts in LOD should operate in an unsupervised
manner.</p>
        <p>Provenance The Linked Data subject to the study of drift should
have accurate and structured provenance information,
especially regarding its collection time in the form of
timestamps
Alg./Data</p>
        <p>Alg.</p>
        <p>Data
Alg.</p>
        <p>Data
Alg.</p>
        <p>Alg.</p>
        <p>Alg.</p>
        <p>Data
the matrix construction process, at the time we focus the analysis on the usage
of vocabulary terms.</p>
        <p>First, we build an index of common unique predicates (275,412) and common
unique graph names (4,506) in all these snapshots. All these predicates and graph
names occur in all snapshots. Secondly, for each snapshot we build a sparse
matrix that indicates the frequency of each predicate in each graph name. These
highly sparse vectors characterize the contents of the graphs: every instance
is a vector containing counts of each possible predicate in that graph name.
This means we generate 79 sparse matrices with 275,412 dimensions and 4,506
instances, with an average sparsity (i.e. non-zero elements) of 0.03%.
4.2</p>
        <sec id="sec-3-1-1">
          <title>Training SOMs</title>
          <p>
            Training self-organizing maps is a computationally expensive task. We rely on
a massively parallel implementation called Somoclu [
            <xref ref-type="bibr" rid="ref15">15</xref>
            ]9. This implementation
9See http://peterwittek.github.io/somoclu/. We use its Python interface for
visualization, which is available at https://pypi.python.org/pypi/somoclu/
can also handle sparse input data. Apart from the computational requirements, a
key challenge in using SOMs is the large number of parameters that require
finetuning. As a rule of thumb, toroid topology yields higher quality maps because we
avoid jamming best matching neurons along the edges as we would with a planar
layout. We seldom see fundamental topological differences between hexagonal
and square-shaped neurons, so we use the second for their easier visual treatment.
Due to memory limitations, we establish a map size of 15 rows and 25 columns.
To ensure a smooth initial map, we train it for a longer time (ten epochs instead
of three) and at a higher learning rate (0.1 instead of 0.05) than the rest. We
use the output neuron weights of snapshot t as a codebook to initialize the map
of snapshot t + 1.
4.3
          </p>
        </sec>
        <sec id="sec-3-1-2">
          <title>Results</title>
          <p>– 2012-05-13: There are at least three recognizable clusters in the middle of the
map. Some BMUs arise in the centers of these clusters, while others appear
in their neighborhoods. The named graphs that correspond to these BMUs
are, by its proximity to the cluster’s center:
• 3987: http://www.bbc.co.uk/nature/life/Morelia_amethistina;
• 312: http://en.openei.org/wiki/Special:ExportRDF/Colorado’s_7th_
congressional_district;
• 4370: http://www.salon.com/writer/glenn_greenwald/;
• 335: http://en.openei.org/wiki/Special:ExportRDF/Property:Name;
• 4376: http://www.steelers.com/;
– These clusters move over the map, and get almost perfectly merged together
by 2013-04-07. At this point important BMUs in this area are:
• 3991: http://www.bbc.co.uk/nature/life/Myodes_glareolus;
• 313: http://en.openei.org/wiki/Special:ExportRDF/Colorado;
• 4373/2: http://www.sports-reference.com/olympics/athletes/br/heinz-brandt-1.</p>
          <p>html / http://www.snee.com/bob/foaf.rdf;
• 3819: http://www.bbc.co.uk/nature/life/Gelada;
• 335: http://en.openei.org/wiki/Special:ExportRDF/Property:Name;
– 2013-8-11: The clusters have perfectly merged. The BMUs of the previous
key epochs continue appearing as “shared” cluster centers;
– End of 2013: BMUs that did qualify as important in the previous epochs,
but that did not make it as cluster centers, are not that obivously related to
the cluster centers anymore. At this point, the BBC BMU and the OpenEI
BMU seem to compete in importance in the unique remaining cluster;
– During 2014: BBC clusters get merged, so does the OpenEI by becoming
indistinguishable within the BBC cluster;
– This tendency continues until the final data epoch in 2016-03-27.</p>
          <p>10See the videos at https://youtu.be/3nK3teAzzCM, https://youtu.be/
XTcsv2i2Hlg, and https://youtu.be/-UKKIdIyKGA
(c) Map for 2013-08-11.</p>
          <p>(d) Map for 2013-12-29.
(e) Map for 2014-06-15.</p>
          <p>(f) Map for 2016-03-27.
The resulting maps described in Section 4.3 and the BMUs displayed therein
seem to indicate that certain named graphs cluster together around some very
specific datasets, namely:
– The BBC Nature data dataset. Some other BBC named graphs cluster
together outside of the main cluster, but end up merging into one. The common
vocabularies and predicates that can be found in these are RDF (rdf:type),
RDFS (rdfs:label), the Wildlife Ontology, FOAF, and Dublin Core.
– The Open Energy Information wiki export as Linked Data. Like the BBC
graphs, some other OpenEI named graphs are clustered separately before
merging into one. The common vocabularies and predicates that are found in
these are RDF (rdf:type), Semantic MediaWiki terms, OWL (owl:imports,
owl:sameAs), RDFS (rdfs:label), and custom OpenEI ontological terms.
– Olympics athletes Linked Data / FOAF file of Bob DuCharme, and more
generally pages with personal details and FOAF data. In the olympics
athletes pages, the used vocabularies are Facebook, the Open Graph Protocol,
and XHTML; while in personal page’s Linked Data the most common are
RDFS (rdfs:seeAlso), RDF, and FOAF.</p>
          <p>Besides the common usage of typical terms of RDF and RDFS, these datasets
represent some significant actors of the LOD cloud: life sciences data and BBC
datasets, encyclopedic knowledge coming from wikis, and RDF/RDFa data
embedded in personal pages describing personal profiles. Apart from their
fundamental differences (e.g. usage of FOAF, MediaWiki), subsequent SOMs tend to
emphasize their common characteristics, grouping them. A fundamental question
is, though, what makes these datasets so different from the surrounding, planar
surface around the main clusters? We conjecture about plausible explanations
for this. First, the higher number of instances using these terms might lead to
a higher frequency in their sparse matrices. Second, the SOM maps reward the
stability of these datasets versus others that changed their used vocabularies over
time. Third, we observe that these datasets tend to reuse existing vocabularies,
instead of minting their own (with few exceptions). Finally, all of them reuse
very common terms that tend appear elsewhere in the LOD cloud. Anyhow,
Figure 1 highlights publishers that kept their data accessible and that did not
introduce new terminology nor removed it. We observe that this corroborates
how systematic the DyLDO crawls were.
5</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Conclusions and Future Work</title>
      <p>In this paper we studied the requirements for visualizing large amounts of Linked
Data that evolve over time, and proposed candidate solutions for fulfilling them.
These candidate solutions were Somoclu, an efficient parallel implementation of
self-organizing maps – an unsupervised dimensionality reduction technique; and
DyLDO, a timestamped and systematic sample of the LOD cloud between 2012
and 2016. Our findings arose from an animation that told a story of stability,
but that emphasized datasets with high availability, high vocabulary reuse, and
priority for frequently used terms.</p>
      <p>Many paths remain open, and we plan on extending this work in several
ways. First, we will prepare the timestamped LOD with different assumptions
beyond usage of predicates in named graphs. Second, we will treat different
scalable datasets as both content repositories and fodders for semantic reasoning.
Finally, we will integrate metadata of version control systems and data
engineering workflows to better visualize, and understand, the semantic drift of Web
data over time.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgment</title>
      <p>The second and third author acknowledge research funding by the European
Commission Seventh Framework Programme under Grant Agreement Number
FP7-601138 PERICLES.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>Alexander</given-names>
            <surname>Mäedche</surname>
          </string-name>
          , Boris Motik, and Ljiljana Stojanovic:
          <article-title>Managing multiple and distributed ontologies in the Semantic Web</article-title>
          .
          <source>The VLDB Journal - The International Journal on Very Large Data Bases</source>
          <volume>12</volume>
          (
          <issue>4</issue>
          ),
          <fpage>286</fpage>
          -
          <lpage>300</lpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Berners-Lee</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hendler</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lassila</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          :
          <article-title>The Semantic Web</article-title>
          .
          <source>Scientific American</source>
          <volume>284</volume>
          (
          <issue>5</issue>
          ),
          <fpage>34</fpage>
          -
          <lpage>43</lpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Dividino</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gottron</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scherp</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Strategies for efficiently keeping local linked open data caches up-to-date</article-title>
          .
          <source>In: Proceedings of ISWC-15</source>
          , 14th International Semantic Web Conference. pp.
          <fpage>356</fpage>
          -
          <lpage>373</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Gonçalves</surname>
            ,
            <given-names>R.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>Analysing Multiple Versions of an Ontology : A Study of the NCI Thesaurus</article-title>
          .
          <source>In: Proceedings of DL-11, 24th International Workshop on Description Logics</source>
          . vol.
          <volume>745</volume>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Käfer</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Abdelrahman</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Umbrich</surname>
            , J.,
            <given-names>O</given-names>
          </string-name>
          <string-name>
            <surname>'Byrne</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hogan</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Observing linked data dynamics</article-title>
          .
          <source>In: Proceedings of ESWC-13, 10th International Conference on the Semantic Web: Semantics and Big Data</source>
          . pp.
          <fpage>213</fpage>
          -
          <lpage>227</lpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Kohonen</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <string-name>
            <surname>Self-Organizing Maps</surname>
          </string-name>
          . Springer (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Leenheer</surname>
            ,
            <given-names>P.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mens</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Ontology evolution: State of the art and future directions</article-title>
          .
          <source>In: Ontology Management for the Semantic Web, Semantic Web Services, and Business Applications</source>
          . Springer (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Meroño-Peñuela</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Semantic Web for the Humanities</article-title>
          .
          <source>In: Proceedings of ESWC13, 10th International Conference The Semantic Web: Semantics and Big Data</source>
          . pp.
          <fpage>645</fpage>
          -
          <lpage>649</lpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Meroño-Peñuela</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Refining Statistical Data on the Web</article-title>
          .
          <source>Ph.D. thesis</source>
          , Vrije Universiteit Amsterdam (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Pesquita</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Couto</surname>
            ,
            <given-names>F.M.</given-names>
          </string-name>
          :
          <article-title>Predicting the Extension of Biomedical Ontologies</article-title>
          .
          <source>PLoS Computational Biology</source>
          <volume>8</volume>
          (
          <issue>9</issue>
          ),
          <year>e1002630</year>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Stavropoulos</surname>
            ,
            <given-names>T.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Andreadis</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , Riga,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Kontopoulos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            ,
            <surname>Mitzias</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Kompatsiaris</surname>
          </string-name>
          ,
          <string-name>
            <surname>I.:</surname>
          </string-name>
          <article-title>A framework for measuring semantic drift in ontologies</article-title>
          .
          <source>In: Proceedings of SuCCESS-16, 1st Int. Workshop on Semantic Change &amp; Evolving Semantics, colocated with the 12th European Conference on Semantics Systems (SEMANTiCS16)</source>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Stojanovic</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Methods and Tools for Ontology Evolution</article-title>
          .
          <source>Ph.D. thesis</source>
          , University of Karlsruhe (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schlobach</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Klein</surname>
            ,
            <given-names>M.C.A.</given-names>
          </string-name>
          :
          <article-title>What Is Concept Drift</article-title>
          and How to Measure It? In: Proceedings of EKAW-
          <volume>10</volume>
          , 17th International Conference on
          <article-title>Knowledge Engineering and Management by the Masses</article-title>
          . pp.
          <fpage>241</fpage>
          -
          <lpage>256</lpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Wittek</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Darányi</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kontopoulos</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Moysiadis</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kompatsiaris</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Monitoring term drift based on semantic consistency in an evolving vector field</article-title>
          .
          <source>In: Proceedings of IJCNN-15, International Joint Conference on Neural Networks</source>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Wittek</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gao</surname>
            ,
            <given-names>S.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lim</surname>
            ,
            <given-names>I.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhao</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Somoclu: An efficient parallel library for self-organizing maps</article-title>
          .
          <source>arXiv:1305.1422</source>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>