<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Statistical Comparison of Current Knowledge Bases</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Michael Färber</string-name>
          <email>michael.faerber@kit.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Achim Rettinger</string-name>
          <email>rettinger@kit.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Institute AIFB, Karlsruhe Institute of Technology (KIT)</institution>
          ,
          <addr-line>Karlsruhe</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <fpage>18</fpage>
      <lpage>21</lpage>
      <abstract>
        <p>In the last years, many knowledge bases have been developed and used in real-world applications. These include DBpedia, Wikidata, and YAGO which all cover general knowledge and therefore similar topics. In this poster, we present statistical measurements on these KBs. Our experiments reveal that despite that fact that these KBs cover the same domains to a considerable amount, they di er from each other signi cantly w.r.t. their graph-based structure and ontological aspects.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Knowledge Bases</kwd>
        <kwd>Knowledge Graphs</kwd>
        <kwd>Statistics</kwd>
        <kwd>Metrics</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>In the last years, several knowledge bases (KBs) have been
developed and found their way into industrial applications.
Although KBs have been used a lot, to the best of our
knowledge, comparative studies on the statistical characteristics of
KBs are very limited so far. This is in particular true for the
KBs DBpedia, Wikidata, and YAGO. These KBs are freely
available and do not cover a speci c domain, but general
knowledge in general. In this paper, we focus on these KBs
and exhibit their particularities w.r.t. their structural and
ontological conditions. Based on the fact that these KBs are
{ from a conceptual point of view { directed graphs
consisting of RDF triples,1 we come up with simple graph-based
and RDF-based metrics such as in-degree, out-degree and a
variety of other metrics. Given the results of these metrics,
we can gain a better insight into the particularities of these
current KBs and learn to what extent they di er from each
other.
1See http://www.w3.org/RDF/.</p>
      <p>Hence, our main contributions in this paper are:
We calculate a variety of statistical measurements on
the widely used KBs DBpedia, Wikidata, and YAGO.
We give an analysis regarding these results.</p>
      <p>We make our framework for statistical analysis of KBs
available for the public2 so that other KBs can be easily
integrated.</p>
      <p>The remainder of this paper is organized as follows: First we
give an overview of related work of semantic graph analysis.
We then introduce the KBs which we selected for our
analysis, and provide details regarding the current versions of the
KBs. We then present the results of applying several
graphbased and semantics-based metrics on the KB datasets in
question. After discussing particularities of our analysis in
Section 3, we conclude in Section 4.</p>
    </sec>
    <sec id="sec-2">
      <title>2. COMPARISON OF KNOWLEDGE BASES</title>
    </sec>
    <sec id="sec-3">
      <title>2.1 Related Work</title>
      <p>
        Firstly, some work on the analyis of the graph structure of
the (HTML) Web has been carried out. Early studies of Web
topology were published already in the 1990s (see, e.g., [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]).
In 2000, Broder et al. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] found out that the structure of the
Web can be modeled in the shape of a bow tie. Rather
recently, Donato et al. [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] developed some models which were
brought into accordance with their crawl dataset regarding
some characteristics such as the power law distribution for
degree.
      </p>
      <p>
        Secondly, related work has been carried out on the analysis
of the Linked Open Data (LOD) cloud:3 Rodriguez [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], for
instance, analyzed the graph of data sources in the LOD
cloud. Among other things, he concluded that, despite the
general assumption of the LOD cloud being a crowded \ravel",
the LOD cloud can be disaggregated into a component around
DBpedia and another component around DBLP.4 Gueret et
al. [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] con rmed that observation, but added a third
component around UniProt.5
Thirdly, a few analyses of single ontologies [
        <xref ref-type="bibr" rid="ref11 ref5 ref7">11, 7, 5</xref>
        ] were
made { as we do it in this paper: Theoharis et al. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]
focus on power-law degree distributions. According to them,
2The implementation of the framework is available for
download at http://www.aifb.kit.edu/web/KB-Statistics.
3See http://lod-cloud.net.
4See http://dblp.uni-trier.de. DBLP contains
bibliographical information and is not domain-independent.
5See http.//www.uniprot.org.
ontologies exhibit power law degree distributions as soon as
they have a su cient number of predicates or classes. In
this paper, we also calculate degree distributions and
examine whether they follow a power-law. Hoser et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] applied
social network analysis on the two ontologies SWRC6 and
Suggested Upper Merged Ontology (SUMO).7 According to
the authors, eigenvalue analysis provides deep insights into
the structure and focus of the ontology. In our work, in
contrary, we do not take eigenvectors into consideration. In the
context of describing and evaluating a benchmark generator
for Linked Data, Duan et al. [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] used measurements such as
indegree and number of distinct subjects/objects of speci c
KBs such as DBpedia and YAGO (as of 2011). Their work
is therefore mostly related to our work. Duan et al. found
out that there is a bad t between the degree distribution
of the Semantic Web benchmark and curated Linked Data
datasets. They propose a new metric called coherence since
the existing graph-based metrics do not make a point about
the quality of a KB. However, as we see in our experiments,
this metric is not properly applicable for our KBs.
      </p>
    </sec>
    <sec id="sec-4">
      <title>2.2 Overview of the Knowledge Bases</title>
      <p>In the following, we shortly describe the di erent KBs which
we analyze in the following sections. We focus on these three
KBs since they cover general, cross-domain knowledge and
similar topics.</p>
      <p>
        DBpedia: DBpedia8 is the most popular and
prominent KB in the LOD cloud [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Since the rst public
release in 2007, DBpedia is updated roughly once a
year.9 DBpedia is created from automatically-extracted
structured information contained in the Wikipedia, such
as from infobox tables, categorization information,
geocoordinates, and external links. Due to its role as the
hub of Linked Open Data, DBpedia contains many
links to other datasets in the LOD cloud. DBpedia is
used extensively in the Semantic Web research
community, but is also relevant in commercial settings:
companies use it to organize their content, such as the
BBC [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] and the New York Times [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. In our
experiments, we use the latest version of DBpedia, which is
DBpedia 2014.10
Wikidata: Wikidata11 started on October 30, 2012
as a project of Wikimedia Deutschland. The aim of
the project is to provide data which can be used by
any Wikimedia project, including Wikipedia.
Wikidata does not only store facts, but also the
corresponding sources, so that the validity of facts can be
checked. Labels, aliases, and descriptions for entities
in Wikidata are provided in more than 350 languages.
Wikidata is a community e ort, i.e., users
collaboratively add and edit information. Also, the schema is
maintained and extended based on community
agreements. In the near future, Wikidata will grow due to
6See http://ontobroker.semanticweb.org/ontologies/
swrc-onto-2001-12-11.oxml.
7See http://www.ontologyportal.org.
8See http://dbpedia.org.
9There is also DBpedia live which is updated when
Wikipedia is updated. See http://live.dbpedia.org.
10See our website for a list of the dump les used in our
experiments.
11See http://wikidata.org.
the integration of Freebase data.12 Our experiments
on Wikidata are based on the Wikidata simple
statements dataset from February 2015.13
YAGO: YAGO14 { Yet Another Great Ontology {
has been developed at the Max Planck Institute for
Computer Science in Saarbrucken since 2007. YAGO
comprises information extracted from the Wikipedia,
WordNet15, and GeoNames.16 As of March 24, 2015,
YAGO3 is available, which we use in our experiments.
Since the YAGO3 data set was not available in triple
format at the time of the experiments, we transformed
the available tsv les into the triple format.
      </p>
    </sec>
    <sec id="sec-5">
      <title>2.3 Analysis of the Knowledge Bases</title>
      <sec id="sec-5-1">
        <title>2.3.1 Number of Triples</title>
        <p>Comparing the number of triples in the di erent KBs (see
Figure 1a), we can see that YAGO has much more triples
than DBpedia or Wikidata. One reason for that might be
that in case of YAGO (and Wikidata) there was only one
dataset with all covered languages given (containing labels
in di erent languages), while for DBpedia we could restrict
the KB to the English language. Wikidata is rather small,
since knowledge stored in Wikidata was not extracted from
one text corpus { as in case of DBpedia {, but created by
users of the Wikidata community.</p>
      </sec>
      <sec id="sec-5-2">
        <title>2.3.2 Disk Space</title>
        <p>As visible in Figure 1b and as expectedly, the measured disk
space is directly correlated to the number of triples. Figure
1c shows the relative disk space. Interesting is here the fact
that { despite the relatively small number of triples {
Wikidata requires much less disk space than the other KBs. The
reason for that is that Wikidata uses non-human readable
URIs (such as http://wikidata.org/entity/Q1040) while
the other KBs rely on human-readable URIs (e.g., http://
dbpedia.org/resource/Karlsruhe and http://yago.org/
resource/Karlsruhe). In case of Wikidata, the
humanreadable labels for entities and properties are stored
separately.</p>
      </sec>
      <sec id="sec-5-3">
        <title>2.3.3 Number of Distinct Subjects and Number of</title>
      </sec>
      <sec id="sec-5-4">
        <title>Distinct Objects</title>
        <p>Comparing the number of distinct subjects across the KBs
in question (see Figure 1d) and the number of distinct
objects (see Figure 1e), it becomes apparent that DBpedia has
relatively few distinct subjects, but instead more distinct
objects. In other words: The set of resources with outgoing
edges is signi cantly smaller than the set of resources with
incoming edges (ratio 1 : 1:6). YAGO, in contrast, has the
opposite characteristic (ratio 21 : 1). Figure 1f and 1g show
the ratio of the set of distinct subjects/objects w.r.t. to the
entire set of resources in the KBs. Notable is that in case of
YAGO, only to relatively few resources is linked.
12See https://plus.google.com/u/0/
109936836907132434202/posts/bu3z2wVqcQc
13See http://tools.wmflabs.org/wikidata-exports/rdf/
exports/20150223/.
14See http://www.mpi-inf.mpg.de/departments/
databases-and-information-systems/research/
yago-naga/yago/downloads/
15See https://wordnet.princeton.edu.
16See www.geonames.org.</p>
        <p>Relative Diskspace
0 DBpedia 2014
70
60
re50
e
g
ide40
n
e
rg30
a
e
vA20
10
1010
108
s
noed106
f
o
rbe104
m
u
N102
100100
3
tsc2.5
e
jfsub 2
o
re1.5
b
m
u 1
N
0.5
10x 104
se 8
itr
e
frop 6
p
o
re 4
b
m
u
N2
20
ee15
r
g
e
d
in10
e
g
a
r
ve
A5
(j) Average number of
instances per class
Indegree Distribution</p>
        <p>DBPedia 2014 20
Wikidata
YAGO3</p>
        <p>Average outdegree
(l) Average indegree with
literals
Outdegree Distribution</p>
        <p>DBPedia 2014
Wikidata
YAGO3
0 DBpedia 2014</p>
        <p>Wikidata</p>
        <p>YAGO3
0 DBpedia 2014</p>
        <p>Wikidata</p>
        <p>YAGO3
(k) Average indegree
10
s
le 8
p
itfr
ro 6
e
b
um4
N
2</p>
      </sec>
      <sec id="sec-5-5">
        <title>2.3.4 Number of Distinct Properties</title>
        <p>From our analysis regarding the number of distinct
properties (see Figure 1h) we can derive that the used Wikidata
RDF version contains only around 1,323 distinct properties.
The reason for that is that properties are carefully
introduced by the Wikidata community and go through an
extensive discussion process before they are released for usage.
DBpedia contains many properties. However, they are very
heterogeneous and the non-mapping-based properties17 (i.e.,
properties which were extracted not based on human-de ned
mappings, but solely as they appeared in the info-boxes in
Wikipedia) are often very noisy.18 A similar situation holds
for YAGO.
17I.e. properties having the URI pre x
http://dbpedia.org/property/.
18There are, for instance, 53,930 triples with the property
http://dbpedia.org/property/s in DBpedia 2014 which
has obviously no meaning.</p>
      </sec>
      <sec id="sec-5-6">
        <title>2.3.5 Number of Distinct Classes</title>
        <p>For calculating the number of distinct classes (see Figure 1i),
we iterated over all instances contained in the KB datasets
and took the objects of the relation rdf:type.19 Although
DBpedia often contains several classes according to this
classassignment method, we only retrieved 526 distinct classes.
The small number in case of Wikidata can be justi ed again
by the community approach of Wikidata. YAGO has a
astonishing number of distinct classes since YAGO is mainly
an ontology, i.e., containing class-based information such as
the classes of the WordNet taxonomy. This last fact
becomes apparent in Figure 1j where the average number of
instances per class is visualized.</p>
      </sec>
      <sec id="sec-5-7">
        <title>2.3.6 Indegree</title>
        <p>Comparing the average indegree (de ned as the average
number of inlinks per node; see Figure 1k) where no triples with
19Standing for
22-rdf-syntax-ns#type.</p>
        <p>http://www.w3.org/1999/02/
literals (values) on the object position were considered and
comparing the average indegree where triples with literals
were considered in addition (see Figure 1l), we can see that
in general (i.e., for all KBs) the average indegree with
literals is much lower than the average indegree where no literals
were counted. The indegree for DBpedia and Wikidata is
roughly the same. One reason might be that a considerable
amount of Wikidata was taken from Wikipedia. It can be
assumed that YAGO has a higher average indegree than
DBpedia and Wikidata, since YAGO comprises many di erent
ontologies.</p>
        <p>The indegree distribution diagram (see Figure 1m) shows
almost ideal logarithmic decreases of the number of nodes
for all considered KBs. This is especially interesting since all
KBs were created in di erent ways: automatically extracted
from Wikipedia (DBpedia), partly created by the
community (Wikidata), or composed of several sources which were
used partly automatically, partly manually (YAGO). In the
light of the gure we can also con rm that the power law
is still applicable to the indegree distribution of semantic
graphs such as the considered KBs.</p>
      </sec>
      <sec id="sec-5-8">
        <title>2.3.7 Outdegree</title>
        <p>
          Considering the average outdegree for each KB (de ned as
the average number of outlinks per node; see Figure 1n), we
can see that nodes in the DBpedia knowledge graph have the
highest number of outgoing links on average. Wikidata
contains currently some domains of knowledge which are
represented very densely (such as persons) while other domains
are rarely covered yet. On average, however, Wikidata
performs similarly as YAGO w.r.t. the average outdegree of
nodes.20
The average outdegree of the KBs (see Figure 1o) suggest
{ as in the case of the average indegree { a power law
distribution. However, if the outdegree is low, the power law
distribution is broken. This con rms the theory of [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] which
states that a su cient number of predicates or classes is
necessary for observing a power law distribution.
        </p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>3. LESSONS LEARNED</title>
      <p>
        According to Theoharis et al. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], ontologies exhibit power
law degree distributions as soon as they have a su cient
number of predicates or classes. Based on our experiments,
we can con rm that for the KBs we considered.
      </p>
      <p>
        Duan et al. [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] stated that \traditional" graph analysis
metrics such as the degree or the number of classes are not
suitable when KBs should be compared. Given our
experimental results, we can con rm that to a certain extent. Duan
et al. proposed a new metric called coherence metric where
the \ lling degree" of all entities of the di erent classes is
calculated and aggregated. This might be a good indicator,
however, the calculation for our KBs is tricky, since we
often do not know the set of possible properties an entity of
a speci c class is able to have. Iterating over all existing
properties of entities of this class is problematic since the
KBs are often very noisy (di erent properties use the same
meaning, di erent object types are used for the same
property, etc.) and the considered KBs may contain multiple
classes per instance.
20The outlier where the outdegree is 108 can be traced back
to the fact that Wikidata contains many blank nodes with
a high outdegree.
      </p>
    </sec>
    <sec id="sec-7">
      <title>4. CONCLUSIONS</title>
      <p>A measurement how current knowledge bases such as
DBpedia, Wikidata, and YAGO look like and how they are
structured, is to a large extent missing. In this paper, we
presented a (freely available) framework for statistical
analysis of KBs where any KB with triple format can easily be
integrated. We calculated a variety of statistical
measurements on the KBs DBpedia, Wikidata, and YAGO, since
they all cover general knowledge and are used in many
applications. Our investigations revealed that all current KBs
performed very di erently w.r.t. the presented metrics.</p>
    </sec>
    <sec id="sec-8">
      <title>Acknowledgement</title>
      <p>This work was carried out with the support of the German
Federal Ministry of Education and Research (BMBF) within
the Software Campus project SUITE (Grant 01IS12051).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>S.</given-names>
            <surname>Auer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Bizer</surname>
          </string-name>
          , G. Kobilarov,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lehmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Cyganiak</surname>
          </string-name>
          , and
          <string-name>
            <surname>Z. Ives.</surname>
          </string-name>
          <article-title>DBpedia: A Nucleus for a Web of Open Data</article-title>
          .
          <source>In Proceedings of the 6th ISWC and 2nd ASWC</source>
          , pages
          <volume>722</volume>
          {
          <fpage>735</fpage>
          . Springer,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>T.</given-names>
            <surname>Bray</surname>
          </string-name>
          .
          <article-title>Measuring the Web</article-title>
          .
          <source>In Proceedings of the Fifth International World Wide Web Conference on Computer Networks and ISDN Systems</source>
          , pages
          <fpage>993</fpage>
          {
          <fpage>1005</fpage>
          . Elsevier Science Publishers B. V.,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A.</given-names>
            <surname>Broder</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Maghoul</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Raghavan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Rajagopalan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Stata</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Tomkins</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Wiener</surname>
          </string-name>
          .
          <article-title>Graph structure in the web</article-title>
          .
          <source>Computer networks</source>
          ,
          <volume>33</volume>
          (
          <issue>1</issue>
          ):
          <volume>309</volume>
          {
          <fpage>320</fpage>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>D.</given-names>
            <surname>Donato</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Laura</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Leonardi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Millozzi</surname>
          </string-name>
          .
          <article-title>The Web As a Graph: How Far We Are</article-title>
          .
          <source>ACM Trans. Internet Technol.</source>
          ,
          <volume>7</volume>
          (
          <issue>1</issue>
          ), Feb.
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>S.</given-names>
            <surname>Duan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kementsietsidis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Srinivas</surname>
          </string-name>
          , and
          <string-name>
            <given-names>O.</given-names>
            <surname>Udrea</surname>
          </string-name>
          .
          <article-title>Apples and Oranges: A Comparison of RDF Benchmarks and Real RDF Datasets</article-title>
          .
          <source>In Proceedings of the 2011 ACM SIGMOD</source>
          , pages
          <volume>145</volume>
          {
          <fpage>156</fpage>
          , New York, NY, USA,
          <year>2011</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>C.</given-names>
            <surname>Gueret</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Wang</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Schlohbach</surname>
          </string-name>
          .
          <article-title>The Web of Data is a Complex System { First Insight into Its Multi-Scale Network Properties</article-title>
          .
          <source>In Proceedings of the European Conference on Complex Systems</source>
          , pages
          <fpage>1</fpage>
          {
          <fpage>12</fpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>B.</given-names>
            <surname>Hoser</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hotho</surname>
          </string-name>
          , R. Jaschke, C. Schmitz, and
          <string-name>
            <given-names>G.</given-names>
            <surname>Stumme</surname>
          </string-name>
          .
          <article-title>Semantic Network Analysis of Ontologies</article-title>
          . In Y. Sure and J. Domingue, editors,
          <source>The Semantic Web: Research and Applications</source>
          , pages
          <volume>514</volume>
          {
          <fpage>529</fpage>
          . Springer Berlin Heidelberg,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>G.</given-names>
            <surname>Kobilarov</surname>
          </string-name>
          et al.
          <article-title>Media Meets Semantic Web { How the BBC Uses DBpedia and Linked Data to Make Connections</article-title>
          .
          <source>In Proceedings of the 6th ESWC</source>
          , pages
          <volume>723</volume>
          {
          <fpage>737</fpage>
          , Berlin, Heidelberg,
          <year>2009</year>
          . Springer.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Rodriguez</surname>
          </string-name>
          .
          <article-title>A graph analysis of the Linked Data cloud</article-title>
          .
          <source>arXiv preprint arXiv:0903.0194</source>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>E.</given-names>
            <surname>Sandhaus</surname>
          </string-name>
          . Semantic Technology at the New York Times:
          <article-title>Lessons Learned and Future Directions</article-title>
          .
          <source>In Proceedings of the 9th ISWC</source>
          , pages
          <volume>355</volume>
          {
          <fpage>355</fpage>
          , Berlin, Heidelberg,
          <year>2010</year>
          . Springer-Verlag.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Theoharis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Tzitzikas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Kotzinos</surname>
          </string-name>
          , and
          <string-name>
            <given-names>V.</given-names>
            <surname>Christophides</surname>
          </string-name>
          .
          <article-title>On Graph Features of Semantic Web Schemas</article-title>
          .
          <source>IEEE Trans. on Knowl. and Data Eng</source>
          .,
          <volume>20</volume>
          (
          <issue>5</issue>
          ):
          <volume>692</volume>
          {
          <fpage>702</fpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>