<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>April</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>DING! Dataset Ranking using Formal Descriptions</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Nickolai Toupikov</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ju¨ rgen Umbrich</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Renaud Delbru</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Michael Hausenblas</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Giovanni Tummarello</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>DERI, National University of Ireland</institution>
          ,
          <addr-line>Galway, IDA Business Park, Galway</addr-line>
          ,
          <country country="IE">Ireland</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2009</year>
      </pub-date>
      <volume>20</volume>
      <issue>2009</issue>
      <fpage>3</fpage>
      <lpage>7</lpage>
      <abstract>
        <p>Considering that thousands if not millions of linked datasets will be published soon, we motivate in this paper the need for an e cient and e ective way to rank interlinked datasets based on formal descriptions of their characteristics. We propose DING (from Dataset RankING) as a new approach to rank linked datasets using information provided by the voiD vocabulary. DING is a domain-independent link analysis that measures the popularity of datasets by considering the cardinality and types of the relationships. We propose also a methodology to automatically assign weights to link types. We evaluate the proposed ranking algorithm against other well known ones, such as PageRank or HITS, using synthetic voiD descriptions. Early results show that DING performs better than the standard Web ranking algorithms.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>MOTIVATION</title>
      <p>
        Following Marshall and Shipman [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], we understand linked
datasets in terms of the distributed database perspective.
The primary targeted consumers are expected to be
machines; a fair degree of automation needs to be guaranteed
in order to enable new types of Web applications. While
nowadays the number of datasets|published in accordance
to the linked data principles [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]|is somewhat limited, this
is expected to change soon. Considering thousands if not
millions of linked datasets1, one can expect to get lost, soon
when trying to identify appropriate datasets for a certain
task.
      </p>
      <p>
        Two issues come to mind when talking about selecting
(possibly many) datasets: e ciency and e ectiveness.
While the former basically refers to how fast certain datasets
can be identi ed, the latter focuses on the relevancy, that
is how well the dataset ful lls the stated requirements in a
certain context (the domain of the Web application). When
faced with a list of potential candidates, one usually wants
to rank them according to certain criteria in order to select
the most relevant ones.
1A simple estimation might support this argument: take for
example relational databases such as MySQL found in nearly
every modern Web application, or the manifold
repositories in the software development domain (for example, CVS
or SVN) or registries (LDAP, OPACs, etc.)|each of them,
once on the Web of Data (using out-of-the-box linked data
publishing tools such as Triplify [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]) represents at least one
dataset.
      </p>
      <p>
        Our thesis at hand now is that, based on a formal
(highlevel) description of a dataset's content and interlinking
provided by voiD, Semantic Web clients can e ectively rank
datasets using well-known strategies such as PageRank [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]
or HITS [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] in a very e cient way. Without such
highlevel descriptions, the client would have to \crawl" a large
number of documents in order to analyze and derive precise
statistics about the content of a dataset, hereby requiring
an excessive amount of time and resources.
      </p>
      <p>The rest of the paper is structured as follows: the next
section discusses exitsing approaches. Then, we lay out
the foundations regarding the formal description of linked
datasets in sec. 3 and render our proposal in detail (sec. 4).
Further, in sec. 5, we report on early ndings when
comparing our approach to widely used ones such as PageRank
or HITS. We conclude in sec. 6 by discussing the proposed
ranking methodology and point out possible future steps.
2.</p>
    </sec>
    <sec id="sec-2">
      <title>EXISTING WORKS</title>
      <p>
        Link analysis has proven to be e ective for query
independent quality web search. PageRanks [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and HITS [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] have
been successfully applied to measure the importance of web
pages by analysing their link structure. These two
algorithms consider only one type of links, i.e. hyperlinks, but
has been shown to improve the e ectiveness of web search
engines [
        <xref ref-type="bibr" rid="ref6 ref7">6, 7</xref>
        ].
      </p>
      <p>When working on a ner granularity level - such as
entity level - with more heterogeneous links, the previous
approaches are no longer applicable. In such condition, by
assuming that links are equivalent, the analysis of entity
relationships does not provide accurate results since links
of di erent types can have various impact on the ranking
computation.</p>
      <p>
        Recent works [
        <xref ref-type="bibr" rid="ref8 ref9">8, 9</xref>
        ] have extended PageRank to consider
di erent types of relations between entities or objects.
PopRank [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], a domain independent object-level link analysis,
proposes a machine learning approach to automatically
assigns a \popularity propagation factor" to each type of
relations. ObjectRank [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] goes further by applying
authoritybased ranking to keyword search in databases where various
objects are connected with semantic relations.
      </p>
      <p>
        The Swoogle search engine [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] was the rst one to
propose, OntoRank, an adaptation of PageRank for Semantic
Web resources. In their work, they compute popularity of
resources based on three levels of granularity: documents,
terms and RDF graphs. In [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], a link analysis is applied
at query time for computing the popularity of resources
and contexts (which can be seen as documents or datasets).
Their approach di erentiates two levels of link analysis,
resources and context graphs, and the di erent relationships
between them.
      </p>
      <p>In this paper, we are studying how to improve search
results by ranking datasets according to their popularity. Our
approach is based on link analysis between datasets by
using the information provided by the voiD descriptions. We
consider the types of relationships but also the cardinality of
link sets. We propose also an automatic weighting scheme
to nd appropriate weights for relation types.</p>
    </sec>
    <sec id="sec-3">
      <title>DESCRIBING DATASETS</title>
      <p>
        In order to realise our vision of a semantic ranking, we
build upon a formal description of the datasets and their
interlinking. Only recently the Vocabulary of Interlinked
Datasets (voiD) [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] has been released; voiD is an RDFS
vocabulary for describing linked datasets. A dataset in voiD
is \a collection of data, published and maintained by a
single provider, available as RDF, and accessible, for example,
through dereferenceable HTTP URIs or a SPARQL
endpoint". Interlinking in voiD is modeled utilising a so called
linksets. A linkset in voiD is \a subset of a dataset, used
for storing triples to express the interlinking relationship
between datasets; in each interlinking triple, the subject is a
resource hosted in one dataset and the object is a resource
hosted in another dataset".
      </p>
      <p>
        Given that such voiD descriptions are published alongside
with the datasets, they can be collected via pings, by
crawling, or simply follow-your-nose by a semantic indexer such
as Sindice [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] or the Yahoo! Search Monkey [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. We
assume such a collection of voiD descriptions in the following.
We note further that, as voiD being metadata about linked
data, is RDF-grounded, we can use all current RDF tools
and libraries to process, store and visualise it. Further, it
is perfectly possible to go from the meta-level to the
metameta-level, that is having a voiD description about voiD
descriptions.
      </p>
    </sec>
    <sec id="sec-4">
      <title>DING—DATASET RANKING</title>
      <p>Our proposal for a semantic ranking of RDF datasets is
called DING (from Dataset RankING) and is based on voiD
descriptions of the datasets.
4.1</p>
    </sec>
    <sec id="sec-5">
      <title>Exploiting voiD’s characteristics</title>
      <p>
        Based on the voiD guide [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] we will review the relevant
features of voiD in the following and discuss their suitability
with respect to dataset selection and ranking.
      </p>
      <p>
        The size of the dataset, that is, for example the
number of triples or the number of distinct subjects
can be used for ranking. In voiD this is a void:statItem
property along with one of ve prede ned dimensions
such as void:numberOfTriples or
void:numberOfDocuments. We have argued in [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] recently that the sheer
numbers of triples is likely not a good measure for its
value.
      </p>
      <p>
        Categorisation of datasets in voiD is done using
dcterms:subject along with DBpedia [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] resources. This
can be used in a rst step to massively decrease the
search space. It acts as a sort of lexicon allowing to
lookup a category and nd related datasets. As a
second step, DING can be used to rank the list of datasets
matching a certain category.
      </p>
      <p>The interlinking of a dataset in voiD, that is, its
outgoing and incoming links, is represented using the
void:linkPredicate property. We identify two
potential dimensions that might be exploited for ranking:
{ regarding the semantics of the links (such as
rdfs:seeAlso vs. foaf:knows) and
{ on a quantitative level, that is regarding the
number of interlinking triples.</p>
      <p>The kind of and number of used vocabularies in a
dataset can be seen from the void:vocabulary
property value.</p>
      <p>Other voiD characteristics such as
void:uriRegexPattern or the technical features of a dataset (such as
available serialisations) via void:TechnicalFeature)
can not directly be used for ranking, though perfectly
for ltering (as in case with categorisation).</p>
      <p>The following example in Fig. 1 may help highlight our
thinking:</p>
      <p>In this section, we present how we adapted the weighted
PageRank algorithm in order to perform the Dataset
ranking based on their interconnection. We then explain how it
is possible to assign automatically a weight to a certain link
type.</p>
      <p>PageRank is a ranking system that originates from Web
Search Engines using a random walk algorithm. The
Ranking system evaluates the probability of nding the web surfer
on any given page. This algorithm is based on the
assumption that when someone publishes a resource on the web, he
will do his best to link the published resource | be it a web
page, or in our case | a dataset | to the most relevant
and trustworthy resources availiable on the web. Hence the
relevancy is assumed to be related to a high degree of inlinks
from other web resources. And from a probabilistic point of
view | the more inlinks a dataset has, the most likely the
'random surfer' will be lead to it in his journey.</p>
      <p>The original PageRank r(Pi) of a web page i is given by
r(Pi) =</p>
      <p>X</p>
      <p>r(Pj)
Pj2BPi jPjj
(1)
10
1 : DS1 a void : Dataset ;
2 foaf : homepage &lt; http :// example . org / cats / &gt; ;
3 dcterms : subject</p>
      <p>&lt; http :// dbpedia . org / resource / Cats &gt; ;
void : subset : DS1toDS3 ;
void : subset : DS1toDS4 .
4
5
6
7 : DS2 a void : Dataset ;
8 foaf : homepage &lt; http :// petfood . example . org / &gt; ;
9 dcterms : subject</p>
      <p>&lt; http :// dbpedia . org / resource / Cats &gt; ;
dcterms : subject</p>
      <p>&lt; http :// dbpedia . org / resource / Pet_foods &gt; ;
void : subset : DS2toDS1 .
Where BPi is the set of pages linking to Pi and jPjj is the
total number of pages linked by Pj . Hence, jP1jj is in fact
the probability for the random surfer to choose to go from Pj
to Pi out of all pages linked by Pj. This probability referred
to as pj!i, can be modi ed in order to provide a weighting
of the "importance" of the hyperlink.</p>
      <p>The parallel from web documents to voiD descriptions
is done in a naive way. The web pages are now datasets,
and the hyperlinks correspond to linksets joining the dataset
they belong to another:</p>
      <p>Pi corresponds to an element Di de ne by void:dataset.
A hyperlink form in the page Pi pointing to the page Pj
will correspond to a void:linkset element connecting
Di and Dj, de ned as void:subset of a dataset Di.
The linkset will be referred to as Li!j. We also de ne
n(Li!j) as the number of relations in the linkset, and
s(Li!j) as the predicate declared in the linkset. For
example, in Fig. 1 n(L1!3) = 600 and s(L1!3) =
"foaf:interest". L is the set of linksets de ned in
the entire data collection.</p>
      <p>Similarly to the set BPi of pages linking to Pi, we
de ne O(i) = fjj9Li!j 2 Lg as the indices of datasets
linked from Di.
pi!j can be modi ed according to the information
available about the linkest Li!j, such as s(Li!j) or
n(Li!j), as well as general statistics over L.</p>
      <p>Like in the web page link analysis, the links between
datasets deserve a deeper analysis in order to obtain a ner
ranking. For example in Fig. 1 the probability of the user
going from DS1 to DS3 is likely to be di erent from the
(2)
(3)
(4)
(5)
probability of going to DS4 - since the predicate and
number of links associated to L1!3 are not the same as the ones
associated to L1!4.</p>
      <p>The goal will hence be to de ne a weight function w(Li!j).
The weight will then be normalized in order to generate the
transition probability pi!j as follows.</p>
      <p>pi!j = Pk2O(i) w(Li!k)</p>
      <p>w(Li!j)
The rst approach is simply to de ne w(Li!j) = n(Li!j) .
600
In the case of Fig. 1, p1!3 = 2000+600 ' 0:23 and p1!4 =
2000
2000+600 ' 0:77. However, this de nition does not take into
account the nature of the link, and the likelihood that the
user may well chose foaf:interest above dc:author to browse
into another dataset. As a result, additional weights can be
assigned based on the nature of the predicate involved in
the link.</p>
      <p>The values assigned can be either statically prede ned, or
computed dynamically, given the accumulated voiD
information. We present our approach, based on TF-IDF a well
known algorithm when it comes to weight the relevance of
a term(in our case - the predicate), given its frequency in a
data collection. Hence, the weight, given by TF-IDF would
be</p>
      <p>T F (Li!j) =</p>
      <p>n(Li!j)
maxk2O(i) n(Li!k)
IDF (s(Li!j)) = log</p>
      <p>N
1 + f req(s(Li!j))
Where f req(s(Li!j)) is the frequency of occurrence of linksets
using the predicate of Li!j in the collection's datasets.
Finally, we de ne w as
w(Li!j) = T F (Li!j)</p>
      <p>IDF (s(Li!j))</p>
    </sec>
    <sec id="sec-6">
      <title>5. EXPERIMENTS AND EARLY FINDINGS</title>
      <p>In order to verify our thesis that formal descriptions of
linked datasets help yielding better results for the ranking
of the datasets, we have set up an evaluation framework that
executes various ranking algorithms on a synthetic voiD
description2(see Fig. 2). It is composed of 15 arti cial dataset
descriptions interlinked using 8 di erent predicates and
partitioned into two clouds (datasets 1 to 9 and 10 to 15). The
experiment used several ranking algorithms to estimate the
generic relevancy of every arti cial dataset within the
synthetic cloud.
5.1</p>
    </sec>
    <sec id="sec-7">
      <title>The setup</title>
      <p>For the evaluation we use the Java Universal
Network/Graph Framework (JUNG)3 to compare the DING
algorithm with other established and well known ranking
algorithms. Further, we use a naive link-sum rank function
(DRank ) as a baseline to discuss the results. Three out
of the four ranking algorithms are also extended with the
DING link weight function. In detail we evaluate and
compare the following ranking algorithms:
2The full benchmark data is available at http://sw.deri.
org/2009/02/DING/example-void-collection.ttl.
Unfortunately no real-world voiD cloud was readily availiable
for the experiment at the time of writing.
3http://jung.sourceforge.net/
DRank: A baseline ranking algorithm using a naive
approach. The datasets are ranked according to the
number of links they have with other datasets.</p>
      <p>
        PageRank Google's page rank algorithm [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>
        DING PageRank modi cation of the PageRank
Algorithm as described in Sec 4.2
HITS Another well known ranking algorithm is HITS
[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. For each data set in the voiD graph a
\hubs-andauthorities" importance measure is calculated.
5.2
      </p>
    </sec>
    <sec id="sec-8">
      <title>Results</title>
      <p>Table 1 lists the results of the evaluation. The naive
ranking approach, DRank, completely leaves out the rst cloud
for having much less links in its linksets than the second
cloud. We see that standard PageRank and HITS
algorithms do not take into account the nature of the links and
rank DS1 rst. Although DS1 is indeed heavily linked by
other datasets, it is mostly inlinked by "weak" links like
owl:sameAs or rdfs:seeAlso. The information-theory view
de ned in tfidf suggests that these links | being the most
common ones | do not hold as much information content
as less common ones, and are therefore less signi cant. For
example while looking for information about an article, the
user will get more precise information following
dcterms:author than a generic property such as rdfs:seeAlso, and
is hence more likely to follow the former. As a result a
dataset linked by uncommon links will likely be more
significant than one linked by common ones - and should have a
higher voiD ranking.</p>
      <p>Another advantage of PageRank that makes it very
relevant for the Linked Data approach is that it gives a low
ranking to datasets that do not have inlinks. The value of
a dataset within the cloud is dependent on how well it is
linked by other datasets.</p>
    </sec>
    <sec id="sec-9">
      <title>CONCLUSION</title>
      <p>We have presented DING, a new approach to rank linked
datasets based on voiD descriptions. Though one might
object that currently there are not many voiD descriptions
available4 we argued that this is very likely to change soon.
Further, the infrastructure to collect voiD descriptions is in
place (voiD being RDF, the requirements to do so are
minimal).</p>
      <p>We have motivated the need for a e cient and e ective
way to rank datasets based on their characteristics
(contentwise and with respect to the interlinking). Finally we have
shown how DING performs in relation to existing ranking
algorithms and discussed the results.
4Indeed one nds voiD descriptions at time of writing,
already; see for example http://void.rkbexplorer.com/.</p>
    </sec>
    <sec id="sec-10">
      <title>Acknowledgements</title>
      <p>Our work has partly been supported by the European
Commission under Grant No. 217031, FP7/ICT-2007.1.2, project
\Domain Driven Design and Mashup Oriented Development
based on Open Source Java Metaframework for Pragmatic,
Reliable and Secure Web Development" (Romulus)5, by the
European FP7 project Okkam - Enabling a Web of
Entities (contract no. ICT-215032), and by Science Foundation
Ireland under Grant No. SFI/02/CE1/I131.
7.
5http://www.ict-romulus.eu/</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>C.C.</given-names>
            <surname>Marshall</surname>
          </string-name>
          and
          <string-name>
            <given-names>F.M.</given-names>
            <surname>Shipman</surname>
          </string-name>
          .
          <source>Which Semantic Web? In HYPERTEXT '03: Proceedings of the fourteenth ACM conference on Hypertext and hypermedia</source>
          , pages
          <volume>57</volume>
          {
          <fpage>66</fpage>
          , New York, NY, USA,
          <year>2003</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>T.</given-names>
            <surname>Berners-Lee</surname>
          </string-name>
          .
          <article-title>Linked Data</article-title>
          . http://www.w3.org/DesignIssues/LinkedData.html,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>S.</given-names>
            <surname>Auer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Dietzold</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lehmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Hellmann</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Aumueller</surname>
          </string-name>
          .
          <article-title>Triplify - Lightweight Linked Data Publication from Relational Databases</article-title>
          .
          <source>In International World Wide Web Conference (WWW 09)</source>
          , Madrid, Spain, page upcoming,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Lawrence</given-names>
            <surname>Page</surname>
          </string-name>
          , Sergey Brin, Rajeev Motwani, and
          <string-name>
            <given-names>Terry</given-names>
            <surname>Winograd</surname>
          </string-name>
          .
          <article-title>The pagerank citation ranking: Bringing order to the web</article-title>
          .
          <source>Technical Report 1999-66</source>
          ,
          <year>November 1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Jon</surname>
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Kleinberg</surname>
          </string-name>
          .
          <article-title>Authoritative sources in a hyperlinked environment</article-title>
          .
          <source>J. ACM</source>
          ,
          <volume>46</volume>
          (
          <issue>5</issue>
          ):
          <volume>604</volume>
          {
          <fpage>632</fpage>
          ,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Sergey</given-names>
            <surname>Brin</surname>
          </string-name>
          and
          <string-name>
            <given-names>Lawrence</given-names>
            <surname>Page</surname>
          </string-name>
          .
          <article-title>The anatomy of a large-scale hypertextual web search engine</article-title>
          .
          <source>Comput. Netw. ISDN Syst</source>
          .,
          <volume>30</volume>
          (
          <issue>1-7</issue>
          ):
          <volume>107</volume>
          {
          <fpage>117</fpage>
          ,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Soumen</given-names>
            <surname>Chakrabarti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Byron E.</given-names>
            <surname>Dom</surname>
          </string-name>
          , David Gibson,
          <string-name>
            <given-names>Jon</given-names>
            <surname>Kleinberg</surname>
          </string-name>
          , Ravi Kumar, Prabhakar Raghavan, Sridhar Rajagopalan, and
          <string-name>
            <given-names>Andrew</given-names>
            <surname>Tomkins</surname>
          </string-name>
          .
          <article-title>Mining the web's link structure</article-title>
          .
          <source>Computer</source>
          ,
          <volume>32</volume>
          (
          <issue>8</issue>
          ):
          <volume>60</volume>
          {
          <fpage>67</fpage>
          ,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>Wenpu</given-names>
            <surname>Xing</surname>
          </string-name>
          and
          <string-name>
            <given-names>Ali</given-names>
            <surname>Ghorbani</surname>
          </string-name>
          .
          <article-title>Weighted pagerank algorithm</article-title>
          .
          <source>In CNSR '04: Proceedings of the Second Annual Conference on Communication Networks and Services Research</source>
          , volume
          <volume>0</volume>
          , pages
          <fpage>305</fpage>
          {
          <fpage>314</fpage>
          , Washington, DC, USA,
          <year>2004</year>
          . IEEE Computer Society.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Ricardo</given-names>
            <surname>Baeza-Yates</surname>
          </string-name>
          and
          <string-name>
            <given-names>Emilio</given-names>
            <surname>Davis</surname>
          </string-name>
          .
          <article-title>Web page ranking using link attributes</article-title>
          .
          <source>In WWW Alt. '04: Proceedings of the 13th international World Wide Web conference on Alternate track papers &amp; posters</source>
          , pages
          <volume>328</volume>
          {
          <fpage>329</fpage>
          , New York, NY, USA,
          <year>2004</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Zaiqing</surname>
            <given-names>Nie</given-names>
          </string-name>
          , Yuanzhi Zhang,
          <string-name>
            <surname>Ji-Rong Wen</surname>
          </string-name>
          , and
          <string-name>
            <surname>Wei-Ying Ma</surname>
          </string-name>
          .
          <article-title>Object-level ranking: bringing order to Web objects</article-title>
          .
          <source>In Proceedings of the 14th international conference on World Wide Web - WWW 05 WWW 05, page 567. ACM</source>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Andrey</surname>
            <given-names>Balmin</given-names>
          </string-name>
          , Vagelis Hristidis, and
          <string-name>
            <given-names>Yannis</given-names>
            <surname>Papakonstantinou</surname>
          </string-name>
          .
          <article-title>Objectrank: authority-based keyword search in databases</article-title>
          .
          <source>In VLDB '04: Proceedings of the Thirtieth international conference on Very large data bases</source>
          , pages
          <volume>564</volume>
          {
          <fpage>575</fpage>
          . VLDB Endowment,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Li</surname>
            <given-names>Ding</given-names>
          </string-name>
          , Rong Pan, Timothy W. Finin, Anupam Joshi, Yun Peng, and
          <string-name>
            <given-names>Pranam</given-names>
            <surname>Kolari</surname>
          </string-name>
          .
          <article-title>Finding and ranking knowledge on the semantic web</article-title>
          .
          <source>In International Semantic Web Conference</source>
          , pages
          <volume>156</volume>
          {
          <fpage>170</fpage>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Aidan</surname>
            <given-names>Hogan</given-names>
          </string-name>
          , Andreas Harth, and
          <string-name>
            <given-names>Stefan</given-names>
            <surname>Decker</surname>
          </string-name>
          .
          <article-title>Reconrank: A scalable ranking method for semantic web data with context</article-title>
          .
          <source>In Proceedings of Second International Workshop on Scalable Semantic Web Knowledge Base Systems (SSWS</source>
          <year>2006</year>
          ), Athens, GA, USA.,
          <year>11 2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>K.</given-names>
            <surname>Alexander</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Cyganiak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hausenblas</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhao</surname>
          </string-name>
          .
          <article-title>Describing Linked Datasets - On the Design and Usage of voiD, the 'Vocabulary of Interlinked Datasets'</article-title>
          .
          <source>In WWW 2009 Workshop: Linked Data on the Web (LDOW2009)</source>
          , Madrid, Spain,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>G.</given-names>
            <surname>Tummarello</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Delbru</surname>
          </string-name>
          , and
          <string-name>
            <surname>E. Oren. Sindice.</surname>
          </string-name>
          <article-title>com: Weaving the Open Linked Data</article-title>
          .
          <source>In The Semantic Web, 6th International Semantic Web Conference, 2nd Asian Semantic Web Conference, ISWC 2007 + ASWC</source>
          <year>2007</year>
          , pages
          <fpage>552</fpage>
          {
          <fpage>565</fpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>P.</given-names>
            <surname>Mika</surname>
          </string-name>
          .
          <article-title>Microsearch: An Interface for Semantic Search</article-title>
          . In Semantic Search,
          <source>International Workshop located at the 5th European Semamntic Web Conference (ESWC</source>
          <year>2008</year>
          ), volume
          <volume>334</volume>
          <source>of CEUR Workshop Proceedings</source>
          , pages
          <volume>79</volume>
          {
          <fpage>88</fpage>
          . CEUR-WS.org,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>K.</given-names>
            <surname>Alexander</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Cyganiak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hausenblas</surname>
          </string-name>
          , and
          <string-name>
            <surname>J. Zhao.</surname>
          </string-name>
          <article-title>voiD guide|Using the Vocabulary of Interlinked Datasets</article-title>
          . Community Draft, voiD working group,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>Michael</given-names>
            <surname>Hausenblas</surname>
          </string-name>
          , Wolfgang Halb, Yves Raimond, and Tom Heath.
          <article-title>What is the Size of the Semantic Web</article-title>
          .
          <source>In Proceedings of I-Semantics</source>
          <year>2008</year>
          , Graz, Austria,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>S.</given-names>
            <surname>Auer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Bizer</surname>
          </string-name>
          , G. Kobilarov,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lehmann</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Cyganiak</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Z. G.</given-names>
            <surname>Ives. DBpedia</surname>
          </string-name>
          :
          <article-title>A Nucleus for a Web of Open Data</article-title>
          .
          <source>In The Semantic Web, 6th International Semantic Web Conference, 2nd Asian Semantic Web Conference, ISWC 2007 + ASWC</source>
          <year>2007</year>
          , pages
          <fpage>722</fpage>
          {
          <fpage>735</fpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>