Investigations into Trust for Collaborative Information
              Repositories: A Wikipedia Case Study
                      Deborah L. McGuinness1 , Honglei Zeng1 , Paulo Pinheiro da Silva2 ,
                            Li Ding3 , Dhyanesh Narayanan1 , Mayukh Bhaowal1
                                1
                                    Knowledge Systems, AI Lab, Department of Computer Science
                                                 Stanford University, California, USA
                                       {dlm, hlzeng, dhyanesh, mayukhb}@ksl.stanford.edu
                                                  2
                                                    Department of Computer Science
                                             University of Texas at El Paso, Texas, USA
                                                           paulo@utep.edu
                                                  3
                                                    Department of Computer Science
                                      University of Maryland, Baltimore County, Maryland, USA
                                                          dingli1@umbc.edu
ABSTRACT                                                                  1.      INTRODUCTION
As collaborative repositories grow in popularity and use, issues             One emerging pattern for building large information repositories
concerning the quality and trustworthiness of information grow.           is to encourage many people to collaborate in a distributed manner
Some current popular repositories contain contributions from a wide       to create and maintain a repository of shared content. The notion
variety of users, many of which will be unknown to a potential end        of open editing has grown in popularity along with the notion of a
user. Additionally the content may change rapidly and informa-            Wiki, which in its simplest form allows users to freely create and
tion that was previously contributed by a known user may be up-           edit web pages1 . Wikipedia [1] is one popular Wiki that is a freely
dated by an unknown user. End users are now faced with more               available online encyclopedia. Its size and diversity is one aspect
challenges as they evaluate how much they may want to rely on in-         of it that makes it an interesting motivating use case for our work.
formation that was generated and updated in this manner. A trust          It has more than 900,000 registered authors2 and three million ar-
management layer has become an important requirement for the              ticles. It has become perceived as a valuable resource and many
continued growth and acceptance of collaboratively developed and          people cite it as a credible information source. While recent studies
maintained information resources. In this paper, we will describe         (e.g. [2]) show that the science articles in Wikipedia are generally
our initial investigations into designing and implementing an ex-         trustworthy, there have been some reports of claimed inaccuracies
tensible trust management layer for collaborative and/or aggregated       appearing in Wikipedia. For example, there was a widely reported
repositories of information. We leverage our work on the Inference        situation where a journalist and a former official in the Kennedy
Web explanation infrastructure and exploit and expand the Proof           administration, stated that Wikipedia contained an inaccurate bi-
Markup Language to handle a simple notion of trust. Our work is           ography article about him in 2005 [3]. The media coverage led
designed to support representation, computation, and visualization        to discussions about trustworthiness of content sources that have
of trust information. We have grounded our work in the setting of         fairly liberal editing policies and also led to changes in Wikipedia’s
Wikipedia. In this paper, we present our vision, expose motiva-           editing policy of anonymous authors.
tions, relate work to date on trust representation, and present a trust      One of the strengths of a collaborative information repository is
computation algorithm with experimental results. We also discuss          that it may benefit from contributions of a wide diversity of users.
some issues encountered in our work that we found interesting.            Of course some of these users will have expertise levels that are
                                                                          untested and unknown to some end users. Additionally content
                                                                          in these repositories may change rapidly. Thus, trust management
Categories and Subject Descriptors                                        has become a critical component of such a system design. With-
H.3.3 [Information Storage and Retrieval]: Information Search             out some form of trust management, these kinds of collaborative
and Retrieval; H.3.5 [Online Information Services]: [Data Shar-           information repositories will have difficulty defending any particu-
ing, Web-based services]; I.2.4 [Artificial Intelligence]: Knowl-         lar level of authoritativeness and correctness. Additionally, without
edge Representation Formalisms and Methods                                some notion of accountability in addition to the trust, these systems
                                                                          will only be able to provide end users with information but not with
                                                                          information about where the information came from and how trust-
General Terms                                                             worthy that source might be. The popular large implementations
Design, Languages, Management                                             such as Wikipedia are currently addressing some of these issues,
                                                                          although currently not to the level that they will need to in the long
                                                                          run if they are to achieve their true potential.
Keywords                                                                     Our work focuses on designing and building an extensible trust
Trust, Wikipedia, Inference Web, Proof Markup Language, Open              framework. We are investigating representation needs for the en-
Editing                                                                   coding of trust, methods for computing trust, and visualization of
Copyright is held by the author/owner(s).                                 1
WWW2006, May 22–26, 2006, Edinburgh, UK.                                      http://wiki.org/wiki.cgi?WhatIsWiki
                                                                          2
.                                                                             http://en.wikipedia.org/wiki/Special:Statistics
information that is informed by trust encodings. In our previous          mantic Web applications to generate portable proofs that contain
work on Inference Web, we have been designing and implementing            information required to explain answers. One of challenge for users
an infrastructure for explaining answers from intelligent applica-        of any explanation system is evaluating trustworthiness of answers.
tions. One information source for these applications may be a col-        Presentations of knowledge provenance, sources used and informa-
laboratively generated information repository such as Wikipedia.          tion manipulation steps performed to produce an answer help. It is
Our work on explaining answers focused us on where information            also important to know how trustworthy any particular piece of in-
came from and how it was manipulated to generate an answer. This          formation is, how trusted the author is etc. We thus have been mo-
work has also led us to investigate forms of trust encodings for in-      tivated to add a trust representation extension to the Proof Markup
formation.                                                                Language. We will report here on our extension and describe how
   As we began to look more closely at aggregated information             we are and plan to use it in our case study using Wikipedia.
sources and collaborative, evolving information sources such as              We view Wikipedia as an example of a collaborative, evolving
Wikipedia, we have found even more requirements for trust for-            information repository that has variety in quality and coverage of
mulation. It is worth noting that an open (or mostly unrestricted)        its subject matter. We were inspired to look at Wikipedia as a case
editing environment is quite different from some other social net-        study for our trust extension work for the following reasons: (i) it
works (e.g., eBay and Epinions) that have addressed trust. These          is a large and growing collaborative repository yet is contained. It
social networks may be viewed as focusing on interactions between         can be viewed as large enough to provide challenges of scale and
users while generating growing content but not typically generating       trust. (ii) it stores much rich provenance information in comparison
changing content. For example, a transaction on eBay or a review          to typical collaborative information repository. (iii) it is in need of
on Epinions is typically created once and then remains unchanged.         a trust solution.
On the other hand, the content of collaborative information repos-           Additionally, we believe that trust relationships can be computed
itories like Wikis may be quite dynamic as it may be continually          from information contained and maintained by Wikipedia. Further,
reviewed, shared, and updated by many different users. Trust for-         we believe that a solution infrastructure appropriate for Wikipedia
mulation and requirements for rapidly changing repositories thus          may be widely reusable in other online system settings.
may be quite different from (mostly) monotonically growing repos-            The rest of our paper is structured as follows. In section 2, we
itories even though both may be perceived as trust problems.              provide a vision of how we will use trust values once available
   Some social networks that have trust approaches that rely on ex-       to present trust information to users. We do this by describing
plicit assertion of trust in a user resulting from feedback from trans-   a customizable trust view of information. In section 3, we show
actions or ratings. Trust in Wikipedia has not been addressed ex-         a citation-based approach, the link-ratio algorithm for computing
plicitly in this manner. We began exploring the view that trust may       trust. In section 4, we present some experimental results using the
be viewed as an implicit feature of the environment and we began          link-ratio algorithm in Wikipedia. In section 5, we discuss the im-
looking for ways to make trust levels explicit and inspectable.           plications of citation trust in Wikipedia and related work. We con-
   Significant research has been done on trust in various contexts        clude our paper with a discussion of related work and future work.
(e.g., [4],[5]); however, most of the work assumes homogeneous               Contributions presented in this paper to trust formulation in open
context. Encryption and authentication (e.g., [6]) help secure trust-     collaborative, evolving settings include: an extension to the Proof
worthiness in terms of the integrity and authenticity of information      Markup Language that creates a proof interlingua capable of en-
through pre-defined representation and functions. Distributed trust       coding trust, a citation based trust algorithm (Link-ratio trust) de-
management (e.g., [7]) offers a flexible policy framework for judg-       signed to demonstrate our computational component and explore
ing if a person is trustworthy enough to perform an action through        some characteristics of trust in Wikipedia; and a customizable visu-
a common policy ontology and corresponding policy inference en-           alization component for presenting Wikipedia content in a manner
gine. Reputation systems (e.g., [8], [9]), and trust networks (based      that has been informed by trust information.
on social networks or P2P network) (e.g., [10],[11]) help compute
trustworthiness of a person or an entity; again, using a pre-defined      2.    TRUST TAB
trust ontology and a common computation method.
                                                                             In order to extend Wikipedia with a trust management compo-
   The Web offers easy access to information from various sources
                                                                          nent, we propose a new “trust” tab associated with each Wikipedia
and computational services at different locations. Thus, distributed
                                                                          article. This trust tab will appear in addition to the conventional
web environments provide diverse and heterogeneous settings for
                                                                          tabs of Wikipedia, i.e., “article”, “edit”, “history” and “discussion”.
trust researchers. For repositories of information like Wikipedia,
                                                                          The motivation is to render Wiki articles in ways that users can vi-
trustworthiness information concerning an article or an author could
                                                                          sually compare and identify text fragments of an article that are
be computed and published by many sources with varying degrees
                                                                          more (or less) credible than other fragments. The trust tab is sup-
of reliability. When an end user is evaluating how to use (portions
                                                                          posed to be a primary tool for helping users to decide how much
of) a Wikipedia article, it may be useful to view an aggregation
                                                                          they should trust a particular article fragment. The rendering of
of the trust information available concerning the article. The end
                                                                          each text fragment is to be based on degrees of trust. These degrees
user may thus want to effectively combine trust information from
                                                                          of trust may be between individual authors or they may be aggre-
multiple sources using different representation schemes potentially
                                                                          gated and thus may be viewed as a community trust level associated
using personalized trust computation methods. Unfortunately, re-
                                                                          with an author of each fragment of the document.
search focused on enabling this scenario is sparse. Our investiga-
                                                                             Our present endeavor is to calculate and display trust information
tions have been driven by our desire to work on distributed, hetero-
                                                                          based on information already available in the Wikipedia and with-
geneous, collaborative environments such as the web in general and
                                                                          out the use of any external information sources, e.g., Wikipedia
collaborative, evolving information repositories in particular. Our
                                                                          users. In the future, we will extend this approach to include feed-
goal is to provide an open, interoperable, and extensible framework
                                                                          back from external sources so as to inform the trust calculations
that can provide a solution framework to the problems of trust we
                                                                          with a wider set of input.
mentioned above.
                                                                             The trust tab is an addition to the conventional article tab in the
   In the way of background, Inference Web (IW) [12] enables Se-
                                                                          sense that, when compared to the article tab, it adds a colored back-
                                                 Figure 1: A Trust Tab Example in Wikipedia.
ground to text fragments in the article as shown in Figure 1. The          mented into a sequence of text fragments where each fragment has
new background color conforms to a color scheme which makes                a single author. We assume that several fragments in the article can
the presentation and its inherent meaning in terms of trust obvious        have a single author. In order to compute a trust level for each frag-
and comprehensive.                                                         ment, the trust tab needs: (i) to identify each individual fragment
   According to the color code legend in the Figure 1, the degrees of      in the article; (ii) to identify the author (and time stamp) of each
aggregated trust of the fragments in the Rhinoplasty article range         fragment; and (iii) to compute a degree of trust for each author.
from 0.2 to 0.8 in a scale [0,1] where 0.0 is the total absence of            The Wikipedia database schema does not store individual frag-
trust and 1.0 is the total presence of trust. The exact meaning of         ments although it archives complete revisions of articles. Thus, one
this scale of trust is irrelevant for the trust tab that aims to provide   approach to fragment identification is to compare successive article
a visual mechanism to compare the parts of the page that are more          revisions, e.g., using diff, and identify changes. Note, the granular-
or less credible. The relative differential between the trust values       ity of the difference measure used is something we are exploring.
is information that is useful to the end user. For instance, the trust     By performing successive comparisons, the trust tab retrieves the
tab says that the last fragment composed of the two last paragraphs        individual fragments of an article as required in (i). Simultane-
of the page has a higher degree of trust than any other fragment in        ously, it identifies the time stamps and authors for the fragments
the page. Moreover, the second paragraph has the lowest degree of          as required in (ii). Trust computation associated with authors is
trust although the fragment “the surgery (...) in 1898 to help those”      discussed below in Section 3.
inside the paragraph have been added by a more credible author. 3
   The implementation of the trust tab has raised several issues re-
lated to Wikipedia. In the rest of this section, we briefly describe       2.2    Provenance Annotation
an approach to implement the trust tab. We will also present some             Even though manual monitoring on Wikipedia has been enhanced
experimental results of our effort to compute aggregated degrees of        recently, there may always be some users who will want informa-
trust for the authors of article fragments as required for rendering       tion about degrees of trust in particular authors. Additionally, some
useful trust tabs when no personalized trust relations are used.           malicious authors or programs may attempt to insert inappropriate
                                                                           or unwanted content in collaborative open systems like Wikipedia.
2.1     Fragment Identification                                            As these systems grow, any level of manual monitoring will not be
  The trust tab relies on the fact that Wikipedia articles can be seg-     adequate since it will not be able to scale with the content size. Au-
                                                                           tomatic methods are required to augment administrator’s abilities to
3
  The actual trust values used to render this page are just for expos-     monitor updates and to help manage their workloads. Automated
itory purposes and are not intended to reflect that actual trust levels    tools built upon the trust values may substantially improve the trust-
for this page; the figure is manually generated for demonstration          worthiness of Wikipedia: for example, as mentioned above, a trust
purposes.                                                                  tab implementation may provide users with trust information about
the articles they are viewing and help them to decide how much          thresholding, etc.
they should trust the articles.
   Our trust tab approach depends on a mechanism for storing trust      2.3    Provenance Visualization
relations between authors as well as aggregated degrees of trust           The trust tab applies conventional rendering techniques used by
inferred from the Wikipedia content. This new stored content how-       the article tab for rendering so that the typical style of articles is
ever, may not be enough to capture some important trust aspects         preserved in the trust tab. In addition to the use of these techniques,
of the system since Wikipedia is managed in a centralized manner.       the trust tab also compares the content of the article with the PML
For instance, we still need to face two important issues in repre-      encoding of the article. The trust tab views the PML encoding to
senting and obtaining knowledge provenance: (iv) how to capture         be metadata for the page in the article tab. By comparing the page
provenance information not originally written by a user, e.g. a user    content with its PML encoding, the trust tab identifies fragments
may copy and paste some content from the Web to an Wiki article;        and the fragment authors. It also retrieves a pre-computed aggre-
and (v) how to make trust computation components independent of         gated degree of trust for each author as stored in the newly created
data storage.                                                           storage for trust in the Wikipedia database. From these degrees of
   For (iv), we need a more comprehensive vocabulary for anno-          trust and a color schema, the trust tab eventually identifies and sets
tating the provenance information. We are using the provenance          the appropriate background color for each fragment.
part of Proof Markup Language (PML) [13] to fulfill this job. Be-
side person, PML also identifies many other types of information
sources including website, organization, team, publication, and on-
                                                                        3.    CITATION-BASED TRUST
tology. Upon updating a Wikipedia article, the editor may pro-
vide additional justification for his/her modifications. For example,   3.1    Trust issues in Wikipedia
when an editor adds one definition to an article, he/she may also          In our work, we begin by considering how citation-based mea-
specify that the definition is obtained from an online article and      sures may be used to determine trust values. In some settings, an
even specify the location of the related span of text.                  end user may be more inclined to rely on the content in a news story
   For (v), we need explicit representation of provenance informa-      from a reputable newspaper, such as the New York Times, over the
tion. This is especially helpful when integrating multiple knowl-       content that is published on a personal Blog, especially if the end
edge repositories which are managed independently. Our solution         user has no knowledge of the Blog or its author.
is to use the RDF/XML serialization of PML. To implement this              One way of computing trust of an author is to take an aggregated
idea, our design adds another “provenance” tab and exposes PML          value from trust rankings of all of the articles written by the author.
provenance information in RDF/XML format to agents (or web ser-         In order to share and visualize such trust information, we formalize
vices) which are capable of computing trust using provenance in-        trust as a numerical value between 0 and 1 and we view it as a mea-
formation.                                                              sure of trustworthiness. In our setting, a value of 1 represents com-
                                                                        plete trust and value 0 represents unknown trustworthiness. Note,
 
   "Article Fragment"
                                                                        as complete distrust. Although we have chosen a rather simplistic
   en                                  trust model in this work, we are also evaluating other, more sophis-
                                                     ticated trust models that we may use to enhance our current model.
                                                         In this work, citation-based algorithms are a family of algorithms
                         that derive trust based on citation relationships among entities. We
                                                     refer to such derived trust as citation-based trust, or simply citation
                                                        trust. We ground our work in Wikipedia and use it as a sandbox for
           Harry
           20051109                 evaluating citation trust.
           2425693                       One distinguishing characteristic of Wikipedia articles in com-
                                                       parison to general web documents is that Wikipedia articles are
       
                                                     meant to be encyclopedia entries. We will refer to the title of a
                                                    Wikipedia article (e.g. “Gauss’s law”), as an encyclopedia index
                                         term. We note that encyclopedia index terms may occur, with or
   
                                                                        without citation, in other articles in Wikipedia. Since Wikipedia is
   Harry                       an encyclopedia, one might expect that occurrences of encyclope-
   0.434                           dia index terms in other articles would refer back to the encyclo-
 
                                                                        pedia index term article, and in fact if a term appears but does so
                                                                        without citation, it might be viewed as a negative indicator of the
                                                                        quality of the index term entry. We will explore this notion and
             Figure 2: PML provenance annotation
                                                                        compute the number of non-citation occurrences of encyclopedia
                                                                        index terms. Two other useful measures of note in collaborative
   The next step is to encode the trust information in PML. Figure      content settings are the number of citations a term (or article) re-
2 shows an example of such an encoding. In this example, Harry          ceives and the citation trust of articles in which it is cited.
is the author of a fragment in the Stanford page and the Wikipedia         Consider the scenario where an article (i.e. its encyclopedia
community has an aggregated degree of trust of 0.434 in Harry.          term) has many non-citation occurrences but few actual citations.
The use of a float for hasTrustValue is a simplification of the PML     One interpretation of this scenario is that the article may not be per-
capabilities for representing trust values. More sophisticated, re-     ceived to be worthy of a high trust value since few authors choose
alistic approaches are discussed in [14]. PML encodings can then        to cite the article when they mention the term 4 . In contrast, non-
be used by automated programs for other presentations of trust in-
formation, or for use in more complex reasoning and question an-        4
                                                                          We will come back to this point in the discussion since another
swering applications that may want to use trust input for filtering,    interpretation of a non-citation is simply ignorance of the article.
citation occurrences of a word or phrase on a typical web page may             Our goal is to infer trustworthiness of authors, fragments and
not mean anything about any associated trust levels since typical           articles based on the above link structures. We also assume most
web page authors do not necessarily link every phrase that one              Wikipedia authors have the genuine intention of providing accurate
would typically find in an encyclopedia to a web page describing            content.
the phrase.                                                                    In the following sections, we will show two citation-based trust
   In our work, we have begun explorations into citation ratios as a        algorithms, the Link-ratio algorithm and the PageRank algorithm.
potential input to trust algorithms. In this paper, we will report on       We will explain the link-ratio algorithm in detail but only briefly
our investigations concerning link ratios. We define the Link-ratio         mention the well-known PageRank algorithm.
of an article (i.e., the page with title x) as the ratio between the
number of citations and the number of non-citation occurrences of           3.3     Link-ratio Algorithm
the encyclopedia term x.                                                        We first compute article-level trust in Wikipedia based on its
   We provide the following motivation for exploring Link-ratio:            rich citation structure. Assume d is an article, then [[d]] refers to
                                                                            the hyperlink citation to this article d. For example, the article
    • Link-ratio is a trust measure unique to collaborative reposito-       Grape refers to the article Wine by stating that “... used for making
      ries of encyclopedic content. The fact that it is a ratio rather      [[wine]]”. When an article is linked to from another one, a certain
      than a raw count of non-citation occurrences helps to min-            trust is implied5 . In this example, the author of Grape expresses
      imize the impact of the difference between the numbers of             his trust towards the article Wine by creating a citation to it. He be-
      occurrences of common vs. uncommon terms.                             lieves that a user may benefit from further information on the wine
                                                                            topic by accessing the information contained in the article Wine.
    • Link-ratio is in the same family as the well respected PageR-             In the link-ratio algorithm, we are interested in non-citation oc-
      ank [15], citation-based algorithm, which has been success-           currences of an encyclopedia term. Thus, the algorithm looks for
      fully used in many web settings. PageRank has also been               articles that contain a term d but do not link to article d. For exam-
      studied in the context of Wikipedia. We will cite and dis-            ple, in the article Beer, it is said that “Unfiltered beers may be stored
      cuss the results of this related research from other researchers      much like wine for further conditioning ...” Both Grape and Beer
      ([16]).                                                               mention the term “wine”, but only Grape links to the article Wine.
                                                                            There may be many reasonable explanations for the omission of
    • Unlike other social networks such as eBay, Wikipedia has no
                                                                            the wine citation in Beer: Beer may have been created before Wine
      explicit trust assertions among authors and articles. Trust al-
                                                                            was created; the author of Beer may be unaware that Wine exists;
      gorithms based on the transitivity property of trust cannot be
                                                                            the Beer author may be in a hurry and may be limiting citations; the
      directly applied without an initial set of trust values. Obtain-
                                                                            Beer author may not believe that the readers of this page need extra
      ing trust values manually for a content repository the size of
                                                                            information on wine; or the author believes Wine is untrustworthy.
      Wikipedia is a large task. The Link-ratio approach may be
                                                                            Without further information, we are not able to determine the exact
      used as one way to obtain initial trust values.
                                                                            cause of a missing citation; therefore, we assume missing citations
3.2     A Simple Wikipedia Model                                            decreases the trustworthiness of an article that was not cited. Si-
                                                                            multaneously, if one is keeping measures of how ”known” a page
   Wikipedia may be (partially) characterized by the abstract model         it, the missing citation decreases this measure.
in Figure 3. Intuitively, Wikipedia consists of a set of articles (i.e.         We define Trust doc(d) to be the trust value of an article d.
articles d1 , d2 , ..., dm in Figure 3). Each article (di ) consists of a   Based on the citation trust we discussed above, the more frequent
set of article fragments (fi1 , fi2 , ..., fini ), each of which is writ-   [[d]] occurs, the higher Trust doc(d) is; the more non-citation oc-
ten by an author (aj ). An author may write more than one frag-             currences of d are, the lower the trust value is.
ment in the same article. In addition, a fragment could link to
other articles as citations. There are three types of links in Fig-
ure 3: author-fragment authorship links (solid lines from ai to fjk ),                                    occurrences([[d]])
                                                                                Trust doc(d) =                                                   (1)
fragment-article citation links (dotted lines from fij to dk ), and                               occurrences([[d]]) + occurrences(d)
article-fragment membership links.                                             Occurrences([[d]]) denotes the number of citations to an article
                                                                            d and occurrences(d) is the non-citation occurrences of term d. The
                                                                            citation trust is thereby defined to be the ratio between the occur-
                                                                            rences of the citations to article d and the total occurrences of term
                                                                            d as a citation and a non-citation.
                                                                               Wikipedia articles are often under constant revision. We refer
                                                                            to the change that an author commits in one edit session as atomic
                                                                            change. The latest version of an article can be simply viewed as
                                                                            the original article followed by a sequence of atomic changes. We
                                                                            define Documents(a) as the set of articles that author a has ever
                                                                            created and changed. We can calculate the aggregated trust value
                                                                            of an author a, Trust author(a), based on the trustworthiness of
                                                                            Documents(a). Intuitively, the trust value of an author is an ag-
                                                                            gregated value of the trust values of all the articles he has con-
                                                                            tributed to. In Equation (2), we adopt the simple arithmetic mean,
                                                                            but other weighting functions are possible (e.g. weighted mean).
                                                                            5
                                                                             This assumes that the link from the original text does not contain
                                                                            negative anchor text or description such as “examples of bad pages
           Figure 3: An Abstract Model of Wikipedia.                        include [[d]]”.
| Documents(a) | is the size of Documents(a), i.e., the number of                 a. Consider the presence of Wikipedia (as a collection of web
articles that author a has contributed to.                                     pages) on the Web. This approach would take account into consid-
                                                                               erations the links between Wikipedia articles as well as the links
                               P                                               from external websites to Wikipedia articles.
                                  d∈Documents(a) Trust doc(d)
        Trust author(a) =                                                (2)      b. Consider Wikipedia as a set of interlinked articles in isolation
                                     | Documents(a) |                          and calculate the PageRank. This approach would account only for
   One of our primary goals is to help users understand how much               links that exist within Wikipedia. One could view it as an “internal
they should rely on information in articles. Since articles are com-           PageRank” that is exclusive to the articles and associated citation
posed of fragments, this also means that we want to help users                 structure in Wikipedia.
compare trustworthiness of article fragments in the same article,                 We are more interested in the second approach, because we in-
each of which may be written by different authors. Since we have               tend to study the relative trustworthiness of articles within the Wikipedia
established author trust in Equation (2), we use a simple notion               collection. Consequently, allowing PageRank from external links
that assumes fragment trust is the same as the trust value of its au-          to flow into this computation might not yield the desired results.
thor. If f is a fragment of an article and Author(f ) denotes the              Note that accounting for links from external pages would definitely
author of this fragment, then we can define the trust of this frag-            help to account for added value to a Wikipedia article from the per-
ment T rust f rag(f ) as follows.                                              spective of the entire Internet.
                                                                                  PageRank has been computed and studied in Wikipedia [16]. In
                                                                               section 5, we will cite and discuss the results, putting it in the con-
           Trust frag(f ) = Trust author(Author(f ))                     (3)   text of citation trust and relating it to the Link-ratio algorithm and
   The notion of fragment trust being identical to author trust is             the general citation-based approach.
a bit too simplisitic. Fragment trust may also depend on context.
For example, Equation (3) would produce the same results for two               4.    EXPERIMENTS
article fragments from the same author, despite the possibility the               The main data set used in our experiments was the dump of the
author of is an expert on the topic of one fragment and is not an              Wikipedia database taken in December, 2005. We computed the
expert on the topic of another fragment.                                       trustworthiness of Wikipedia articles using the link-ratio algorithm
   Fortunately, Wikipedia classifies articles into different categories;       in Equation (1). In order to determine the citation trust of a given
for example, the Mathematics category is meant to hold articles                article, all the other articles in Wikipedia were parsed searching for
about mathematics. If we define c1 , c2 , ..., ct to be the categories         the reference of the article under consideration, whether it was a
in Wikipedia, such that each of ci is a collection of articles relating        plain occurrence or a linked reference.
to the same topic, we can rewrite Equation (2) and Equation (3) to                The first experiment was to compute the link-ratio values of fea-
be topic-dependent.                                                            tured articles, normal articles, and clean-up articles in Wikipedia.
                                                                               Featured articles are expected to be the best articles in Wikipedia;
                             P
                                                 V                             they were reviewed for accuracy, completeness, and style by ex-
                                d∈Documents(a)       d∈ci Trust doc(d)
  Trust author(a, ci ) =                                                       perts in the same fields. On the contrary, clean-up articles are those
                                     | Documents(a, ci ) |
                                                                   (4)         articles below the quality standard of Wikipedia and are viewed by
   The trust of an author a on topic ci (T rust author(a, ci )) is the         editors as being in need of major revisions. Clean-up articles are
ratio between the average trust values of his contributed articles on          typically manually marked by Wikipedia administrators or other
topic ci .                                                                     authors. Normal articles are articles that are neither featured ar-
                                                                               ticles nor clean-up articles. Intuitively, featured articles are most
                                                                               trustworthy, clean-up articles are least trustworthy, and normal ar-
          Trust frag(s) = Trust author(Author(s, ci ))                   (5)   ticles are somewhere in between.
                                                                                  We randomly chose 50 featured articles, 50 normal articles and
   The trust of a fragment is now modified to be the trust of its
                                                                               50 clean-up articles from the Geography category. Table 1 shows
author on the topic ci , which the article of the fragment belongs
                                                                               the average link-ratio values of each type of articles.
to. Topic-specific trust may be viewed as a coarse approximation
to context-based trust.
                                                                               Table 1: Average link-ratio values of 50 articles in the Geogra-
3.4     PageRank                                                               phy category
   We briefly mention the well known PageRank algorithm in this                         Type of the articles Average Link-ratio value
section as another example of citation-based approaches. PageR-                         Featured articles    0.34
ank is an algorithm for ranking web pages used by Google and                            Normal articles      0.26
other retrieval engines. Web pages that have high PageRank values                       Clean-up articles    0.21
are typically more highly regarded and trusted and many end users
prefer to have them returned first.                                               As we may expect, featured articles have the highest link-ratio
   According to [15], PageRank of a web page A is defined to be                values while clean-up articles have the lowest value. The differ-
                                   P R(t1 )         P R(tn )                   ences between normal articles and clean-up articles are rather small,
        PR(A) = (1 − d) + d(                + ... +          )           (6)   possibly because normal articles have a wide range of trustworthi-
                                    C(t1 )           C(tn )
                                                                               ness and quality. In practice, we have viewed articles with a link-
   In the Equation (6), t1 , t2 , ..., tn are pages linking to page A and      ratio over 0.30 as trustworthy, and articles with a value less than
C(ti ) is the number of outgoing links that a page Ti has. d is a              0.15 as having unknown trustworthiness. For example, the article
damping factor, empirically set to 0.85.                                       Cleveland, Ohio has a link-ratio 0.53, which means that over 50%
   When calculating the PageRank of articles in Wikipedia, one can             of the times that the string ”Cleveland, Ohio” occurs in documents,
take two possible approaches:                                                  that string is linked to the article Cleveland, Ohio.
   Our results are limited by the size of the article samples and their    5.    DISCUSSION AND RELATED WORK
categorization. One source of rated articles was the class of fea-            In general, our experiments support our intuition that the link
tured articles. Unfortunately, currently, only 0.1% of Wikipedia           ratio approach computes high trust values for specialized articles
articles are featured articles. In particular, there are less than 80      that are trustworthy. For example, we may conclude that the article
featured articles in the Geography category, which was our chosen          Lake Burley Griffin is probably more trustworthy than the article
topic area for evaluation. Since we are interested in topic-specific       Lingaraj temple since both terms are specialized geography names,
trust, lack of featured articles (and clean-up articles to a lesser ex-    and the former has a link-ratio 0.57 while the latter has only 0.1.
tent) poses one challenge in evaluating the effectiveness of citation-     This comparison of link ratio values was done between terms of
based approach and other approaches, because there are no other            the same type. Nevertheless, it is not informative to compare the
explicit trust assertions in Wikipedia.                                    link ratio value of Lake Burley Griffin article to the link-ratio value
   Our second observation is that the link-ratio value depends on          for the article on Love. When the link-ratio of an article is low,
not only the trustworthiness of an article but also on how “link-          we can not determine whether it is because the article is untrust-
able” the encyclopedic index term is. For example, if one writes           worthy or if it is low for another reason, such as would be the case
an article and it has the word “Love” in it, it is unlikely that the       for a common term like “love”. Therefore, we interpret low link-
author will consider the linking the occurrence of the term ”Love”         ratio values as being of unknown trustworthiness, because we may
to the article love. The author probably expects that readers of the       not have sufficient information to determine its trustworthiness, not
new article know what the definition of love is and there is no need       that we believe the article is untrustworthy. There are other con-
to link it to the encyclopedia entry. On the contrary, if one uses         siderations as well such as how new a page is - if the page has just
a scientific term such as “Gauss’s law”, it is likely that the author      been created, then there may be many non-citation occurrences of
will consider linking to the encyclopedia article gauss’s law, as the      the phrase simply because the entry did not exist previously. This
author may assume a typical reader may want more information               is an issue that could be handled with a kind of time stamp filtering
concerning the topic. Thus the link-ratio result can be dependent          though.
on how common the term is as well as how likely it is to require              We do not expect link-ratio to be an accurate trust measure in
supplemental information that is obtainable from an encyclopedic           isolation. It should either work with other trust measures, or be
web page entry. In another example, names of famous people will            one component in a solution that utilizes multiple trust computa-
have higher link-ratio values than those of general things like wine       tion measures. In section 2, we proposed using PML for building
or coal. Table 2 shows increasing link-ratio values for terms that         trust layer solution. Our extension to PML for representing trust is
are less common and more specialized.                                      intended to be used for encoding aggregated trust values that may
                                                                           have been computed using multiple approaches.
                                                                              PageRank is a good candidate for an additional trust compu-
Table 2: Link-ratio values of common and less common cyclo-
                                                                           tation method since it has been useful in similar settings and it
pedia terms
                                                                           is also based on citation structures. [16] calculated the (internal)
  Type                          Article            Value
                                                                           PageRank on a subset of Wikipedia articles. Specifically, approx-
  General terms                 English            0.003                   imately 109K articles from the normal entries of the Wikipedia
                                Love               0.004                   English database were considered for their experiment. [16] uses
                                Beer                0.05                   the PageRank implementation available in the Java Universal Net-
                                Wine                0.06                   work/Graph Framework (JUNG) [17] open-source library. They
  General scientific terms      Broadcasting        0.02                   noted that a large number of the highly ranked entries are the names
                                Electronics         0.07                   of countries or years. The top 5 articles with their associated Page-
  Specialized scientific terms Maxwell’s equations  0.44                   Rank values are presented below:
                                Gauss’s law         0.47                        Article              PageRank value Link-ratio value
  Names of famous people        John F. Kennedy     0.41                        United States        0.003748             0.13
                                Winston Churchill   0.59                        United Kingdom 0.001840                   0.19
                                                                                France               0.001663             0.19
   Our third observation is that co-references of a term also plays             2004                 0.001584             0.06
an important role in determining the link-ratio value. For example,             Centuries            0.001264             0.12
“Massachusetts Institute of Technology” has a much higher link-               The PageRank score may be viewed as a reflection of the rel-
ratio value than its acronym “MIT”, as shown in the Table 3. If an         ative popularity of an article in a collection of articles, as inferred
author writes the entire name as in the title, he likely does so as he     from the link-structure within that collection. Obviously, there is no
specifically wants to link it to that article. After all, “Massachusetts   strong correlation between the PageRank scores and the link-ratio
Institute of Technology” is a more precise encoding than “MIT”.            values, because PageRank is determined by the number of citations
                                                                           and the citation trust of cited articles, while link-ratio is determined
                                                                           by the number of citations and the number of non-citation occur-
Table 3: Link-ratio values of Universities and their acronyms
                                                                           rences. Nevertheless, it is useful to combine two approaches to find
   Article                                 Link-ratio value
                                                                           more evidence supporting accurate trust evaluation. For example,
   Massachusetts Institute of Technology 0.52
                                                                           if both methods are used to calculate high trust values for the same
   MIT                                     0.001
                                                                           article, we have more evidence that the article is trustworthy. Fur-
   California Institute of Technology      0.69                            ther, using the inference web approach, we can provide information
   Caltech                                 0.01                            concerning the trust value and how it was computed.
   Carnegie Mellon University              0.65                               Wikipedia is different from the Web because Wikipedia articles
   CMU                                     0.002                           are restricted to be encyclopedia entries. For example, the article
   University of California, Los Angeles 0.40                              “love” in Wikipedia may be viewed as a description of the def-
   UCLA                                    0.15                            inition of love, the scientific models and different point view of
love as opposed to any of the top 10 pages returned from a search         lations between users. We have also designed a citation-based trust
for “love” using Google. Those pages are mostly websites about            metric motivated by some characteristics of Wikipedia. We im-
matching and dating services or love poetry resources. Citation-          plemented the approach and presented some experimental results
based algorithms may yield different results in a more general web        using Wikipedia data indicating that neither the Link-Ratio algo-
setting. Popular (and potentially trustworthy) general web pages          rithm nor the PageRank algorithm proved to be effective enough
may be viewed as more interesting to link to than dry encyclope-          alone for computing trustworthiness of assertions in an aggregated
dic pages so they will return higher page rank scores and possibly        knowledge repository such as Wikipedia. Motivated by this ob-
higher link-ratio scores as well. We are continuing investigations        servation, we have begun exploring new directions for computing
into complementary methods and also on defining the conditions            trust in collaborative environments, using citation based trust as one
under which methods are more effective.                                   building block. We intend to leverage the PML trust extension that
   Our analysis is somewhat limited by the computational cost of          we have proposed in this paper to work in combination with new
the calculation of Wikipedia trustworthiness measures currently un-       trust algorithms.
der investigation. For each article, we need to navigate all other           While we implemented a single trust measure that was purely
articles for counting citations and non-citation occurrences. How-        computational, we plan to continue our work along a number of di-
ever, automated trust computing is essential in improving the trust-      mensions. First, we believe that trust measures should include com-
worthiness of Wikipedia. In practice, incremental calculation of          putational components yet we also want to allow stated trust values
citation trust is desired because articles in Wikipedia are under con-    between entities (among users, between users and other sources,
stant revisions.                                                          etc.) We are expanding our design to include stated trust values in
   The trustworthiness of a Wikipedia article may be measured in          addition to computed values. We are also expanding our design to
different ways, for example, trust as a measure of accuracy of the        include learning trust values by user instruction.
article. Lih [18] studied the impact of press citation on the quality        We have also begun investigations into more sophisticated mod-
of a Wikipedia article in terms of number of editors and number of        els of trust. We extended PML with a very simple notion of trust
changes. Stvilia et al. [19] conducted a comprehensive qualitative        and we are currently using a simple single value. We are explor-
analysis on various aspects of the information quality of Wikipedia       ing more complex measures of trust and we are working on formal
article. While qualitative approaches are important, we are more          descriptions so that different applications may use well defined def-
interested in deriving quantitative metrics which can be automati-        initions and values for trust and share those encodings among them-
cally computed from Wikipedia database.                                   selves. This would enable trust to be treated as a first-class entity
   Link structure analysis on the Web has been extensively studied        and offer better flexibility in expressing complex trust relationships
in the last of several years, e.g. [20] [21]. Social network and p2p      and multiple attributes that could codify trust.
network trust are also relevant to our work, e.g. [8] [10] [11] [22]         The citation-based trust measure is intended to work as one com-
[23]. Social networks usually have explicit trust assertions among        ponent in a solution that utilizes multiple computational trust mea-
the entities, such as user ratings of a movie, or to a transaction.       sures. We are exploring another approach based on the hypothesis
However, Wikipedia lacks such explicit trust assertions. This is          that revision history may be a useful component in a hybrid ap-
one of the reasons we began with the study of citation-based ap-          proach for computing a measure of trustworthiness of articles. For
proaches, in which trust is implicit. Nevertheless, a hybrid model        example, one may assume that an article may become more trust-
of trust propagation and a citation-based approach may be a more          worthy if it revised by a trustworthy author, and similarly, it may
effective hybrid solution.                                                become less trustworthy if revised by an author who is known to
   We are also interested in the representation of trust in large-scale   be less trustworthy. Given the rich and accessible revision informa-
and heterogeneous sources. Our markup representation for expla-           tion in Wikipedia6 , we are working on a hybrid model that utilizes
nation information was designed to interoperate between applica-          both citation-based trust and revision history-based trust. Prelimi-
tions needing to share answers and justifications. Similarly, our         nary experiments indicate that this hybrid approach using these two
extension to this markup representation was designed to encode            metrics performs far better than when a single model is used.
trust and to share that trust information between applications. This
approach makes it possible to aggregate different trust values as
calculated by different trust approaches. McGuinness and Pinheiro         7.    ACKNOWLEDGMENTS
da Silva [12] present Inference Web, a framework for storing, ex-            This research was largely supported by Stanford’s DARPA con-
changing, combining, abstracting, annotating, comparing and ren-          tract #HR0011-05-1-0019-P00001 and DTO contract #2003*H278-
dering proofs and proof fragments provided by reasoners embed-            000*000. We would also like to thank Cynthia Chang and Richard
ded in Semantic Web applications and facilities. We are currently         Fikes for valuable conversations and implementations.
extending our Inference Web toolkit, including the IWTrust com-
ponent, to include more support for encoding and sharing trust in-
formation.                                                                8.    REFERENCES
                                                                           [1] : Wikipedia. (http://www.wikipedia.com)
6.    CONCLUSION AND FUTURE WORK                                           [2] Giles, J.: Internet encyclopaedias go head to head. In: Nature
   Trust is a central issue when dealing with systems and environ-             438, 900-901 (15 Dec 2005) News. (2005)
ments that use information coming from multiple, unknown sources.          [3] : John seigenthaler sr. wikipedia biography controversy.
In this paper, we have presented a vision of how one can use trust             (http://en.wikipedia.org/wiki/John Seigenthaler Sr.
information to help users view and filter information in collabora-             Wikipedia biography controversy)
tive and evolving information repositories such as Wikipedia. Our          [4] Castelfranchi, C., Tan, Y., eds.: Trust and Deception in
tools enable users to develop their own opinion concerning how                 Virtual Societies. Kluwer Academic Publishers (2001)
much and under what circumstances, they should trust information.
We have extended PML to provide an interoperable and extensible           6
                                                                            Wikipedia authors have made approximately 41 million revisions,
encoding useful for capturing trust information including trust re-       an average of 12 versions per article, over the last four years.
 [5] Grandison, T., Sloman, M.: A survey of trust in internet
     application. IEEE Communications Surveys Tutorials
     (Fourth Quarter) 3(4) (2000)
 [6] Maurer, U.: Modelling a public-key infrastructure. In:
     ESORICS: European Symposium on Research in Computer
     Security, LNCS, Springer-Verlag (1996)
 [7] Blaze, M., Feigenbaum, J., Lacy, J.: Decentralized trust
     management. In: Proceedings of the 1996 IEEE Symposium
     on Security and Privacy. (1996) 164–173
 [8] Damiani, E., di Vimercati, S., Paraboschi, S., Samarati, P.,
     Violante, F.: A reputation-based approach for choosing
     reliable resources in peer-to-peer networks. (2002) In 9th
     ACM Conf. on Computer and Communications Security.
 [9] Mui, L.: Computational Models of Trust and Reputation:
     Agents, Evolutionary Games, and Social Networks. PhD
     thesis, MIT (2002)
[10] Kamvar, S.D., Schlosser, M.T., Garcia-Molina, H.: The
     eigentrust algorithm for reputation management in p2p
     networks. In: Proceedings of the 12th international
     conference on World Wide Web. (2003)
[11] Guha, R., Kumar, R., Raghavan, P., Tomkins, A.:
     Propagation of trust and distrust. In: Proceedings of the 13th
     international conference on World Wide Web, ACM Press
     (2004) 403–412
[12] McGuinness, D.L., Pinheiro da Silva, P.: Explaining answers
     from the semantic web: The inference web approach. In:
     Journal of Web Semantics. Volume 1. (2004) 397–413
[13] Pinheiro da Silva, P., McGuinness, D.L., Fikes, R.: A proof
     markup language for semantic web services. (In: Information
     Systems. (To appear))
[14] Cock, M.D., Pinheiro da Silva, P.: A many valued
     representation and propagation of trust and distrust. In: In
     Proceedings of International Workshop on Fuzzy Logic and
     Applications (WILF2005). (2005)
[15] Page, L., Brin, S., Motwani, R., Winograd, T.: The pagerank
     citation ranking: Bringing order to the web. Technical report,
     Stanford University (1998)
[16] : Pagerank report on the wikipedia.
     (http://www.searchmorph.com/wp/2005/01/26/pagerank-
     report-on-the-wikipedia)
[17] : Java universal network/graph framework (jung).
     (http://jung.sourceforge.net/)
[18] Lih, A.: Wikipedia as participatory journalism: Reliable
     sources? metrics for evaluating collaborative media as a
     news resource. In: Proceedings of the 5th International
     Symposium on Online Journalism. (2004)
[19] Stvilia, B., Twidale, M.B., Gasser, L., Smith, L.C.:
     Information quality discussion in wikipedia. In: Proceedings
     of the 2005 International Conference on Knowledge
     Management. (2005) 101–113
[20] Haveliwala, T.H.: Topic-sensitive pagerank. In: Proceedings
     of the Eleventh International World Wide Web Conference.
     (2002)
[21] Tomlin, J.A.: A new paradigm for ranking pages on the
     world wide web. In: Proceedings of the Twelveth
     International World Wide Web Conference. (2003)
[22] Xiong, L., Liu, L.: A reputation-based trust model for
     peer-to-peer ecommerce communities. (2003) Proceedings of
     the 4th ACM conference on Electronic commerce.
[23] Wang, Y., Vassileva, J.: Trust and reputation model in
     peer-to-peer networks. In: P2P’03. (2003)