<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Enabling cross-wikis integration by extending the SIOC ontology</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Fabrizio Orlandi</string-name>
          <email>orlandi.fabrizio.31985@unimore.it</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alexandre Passant</string-name>
          <email>alexandre.passant@deri.org</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Digital Enterprise Research Institute, National University of Ireland</institution>
          ,
          <addr-line>Galway</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Universita degli studi di Modena e Reggio Emilia</institution>
          ,
          <addr-line>Modena</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper describes how we extended the SIOC ontology to take into account particular aspects of wikis in order to enable integration capabilities between various wiki systems. In particular, we will overview the proposed extensions and detail a webservice providing SIOC data from any MediaWiki instance, as well as related query examples that show how di erent wikis, designed as independant data silos, can be uniformally queried and interlinked.</p>
      </abstract>
      <kwd-group>
        <kwd>Semantic Web</kwd>
        <kwd>SIOC</kwd>
        <kwd>wikis</kwd>
        <kwd>MediaWiki</kwd>
        <kwd>Social Semantic Web</kwd>
        <kwd>Linked Data</kwd>
        <kwd>DBpedia</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>
        The SIOC Ontology { Semantically-Interlinked Online Communities [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] { is now
considered as one of the building blocks of the "Social Semantic Web". More
than 50 applications are currently using SIOC3, either as a common vocabulary
to expose their data in RDF, alongside with FOAF for instance, as well as using
existing SIOC data, as for instance Yahoo! SearchMonkey. Moreover, the use of
SIOC goes further than mainstream Web 2.0 services, from Enterprise 2.0
information integration4 to Health Care and Life Sciences discourse representation5.
      </p>
      <p>However, only a few work have been done so far regarding wikis, semantic
wikis and the SIOC ontology. While the SIOC Types6 module already provides
the Wiki and WikiArticle classes that can be used to represent the basic objects
manipulated by wikis, some particular features of wikis such as pages versioning
and backlinks are not taken into account, neither in the SIOC core nor in its
modules. Yet, providing wikis information using SIOC would have several
advantages in terms of integration with existing and constantly dynamically-created</p>
      <sec id="sec-1-1">
        <title>3 http://sioc-project.org/applications</title>
      </sec>
      <sec id="sec-1-2">
        <title>4 http://www.w3.org/2001/sw/sweo/public/UseCases/EDF/</title>
      </sec>
      <sec id="sec-1-3">
        <title>5 http://esw.w3.org/topic/HCLSIG/SWANSIOC</title>
      </sec>
      <sec id="sec-1-4">
        <title>6 http://rdfs.org/sioc/types</title>
        <p>SIOC data, as well as interlinking with other RDF data for advanced querying
purposes. For instance, one will be able to run the same SPARQL query to nd
latest created items on a MediaWiki instance or on a WordPress weblog. Hence,
we recently worked on extending the SIOC ontology for this purposes, as well
as providing a SIOC exporter for MediaWiki, potentially creating millions of
SIOC-based RDF documents from various popular wiki services.</p>
        <p>This paper is organized as follows. First, we will go through an overview of
wiki features that are important to consider in such a modeling approach and
explain how we took them in consideration in regards of the SIOC ontology and
how we extended it based on this analysis. Second, since some wikis already
expose their data in a machine-readable form thanks to Semantic Web
technologies, we will focus on a state of the art of existing models that achieve the
same goal. Then we will detail how we built a webservice that translates any
MediaWiki wiki page to RDF using the newly-extended SIOC ontology. We will
particularly focus on how this service produces RDF data compliant with the
Linked Data principles and how it relates to initiatives such as DBpedia. Then
we will show some relevant query examples, from advanced queries in a single
wiki to cross-querying capabilities. Finally, we will conclude the paper with an
overview of future works on the domain.
2</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>Using and extending the SIOC ontology for advanced modeling of wiki structure</title>
      <p>In this section we spot and explore what we consider being the typical and
fundamental features of wikis in terms of structure and social interactions. Typically
wikis allow editing of documents and, by de nition, allow multiple users to
simultaneously contribute to the content; they track history of changes so that
pages can be restored to previous modi ed versions; they include comments or
discussion areas; they link to other external sources or within the wiki; they
describe categories into hierarchical structures. For each identi ed feature, we give
a brief overview of its goal, and detail how we extended, and generally use, the
SIOC Core ontology7 and its Types module8 taking each feature into account in
our modeling approach.
2.1</p>
      <sec id="sec-2-1">
        <title>Modeling relevant wiki features</title>
        <p>Multi-authoring. A fundamental feature of wikis is that multiple users are
allowed to modify the same content, enabling some kind of collective intelligence
process. In this regard the semantic infrastructure should provide a model to
identify users and theirs modi cations, marking events with a corresponding
timestamp.</p>
        <p>This feature can be modeled using the class sioc:User as object of the
property sioc:has_creator that describes a user account in an online community</p>
        <sec id="sec-2-1-1">
          <title>7 http://rdfs.org/sioc/ns, pre x sioc in this document</title>
        </sec>
        <sec id="sec-2-1-2">
          <title>8 http://rdfs.org/sioc/types, pre x sioct in this document</title>
          <p>site, and which is a subclass of foaf:OnlineAccount. In this way a foaf:Person
could be linked to several sioc:User belonging to di erent wiki sites. Another
way to model the relationships between pages and their authors is to reuse
properties from the Dublin Core ontology, i.e. dc:contributor (or dc:creator) and
dcterms:created. Yet, these properties do not link to a user URI but to a
simple text string, which can be an issue when querying information, especially for
cross-querying as we will detail in section 5.</p>
          <p>
            Categories. Wiki pages are generally related to categories, that allow
readers to nd sets of articles on related topics. Categories can also be organized
in a tree-like structure and their semantic model should mantain the original
taxonomical structure. In this regard an appropriate solution is provided by
the SKOS9 vocabulary [
            <xref ref-type="bibr" rid="ref11">11</xref>
            ], as it o ers a way to model hierarchical structures
between various categories, represented as instances of skos:Concept.
          </p>
          <p>As regards the SIOC ontology, a sioct:Category class was already present
into the SIOC Types module, allowing only the modeling of a at set of category
names. Hence, we decided to declare this class as a subclass of a skos:Concept,
giving it the ability to use the wide SKOS ontology capabilities to organize
categories into advanced taxonomies. Moreover, thanks to the sioc:topic property,
one can link any wiki page to such category.</p>
          <p>Social tagging. While not all wiki engines support that feature, we believe
this is particularly relevant, especially as it o ers an open and user-driven
classi cation scheme for wiki pages. The use of tags lead to a non-organised but
dynamic organisation process, known as a "folksonomy", rather than the more
widely used hierarchical structures.</p>
          <p>
            The properties sioc:topic and dc:subject can be used to represent tags
related to a particular wiki page, either using URIs for these tags (with sioc:topic)
or simple keywords (dc:subject). In addition, vocabularies such as the Tag
ontology [
            <xref ref-type="bibr" rid="ref12">12</xref>
            ], SCOT [
            <xref ref-type="bibr" rid="ref8">8</xref>
            ] or MOAT [
            <xref ref-type="bibr" rid="ref13">13</xref>
            ] allow to model tagging as tripartite actions
(between a wiki page, a user and a tag) as well as organize tags together or
link them to ontology concepts, in order to solve common tagging issues such as
ambiguity between tags.
          </p>
          <p>Discussions. Several wikis associate a discussion page to every wiki page, so
that each user is able to comment and argue his point-of-view on the topic. On
a discussion page, people can discuss about the article subject, or about the
way that subject is presented (see the Wikipedia's approach10). A rst modeling
solution could be to simply keep the native wiki text format of the wiki and just
semantically link the discussion page to the related article page.</p>
          <p>
            The SIOC's main class responsible for the modeling of a discussion is the
sioc:Forum class, but there could be other speci c classes that are more suitable
9 http://www.w3.org/2004/02/skos/
10 http://en.wikipedia.org/wiki/Wikipedia:Talk_page_guidelines
for these discussion purposes, as de ned in the Types module. The appropriate
class to choose depends also on the type and style of the discussion page. So it has
been necessary to identify a proper attribute to link a wiki page to its discussion
page. In this regard we decided to add a sioc:has discussion property to
the SIOC Core ontology, with domain sioc:Item and open range. This choice
has been done in order to make this property reusable also in other contexts,
for instance linking a simple webpage to a discussion forum. The discussions
happening within the related sioc:Forum can then be modeled either as
wikistyle discussions or threaded ones, and that feature also allows us to re-use
advanced SIOC-based argumentative discussion modeling as de ned in [
            <xref ref-type="bibr" rid="ref10">10</xref>
            ].
Backlinks. Backlinks are an important feature of wikis, as they allow to
visualize instantaneously all the incoming links to a website or web page. More
precisely they are wiki internal links pointing to a wiki article. It is a very
common wiki feature and they may be of signi cant interest: they indicate who is
paying attention to the linked page or topic.
          </p>
          <p>We modeled this feature using the already existing sioc:links to property.
This property identi es links extracted from hyperlinks within a SIOC concept
and is a subproperty of dcterms:references. It is important to remember that
this property has to be de ned into the RDF description of the original wiki
article which links back to the wiki article. Hence, to model for instance that the
Wikipedia page about "DERI" features a backlink from the page about "RDF",
the following statement would be added into the RDF description of DERI's
page.</p>
          <p>&lt; http :// en . wikipedia . org / wiki /</p>
          <p>Resource_Description_Framework &gt; sioc : links_to
&lt; http :// en . wikipedia . org / wiki /</p>
          <p>Digital_Enterprise_Research_Institute &gt; .</p>
          <p>Listing 1.1. Representing backlinks
Pages versioning. Usually all editable pages on wikis have an associated page
history. This history consists of the old versions of the wikitext, as well as a
record of the date and time of every edit, the username or IP address of the
user who wrote it, and their edit summary. All this is usually accessible through
a special "history" page which shows time-ordered links to all the revisions.
Commonly the latest revision of a wiki page has always the same URL (alias
name), meanwhile older versions have further parameters appended to the URL.</p>
          <p>The versioning of pages could be modeled in several ways. Taking into account
other semantic wikis, that we describe in Section 3, we took inspiration from
other existing approaches. Then we de ned our own di erent model because
we wanted to keep the pros of each model and we did not nd one capable to
satisfy completely our needs. An important requirement we take into account
is the fast and simple browsing capability that the model should have. For this
reason we chose to use transitive properties to express the temporal relation
between revisions of a wiki page. The model is displayed in Fig. 1 and all the
used properties are now de ned in the SIOC Core ontology.</p>
          <p>http://wikiexample.org/PageName
next_version
previous_version
earlier_version
latest_version
http://wikiexample.org/PageName_Vers_X
next_version</p>
          <p>previous_version
http://wikiexample.org/PageName_Vers_Y
next_version</p>
          <p>previous_version
http://wikiexample.org/PageName_Vers_Z
later_version</p>
          <p>The next/previous version properties link only the next/previous revision
of a generic sioc:Item. Meanwhile earlier/later version are de ned as
transitive properties and as super-properties respectively of next/previous version.
The main advantage of the de nition of transitivity and the declaration of
superproperties, is that they can be inferred automatically by a reasoner. Hence, using
an OWL level reasoning engine, when modeling a WikiArticle (or a sioc:Item
in general), it is only necessary to describe its previous and next revision and
the transitive super-properties will be automatically inferred by the system. This
can also be convenient during the querying process (described in Section 5): if
it is necessary to get all the earlier versions of a wiki page, with transitivity it
is su cient to use the sioc:earlier version transitive property, while in the
other case, it has to be implemented a query that recursively "jumps" on each
sioc:previous version of the latest wiki article. Another introduced property
is the sioc:latest version which points always to the newest revision.
Usually it is used in combination with an alias name of the latest version so that
it is not necessary to change the referred URI in all the earlier versions as soon
as a modi cation happens. All the wikis we analyzed adopt this solution as it
addresses scalability.
2.2</p>
        </sec>
      </sec>
      <sec id="sec-2-2">
        <title>Changes summary</title>
        <p>All the changes we made to the SIOC ontology are summarized as follows:
{ De ned the sioct:Category class as a subclass of skos:Concept.
{ Added a sioc:has discussion property, with domain sioc:Item and open
range.
{ Added a sioc:latest version property, with sioc:Item as domain and
range.
{ Added two transitive properties: sioc:earlier version and sioc:later version,
with sioc:Item as domain and range.
{ De ned sioc:later version as inverse property of sioc:earlier version.
{ De ned sioc:next version as a subproperty of sioc:later version.
{ De ned sioc:previous version as a subproperty of sioc:earlier version.
3
3.1</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Related work</title>
      <sec id="sec-3-1">
        <title>Existing models to represent structure of wikis</title>
        <p>While our aim is to model wiki features by extending SIOC, several vocabularies
have been already proposed to achieve this goal. We will overview some of them
in this section, by distinguishing models created with a general purpose and
models created for a particular wiki engine but nevertheless available on the
Web. In addition, it is worth noticing that we focus here on models and tools
de ned to represent the structure of wikis and not on the ones that allow domain
ontology modeling and population in RDF(S)/OWL from the wiki itself, while
these two levels of semantics can be obviously combined.</p>
        <p>
          As regards generic models WikiOnt11 [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] is an ontology for describing and
exchanging wiki articles and it aims at integrating Wikipedia (and by
extension other MediaWiki-based sites) into the Semantic Web framework, making
Wikipedia machine-processable. This OWL ontology uses DublinCore to
identify multiple authors of wiki pages as well as the editing date, and provides
Article and Category classes. Yet all the other features are currently not
modeled by the ontology. Wiki Interchange Format12 (WIF) [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ] is a project
that allows data exchange between wikis and related tools. Di erent from other
approaches, it also tackles the problem of page content and annotations. WIF
de nes a subset of XHTML as an over-the-wire format for wiki content exchange.
It de nes the classes of WikiUser (subclassing foaf:Person) and WikiPage to
model pages and authors. It also privides a versioning system thanks to the
hasPreviousVersion and hasChangeDate properties. Categories, social tagging,
discussions and backlinks are features currently not modeled by the ontology.
        </p>
        <p>
          In addition to the previous models we considered some particular wiki engines
that expose their data in RDF providing an RDF vocabulary for such export.
IkeWiki13 [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] aims at creating instance data based on an existing ontology
but also at being a tool for creating and editing ontologies. In addition, IkeWiki
provides a complete export of the wiki structure using a dedicated OWL
ontology. It introduces a User class, subclass of foaf:Person, and uses a hasAuthor
11 http://sw.deri.org/2005/04/wikipedia/wikiont.html
12 http://wif.ontoware.org/2005/04/
13 http://ikewiki.salzburgresearch.at
property, subclass of foaf:maker, to associate an author/User with a resource
in the wiki. It is also worth noticing that IkeWiki uses SIOC to model
discussions, using a hasDiscussion property that links any wiki page to an instance of
sioc:Forum. A wiki article is de ned by a Page class. Social tagging, backlinks
and pages versioning are features currently not modeled by the ontology.
SweetWiki14 [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] is a semantic wiki based on the CORESE engine. It relies on
Web standards for the semantic annotations (RDFa, RDF) and for the ontologies
it manipulates (OWL Lite). The SweetWiki ontology manages versions of wiki
pages using a Version class, de ned as a subclass of the main WikiPage class.
A pageHasVersion property links each old version with the latest page
represented by the WikiPage class. The page version number is declared as an integer
number with the isTheVersionNumber attribute. To note that SweetWiki o ers
advanced social tagging features, which are not modeled by other Semantic wikis.
Keywords can be collaboratively structured through a lightweight ontology
editor and related either to pages, categories (de ned using a particular Category
class) or embedded media content. Sweetwiki also supports backlinks, but not
discussion pages at the moment. It also de nes its own classes to model authors
and wiki pages, respectively using Person and WikiPage.
        </p>
        <p>
          Semantic MediaWiki15 (SMW) [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] uses a particular ontology to represent the
semantic data exported from a page by a user, named SWIVT { Semantic Wiki
Vocabulary and Terminology. This ontology provides a basis for interpreting
the semantic data exported by SMW, and it incorporates various elements that
are closely related to SMW's metadata model. Yet, while some features such
as backlinks or categories are provided by SMW, they are not exposed in the
SWIVT model, exporting only a Wikipage class.
        </p>
        <p>
          Finally, regarding the use of SIOC for wikis, we can mention UfoWiki [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ],
that have been deployed in complement of other SIOC-related data in an
Enterprise 2.0 platform, while it does not support versioning in its RDF representation.
3.2
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>Comparison and positioning of our approach</title>
        <p>Based on the previous analysis, we produced a comparison matrix, to underline
the pros and cons of each approach. We may conclude that multi-authoring is a
feature supported by the ontologies of all the wiki models, and this is because it is
an inescapable characteristic of a semantic wiki. On the other hand backlinks and
versioning are not modeled by most of the considered wikis. These two features
are addressed only by SweetWiki, with the exception of WIF developing a very
simple versioning solution. Moreover, some features that are not modeled by
these vocabularies could be added by external vocabularies, as for instance social
tagging. Finally the most complete model to take into account is SweetWiki,
being able to accomplish to every requirement but the discussions.
14 http://sweetwiki.inria.fr/sweetwiki
15 http://semantic-mediawiki.org
Multi-authoring</p>
        <p>Categories
Social Tagging</p>
        <p>Discussions
Backlinks
Versioning</p>
        <p>IkeWiki SweetWiki SWIVT WikiOnt WIF SIOC
p p p p p p
p p p p p</p>
        <p>p p
p p
p p
p p p</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Exporting SIOC data from</title>
    </sec>
    <sec id="sec-5">
      <title>MediaWiki</title>
      <p>In order to see implications of our extension, our aim was to build an exporter
from a popular wiki platform, so that it can expose its data in RDF using our
proposed model. Hence we decided to create a web-service application to export
any MediaWiki instance. MediaWiki is one of the most popular wiki platforms,
hosting all the Wikimedia Fundation wikis (i.e. Wikipedia, Wiktionary, etc.) and
propulsing more that 25 millions of wiki articles from di erent wiki sites16.
4.1</p>
      <sec id="sec-5-1">
        <title>Principles of the SIOC-MediaWiki webservice</title>
        <p>In order to export SIOC data from MediaWiki's wikis we implemented a
webservice, written in PHP, that exports a wiki article in RDF with the structure
we explained in the previous sections. The webservice is publicly available at
http://ws.sioc-project.org/mediawiki/.
16 http://s23.org/wikistats/largest_html.php</p>
        <p>The MediaWiki exporter is relatively lightweight and built thanks to two
PHP classes: the SIOC-Mediawiki exporter itself and the already existing SIOC
API17. Our approach combines the use of the MediaWiki API as well as the
SIOC PHP API { that has been extended based on the previously-detailed
ontology changes { to create SIOC data. The exporter class is the part responsible
for querying the MediaWiki API and parsing the results, and the SIOC API is
responsible for exporting the content in RDF. The script indeed uses the
MediaWiki API to get all the information about the article inserted in the form, with
the following process (as represented in Fig. 3):
{ it automatically discovers the API location (if not detected it is possible to
manually specify the API path in the proper text eld);
{ it connects to the API sending HTTP requests as queries;
{ it parses the results of the queries and lls in the proper variables;
{ it calls the SIOC API to export in RDF the fetched structural information
and outputs the results in RDF/XML serialization.</p>
        <p>User query
RDF Output</p>
        <p>SIOC
MediaWiki
webservice</p>
        <p>MediaWiki</p>
        <p>API</p>
        <p>SIOC PHP</p>
        <p>
          API
One of our goals with this exporter was not only to create RDF data from any
MediaWiki page, but also to easily allow interlinking between various wikis, as
well as between wiki data and other RDF data, whatever it is social data modeled
with FOAF or SIOC or any other kind of RDF data. Hence, we followed the
Linked Data best practices de ned in [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ], [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] and [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ].
        </p>
        <p>Particularly, to o er a better browsing experience and ease the process of
crawling SIOC exports of MediaWiki instances, our webservice automatically
17 http://wiki.sioc-project.org/index.php/PHPExportAPI
produces rdfs:seeAlso links between wiki pages. Actually, more than a simple
link to the wiki page, the exporter provides a link to the related RDF document,
as we can see in the following Listing 1.2 related to a particular sioc:User.
As we can also notice in that example, we distinguish the concept itself (i.e.
User:StefanDecker) and the related RDF page. These seeAlso links are very
useful not only to provide link to other related RDF documents, that can be
used for instance when browsing data with Tabulator, but also in a crawling
perspective. A RDF crawler could easily follow all the seeAlso links found on
every document and continue to crawl. In this regard, for example, we crawled
and exported entire wiki sites just following these links.</p>
        <p>&lt; sioc : User rdf : about =" http :// en . wikipedia . org / wiki / User :</p>
        <p>StefanDecker "&gt;
&lt; rdfs : seeAlso rdf : resource =" http :// ws . sioc - project . org /
mediawiki / mediawiki . php ? wiki = http :// en . wikipedia . org
/ wiki / User : StefanDecker "/ &gt;
&lt;/ sioc : User &gt;</p>
        <p>Listing 1.2. Modeling a user in the MediaWiki exporter</p>
        <p>Another interesting feature is the linkage to the corresponding DBpedia18
resource, if the article belongs to the english Wikipedia. Since DBpedia
semantically models the content of a Wikipedia page, this connection is very useful to
link semantic data about the content and the structure of a wiki article.
DBpedia resource URIs are used in rage of the foaf:primaryTopic property, this
because it relates a document to the main thing that the document is about.
5</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Cross-wikis integration and advanced querying process</title>
      <p>
        In order to evaluate our proposal, we exported and crawled di erent MediaWiki
instances. Four di erent wikis have been crawled { using for each crawl a single
entry point thanks to the use of the rdfs:seeAlso links { each one belonging to
the same area of interest in order to have a high probability of shared topics and
users: Semanticweb.org19, Protege Wiki 20, RDFa Wiki 21 and the ONTOLORE
Karlsruhe wiki22. In total, we collected about 1GB of RDF data and loaded it
in Sesame [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. As we needed an higher degree of inference (because of the OWL
transitive properties) we installed and con gured the reasoning engine OWLIM23
on the top of it.
18 http://www.dbpedia.org
19 http://www.semanticweb.org
20 http://protegewiki.stanford.edu
21 http://rdfa.info/wiki/RDFa_Wiki
22 http://logic.aifb.uni-karlsruhe.de/wiki/ONTOLORE
23 http://www.ontotext.com/owlim/
      </p>
      <sec id="sec-6-1">
        <title>Advanced querying for a single wiki</title>
        <p>A rst example of advanced querying for a particular wiki is the ability to
answer to the following question: "what are the collaborating users that worked
alternatively on the same wiki article?". In Listing 1.3 we provide the SPARQL
implementation of this query.</p>
        <p>SELECT DISTINCT ? wikiArt ? Contrib_a ? Contrib_b
WHERE {
?x sioc : latest_version ? wikiArt .
? wikiArt sioc : earlier_version ? VersA .
? VersA sioc : earlier_version ? VersB ;</p>
        <p>dc : contributor ? Contrib_a .
? VersB sioc : earlier_version ? VersC ;</p>
        <p>dc : contributor ? Contrib_b .
? VersC dc : contributor ? Contrib_a .</p>
        <p>FILTER (? Contrib_a != ? Contrib_b ) .
}</p>
        <p>Listing 1.3. Identifying collaborating users</p>
        <p>In Fig. 4 we display a diagram that summarizes the above query, meanwhile
in Fig. 5 we show the results we got querying on our SESAME triplestore. As
we can see, this query takes advantage of the transitivity of the newly created
property sioc:earlier_version, since we identify users that worked on earlier
versions, and not only immediately on the previous one.</p>
        <p>?wikiArt</p>
        <p>sioc:earlier_version
sioc:earlier_version
sioc:earlier_version
?VersA
?VersB
?VersC
dc:contributor
dc:contributor
dc:contributor
?Contrib_a
?Contrib_b</p>
        <p>The query results provide the article URI and the two usernames in case
the rst user (?Contrib a) re-edited the article after a modi cation made by
the second user (?Contrib b). It enables you to look for users sharing the same
interests and knowledge areas. The query is important expecially in a social
semantic context.</p>
      </sec>
      <sec id="sec-6-2">
        <title>Cross-wiki integration and querying</title>
        <p>Another interesting feature of our approach is the ability to do cross-wikis
querying, since they are based on the same model. Obviously, one can argue that since
all the exported wikis are based on MediaWiki, the same approach could have
been used simply with the MediaWiki API. Yet, our proposal has many
advantages as it relies on SPARQL instead of a particular API and it provides advanced
inference capabilities that the original API does not o er. The following query
identi es users involved in di erent wikis, looking for the same usernames.</p>
        <p>SELECT DISTINCT ? creator1 ? page1 ? page2 ? wiki1 ? wiki2
WHERE {
? page1 sioc : has_container ? wiki1 ;</p>
        <p>dc : contributor ? creator1 .
? page2 sioc : has_container ? wiki2 ;</p>
        <p>dc : contributor ? creator2 .</p>
        <p>FILTER ( str (? creator1 ) == str (? creator2 )) .</p>
        <p>FILTER ( str (? wiki1 ) != str (? wiki2 )) .
}
Listing 1.4. Identifying pages created by a single user in di erent wikis
While this is a very simple query it requires high computation capabilities
when ran through a large number of di erent wikis. Hence, in Fig. 6 we display
a screenshot of the results we get after running the same query between the
Semanticweb.org wiki and the Protege wiki. Instead of also displaying all the
other details, such as the related wiki pages and the two wiki containers, we
show only the distinct usernames of the found users.</p>
        <p>Yet, as this query relies on a FILTER clause, it will identify common users
only if they use the same account name on two di erent wikis. Moreover, we can
imagine that some common account names will be used by di erent people on
di erent wikis, e.g. JohnSmith. To that extend, we can bene t from the strong
ties that exist between FOAF and SIOC and the fact we are modeling a wiki user
using the sioc:User class. One person can indeed de ne in his FOAF pro le
the various wiki accounts he owns, using simple foaf:holdsAccount properties.
Then, the previous query can be adapted to deal not only with text strings to
identify the user, but with their related accounts from the FOAF URI, so that
a single query can be used to retrieve all the contributions of a user whatever
the wiki used was. Moreover, since the wiki model is based on SIOC, the same
query can be used to retrieve wiki pages, blog posts, etc. as follows.</p>
        <p>SELECT DISTINCT ? content
WHERE {
&lt; http :// example . org / js #me &gt; foaf : holdsAccount ? account .
? account rdf : type sioc : User .</p>
        <p>? content sioc : has_creator ? account .
}</p>
        <p>Listing 1.5. Cross-sites querying by combining FOAF and SIOC</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>Conclusion</title>
      <p>In this paper we presented how the SIOC ontology and lightweight semantics
can be used and extended to represent the structure of wikis in an uni ed way.
We rst explained our motivations regarding some properties of wikis that we
focused on in our modeling process, particularly focusing on a versioning process,
and how we can bene t in this case of OWL reasoning capabilities. Then, we
described how we designed a webservice to translate any MediaWiki page into
SIOC data, following the Linked Data best principles to provide not only isolated
RDF, but interlinked data. Finally, we gave some examples regarding how this
data could be e ciently used for querying purposes.</p>
      <p>While the work done here have been only applied to MediaWiki, further
developments may include exporters and plug-in for other platforms to enable
better cross-wikis integration. We also consider extending the versioning system
de ned in SIOC regarding wiki pages to other user-generated content. Moreover
the semantic modeling of a wiki article might be improved adding more details
about the content of the article itself. One of our goals is also to run
crossqueries between our Wikipedia export and DBpedia, for instance to identify
which people where the most active on a particular wiki page or topic. Finally,
we also believe that this article gives a complete and nice overview regarding
how to extend an ontology such as SIOC for particular purposes.
7</p>
    </sec>
    <sec id="sec-8">
      <title>Acknowledgements</title>
      <p>The work presented in this paper has been funded in part by Science Foundation
Ireland under Grant No. SFI/08/CE/I1380 (Lion-2).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>Danny</given-names>
            <surname>Ayers</surname>
          </string-name>
          and
          <article-title>Max Volkel. Cool URIs for the Semantic Web</article-title>
          .
          <source>W3C Interest Group Note 03 December</source>
          <year>2008</year>
          , World Wide Web Consortium,
          <year>2008</year>
          . http://www. w3.org/TR/cooluris/.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>Tim</given-names>
            <surname>Berners-Lee</surname>
          </string-name>
          .
          <article-title>Linked Data. Design issues for the world wide web</article-title>
          ,
          <source>World Wide Web Consortium</source>
          ,
          <year>2006</year>
          . http://www.w3.org/DesignIssues/LinkedData.html.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Chris</surname>
            <given-names>Bizer</given-names>
          </string-name>
          , Richard Cyganiak, and Tom Heath.
          <article-title>How to Publish Linked Data on the Web</article-title>
          .
          <source>Technical report</source>
          ,
          <year>2007</year>
          . http://www4.wiwiss.fu-berlin.de/bizer/ pub/LinkedDataTutorial/.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4. John G. Breslin, Andreas Harth, Uldis Bojars, and
          <string-name>
            <given-names>Stefan</given-names>
            <surname>Decker</surname>
          </string-name>
          .
          <article-title>Towards Semantically-Interlinked Online Communities</article-title>
          .
          <source>In Proceedings of the 2nd European Semantic Web Conference (ESWC2005)</source>
          , volume
          <volume>3532</volume>
          of Lecture Notes in Computer Science, pages
          <volume>500</volume>
          {
          <fpage>514</fpage>
          . Springer,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>Jeen</given-names>
            <surname>Broekstra</surname>
          </string-name>
          , Arjohn Kampman, and Frank van Harmelen.
          <article-title>Sesame: A Generic Architecture for Storing and Querying RDF and RDF Schema</article-title>
          .
          <source>In The Semantic Web - ISWC 2002. First International Semantic Web Conference</source>
          , volume
          <volume>2342</volume>
          of Lecture Notes in Computer Science, pages
          <volume>54</volume>
          {
          <fpage>68</fpage>
          . Springer,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>Michel</given-names>
            <surname>Bu</surname>
          </string-name>
          <string-name>
            <surname>a</surname>
          </string-name>
          , Fabien L. Gandon, Guillaume Ereteo,
          <string-name>
            <given-names>Peter</given-names>
            <surname>Sander</surname>
          </string-name>
          , and Catherine Faron.
          <article-title>SweetWiki: A semantic wiki</article-title>
          .
          <source>Journal of Web Semantics</source>
          ,
          <volume>6</volume>
          (
          <issue>1</issue>
          ):
          <volume>84</volume>
          {
          <fpage>97</fpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>Andreas</given-names>
            <surname>Harth</surname>
          </string-name>
          , Hannes Gassert,
          <string-name>
            <surname>Ina O'Murchu</surname>
          </string-name>
          , John G. Breslin, and Stefan Decker.
          <article-title>WikiOnt: An Ontology for Describing and Exchanging Wikipedia Articles</article-title>
          .
          <source>In Proceedings of Wikimania</source>
          <year>2005</year>
          { The First International Wikimedia Conference,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>Hak</given-names>
            <surname>Lae</surname>
          </string-name>
          <string-name>
            <given-names>Kim</given-names>
            ,
            <surname>Sung-Kwon</surname>
          </string-name>
          <string-name>
            <given-names>Yang</given-names>
            , John G. Breslin, and
            <surname>Hong-Gee Kim</surname>
          </string-name>
          .
          <article-title>Simple algorithms for representing tag frequencies in the scot exporter</article-title>
          .
          <source>In Proceedings of the 2007 IEEE/WIC/ACM International Conference on Intelligent Agent Technology</source>
          , pages
          <volume>536</volume>
          {
          <fpage>539</fpage>
          . IEEE Computer Society,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>Markus</given-names>
            <surname>Kro</surname>
          </string-name>
          <article-title>tzsch, Denny Vrandecic, and Max Volkel. Semantic MediaWiki</article-title>
          .
          <source>In Proceedings of the 5th International Semantic Web Conference (ISWC</source>
          <year>2006</year>
          ), volume
          <volume>4273</volume>
          of Lecture Notes in Computer Science, pages
          <volume>935</volume>
          {
          <fpage>942</fpage>
          . Springer,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Christoph</surname>
            <given-names>Lange</given-names>
          </string-name>
          , Uldis Bojars, Tudor Groza, John G. Breslin, and
          <string-name>
            <given-names>Siegfried</given-names>
            <surname>Handschuh</surname>
          </string-name>
          .
          <article-title>Expressing Argumentative Discussions in Social Media Sites</article-title>
          .
          <source>In Proceedings of the ISWC2008 Workshop on Social Data on the Web (SDoW2008)</source>
          , volume
          <volume>405</volume>
          <source>of CEUR Workshop Proceedings. CEUR-WS.org</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <given-names>Alistair</given-names>
            <surname>Miles</surname>
          </string-name>
          and
          <string-name>
            <given-names>Sean</given-names>
            <surname>Bechhofer. SKOS Simple</surname>
          </string-name>
          <article-title>Knowledge Organization System Reference</article-title>
          .
          <source>W3C Working Draft 29 August</source>
          <year>2008</year>
          , World Wide Web Consortium,
          <year>2008</year>
          . http://www.w3.org/TR/2008/WD-skos-reference-
          <volume>20080829</volume>
          /.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12. Richard Newman, Danny Ayers, and
          <string-name>
            <given-names>Seth</given-names>
            <surname>Russell</surname>
          </string-name>
          . Tag ontology,
          <year>December 2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <given-names>Alexandre</given-names>
            <surname>Passant</surname>
          </string-name>
          and
          <string-name>
            <given-names>Philippe</given-names>
            <surname>Laublet</surname>
          </string-name>
          .
          <article-title>Meaning Of A Tag: A collaborative approach to bridge the gap between tagging and Linked Data</article-title>
          .
          <source>In Proceedings of the WWW2008 Workshop Linked Data on the Web (LDOW2008)</source>
          , volume
          <volume>369</volume>
          <source>of CEUR Workshop Proceedings. CEUR-WS.org</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <given-names>Alexandre</given-names>
            <surname>Passant</surname>
          </string-name>
          and
          <string-name>
            <given-names>Philippe</given-names>
            <surname>Laublet</surname>
          </string-name>
          .
          <article-title>Towards an Interlinked Semantic Wiki Farm</article-title>
          .
          <source>In Third Semantic Wiki Workshop { The Wiki Way of Semantics</source>
          , volume
          <volume>360</volume>
          <source>of CEUR Workshop Proceedings. CEUR-WS.org</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Sebastian</surname>
          </string-name>
          <article-title>Scha ert</article-title>
          .
          <article-title>IkeWiki: A Semantic Wiki for Collaborative Knowledge Management</article-title>
          .
          <source>In First International Workshop on Semantic Technologies in Collaborative Applications (STICA 06)</source>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Max</surname>
          </string-name>
          <article-title>Volkel and Eyal Oren</article-title>
          .
          <article-title>Towards a Wiki Interchange Format (WIF) - Opening Semantic Wiki Content and Metadata</article-title>
          .
          <source>In Proceedings of the First Workshop on Semantic Wikis - From</source>
          Wiki to Semantics (SemWiki-2006), volume
          <volume>206</volume>
          <source>of CEUR Workshop Proceedings. CEUR-WS.org</source>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>