<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Introducing a Diversity-Aware Drupal Extension</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Simon Hangl</string-name>
          <email>simon.hangl@sti2.at</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ioan Toma</string-name>
          <email>ioan.toma@sti2.at</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andreas Thalhammer</string-name>
          <email>andreas.thalhammer@sti2.at</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Innsbruck</institution>
          ,
          <addr-line>Technikerstr. 21a, A-6020 Innsbruck</addr-line>
        </aff>
      </contrib-group>
      <fpage>20</fpage>
      <lpage>24</lpage>
      <abstract>
        <p>This demonstration paper introduces a diversity-aware extension for the content management system Drupal. It shows how different aspects, such as automatically recognized entities, topics and sentiment scores can be leveraged in a Web user interface. We introduce a coherent approach that enables readers to navigate to further related articles. In particular, we demonstrate new ways to quickly grasp what the articles' sentiments are and which topics they cover before the actual click.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        The Diversity Enricher module provides several functionalities that present and
process information that can be considered to enrich Drupal articles with more
diverse information.
One of the most important things about the Drupal extension is the fact that no
user interaction is needed to extract the necessary diversity information. This
information is generated by the Enrycher service3 which is publicly available.
Enrycher utilizes natural language processing techniques to extract diversity
information such as topics, sentiments, sentiment scores or named entities
captured by the article text. This information is then described by using SIOC [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
in combination with the Knowledge Diversity Ontology 4 (KDO) [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
2.2
      </p>
      <p>Links to Related Articles and Topics
Figure 1 shows an overview on the extension. On the left hand side the original
article is presented, whereas on the right hand side the main functionality of the
extension is located. There, related articles within the Drupal database are listed,
split up according to their extracted overall sentiment. An article is considered
to be related, if it has at least one topic in common with the currently shown
one and is located in the same cluster of the Diversity-Aware Ranking Service5.</p>
      <p>
        The topics of the related articles can be shown by clicking on the + button
in front of the article titles. In addition, tags extracted from the currently shown
article are presented in a tag cloud below the related article’s tree. The size of
the respective tag is determined by its number of occurrences in the triple store.
Named entities are recognized within the text and get marked. It is possible to
click on all tags, named entities, and topics in order to get articles with the same
tag/topic (see Figure 2). As a further diversity feature, each article’s sentiment
is displayed between the title and the actual text.
3 Enrycher – http://enrycher.ijs.si
4 KDO – http://kdo.render-project.eu/
5 Diversity-Aware Ranking Service – http://ranking.render-project.eu
The diversity data produced by the Enrycher service can be exported in the
following formats: RDF+XML, JSON, and Turtle.
Another important function of the extension is the ability to import additional
articles into the Drupal database, if they are stored in an Sesame store and are
described with the SIOC [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] and KDO [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] ontologies. This option is only available
through the administration interface.
3
      </p>
    </sec>
    <sec id="sec-2">
      <title>Key technologies and implementation</title>
      <p>The Drupal extension makes use of several technologies and tools. This section
describes how the main parts of the Diversity Enricher Drupal extension
interact.</p>
      <p>Diversity Mining Web Services (Enrycher) The main functionality of the
Enrycher service has already been described in Section 2.1. However, the
Enrycher service could be replaced by any Web service that is SIOC and
KDO compliant. This means, that the service has to support a subset of the
SIOC and KDO functionalities - namely the extraction and proper output
of topics, sentiments and sentiment scores.</p>
      <p>Sesame triple store Sesame is used as data store back-end. All other
components operate on the Sesame store by using SPARQL queries. The Enrycher
service returns RDF data which can be directly submitted to the Sesame
store. The Diversity-Aware Ranking Service component and the Drupal tool
read from the store using a set of predefined queries.
Diversity-Aware Ranking Service This service is used to retrieve related
articles with differing sentiments. It operates on a Sesame triple store. The
core of the ranking service is a clustering algorithm that operates using a
distance metric based on topics and sentiment scores. Articles that have at
least one topic in common with the current article are preselected and then
clustered by topic. All articles that are in the same cluster as the currently
browsed one are then marked as related.</p>
      <p>Drupal Integration The tool is connected to Drupal with so-called hooks. An
implemented hook is called each time a certain event occurs. The hooks of
the Diversity Enricher module are
– New Article Created: As soon as an article is created, the raw text data
is submitted to Enrycher, which extracts the diversity information. This
information is then stored to the local Sesame store.
– Article Viewed: If the article is viewed the first time and it has been in the
database before the Drupal extension has been activated, this hook acts the
same as in the case for New Article Created. Additionally, the information
needed to present is generated (by using ranking, SPARQL queries) and then
presented beside the raw article text.
– Article Changed: If an article is changed, Enrycher is again asked for diversity
information and the store is updated with the new enrichment.
– Article Deleted: If the article is removed, all links to the diversity information
is deleted from the Drupal database.
4</p>
    </sec>
    <sec id="sec-3">
      <title>Related work</title>
      <p>The integration of semantic technologies into CMSs brings clear benefits
especially for improving search, integration and intelligent management of the
content. During the last years several approaches have been published on how
semantics can be used within CMSs in general and Drupal in particular.</p>
      <p>Since version 7, Drupal natively supports RDF representation of posts,
making use of vocabularies like SIOC, FOAF, Dublin Core, and SKOS. Although the
new RDF module in Drupal easily enables publishing LOD, it does not provide
means for the automatic creation of links to relevant LOD resources.</p>
      <p>
        The approaches described in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] enable the production and
consumption of Linked Data in CMSs. In [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], two Drupal modules are introduced, one for
creating RDFa annotations and another one for generating a SPARQL endpoint
for any Drupal site out of the box. The RDFa export module also enables
content providers to use their own vocabulary with RDF mappings management.
[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] presents RDFaCE, a WYSIWYM (What You See Is What You Mean)
editor that extends traditional WYSIWYG editors by RDF statement and RDFa
output capabilities. This also enables the reuse of Linked Data sources such as
DBpedia. Both approaches focus on the manual or semi-automatic annotation
of articles with named entities and topics.
      </p>
      <p>VIE.js6 is a JavaScript-based semantic interaction framework. It facilitates
annotation and interaction with textual and RDFa-annotated content on Web
pages. It is used in combination with Apache Stanbol7 that supports the
extension of CMSs with semantic services. Another annotation framework is given by
the OpenCalais8 Drupal extension that uses the OpenCalais API of Thomson
Reuters to annotate posts with named entities, facts, and events.</p>
      <p>While the above approaches focus on the named entity or topic aspects, we
introduce a new dimension given by the active utilization of automatic sentiment
extraction. Eventually, this is expected to support the content creation and
perception process (given a more fine-grained sentiment and opinion extraction).
Also, in contrast to the above approaches, our approach focuses on providing a
complete and fully automatic cycle to support the management of diversity; from
text analysis and annotation to different visualization methods within Drupal.
5</p>
    </sec>
    <sec id="sec-4">
      <title>Current Work</title>
      <p>We developed a diversity-aware Drupal extension coined Diversity Enricher. The
module is currently available at http://drupal.org/sandbox/sti-innsbruck/
1991696. As of the time of writing (i.e., August 12, 2013) the extension is within
a review process to achieve “full project status” within the http://drupal.org
Web portal. Amongst our next steps will be the qualitative evaluation of the
Diversity Enricher Drupal module.</p>
      <p>Acknowledgment This research was partly funded by the European Union’s
Seventh Framework Programme (FP7/2007-2013) under grant agreement no.
257790 (RENDER project).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1. John G. Breslin, Andreas Harth, Uldis Bojars, and
          <string-name>
            <given-names>Stefan</given-names>
            <surname>Decker</surname>
          </string-name>
          .
          <article-title>Towards semantically-interlinked online communities</article-title>
          .
          <source>In The Semantic Web: Research and Applications</source>
          , volume
          <volume>3532</volume>
          of Lecture Notes in Computer Science, pages
          <fpage>500</fpage>
          -
          <lpage>514</lpage>
          . Springer Berlin Heidelberg,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>Stephane</given-names>
            <surname>Corlosquet</surname>
          </string-name>
          , Renaud Delbru, Tim Clark, Axel Polleres, and
          <string-name>
            <given-names>Stefan</given-names>
            <surname>Decker</surname>
          </string-name>
          .
          <article-title>Produce and consume linked data with drupal!</article-title>
          <source>In Proc. of the 8th Intl. Semantic Web Conf. (ISWC2009), Lecture Notes in Computer Science</source>
          , pages
          <fpage>763</fpage>
          -
          <lpage>778</lpage>
          . Springer,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>Ali</given-names>
            <surname>Khalili</surname>
          </string-name>
          , S¨oren Auer, and
          <string-name>
            <given-names>Daniel</given-names>
            <surname>Hladky</surname>
          </string-name>
          .
          <article-title>The rdfa content editor - from wysiwyg to wysiwym</article-title>
          .
          <source>In Proc. of COMPSAC</source>
          <year>2012</year>
          , pages
          <fpage>531</fpage>
          -
          <lpage>540</lpage>
          . IEEE Computer Society,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>Andreas</given-names>
            <surname>Thalhammer</surname>
          </string-name>
          , Ioan Toma, Rakebul Hasan, Elena Simperl, and Denny Vrandeˇci´c.
          <article-title>How to represent knowledge diversity. Poster at the 10th intl</article-title>
          .
          <source>Semantic Web Conf. (ISWC2011)</source>
          ,
          <year>10 2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>6 VIE.js Semantic Interaction Framework - http://viejs.org/</mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>7 Apache Stanbol - http://stanbol.apache.org/</mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          8
          <string-name>
            <given-names>OpenCalais</given-names>
            <surname>Drupal</surname>
          </string-name>
          module - http://drupal.org/project/opencalais
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>