<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>\Vernetzte Kirche": Building a Semantic Web</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>SÄoren Auer</string-name>
          <email>auer@informatik.uni-leipzig.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bart Pieterse</string-name>
          <email>bartel.pieterse@elkb.de</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Leipzig</institution>
          ,
          <addr-line>04109 Leipzig</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Vernetzte Kirche</institution>
          ,
          <addr-line>Marsstr. 19, 80335 MuÄnchen</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>The only possibility for federally structured organizations and communities to enable consistent views on their meta-data and contents is to establish methods to gather the meta-data from the distributed peers, integrate it into a common conceptual model and ¯nally make it accessible to humans and software systems. We present the approach taken to implement this strategy for the over 2000 a±liated organizations of the Bavarian Lutheran Church. It showcases how di®erent Semantic Web standards, vocabularies, and methodologies can be coherently integrated into a consistent framework delivering added value to the large audience of participating peers.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>The Lutheran Church in the German federal state Bavaria has around 2000
af¯liated clerical, cultural or charitable communities, organizations, and societies.
All of them have their separate organizational structure with di®erent spheres
of activities, websites, contact information and various other information.</p>
      <p>Unlike strictly hierarchical structured organizations such as business
cooperations, for organizations which pro¯t most from their social and cultural diversity,
such as the Lutheran Church, the usage of a single uni¯ed data model or system
(e.g. content or knowledge management system) to manage such information
will be impossible.</p>
      <p>Nevertheless it is highly desirable to give people an overview of lutheran
organizations, activities, and content related to a geographical region or to a
speci¯c topic. Thus a web portal was requested interlinking all these resources
and making them easily accessible for interested parties within and outside the
Lutheran Church of Bavaria.</p>
      <p>To reach this aim a Semantic Web application was envisioned, which collects
information and meta data from distributed sources with diverse ownerships,
integrates them into a common extensible conceptual model, which should be
¯nally represented on the Web, exposing as many semantic relations as possible.
An additional aim is to establish a meta-data initiative focused on religious
related content and to support individuals and organizations accordingly with
its application.</p>
      <p>
        The implementation of a prototype was realized by the department
\Vernetzte Kirche" within the Bavarian Lutheran Church in cooperation with the
University of Leipzig. The implementation is based on the Semantic Web
application development framework Powl [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Powl itself makes use of the Web
technologies with greatest deployed base (PHP and MySQL) and enables the
rapid development of Semantic Web applications as well as appropriate
ontologies.
      </p>
      <p>The \Vernetzte Kirche" use case of Semantic Web technologies is presented
here as follows: We ¯rst describe the information structure used to represent the
various types of information. In Section 3 we give account how this information
structure is populated with concrete data from the distributed, heterogeneous
sources. We elaborate in Section 4 on ways to make this information accessible
to humans and software systems. Finally we give some concluding remarks and
an outlook on planned enhancements in Section 5.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Structuring Information</title>
      <p>The ¯rst challenge was to ¯nd an integrated conceptual model for the information
about relevant organizations and resources, which should be °exible enough to
allow consistent integration of new information as it arises.</p>
      <sec id="sec-2-1">
        <title>Content</title>
      </sec>
      <sec id="sec-2-2">
        <title>Classifications</title>
        <p>Schema ontologies
aligned
…
…
…
Instance ontologies
Syncronize</p>
      </sec>
      <sec id="sec-2-3">
        <title>LDAP</title>
      </sec>
      <sec id="sec-2-4">
        <title>Webservices</title>
        <p>Authenticate
Syndicate</p>
      </sec>
      <sec id="sec-2-5">
        <title>NetAPI</title>
      </sec>
      <sec id="sec-2-6">
        <title>SKOS</title>
      </sec>
      <sec id="sec-2-7">
        <title>FOAF</title>
        <p>Geo</p>
      </sec>
      <sec id="sec-2-8">
        <title>Dublin Core vCard</title>
        <p>Vocabularies</p>
        <p>
          Because of the heterogenic structure of this information an ontology seems to
be more adequate to capture it than for example a ¯xed database schema. Such
an ontology was developed in OWL [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] using Powl's schema editor. Its overall
ontological structure is depicted in Figure 1 and described in more detail in the
remainder of this section.
        </p>
        <p>The main concepts (or classes) were quickly identi¯ed: Organizations and
Content. The class Content is categorized into the subclasses Images, Books,
News, Events and Journals; additional classes can be added as needed. The
classi¯cation of organizations though is more complicated, since every organization
can be classi¯ed along di®erent taxonomies:
{ Regional,
² By German administrative units (federal state; district; region;
community or city),
² By established regional church structure (regional church; deanery;
clerical district; religious community),
{ Sphere of activity (e.g. education, arts, and culture; administration; souls
care),
{ Type of organization (e.g. choir; community; publishing house),
{ Topics and keywords.</p>
        <p>
          For the spheres of activity, types of organizations, topics and keywords
separate classes containing taxonomies of instances are introduced. The taxonomic
structure is encoded using a functional object property subTypeOf having the
respective class as domain and range. An elaborated keyword taxonomy was
created using the SKOS [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] methodology and much inspired by the IPTC
subject reference system [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]. This allows for organizations and content to(such as
images provided by the photo service \FotoBayern" 1 or news provided by RSS
feeds) to be annotated in a consistent manner.
        </p>
        <p>Address information for a speci¯c organization is used in conjunction with
the data of OpenGeoDB 2 to calculate its geographical coordinates and relate
the instance to the regional classi¯cations. The geographical vocabulary (with
the namespace http://www.w3.org/2003/01/geo/wgs84_pos#) is used here to
capture this information within the ontology.</p>
        <p>For initial acquisition of instance data Powl's feature to import rows from an
Excel sheet as instances was very bene¯cial, since most of the information was
already available within conventional O±ce documents. As Powl enables the
customized generation of reports in spreadsheet compatible formats the traditional
storing of information within Excel documents became needless.</p>
        <p>The complete ontology schema is available from the \Vernetzte Kirche" web
site 3 and intended to be reusable by other established regional churches or
similar federally structured associations. Currently, it contains around 70 classes
and 40 properties.
3</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Continuous Acquisition of Information</title>
      <p>
        To populate the information structure presented in the last section, instance data
has to be continuously acquired, synchronized, or updated. For content related
meta-data, specialized acquisition methods have been developed:
1 http://www.fotoBayern.de
2 http://opengeodb.de/
3 http://www.vernetzte-kirche.de
{ Images - meta data is envisioned to be integrated into web-accessible image
¯les according to the proposed \Vernetzte Kirche" classi¯cations. Common
image formats, such as JPEG or PNG o®er the ability to save meta-data
within the ¯le in custom text formats. Images and meta-date thus can be
gathered from speci¯ed web sources, such as the photo service \FotoBayern".
{ News - such as from publications as the \Sonntagsblatt"4 or the lutheran
press service EPD5 are syndicated by accessing respective RSS feeds. News
feeds are encouraged to use the \Vernetzte Kirche" or IPTC classi¯cations
to tag their news.
{ Events - for events as concerts, religious services or courses a specialized
database and administration portal \Evangelische Termine"6 existed.
Information is exchanged by establishing a synchronization between the database
schema and the ontology in the spirit of D2RQ [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>To continuously acquire instance data about organizations a distributed
ownership strategy was selected:
1. Organizations integrate meta information about them self inside the HTML
header of their website.
2. A crawler regularly gathers this meta data and updates existing instances
accordingly.</p>
      <p>
        To ease the task of integrating meta information into the organization
websites a meta-tag generator was developed. A publisher thus has to complete a
simple web form to generate an HTML header snippet for integration into the
organizations website. The integration of meta data and its extraction is
implemented according to the GRDDL mechanism [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. The following code shows an
example meta tag snippet for integration into an HTML header:
&lt;meta name="DC.title" content="Vernetzte Kirche" /&gt;
&lt;meta name="DC.creator" content="Bart Pieterse" /&gt;
&lt;meta name="DC.subject" content="Semantisch Vernetzte Kirche" /&gt;
&lt;meta name="DC.description" content="Informationen zur AG
Vernetzte Kirche der Evangelisch-Lutherischen Kirche in
Bayern und semantische Suche nach ELKB Resourcen" /&gt;
&lt;meta name="DC.publisher" content="Vernetzte Kirche" /&gt;
&lt;meta name="DC.date" content="2005-05-03" /&gt;
&lt;meta name="DC.language" content="de" /&gt;
&lt;meta name="DC.coverage" content="Bayern" /&gt;
&lt;meta name="DC.rights" content="GFDL" /&gt;
&lt;meta name="geo.lat" content="48.1333" /&gt;
&lt;meta name="geo.long" content="-11.5833" /&gt;
&lt;meta name="geo.country" content="BRD" /&gt;
&lt;meta name="dmoz.id" content="" /&gt;
&lt;meta name="keywords" content="Vernetzte Kirche, Semantic Web,
      </p>
      <p>ELKB, semantische Suche" /&gt;
&lt;meta name="robots" content="follow" /&gt;
&lt;meta name="revisit-after" content="20 days" /&gt;
4 http://www.sonntagsblatt.de
5 http://epd.de
6 http://www.evangelische-termine.de/</p>
      <p>
        By using attributes of commonly used vocabularies (such as Dublin Core [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ],
vCard [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], FOAF [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], Geo [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]) this meta information can be easily interpreted
by other (search) agents as well.
      </p>
      <p>Since some, user and organization related information, is held in an LDAP
directory, for extranet authentication purposes, a synchronization between distinct
instance and property values and LDAP nodes was implemented.</p>
      <p>The overall instance data related to the Bavarian Lutheran Church contains
currently around 5,000 instances, accounting for 60,000 triples.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Sharing the Information</title>
      <p>
        After ¯lling the meta-data repository, this information should be shared with
\human" and arti¯cial agents. For machine consumption access is provided by
Web-Services or the NetAPI [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], which allows to share RDF ontology
(fragments) by using the HTTP protocol. For humans a bunch of di®erent, domain
or topic speci¯c views on the data is planned. Initially Powl's application
programming interface is used to present the information in a portal style way on
the web site http://www.vernetzte-kirche.de. Users are enabled to e±ciently
search and ¯lter the instance data in the ontology.
      </p>
      <p>The user interface of the web portal consists of two di®erent types of pages:
overview pages with lists of instances and detail pages, exposing the attributes
of a distinct instance with all the speci¯cs. The overview pages for organizations
follow a three column layout (as depicted in Figure 2):
{ the left part contains the regional ¯ltering options (by German
administrative units or established regional church structure),
{ in the middle part the list of organizations in the selected region (and
according to optionally applied ¯lters) is presented,
{ on the right the results can be ¯ltered according to available taxonomical
categorizations available for organizations in the selected region.</p>
      <p>By choosing the details link for an organization in the list the user reaches
a web page combining all information about that organization. The calculated
geo coordinates for a speci¯c organization are used to render a map with the
location highlighted and to get a list of nearby organizations.</p>
      <p>A full-text search is provided, to query the data for literal descriptions
containing the search string. Results are grouped by the instances and ordered by
the occurrence frequency of the search string. Search results may be ¯ltered to
contain only resources which are instances of a distinct class (e.g. restrict the
search to organizations or books) or which are described by the literal only in
conjunction with a distinct property (e.g. dc:author). This semantic search has
signi¯cant advantages compared to conventional full-text searches, since it
provides important feedback to the searching user on how to successfully re¯ne the
search.</p>
      <p>
        The portal framework further allows to export or extract complete
metadata sets or information fragments by NetAPI [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] or Web-Service interfaces.
This strategy is in particular used to syndicate content and authenticate
participating organizations and individuals at partnering web sites. The photo service
\FotoBayern" for example gives discounts to individuals and organizations
collaborating within the \Vernetzte Kirche" project.
5
      </p>
    </sec>
    <sec id="sec-5">
      <title>Conclusion and Outlook</title>
      <p>Semantic Web knowledge representation standards and especially the generic
Semantic Web application development framework Powl enabled the project
\Vernetzte Kirche" to rapidly develop an initial version of the ontology and a
prototype of the web portal with minimized resource and cost demands.</p>
      <p>Though already publicly accessible the application is still in Beta stage. First
experiences with real users showed the practicability of the approach. Currently,
it is worked on the transition into an easy to use, productive, scalable industry
strength system. To achieve this goal usability and documentation as well as the
stability of the crawling solution has to be improved further. It is additionally
planned to implement enhanced client-side meta-data annotation and extraction
methods with the focus for usage within the \Vernetzte Kirche" project.</p>
      <p>Summarizing we can state that the project showcases how di®erent Semantic
Web standards, vocabularies, and methodologies can be coherently integrated
into an consistent framework delivering added value to a large audience of
participating peers.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>SÄoren</given-names>
            <surname>Auer</surname>
          </string-name>
          .
          <article-title>Powl: A web based platform for collaborative semantic web development</article-title>
          .
          <source>In SÄoren Auer</source>
          , Chris Bizer, and Libby Miller, editors,
          <source>Proceedings of the Workshop Scripting for the Semantic Web, number 135 in CEUR Workshop Proceedings</source>
          , Heraklion, Greece, 05
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>Sean</given-names>
            <surname>Bechhofer</surname>
          </string-name>
          , Frank van Harmelen,
          <string-name>
            <surname>Jim Hendler</surname>
          </string-name>
          , Ian Horrocks, Deborah L.
          <string-name>
            <surname>McGuinness</surname>
          </string-name>
          ,
          <string-name>
            <surname>Peter F. Patel-Schneider</surname>
          </string-name>
          ,
          <article-title>and Lynn Andrea Stein</article-title>
          .
          <article-title>Owl web ontology language reference</article-title>
          .
          <source>W3C Recommendation</source>
          (http://www.w3.org/TR/owlref/),
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>Christian</given-names>
            <surname>Bizer</surname>
          </string-name>
          and
          <string-name>
            <given-names>Andy</given-names>
            <surname>Seaborne</surname>
          </string-name>
          .
          <article-title>D2rq -treating non-rdf databases as virtual rdf graphs (poster)</article-title>
          .
          <source>Poster at Third International Semantic Web Conference (ISWC2004)</source>
          , Hiroshima, Japan,
          <year>November 2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>Dan</given-names>
            <surname>Brickley</surname>
          </string-name>
          .
          <article-title>Rdf vocabulary for representing lat(itude), long(itude) and other information about spatially-located things</article-title>
          (
          <source>technical report)</source>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>Dan</given-names>
            <surname>Brickley</surname>
          </string-name>
          and
          <string-name>
            <given-names>Libby</given-names>
            <surname>Miller</surname>
          </string-name>
          .
          <article-title>Foaf vocabulary speci¯cation</article-title>
          . http://xmlns.com/foaf/0.1/,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Dominique</surname>
            HazaÄel-Massieux and
            <given-names>Dan</given-names>
          </string-name>
          <string-name>
            <surname>Connolly</surname>
          </string-name>
          .
          <article-title>Gleaning resource descriptions from dialects of languages (grddl)</article-title>
          . http://www.w3.org/TR/grddl/, 13 April
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>Renato</given-names>
            <surname>Iannella</surname>
          </string-name>
          .
          <source>Representing vcard objects in rdf/xml (technical report)</source>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8. Dublin Core Metadata Initiative.
          <article-title>Dcmi metadata terms</article-title>
          . http://dublincore.org/documents/dcmi-terms/,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>Alistair</given-names>
            <surname>Miles</surname>
          </string-name>
          and
          <string-name>
            <given-names>Dan</given-names>
            <surname>Brickley</surname>
          </string-name>
          .
          <article-title>Skos core guide</article-title>
          . http://www.w3.org/TR/swbpskos-core-guide/,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <given-names>Andy</given-names>
            <surname>Seaborne</surname>
          </string-name>
          .
          <article-title>An RDF netAPI</article-title>
          .
          <source>In I. Horrocks and J</source>
          . Hendler, editors,
          <source>Proceedings of the First International Semantic Web Conference (ISWC2002)</source>
          , volume
          <volume>1201</volume>
          of Lecture Notes in Computer Science. Springer-Verlag GmbH, February 11
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <given-names>Misha</given-names>
            <surname>Wolf</surname>
          </string-name>
          .
          <article-title>Metadata framework technical speci¯cation (iptc standards draft)</article-title>
          . http://iptc.org/pdl.php
          <article-title>?fn=DRAFT-NAR 1.0-spec-NMDF-TechSpec 6</article-title>
          .pdf,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>