<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Semantic Digital Archives</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Andreas Nürnberger, Otto-von-Guericke University of Magdeburg</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Annett Mitschick, University of Dresden</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Livia Predoiu, Otto-von-Guericke University of Magdeburg</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Seamus Ross, University of Toronto</institution>
          ,
          <country country="CA">Canada</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>Steffen Hennicke, Humboldt University of Berlin</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2011</year>
      </pub-date>
      <abstract>
        <p>Vol-801 urn:nbn:de:0074-801-0</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Editors:
Copyright © 2011 for the individual papers by the papers’ authors. Copying permitted only
for private and academic purposes. This volume is published and copyrighted by its editors.</p>
    </sec>
    <sec id="sec-2">
      <title>Preface</title>
      <p>These proceedings are the result of an exciting workshop held in conjunction with the
first international conference on Theory and Practice of Digital Libraries, TPDL 2011,
formerly known as European Conference on Digital Libraries, ECDL. The name of
the workshop, Semantic Digital Archives – sustainable long-term curation
perspectives of Cultural Heritage (short: SDA 2011) already provides a first hint
towards its general topics and goals: to promote and discuss sophisticated knowledge
representation and knowledge management solutions specifically designed for
improving Archival Information Systems (AIS) and Archival Information
Infrastructures (AII).</p>
      <p>Over the past couple of decades, digitally created content has come to permeate all
aspects of our lives and the life cycle of these objects is increasingly exclusively
digital. A portion of this content can be expected to have enduring value as it delivers
insight into the contemporary trends and spirit of its time. Hence, it can be considered
being part of our cultural and scientific heritage. This vast corpus needs to be
appraised and items of enduring value selected, archived and kept accessible so that it
can be made available in response to requests from information professionals, and the
general public. Therefore, sustainable long-term curation perspectives for our digital
cultural heritage are essential. Digital content poses many socio-cultural and
technological challenges which create obstacles to long-term or indefinite
preservation. Changing technologies and shifting user communities as well as the
increasing complexity of digital content consisting of or being enriched with software
and multimedia attachments are only a few examples. Dealing with these challenges
was the central theme of the workshop.</p>
      <p>The workshop aimed to involve and stimulate discussions between the digital
archiving, the digital museums, the digital libraries and the semantic (web)
technologies communities. Archives, museums and libraries share a natural bond as
all three have a long history of experience with maintaining (storing and retrieving) a
large amount of objects, data and information. Hence, there is a lot potential for
crossfertilization between these related fields. Furthermore, libraries already started to
adopt semantic web technologies successfully as shown by various workshops and
conferences on this topic that recently have emerged. Most remarkably, also a W3C
incubator group on library linked data has been created. Hence, the workshop aimed
at fostering discussions about experiences and best practices of employing semantic
web technologies in the library domain yielding so called semantic digital libraries in
order to inspire and boost the adoption of semantic web technologies in the area of
digital archiving as well.</p>
      <p>The area of semantic (web) technologies is a broad scientific discipline that focuses
on providing promising technical solutions for knowledge representation and
knowledge management. It provides knowledge representation languages and
management technologies based on a solid artificial intelligence foundation and is
supported by appropriate W3C recommendations and a large user community. At the
forefront of making the semantic web a mature and applicable reality is the linked
data initiative. Using semantic (web) technologies in general and linked data in
particular can be expected to mature the area of digital archives as well and
technologically tighten the bond between digital libraries and digital archives.
Furthermore, digital archives and their users have special requirements that can also
inspire semantic (web) technologies research in general.</p>
      <p>The workshop was well accepted by the community and was able to attract 23
submissions from which we selected 13 papers with the help of our program
committee; giving an overall acceptance rate of 56%. The papers covered a broad
range of relevant topics in the area of semantic digital archives, bringing together
people from archives, museums, digital libraries and the semantic web as hoped and
expected. A lot of different research projects are represented in these proceedings, e.g.
the KEEP project (W. Bergmeyer), SHAMAN (J. Brunsman, K. Qian et al.), Semlib
(C. Morbidoni et al.), ASPI (C. Cortese and G. Mantegari), Europeana (S. Hennicke
et al.) and EUscreen (J. Oomen and V. Tzouvaras). Some of the papers that have been
presented during the workshop are very data-oriented and focus on a specific kind of
data to be preserved, maintained or kept accessible, like computer games (W.
Bergmeyer), metadata on products in a company (J. Brunsmann), digital libraries in
general (C. Morbidoni et al.), archival data in general (C. Cortese and Mantegari, S.
Mazzini and F. Ricci, S. Hennicke et al.) and pictures of museum items (T. Wray and
P. Eklund). Other papers focus on a general approach like the paper by A. Schröder et
al. who present a novel and promising approach for semantic hierarchical storage
management. Another example for a paper that focuses on a general approach is the
paper by Kai Eckert who proposes a basic linguistic indexer for digital libraries.</p>
      <p>The workshop started with an invited talk on The KEEP emulation framework (W.
Bergmeyer) which is also contained as publication in this volume. In this publication,
W. Bergmeyer presents the KEEP (Keeping Emulation Environments Portable)
project which is a research project of the European 7th Framework Programe. During
the workshop, a demo of the KEEP emulation software framework has been shown.
This talk brought the general trend of emulation as a preservation strategy which is
currently the method of choice when preserving software tools and multimedia
systems into the discussions of the workshop. Afterwards, focusing on hardware as
well, a semantic extension of a hierarchical storage management system for small
and medium-sized enterprises (A. Schröder et al.) has been discussed. Since such a
system saves costs, capacity and access time, it can be especially useful in large
digital archiving frameworks and infrastructures in order to distribute, store and
retrieve semantically coherent archival data.</p>
      <p>In the submission about the semantic exploration of archived product lifecycle
metadata under schema and instance evolution (J. Brunsmann), J. Brunsmann brings
a new view into the discussion since he considered a holistic approach for maintaining
the life cycle of linked data describing obsolete product ideas within a company
archive. Hence, he introduces an interesting application field for semantic digital
archives.</p>
      <p>The paper Towards a semantic data library for the social sciences (T. Grotton et
al.) brings a very interesting preliminary approach for a linked library data
infrastructure for statistical data in the social sciences into the discussion. Although
this work does not consider digital archiving and is on a very preliminary state, it
provides an insight into a statistical semantic digital library infrastructure and hence
stimulated the discussion on semantic digital libraries versus semantic digital
archives. More information on semantic digital libraries is provided by the paper on
introducing the Semlib project: semantic web tools for digital libraries (C. Morbidoni
et al.) which describes an annotation system for digital libraries. The proposed system
adds user interaction to digital libraries via annotation and provides semantic structure
to such annotations as well.</p>
      <p>The paper LOHAI: Providing a baseline for KOS based automatic indexing (K.
Eckert) proposes a free, open source and easy to use indexer tool for KOSs. This tool
can provide the fundament on which to build more ambitious tools; although it has
been developed for digital libraries, it can be used in other contexts like digital
archiving contexts as well.</p>
      <p>The publication on extending the digital archives of italian psychology with
semantic data (C. Cortese and G. Mantegari) discusses an approach for
implementing a semantic digital archive using CIDOC CRM for ontology modeling.
Similarly, the paper on EAC-CPF Ontology and Linked Archival Data (S. Mazzini
and F. Ricci) presents a topic that is relevant for digital archiving. More specifically,
the development of an ontology is described that corresponds semantically to the
EAC-CPF schema which is an archival standard for modelling and describing
individuals, families and corporate bodies that create, preserve, use and are
responsible for and/or associated with records in a variety of ways. A related topic is
discussed in the submission about the conversion of EAD into EDM linked data (S.
Hennicke et al.) as it deals with integrating archival finding aids into the portal of the
Europeanna project which is an ambitious european project aiming at integrating data
and information of museum, archives and libraries in one semantic web enhanced
portal.</p>
      <p>With Concepts and Collections: A case study using objects from the Brooklyn
Museum (T. Wray and P. Eklund), an interesting approach for a browsing framework
for digitised cultural collections based on Formal Concept Analysis has been
presented. This framework has also been evaluated with a case study using real data
of the Brooklyn museum which nicely demonstrates that appropriate NLP techniques
can be used to extract formal contexts from textual resources.</p>
      <p>By the paper Publishing Europe's television heritage on the web (J. Oomen and V.
Tzouvaras), the first results of the European project EUScreen that deals with
aggregating television heritage from European television archives for the European
digital library Europeana have been presented.</p>
      <p>Another very interesting paper is A security contextualization framework for
digital long-term preservation (K. Qian et al.) as it is concerned with semantic
security policies for digital archives. The approach extends the OAIS standard with
security related features. Hence, an often neglected but crucial aspect in digital
archiving is considered when establishing policies and infrastructures for long-term
preservation.</p>
      <p>The submission on DA-NRW: A distributed architecture for long-term
preservation (M. Thaller et al.) presents an ongoing project that aims at creating a
digital archive or long-term repository for the German state of North-Rhine
Westphalia. This system will have a kind of sandwich position having to ingest data
of depositors being archives and act as a pre-aggregator for portals like the Deutsche
Digitale Bibliothek or Europeana. A similar topic is dealt with by the paper on RDFa
as a lightweight metadata interoperability layer between repository software and
LOCKSS (F. Ostrowski) as it considers the extension of the LOCKSS framework with
RDF and ontologies using SPARQL endpoints.</p>
      <p>We would like to thank all members of the program committee for supporting us in
the reviewing process. Altogether, the diversity of the papers in these proceedings
represent a multitude of interesting facets about the new, exciting and promising
research field of semantic digital archives and semantic digital archiving
infrastructures. Hence, these proceedings provide a good and conclusive overview of
the current research in this area.</p>
    </sec>
    <sec id="sec-3">
      <title>Organization</title>
    </sec>
    <sec id="sec-4">
      <title>Program Chairs</title>
      <sec id="sec-4-1">
        <title>Livia Predoiu</title>
        <p>Steffen Hennicke
Andreas Nürnberger
Annett Mitschick
Seamus Ross</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Program Committee</title>
      <sec id="sec-5-1">
        <title>Sören Auer</title>
        <p>Kai Eckert
Armin Haller
Stijn Heymans
Pascal Hitzler
Yannis E. Ioannidis
Christian Keitel
Thomas Lukasiewicz
Knud Möller
Vit Novacek
Johan Oomen
Jacco van Ossenbruggen
Daniel Pitti
Andreas Rauber
Thomas Risse
Sebastian Rudolph
Francois Scharffe
Michael Seadle
Marc Spaniol</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Additional Reviewers</title>
      <sec id="sec-6-1">
        <title>Thomas Low Magnus Pfeffer</title>
      </sec>
      <sec id="sec-6-2">
        <title>Otto-von-Guericke University of Magdeburg,</title>
        <p>Germany
Humboldt University of Berlin, Germany
Otto-von-Guericke University of Magdeburg,
Germany
University of Dresden, Germany
University of Toronto, Canada</p>
      </sec>
      <sec id="sec-6-3">
        <title>University of Leipzig, Germany University Library of Mannheim, Germany CSIRO, Australia SemanticBits, USA</title>
        <p>Wright State University, USA
University of Athens, Greece
State Archive of Baden-Württemberg, Germany
University of Oxford, UK
DERI Galway, Ireland
DERI Galway, Ireland
Netherlands Institute for Sound &amp; Vision,
Netherlands
VU University Amsterdam, Netherlands
University of Virginia, USA
Vienna University of Technology, Austria
L3S Research Center Hannover, Germany
Karlsruher Institut für Technologie, Germany
University of Montpellier, France
Humboldt University of Berlin, Germany
Max-Planck-Institut Saarbrücken, Germany</p>
      </sec>
      <sec id="sec-6-4">
        <title>Otto-von-Guericke University of Magdeburg, Germany HdM Stuttgart, Germany</title>
        <sec id="sec-6-4-1">
          <title>Hardware and Product Lifecycle Support</title>
        </sec>
        <sec id="sec-6-4-2">
          <title>Linked Data Infrastructures and Ontologies</title>
          <p>Extending the Digital Archives of Italian Psychology with Semantic Data …......... 60
Claudio Cortese and Glauco Mantegari</p>
        </sec>
        <sec id="sec-6-4-3">
          <title>Digital Libraries and Museums</title>
          <p>Introducing the Semlib Project: Semantic Web Tools for Digital Libraries …......... 97
Christian Morbidoni, Marco Grassi, Michele Nucci, Simone Fonda and
Giovanni Ledda
LOHAI: Providing a Baseline for KOS based Automatic Indexing ….................... 121
Kai Eckert</p>
        </sec>
        <sec id="sec-6-4-4">
          <title>Archiving Frameworks and Infrastructures</title>
          <p>A Security Contextualisation Framework for Digital Long-Term Preservation ….. 131
Kun Qian, Maik Schott, Christian Kraetzer, Matthias Hemmje, Holger
Brocks and Jana Dittmann
DA-NRW: A Distributed Architecture for Long-Term Preservation ….................. 143
Manfred Thaller, Sebastian Cuy, Jens Peters, Daniel de Oliveira and
Martin Fischer</p>
        </sec>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <given-names>The</given-names>
            <surname>Keep Emulation</surname>
          </string-name>
          <string-name>
            <surname>Framework …........................................................................................</surname>
          </string-name>
          8 Winfried Bergmeyer
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>A</given-names>
            <surname>Semantic</surname>
          </string-name>
          <article-title>Extension of a Hierarchical Storage Management System for Small and Medium-sized</article-title>
          <string-name>
            <given-names>Enterprises …................................................................... 23 Axel</given-names>
            <surname>Schröder</surname>
          </string-name>
          , Ronny Fritzsche, Sandro Schmidt, Annett Mitschick and Klaus Meißner
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <article-title>Semantic Exploration of Archived Product Lifecycle Metadata under Schema and Instance Evolution ….</article-title>
          <string-name>
            <surname>...........................................................................</surname>
          </string-name>
          37 Jörg Brunsmann
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <article-title>Towards a Semantic Data Library for the Social Sciences ….</article-title>
          <string-name>
            <given-names>.................................. 48 Thomas</given-names>
            <surname>Gottron</surname>
          </string-name>
          , Christian Hachenberg, Andreas Harth and Benjamin Zapilko
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>EAC-CPF Ontology</surname>
            and
            <given-names>Linked Archival Data ….................................................... 72 Silvia</given-names>
          </string-name>
          <string-name>
            <surname>Mazzini</surname>
          </string-name>
          and Francesca Ricci
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <article-title>Conversion of EAD into EDM Linked Data ….</article-title>
          <string-name>
            <given-names>........................................................ 82 Steffen</given-names>
            <surname>Hennicke</surname>
          </string-name>
          , Marlies Olensky, Viktor de Boer, Antoine Isaac and Jan Wielemaker
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>Publishing</given-names>
            <surname>Europe's Television</surname>
          </string-name>
          <article-title>Heritage on the Web ….</article-title>
          <string-name>
            <given-names>.......................................... 89 Johan</given-names>
            <surname>Oomen</surname>
          </string-name>
          and Vassilis Tzouvaras
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Concepts</surname>
          </string-name>
          and
          <article-title>Collections: A Case Study using Objects from the Brooklyn</article-title>
          <string-name>
            <given-names>Museum .................................................................................................................... 109 Tim</given-names>
            <surname>Wray</surname>
          </string-name>
          and Peter Eklund
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <article-title>RDFa as a Lightweight Metadata Interoperability Layer between Repository Software</article-title>
          and
          <string-name>
            <surname>LOCKSS …........................................................................................</surname>
          </string-name>
          150 Felix Ostrowksi
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>