<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Semantic Digital Archives</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Andreas Nürnberger, Otto‐von‐Guericke University Magdeburg</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Annett Mitschick, Technische Universität Dresden</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Fernando Loizides, Cyprus University of Technology</institution>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Livia Predoiu, Otto‐von‐Guericke University Magdeburg</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>Seamus Ross, University of Toronto</institution>
          ,
          <country country="CA">Canada</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2012</year>
      </pub-date>
      <abstract>
        <p>held in conjunction with the 16th Int. Conference on Theory and Practice of Digital Libraries (TPDL)</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Copyright © 2012 for the individual papers by the papers’ authors. Copying permitted only for private and
academic purposes. This volume is published and copyrighted by its editors.
Preface
The 2nd Workshop on Semantic Digital Archives (SDA 2012) builds upon the success of the
previous edition in 2011, held in conjunction with the International Conference on Theory and
Practice of Digital Libraries, TPDL 2011 (formerly known as European Conference on Digital
Libraries, ECDL). Organized as full‐day workshop, SDA 2012 aims to advance and discuss
appropriate knowledge representation and knowledge management solutions specifically
designed for improving Archival Information Systems. The main objective is to have a closer
dialogue between the technical oriented communities with people from the (digital) humanities
and social sciences, as well as cultural heritage institutions in general in order to approach the
topic from all relevant angles and perspectives. This workshop is an exciting opportunity for
collaboration and cross‐fertilization.</p>
      <p>Intending to have an open discussion on topics related to the general subject of Semantic Digital
Archives, we invited contributions that focus on one of the following topics:
 Ontologies &amp; linked data for digital archives and digital libraries (incl. multimedia archives)
 Semantic search &amp; semantic information retrieval in digital archives and digital libraries (incl.</p>
      <p>multimedia archives)
 Implementations and evaluations of semantic digital archives
 Theoretical and practical archiving frameworks using Semantic (Web) technologies
 Semantic or logical provenance models for digital archives or digital libraries
 Visualization and exploration of content in large digital archives
 User interfaces for semantic digital libraries and intelligent information retrieval
 User studies focusing on end‐user needs and information seeking behavior of end‐users
 Semantic (Web) services implementing the OAIS standard
 Logical theories for digital archives
 Knowledge evolution
 Information integration/semantic ingest (e.g. from digital libraries)
 Trust for ingest &amp; data security/integrity check for long‐term storage of archival records
 Semantic extensions of emulation/virtualization methodologies for digital archives
 Semantic long‐term storage and hardware organization tailored for digital archives
 Migration strategies based on Semantic (Web) technologies
We received submissions covering a broad range of relevant topics in the area of semantic
digital archives. With the help of our program committee all articles were peer‐reviewed. These
proceedings comprise all accepted submissions which have been carefully revised and enhanced
by the authors according to the reviewers’ comments.</p>
      <p>These papers were joined by an invited keynote by Andreas Rauber (Vienna University of
Technology, Austria). In Digital Preservation in Data‐Driven Science: On the Importance of Process
Capture, Preservation and Validation he points out the necessity of capturing and documenting
processes (in addition to the context of data objects), especially in e‐Science and business
settings, and presents an approach for process preservation and verification upon later re‐
execution.</p>
      <p>The paper Entity Extraction and Consolidation for Social Web Content Preservation (S. Dietze et
al.) presents an approach to extract and consolidate information from archived social Web
content in order to facilitate semantic search of Web archives. The work was developed in the
EC‐funded Integrating Project ARCOMEM.
With Do we need metadata? ‐ An On‐line Survey in German Archives Marcel Ruhl presents the
revealing results of a survey among German archives regarding the use of metadata standards
for the annotation of audiovisual media.</p>
      <p>The paper Automatic Classification of Scientific Records using the German Subject Heading
Authority File (Ch. Wartena &amp; M. Sommer) introduces an approach to assign subject
classifications to records without using machine learning techniques but by the application of
the German Subject Heading Authority File (SWD).</p>
      <p>In Pundit: Semantically Structured Annotations for Web Contents and Digital Libraries (M. Grassi
et al.) the authors propose an annotation system, developed in the context of the Semlib project,
which provides the user with the ability to annotate distributed resources, i.e. multimedia
content published on the Web, using an extension of the Open Annotation Collaboration (OAC)
ontology.</p>
      <p>The paper Towards a Recommender System for Statistical Research Data (D. Bahls et al.) presents
the conceptual ideas and the system architecture of a case‐based recommender system for
statistical data used in scientific research, and discusses possible similarity measures and
notification services.</p>
      <p>With A method and guidelines for the cooperation of ontologies and relational databases in
Semantic Web applications L. Bozzato et al. showcase a methodology for mapping relational data
to an ontology structure to support SPARQL queries and inference and to take advantage of the
representation possibilities offered by both data models.</p>
      <p>A critical issue when developing RDF‐based semantic archives is the right choice of an
appropriate large‐scale storage solution for the data. Yet Another Triple Store Benchmark?
Practical Experiences with Real‐World Data (M. Voigt et al.) presents the experimental setting
and the results of extensive performance tests of state‐of‐the‐art RDF stores using non‐synthetic
RDF datasets.</p>
      <p>Finally, the paper Implementing CIDOC CRM Search Based on Fundamental Relations and OWLIM
Rules (V. Alexiev) proposes an approach to provide a higher‐level perspective on RDF data by
mapping complex sub‐graph patterns to simpler, more abstract descriptions using OWLIM rules.
The author presents an implementation of the concept regarding search with the CIDOC
Conceptual Reference Model.</p>
      <p>We sincerely thank all members of the program committee for supporting us in the reviewing
process. Altogether, the diversity of the papers in these proceedings represent a multitude of
interesting facets about the exciting and promising research field of semantic digital archives
and semantic digital archiving infrastructures.</p>
      <p>We would also like to thank Sun SITE Central Europe for hosting these proceedings on
http://ceur‐ws.org.</p>
    </sec>
    <sec id="sec-2">
      <title>September 2012</title>
      <p>A. Mitschick, F. Loizides, L. Predoiu, A. Nürnberger, and S. Ross</p>
    </sec>
    <sec id="sec-3">
      <title>Vassilis Christophides</title>
    </sec>
    <sec id="sec-4">
      <title>Kai Eckert Armin Haller Steffen Hennicke</title>
    </sec>
    <sec id="sec-5">
      <title>Stijn Heymans Pascal Hitzler Christian Keitel Birger Larsen</title>
    </sec>
    <sec id="sec-6">
      <title>Thomas Lukasiewicz Mathias Lux Knud Möller Kai Naumann</title>
      <p>Jacco van Ossenbruggen
Andreas Rauber
Thomas Risse
Sebastian Rudolph
Mike Salampasis</p>
    </sec>
    <sec id="sec-7">
      <title>Herbert van de Sompel</title>
    </sec>
    <sec id="sec-8">
      <title>Marc Spaniol Manfred Thaller</title>
    </sec>
    <sec id="sec-9">
      <title>Foundation of Research &amp; Technology ‐ Hellas, Greece University Library of Mannheim, Germany CSIRO ICT Centre, Australia</title>
      <p>Humboldt‐Universität zu Berlin,
Germany
SRI International, USA
Wright State University, USA
State Archive of Baden‐Württemberg, Germany
Royal School of Library and Information Science,
Denmark
University of Oxford, UK
Klagenfurt University, Austria
Talis, Birmingham, UK
State Archive of Baden‐Württemberg, Germany
VU University Amsterdam, Netherlands
Vienna University of Technology, Austria
L3S Research Center, Hannover, Germany
Karlsruher Institut für Technologie, Germany
Alexander Technology Educational Institute of
Thessaloniki, Greece
Los Alamos National Laboratory Research Library,
USA
Max‐Planck‐Institut Saarbrücken, Germany
University of Cologne, Germany</p>
      <sec id="sec-9-1">
        <title>Digital Preservation and Metadata</title>
        <p>Do We Need Metadata? ‐ An On‐line Survey in German Archives ............................................................. 30
Marcel Ruhl
Invited Contribution: Digital Preservation in Data‐Driven Science: On the Importance of
Process Capture, Preservation and Validation .....................................................................................................7
Andreas Rauber
Entity Extraction and Consolidation for Social Web Content Preservation .......................................... 18
Stefan Dietze, Diana Maynard, Elena Demidova, Thomas Risse, Wim Peters, Katerina Doka and
Yannis Stavrakas</p>
      </sec>
      <sec id="sec-9-2">
        <title>Structuring and Recommendation</title>
        <p>Automatic Classification of Scientific Records using the German Subject Heading Authority File
(SWD) .................................................................................................................................................................................. 37
Christian Wartena and Maike Sommer
Pundit: Semantically Structured Annotations for Web Contents and Digital Libraries ................... 49
Marco Grassi, Christian Morbidoni, Michele Nucci, Simone Fonda and Giovanni Ledda
Towards a Recommender System for Statistical Research Data ................................................................ 61
Daniel Bahls, Guido Scherp, Klaus Tochtermann and Wilhelm Hasselbring</p>
      </sec>
      <sec id="sec-9-3">
        <title>Semantic Technologies and Ontologies</title>
        <p>A Method and Guidelines for the Cooperation of Ontologies and Relational Databases in Semantic
Web Applications ........................................................................................................................................................... 73
Loris Bozzato, Stefano Braghin and Alberto Trombetta
Yet Another Triple Store Benchmark? Practical Experiences with Real‐World Data ....................... 85
Martin Voigt, Annett Mitschick and Jonas Schulz
Implementing CIDOC CRM Search Based on Fundamental Relations and OWLIM Rules ................ 95
Vladimir Alexiev</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list />
  </back>
</article>