<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Biblissima's Prototype on Medieval Manuscript Illuminations and their Context</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Equipex Biblissima</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Campus Condorcet</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Paris</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>France</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>stefanie.gehrke</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>eduard.frunzeanu</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>pauline.charbonnier</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Objectives of the Biblissima Observatory</institution>
        </aff>
      </contrib-group>
      <fpage>43</fpage>
      <lpage>48</lpage>
      <abstract>
        <p>Biblissima is an online digital library, which provides easy and coordinated access to a huge and complex mass of documentation on manuscripts and early printed books, the texts contained therein, their circulation and their readers, from the 8th to 18th centuries. This workshop presentation will give an overview of the steps followed and decisions made along the way to releasing a rst prototype of the Biblissima portal: from mapping data to a common ontology, via the establishment of a thesaurus, to the technical development of a single interface and a common triple store for data deriving from di erent iconographic databases on medieval manuscripts.</p>
      </abstract>
      <kwd-group>
        <kwd>cidoc crm</kwd>
        <kwd>frbroo</kwd>
        <kwd>medieval manuscript</kwd>
        <kwd>illumination</kwd>
        <kwd>in- teroperability</kwd>
        <kwd>descriptors</kwd>
        <kwd>thesaurus</kwd>
        <kwd>historical place names</kwd>
        <kwd>semantic web</kwd>
        <kwd>linked data</kwd>
        <kwd>library</kwd>
        <kwd>Middle Ages</kwd>
        <kwd>Humanism</kwd>
        <kwd>Renaissance</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>The two main components of the observatory are a cluster of the project's
data on manuscripts and early printed books currently found in as many as 40
databases in di erent formats and with di erent research interests (including
illuminated manuscripts, history of the transmission of texts and history of
collections) and a digital image repository. The databases will be interconnected
using semantic web technologies and linked to a platform for digital editions and
to the project's digital image repository.
1.1</p>
    </sec>
    <sec id="sec-2">
      <title>Semantic Web Solutions for Historical Data</title>
      <p>
        In order to handle the heterogeneity of the database formats (MySQL, EAD,
TEI P5, UNIMARC, etc.) and the variety of Biblissima's data (manuscript
cataloguing databases, textual editions, iconographic databases) we have chosen
to use the CIDOC Conceptual Reference Model (Comite International pour la
Documentation Conceptual Reference Model [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]) and FRBRoo (Functional
Requirements for Bibliographic Records object oriented [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]) as framework for a
project-speci c extension of those ontologies that facilitates the internal
mapping to a single common model and allows the partners to expose their data in
RDF compliant to a globally established standard.
      </p>
      <p>CIDOC CRM is an accepted ISO standard (ISO 21127). As an event-centric
ontology it covers di erent phenomena in space and time like provenance,
copying of texts, creation of works and expressions, as well as the production of
information carriers and attribute assignments. As CIDOC CRM and FRBRoo
(which combines the CIDOC CRM approach with the common vocabulary for
the transmission of works (WEMI) that is provided by the FRBR model) are
generic models for the museum and library domains, it was decided to de ne
a few more speci c classes and properties related to manuscripts, early printed
books and illuminations. For example, within the scope of the Biblissima project
a medieval manuscript is an instance of the class bibma:Manuscript, which is a
subclass of frbroo:F4 Manifestation Singleton (\This class comprises
physical objects that each carry an instance of F2 Expression, and that were
produced as unique objects, with no siblings intended in the course of their
production"). An instance of a bibma:Manuscript might be composed of several parts
(bibma:Component) and might carry both text and illustrations.</p>
      <p>
        As regards the illustrations, there are several possible modelling solutions
in CIDOC CRM, such as E38 Image (\This class comprises distributions of
form, tone and colour that may be found on surfaces such as photos, paintings,
prints and sculptures or directly on electronic media") or its subclass E36 Visual
Item (\This class comprises the intellectual or conceptual aspects of recognisable
marks and images"). These solutions have been adopted both for book
illustrations by the \Illustrations of Goethe's Faust " project [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and for maps by the
\Carte de la nouvelle frontire Turco-Grecque " project [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. In order to model the
illumination genre, we decided instead to de ne an illumination as an instance of
a class called bibma:Illumination, which is a subclass of E26 Physical Feature
(`This class comprises identi able features that are physically attached in an
integral way to particular physical objects"). The following RDF triple expresses
this relationship.
:c a bibma:Component .
      </p>
      <p>:i a bibma:Illumination .</p>
      <p>:c crm:P56 bears feature :i .</p>
      <p>This is a shortcut for the fully developed path:
:folio a crm:E53 Place .</p>
      <p>:c a bibma:Component ;
crm:P59 has section :folio .
:i a bibma:Illumination ;
crm:P53 has former or current location :folio .</p>
      <p>Instances of E53 Place are a folio or a particular zone on a folio, for example.</p>
      <p>The ontology interacts with a thesaurus of technical terms used in medieval
studies (codicology, palaeography, iconography etc.) and descriptors used for
indexing medieval illuminations in the project's databases. The data is
structured in a thesaurus compliant with the international standard for thesauri and
interoperability with other vocabularies (ISO 25964). The di erent lexical and
semantic relationships that can be de ned between the descriptors will have an
intrinsic (semantic) role, in that they will help to show the relationships of
hyponymy or synonymy, as well as an extrinsic (technical) function for the search
engine. In addition, the project's data on people, corporate bodies, places and
titles are aligned with existing authority les and linked data repositories, such
as Rameau, VIAF, and GeoNames.
1.2</p>
    </sec>
    <sec id="sec-3">
      <title>The Historical Dimension of Biblissima's Data</title>
      <p>The majority of Biblissima's databases contain descriptive and structural
metadata for medieval manuscripts and early printed books, issued from the
cataloguing of these documents or scienti c research, but the project also includes
digital editions in TEI P5 of library inventories and texts and records on
illuminations. Metadata like the date of creation and place of origin of a manuscript
and its illuminations, the identi cation of the scribe, translator or commentator
of the copied text, former owners (people and corporate bodies) of a manuscript
throughout the centuries and lists of books kept in libraries at a given moment
in time can be used to study the history of the texts and manuscripts, as well
as reading and collecting practices.</p>
      <p>In order to develop the portal step by step we have chosen to begin by creating
a uni ed access point to two iconographic databases.</p>
      <sec id="sec-3-1">
        <title>Objectives of the Biblissima Prototype</title>
        <p>Using Semantic Web technologies, the Biblissima prototype aims to demonstrate
the potential of the available metadata produced by the Biblissima project. It
was developed using open source solutions and all the data is publicly available
under an open licence in order to facilitate reuse.</p>
        <p>
          The prototype is built on a subset of two iconographic databases: Mandragore
[
          <xref ref-type="bibr" rid="ref6">6</xref>
          ], the database of the Department of Manuscripts of the National Library of
France (BnF) and Initiale [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ], the database of the IRHT.
        </p>
        <p>It provides federated access to a subset of data present in the two databases,
such as illumination related data: caption, descriptor, folio carrying the
illumination, illumination record, digital surrogate of the illumination, artist, context
of the illumination (author and title of the textual work per artistic unit), date
of origin and place of origin. The data set also contains manuscript related
data such as shelfmark, common name, grouping, repository, digital surrogate,
manuscript record.</p>
        <p>The subset is limited, in the case of both databases, to records on
illuminations indexed with at least one geographical descriptor, which equates to almost
5 000 descriptors for approximately 20 000 illuminations.</p>
        <p>
          A SPARQL endpoint and a federated search engine make it possible to search
all the data in the cluster. Users can search by descriptor, artist, date of origin,
place of origin, author or work title, and can re ne their search with a series of
facet lters. The results are displayed in a user-friendly manner by grouping them
in lists. Pages about manuscripts, their units and illuminations include frames
that display the corresponding digital surrogate using IIIF manifests [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ],
relating text and images. Other visualisation features are available to the user, such
as timelines and maps. The data from Initiale and Mandragore are augmented
with data on the digital surrogates of illuminations in their context, extracted
from other manuscript catalogues (Medium [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] - the IRHT manuscript
repository, Gallica - the digital library of the BnF, and BnF archives et manuscrits
[
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] - the catalogue of the Department of Manuscripts of the National Library
of France). Each illumination record in the prototype links back to the
original record from one of the two databases as well as to the full digitisation of
the manuscript when available. The search capabilities currently do not include
manuscript genealogies. This can be achieved by including more databases and
classes like bibma:Type of Use Manuscript and bibma:Source in the future,
when implementing solutions deriving from lessons learned about texts and their
transmission.
        </p>
        <p>Both databases have been used to index manuscript illuminations for the
last 25 years and di erent systems were chosen for structuring the descriptors.
A polyhierarchical classi cation of Biblissima's thesaurus may make it possible
to retain the original descriptor classi cations while reordering them in a new
systematisation. However, these classi cations do not re ect the medieval
practices of organising knowledge. The identi cation of the iconographic elements is
sometimes based on internal information such as heading titles, chapter titles,
inscriptions or notes present in the manuscript; in the absence of this kind of
information, identi cation is dependent on the scholar's culture, especially in the
case of living things and artefacts. This means that when using these descriptors
to study medieval illuminations, we must keep in mind that their identi cation
has a disparate chronological and cultural origin. Another feature of the indexing
practices speci c to these databases is that the data does not provide co-textual
information, and the contextual information is not always available. This makes
quite di cult to trace the diachronic evolution of the meaning of a word and of
its iconographic representation.</p>
        <p>The technical solutions adopted by Biblissima open new avenues and yield
new ways of searching through data that could contribute to the analysis of
iconographic representations. On the basis of the geographic descriptors, one
could attempt to answer several questions regarding the status of cities in artistic
imagery and de ne the notion of the city through iconographic choices: what are
the criteria which confer an urban identity to a community and what makes the
di erence between an urban and a rural environment? From what point in time
do cities begin to be represented and what cities are the most represented over
the centuries? How could one explain the cases of single occurrence? Could one
analyse the anachronistic representations of places, be they cities or battle elds?
What are the most common descriptors associated with toponyms?
3</p>
      </sec>
      <sec id="sec-3-2">
        <title>Conclusion</title>
        <p>
          The semantic web solutions that Biblissima has chosen could be adapted in order
to provide answers to other kinds of research topics. As such, Biblissima's
polyhierarchical thesaurus makes it possible to establish new classi cations of the
descriptors that already exist in the databases and to recreate a medieval
taxonomy of living species as it was conceived by an encyclopedist or a physician, for
example. One might also connect the thesaurus to the digital edition of
exegetical texts such as the biblical Glossa [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ], one of Biblissima's partner projects,
and try to study the semantic relations between the four senses of the
Scripture (historical, allegorical, tropological and anagogical) and the iconographic
representation of the biblical words and scenes, for example.
        </p>
        <p>By adopting common standards for the ontology and for the thesaurus,
Biblissima's data might also be aggregated with and used by other semantic web
projects in the future.
http://</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Investissements</surname>
          </string-name>
          <article-title>d'avenir (CGI)</article-title>
          , ANR-11
          <string-name>
            <surname>-</surname>
          </string-name>
          EQPX-0007 investissement-avenir.gouvernement.fr
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <source>CIDOC CRM 6</source>
          .0, http://cidoc-crm.org/docs/cidoc_crm_
          <source>version_6</source>
          .0.pdf
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>3. FRBRoo 2.2, http://www.ifla.org/files/assets/cataloguing/frbr/frbroo_ v2.2.pdf</mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Abrami</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Freiberg</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Warner</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Managing and Annotating Historical Multimodal Corpora with the eHumanities Desktop - An outline of the current state of the LOEWE project Illustrations of Goethe's Faust</article-title>
          . In: Historical Corpora, pp.
          <volume>353</volume>
          {
          <fpage>363</fpage>
          . Narr Francke Attempto, Tubingen (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Gkadolou</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stefanakis</surname>
          </string-name>
          , E.:
          <article-title>A formal ontology for historical maps</article-title>
          , http://galaxy. hua.gr/~heraclitus/images/gkadolou/3\_Gkadolou\_ICA.pdf (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>6. Mandragore, http://mandragore.bnf.fr</mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>7. Initiale, http://initiale.irht.cnrs.fr/accueil/index.php</mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>8. IIIF, http://iiif.io/</mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>9. Medium, http://medium.irht.cnrs.fr/</mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>10. BnF, Archives et Manuscrits, http://archivesetmanuscrits.bnf.fr</mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>11. Biblical Glossa, http://www.glossae.net/</mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>