<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Integrating Terminological Tools and Semantic Archaeological Information: the ICCD RA Schema and Thesaurus</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Achille Felicetti</string-name>
          <email>achille.felicetti@pin.unifi.it</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ilenia Galluccio</string-name>
          <email>ilenia.galluccio@pin.unifi.it</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Cinzia Luddi</string-name>
          <email>cinzia.luddi@pin.unifi.it</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maria Letizia Mancinelli</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tiziana Scarselli</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Antonio Davide Madonna</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>MiBACT-ICCD, Istituto Centrale per il Catalogo e la Documentazione</institution>
          ,
          <addr-line>Rome</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>MiBACT-ICCU, Istituto Centrale per il Catalogo Unico</institution>
          ,
          <addr-line>Rome</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>PIN, VAST-LAB</institution>
          ,
          <addr-line>Prato</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <fpage>28</fpage>
      <lpage>43</lpage>
      <abstract>
        <p>This paper describes the process of mapping, translation and publication in SKOS format of the RA Thesaurus, a terminological tool developed by the Italian Ministry of Cultural Heritage (MiBACT) as a part of the o cial documentation used for the recording of archaeological nds. In particular, the RA Thesaurus is intended to provide uni ed and meaningful terminology for the description of archaeological objects according to the MiBACT o cial cataloguing standards. After describing the thesaurus, the logic with which it was developed and its internal structure, we report the various phases of the conversion, both from a theoretical and implementation point of view, and the various technologies used for the publication of the thesaurus on the web. This work is a collaborative e ort between PIN and MiBACT carried out under the ARIADNE project.</p>
      </abstract>
      <kwd-group>
        <kwd>Archaeology</kwd>
        <kwd>Mapping</kwd>
        <kwd>Thesauri</kwd>
        <kwd>ICCD</kwd>
        <kwd>CIDOC CRM</kwd>
        <kwd>SKOS</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        ARIADNE is a European project focusing on integration of existing
archaeological research data infrastructures to enable the use of distributed datasets and
services by means of new and powerful technologies as an integral component
of the archaeological research methodology. Among other activities, ARIADNE
is also actively working on building a coordinated system of multilingual
terminology tools able to meet the many needs of the international community of
archaeologists. As part of these integration activities, the valuable work of
mapping national catalogue schemas on international standards is a critical step;
at the same time integration of terminology resources is necessary to overcome
linguistic barriers that frequently slow down the integration processes. We have
extensively described the process of CIDOC CRM encoding of the RA Schema,
released by ICCD for documenting archaeological artefacts in Italian
archaeology, in a previous work [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The mapping was presented as work in progress at
that time. Since then, new extensions of the CIDOC CRM (and in particular
CRMarchaeo) have been released which are now able to provide more
possibilities for the enrichment of the semantic archaeological information and a more
archaeological oriented means of documentation. The release of new versions and
the creation of new extensions of the CIDOC CRM gave us the opportunity to
investigate how the mapping could be improved. This allowed us to bring the
mapping to a stage very close to completion, although much work still remains to
be done. The RA Schema is closely linked to the RA Thesaurus, a sophisticated
vocabulary providing all the necessary terminological facilities for an e cient
and well-structured recording of the objects coming from archaeological
excavations. The vocabulary has been implemented by ICCD to support the encoding
of two speci c elds (OGTD - CLS). These two elds describe the de nition
of the object and its class and production. This paper will focus and propose
integration between the RA Schema and its thesaurus, based on W3C
recommendations and using numerous tools developed and used by several partners in
the ARIADNE project.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>ICCD and the Standards for Cultural Heritage</title>
      <p>ICCD is the Italian Central Institute for Catalogue and Documentation, one of
the seven Central Institutes of the Italian Ministry of Cultural Heritage whose
main goal is to create a centralized national catalogue of Italian cultural
heritage. The activity of the Institute is based on the research and development
of tools, methods and standards for knowledge, protection and enhancement of
the cultural and artistic heritage in Italy. It mainly provides the management of
the national general catalogue of archaeological, architectural, historical, artistic
and ethno-anthropological heritage, the development of cataloguing
methodologies and standards, and the coordination of the technical institutions involved
in the cataloguing activities on the national territory. ICCD also provides tools
and best practices for implementing these standards with the clear intent of
unifying and streamlining processes related with the cataloguing activities, to
guarantee quality and to implement standardisation and interoperability at a
national level.</p>
      <p>To ensure that this happens e ciently, the Institute creates and releases a
series of organic resources and recommendations to support the standardization
process in all its aspects. These include detailed regulations describing the
various tools and the way they should be used, a set of schemas and forms to collect
information in a structured way according to the di erent asset types, authority
les to guarantee homogeneity for the common transversal key concepts and
entities, thesauri and terminological tools to provide uniform layers of information
and a common language. Among the latter category, one of the most important
tools released by ICCD is the RA Thesaurus, a tool providing standard names
for the de nition of archaeological artefacts described using the RA schema,
the ICCD standard schema used for the recording of movable objects. The RA
Schema is the most used and well established standard for Italian archaeology
so far. For this reason, ICCD has invested a lot of e ort in the de nition of
a terminological tool able to provide standardized and unambiguous names for
speci c elds of the schema. The creation of the RA thesaurus is one of the best
results of this e ort.
3</p>
    </sec>
    <sec id="sec-3">
      <title>The ICCD RA - CIDOC CRM</title>
    </sec>
    <sec id="sec-4">
      <title>Mapping</title>
      <p>
        In the previous work carried out together with ICCD, a detailed analysis of the
RA Schema was made to map the most signi cant model of ICCD archaeological
cataloguing system to CIDOC CRM. The RA Schema is used to record movable
objects. It is one of the most used for Italian archaeology because of the huge and
ever increasing amount of artefacts found during excavations. The RA Schema
contains a large number of descriptive information and \cross-sections" allowing
cross references with other ICCD resources. The RA Schema, together with the
RA Thesaurus, features one of the best tools of this kind in the international
panorama of cataloguing systems. The previous mapping work was carried out
on CIDOC CRM and took advantage of version 5 of the model, released in 2013.
However, in the last two years, a new version and numerous extensions of CIDOC
CRM have been released. Version 6 and the CRMarchaeo [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and CRMsci [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]
extensions, much more suitable for the description of archaeological phenomena,
have strongly enhanced the representation and mapping of excavation entities.
CRMarchaeo, in particular, is being developed by the ARIADNE project to
facilitate the encoding of archaeological entities. Given this, we decided to update
the previous mapping in order to provide a stronger archaeology-oriented logic
to the various concepts and relationships that the RA Schema presents.
      </p>
      <p>One of the most di cult problems to solve during the previous mapping
was the representation of the \ nding" event, intended as the excavation
activity during which objects are found. This event is of paramount importance in
archaeology because it is fundamental to trace the object's provenance and to
reconstruct its history. Following the CIDOC-CRM model, we represented the
archaeological objects by using the E22 Man-Made Object class. However, to
describe their relationships with the two important activities of \survey"
(corresponding to the \RE" eld of the RA Schema) and \excavation" (speci ed
in the \DSC" eld), CIDOC CRM core only provided a \change of ownership"
relationship that hardly ts here but we decided to use it anyway. Our previous
mapping appeared as shown in Figure 1.</p>
      <p>
        Thanks to the release of the new extensions and a deep analysis of the
crosssection relating the RA Schema [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], the new mapping now shows a more
accurate rendering of these concepts. To express the \object found during an
excavation" relationship, CRMarchaeo provides the O19i was object found by
property, through which it is possible to link the artefact with the new S19
Encounter Event class, expressly designed to render the concept of \ nding" as
an event which occurred (P7 took place at ) at a given Site (E7 ) identi ed by
a given appellation (P57 is identi ed by - E44 Place Appellation), as shown in
Figure 2. This constitutes a more accurate representation of these concepts.
The RA Thesaurus was developed expressly to provide standardised values for
some of the \OG-OGGETTO" (Object ) elds of the RA Schema. The content
of the thesaurus is organized in a tabular structure with ve columns arranged
according to the hierarchical levels provided by the thesaurus. The rst three
columns, used to ll the CLS eld of the RA schema, present the categories' three
levels of hierarchy, to which any concept can belong; column four lists the main
terms for the de nition of the objects; column ve provides speci cations of the
main terms in accordance with morphological, functional or partitive criteria.
Both columns four and ve are meant to provide standard terms for the OGTD
eld of the RA Schema. Additional columns, reporting further attributes and
speci cations for each term and subterm, such as descriptive notes and sample
images, are also present (see Fig. 3). Images are an added value of this tool for
their ability to visually show what words are not always able to say. We have
already investigated some of the possibilities to encode gures in our mapping,
but unfortunately the tools at our disposal do not always allow a clear de nition
of these entities. For sure it will be important, in future versions of the thesaurus,
to de ne a standard mechanism for associating concepts with their images even
in the SKOS version of the thesaurus.
      </p>
      <p>
        The RA Thesaurus di ers from the other terminological tools created by
ICCD in the very sophisticated structuring criteria it follows, made more
complicated by the large amount of information deriving from Italian archaeology
and the huge number of classi cations and nomenclatures it provides. In
particular, the thesaurus is structured according to a multilevel schema based on
concept coordination, a typical KOS activity in which concepts are combined
with each other in order to produce meaningful \sentences" that de ne complex
concepts. Generally speaking, there can be two types of concept coordination:
pre-coordination and post-coordination. The key distinction between the two
relies on when the actual coordination occurs in relation to an information retrieval
event. Pre-coordination is decided and implemented before the information
retrieval time, by a KOS maintainer or by an indexer who is using the KOS itself.
This occurs, for instance, when an indexer takes two existing concepts from a
concept scheme, such as \Coins" and \Mintage", and explicitly combines them
with a given syntax, such as \Coins-Mintage", to index a particular document.
Post-coordination, on the other hand, is performed as part of an information
retrieval task, for instance through a SPARQL query able to retrieve all documents
indexed using both \Coins" and \Mintage" concepts [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. The RA Thesaurus
follows the post-coordination approach to create ad hoc concepts by using the
elements of a given schema. Each concept is in fact provided with all the
necessary subterms depending on it, which can belong to three speci c semantic
areas according to the speci cation provided: either functional (i.e. relative to
the speci c function of the object), partitive (i.e. relative to a speci ed part of
the object) or morphological (i.e. linked to the di erent forms that from time
to time an object may present). The structure of the thesaurus is obviously
functional to the speci c cataloguing activities. Each concept is thus created on
the y by combining the main terms with all the related subterms required to
render the speci c name that a concept should show in a given context. Figure 4
provides an example of how the thesaurus is structured by reporting the various
facets of the term \cintura" (belt) and its related functional and morphological
subterms:
      </p>
      <p>It is evident from the example above that the thesaurus itself does not o er a
closed and exhaustive list of all possible terms that can be used during the
compilation of the schema. Instead, it is a reference tool that, after a general term
is xed, assists the user in proceeding to further speci cations by the addition
of suitable subterms to gradually approximate the precise semantic meaning of
the object to be described.</p>
      <p>The exibility of this structure allows it to achieve a signi cant depth of
semantics, where required, and to build speci c de nitions of several types of objects,
in order to have an entry like this:
Cintura per la sospensione delle armi, borchia (Belt for weapons suspension,
stud)
representing an exhaustive explanation of the fragmentary object itself and of
the bigger object which is part of, and also as a valid entry from the
terminological point of view following the formal recommendations provided by the ICCD
guidelines and validation systems.
3.2</p>
      <sec id="sec-4-1">
        <title>A SKOS Mapping Proposal</title>
        <p>SKOS is the standard chosen by the ARIADNE project for the encoding of all
terminological resources to be used in its integration plan, and for the
undeniable advantages provided to integration and interoperability by its RDF-based
format. As one can easily understand from what was previously stated, the
\combinatorial" nature of the RA Thesaurus, and especially of the sections intended
for the encoding of the OGTD eld (column four and ve), makes it very
difcult to encode in a SKOS compatible format, which requires that a complete,
self-consistent and self-su cient de nition exists in the thesaurus for each item
or concept. The SKOS vocabulary itself does not provide any mechanism for
expressing that a given concept may consist of other pre-coordinated concepts.
It is, of course, possible to extend SKOS to establish a pattern for representing
coordinated concepts, for instance by stating a new sub property, as in the
following example:
iccd:coordinationOf a rdf:Property ;
rdfs:domain skos:Concept ;
rdfs:range rdf:List.
and then use the new property this way:
iccd:coinsMintage a skos:Concept ;
iccd:coordinationOf (iccd:coins iccd:mintage);
skos:prefLabel \Coins-Mintage"@en.</p>
        <p>However, patterns for pre-coordination have not yet been exploited by the SKOS
community and solutions of this kind have not been explored fully enough to
warrant their inclusion in the o cial SKOS vocabulary. Analyzing the RA
Thesaurus, PIN and ICCU identi ed a possible solution. We tried to follow a
different approach, more \pre-coordination oriented" to rearrange, where possible,
the original content according to semantic criteria in order to de ne meaningful
self-consistent concepts in the SKOS representation. After discussing the matter
in depth, we proposed the following solutions:</p>
        <p>1. The partitive speci cation subterms are in many cases independent terms
related with the main term mostly by a part-whole relationship. Thus, it is
possible to describe this relationship by using the skos:narrowerPartitive property to
de ne them. This is particularly suitable if we consider that the same partitive
term could occur for di erent main concepts: both a belt and a ag could have a
puntale (ferrule) as partitive concept. Therefore, it is important to clearly de ne
the hierarchy of these kinds of objects. Alternatively, it would be possible to
combine main terms with their partitive terms in order to de ne complete and
self-consistent concepts, to be then de ned as narrower terms of the main ones.
In the previous example, we could de ne, for instance, a new puntale di cintura
(belt ferrule) term, which would be clearly distinguished by a puntale di insegna
( ag ferrule), the two being totally di erent, although very similar, objects.</p>
        <p>2. The morphology and functional speci cation subterms are meaningless in
themselves. They become meaningful only when combined with their main term.
Creating SKOS narrower terms from these elements requires, for each
morphological or functional term, the creation of a subterm obtained by combination
with the super concept, in order to obtain a set of semantically consistent narrow
terms. There is no semantic meaning in multipla itself unless this concept is used
together with cintura in order to specify, in this case, the typology of a given
belt. Cintura multipla is, on the other hand, a perfectly consistent concept.</p>
        <p>Multiple combination of partitive, morphological and functional sub concepts
to create speci c entries, even if not impossible, would be very di cult to
implement in SKOS due to the exponential growth of all possible combinations. At
present, we propose not to extend the pre-coordination operations beyond the
minimum requirements of semantic understandability and to use more than one
SKOS concept to describe speci c archaeological objects if required.
4</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>SKOS Encoding of RA Thesaurus</title>
      <p>From a technological point of view, the RA Thesaurus was created starting
from 2008 on the basis of the terms extracted from the database maintained
by the \Sistema Informativo Generale del Catalogo" (SIGEC). Its development
also went through various phases of data cleaning and strengthening. The RA
Thesaurus is currently an \open vocabulary", meaning that it is not meant to
have a stable form since its content can be updated and modi ed by ICCD during
further stages of research. Currently, the available version of the vocabulary is
in textual format that is organized in a tabular structure, whose elds comply
with the ISO standard norms for thesauri. In order to make the original textual
information interoperable and ensure integration with semantic terminological
tools, it was necessary to encode them in a structured and standard format.</p>
      <p>The process we implemented for the SKOS encoding of the RA Thesaurus is a
proposal for its re-engineering as a formal ontology and for making the knowledge
it provides explicit in a formal sense. The whole process of encoding required
a set of subsequent steps for data analysis, adjustment, conversion, publication
and enrichment, in which the original textual data has been processed using
both open source tools and ad hoc scripts.</p>
      <p>The process can be subdivided into two analytic phases (see Fig. 5):
1. In the rst analytic phase we focused on encoding the key elds of the original
thesaurus, such as concepts and classes. The result of the rst phase consisted
in the creation of a SKOS/RDF version of the RA Thesaurus obtained through
the mapping between the main concepts and the SKOS Core Vocabulary.
2. In the second phase, we focused on the integration of all morphological,
functional and partitive aspects related to thesaurus' concepts. The analysis of this
additional information required further investigation into how SKOS extensions
could be used for the publication of thesauri in a semantic format.
4.1</p>
      <sec id="sec-5-1">
        <title>Thesaurus Conversion Using SKOS Core Vocabulary</title>
        <p>
          The conversion of the RA Thesaurus initially required a deep data analysis to
dene a precise mapping between its main elds and the SKOS Core Vocabulary [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]
in order to use its set of properties and classes to express the conceptual content
of the thesaurus as an RDF graph. The elds examined in the rst analytic phase
are levels one and two, containing categories and subcategories, and level four,
containing the main terms for the description of the artefacts. With reference
to level ve, we limited our analysis to the functional facet only and we
considered the descriptive notes in the attribute elds. Classes and terms were mapped
using the skos:Concept entity, main terms were mapped as skos:prefLabel,
nonpreferential terms as skos:altLabel, notes were encoded using skos:scopeNote. The
skos:broader and skos:narrower properties were used to express the hierarchical
relationships between categories or concepts. The functional speci cation of a
term was expressed through the skos:narrower relation with a subterm obtained
by combination with the super concept.
        </p>
        <p>
          Figure 6 shows an example of the mapping expressed by using SKOS entities.
Each concept coming from the RA Thesaurus is represented by a blue circle.
The central circle depicts the concept of Cintura (belt) while the red circle
represents the thesaurus itself. Arrows connecting the various circles represent
the SKOS relationships existing among them. The mapping de nition on the
SKOS Core Vocabulary was followed by the use of an ad hoc script and of a
speci c tool that allowed the conversion of a huge textual le into RDF format.
At rst, the original thesaurus was manipulated and converted in order to create
a CSV le that satis ed some speci c technical requirements. The script was
developed in Perl language and was intended to select a speci c thesaurus' subset
of elds, to sort and to clean the information and to convert them into a custom
CSV le. Subsequently, the Stellar Console tool was applied to further elaborate
this le. Stellar Console is an open source command line utility application
developed in the framework of the AHRC-funded project \Semantic Technologies
Enhancing Links and Linked Data for Archaeological Resources" (STELLAR)
[
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]. The Console accepts input format such as CSV in order to produce a more
structured output such as SKOS/RDF or CIDOC-CRM/RDF by applying a set
of customizable templates. The templates look for the presence of particular eld
names in the input data, and process each row in turn using the values contained
in these elds. The use of the conversion feature from the custom CSV les to
the SKOS/RDF of Stellar Console is the nal step for the conversion of the
main subset of RA Thesaurus from a textual format to a structured, semantic
and interoperable format.
4.2
        </p>
        <p>Thesaurus Publication and Enrichment Using SKOS Extensions
In the second analytic phase, the publication of the thesaurus was analysed and
tested on a vocabulary server. Possible solutions for mapping and integrating
the elds that were not converted in the rst analytic phase were consequently
studied and tested.</p>
        <p>In order to produce the necessary results for the RA Thesaurus publication,
it was important to consider two fundamental aspects. The rst was a
vocabulary web server supporting international standards such as SKOS and the ISO
thesaurus norms; the second was a vocabulary web application which supports
multilingualism, semantic thesauri and data enrichment. All these aspects, in
our opinion, are fundamental to make the RA Thesaurus even more exible
for future study phases by expanding and integrating it with further multiple
extensions.</p>
        <p>
          We considered di erent possibilities to achieve the above-mentioned results,
by choosing TemaTres as the most pragmatic solution. TemaTres is an
opensource, web-based thesaurus management package [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] that supports the handling
of vocabularies in accordance with the ISO standard thesaurus norms, including
the last ISO-25964 [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]. The main features of TemaTres include a functional user
interface for editing and browsing, good search capabilities, and the ability to
export all or part of the thesaurus in a number of standardized forms (Json,
JsonLD, SKOS Core, DC etc.). TemaTres easily allows data import in SKOS/RDF
format and some of the more advanced features include the ability to link terms
between two di erent vocabularies. A test version of TemaTres was installed on
a local server and used to import the SKOS/RDF thesaurus version containing
the main concepts in order to proceed further with the enrichment work. The
TemaTres publication of the RA Thesaurus provides many editing and search
facilities. One of the most important is the ability to customize and
automatically generate URIs used to unambiguously identify and reach resources from
any context. For generating suitable URIs we have used - by means of testing
- the o cial ICCD namespace (http://www.iccd.beniculturali.it), which will be
useful for the future installation of TemaTres on the ICCD server and for the
creation of consistent and unambiguous URI/URL to make the RA Thesaurus
available also in a Linked Open Data format. The conversion of the elds related
to morphological and partitive speci cation of terms required further actions on
the data. We used the TemaTres administration facilities for this semantic
enrichment. We mapped the morphological speci cations using skos:broader and
skos:narrower properties. The partitive speci cation subterms was mapped
using the last ISO standard on thesauri ISO 25964[
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]. One of the innovations
introduced by the current norm is the possibility to make explicit the nature
of semantic relationships, in particular we focused on the changes regarding the
hierarchical relationships. To extend the richness of thesauri, the SKOS Core
hierarchical relationships depicted through the tags BT and NT can be
further divided into generic (BTG/NTG), partitive (BTP/NTP) and instantial
(NTI/BTI). ISO 25964 speci es that this relationship holds \between a pair of
concepts when the scope of one of them falls completely within the scope of
the other" [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ].We introduced the BTP and NTP relationships using the
corresponding property in the `iso-thes' namespace: iso-thes:broaderPartitive and
iso-thes:narrowerPartitive [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]. The example in Figure 7 shows that a \ bbia
di cintura" (belt buckle) concept stated this way, for instance, speci es that
the bbia is part of a \cintura" (belt), whereas a \ bbia" (buckle) per se could
also be part of other objects, for instance, a weapon, a garment and so forth.
Therefore, the BTP/NTP relationships cannot be automatically inferred by the
subconcept only because it could be part of many objects.
        </p>
        <p>
          The image eld of the RA Thesaurus is also a very interesting case. As
already mentioned, images increase the richness and meaningfulness of concepts,
their presence being sometimes crucial, especially in cases where proper
understanding of the archaeological objects may remain ambiguous. In a 2005 version
of the SKOS Core Guide W3C Working Draft [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ], the Working Group
proposed the use of symbolic labels, as part of the labelling properties, to label a
concept with an image. Symbolic labels could be used to assign preferred and
alternative symbolic labels to a concept by means of the skos:prefSymbol and
skos:altSymbol properties. This solution would have been the most appropriate
for the mapping of the RA Thesaurus sample image, but in the subsequent W3C
Recommendation [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ], symbolic labelling elements were removed, although no
explicit deprecation axioms were expressed in the schema. In order to achieve
a publication of the thesaurus that complies as far as possible with the W3C
speci cations, we preferred not to use the solution proposed in the SKOS Core
Guide W3C Working Draft, but to use the current W3C Recommendation only.
According to the latter, sample images can be regarded as accessorial
information of the SKOS concepts. The relationship can be mapped using the skos:note
property, considering that there is no restriction on the nature of the information
that the property can associate with the concept.
The AAT thesaurus (Art Architecture Thesaurus - Getty Institute) [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] was
chosen by ARIADNE to represent a common spine and to constitute a facet
allowing search and faceted browsing across all the terminological tools that
the project is collecting. Integration will be based on mappings of national/local
vocabularies to the AAT thesaurus. This will allow interoperability over the
subject metadata in di erent partner languages via the common AAT spine. The
issue of multilingualism is a matter that needs to be taken into account, not
only because of the variety of national thesauri that are going to be integrated
by the ARIADNE initiative, but also for the future creation of common and
transnational terminological tools. Linguistic issues often make the direct
mapping of a concept via the skos:exactMatch property on AAT concept di cult, but
hopefully the most signi cant issues will be resolved by the end of the project.
The conceptual mapping between the ICCD RA Thesaurus and AAT has been
completed and revised; for this purpose it was decided to manually construct a
mapping from the various terms and functions (if any), following in sequences
the three main categories of the RA Thesaurus. The work pattern was based
on an Excel representation of the thesaurus to which additional columns were
added in order to specify:
{ the targetLabel and the unique identi er ( ID ) of the corresponding de
nition/term selected in AAT;
{ the SKOS schema properties (skos:closeMatch; skos:exactMatch;
skos:broadMatch and skos:matchURI );
{ the name of the institution in charge of the de nition of each speci c
mapping (creator).
        </p>
        <p>Only a subset of the RA Thesaurus was taken into account to demonstrate the
feasibility of these operations. The subset includes one thousand, one hundred
and ninety one terms related to ten major categories (highlighted in the original
source as \livello 1 categoria ") relating to:
{ CLOTHING AND ACCESSORIES
{ FURNISHING
{ TRANSPORTATION
{ CONSTRUCTION INDUSTRY
{ PAINTING
{ ARCHAEOBOTANICAL FINDINGS
{ ARCHAEOZOOLOGICAL FINDINGS
{ SCULPTURE
{ INSTRUMENTS - TOOLS AND OBJECTS OF USE
{ GENERAL TERMS</p>
        <p>The analysis for nding the corresponding entries in the AAT thesaurus took
into account the information provided by scope notes and images accompanying
each concept; extensive web searches were performed to nd the most
appropriate matching term between Italian and English; and terminological researches
was carried out using di erent resources to identify synonyms to make the
associated targetLabel as unique and as precise as possible.</p>
        <p>The mapping work has identi ed :
{ 457 broadMatch associations
{ 104 closeMatch associations
{ 630 exactMatch associations</p>
        <p>Three examples of association are provided in the following table:</p>
        <p>At the end of the mapping work we can say that the most signi cant activity,
from the scienti c-methodological point of view, has been the review of the whole
process. Started as punctual control \1: 1" of correspondence between the terms
of the two terminology tools (thesaurus ICCD / RA and AAT), this review
has, in fact, been expanding by realizing the mapping of the terminological
categories relating to individual entries with the codes referring to the facet and
the hierarchy AAT. This has made possible:
1. disambiguating and correcting matches previously selected - and often
lexically corrected - but decontextualized from the original domain of belonging;
2. providing the basis for a future matching job between di erent categories of
multilingual thesauri.</p>
        <p>It is worth underlining that the focus of the whole work of mapping is the concept
of individual terms meant as records entered in a complete hierarchical structure
of related terms and notes. Among the results which have been achieved - and
which are highlighted though the mapping between classes - we can state the
high level of correspondence between the ICCD/RA thesaurus entries and the
AAT Thesaurus record types. Out of one thousand, one hundred and ninety one
basic records one thousand, one hundred and sixty four among them are linked
to \concept" and only twenty seven to \guide term". According to the AAT
Thesaurus guidelines:
{ Concept: Refers to records in the AAT that represent concepts; records for
concepts include terms, a note, and bibliography.
{ Guide term: Refers to records that serve as place savers to create a level in
the hierarchy under which the AAT can collocate related concepts. Guide
terms are not used for indexing or cataloguing.
6</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Conclusions and Further work</title>
      <p>The study and analysis of the RA Thesaurus allowed us to fully understand
the complexity of the challenges arising from the need to de ne, by means of
standard nomenclatures, objects of such various and multifaceted nature as
archaeological objects are. The ICCD RA vocabulary, being the result of years of
research by a team of experts in the eld of Cultural Heritage, is de nitely an
irreplaceable resource that adequately meets this need. Its structure is certainly
an important point of arrival on the road to standardization. From a
methodological point of view, the work carried out has highlighted both conceptual and
procedural challenges that arise when attempts are made to handle a complex
structure in a standard tool. The results achieved so far are considered
satisfactory, also in consideration of the fact that the work is at an intermediate stage
and that further studies and investigations will be necessary before the
conversion of the entire thesaurus can be completed. Future activities will include a
clear and unambiguous de nition of complex concepts, such as those arising from
the combination of multiple terms and subterms; and the de nition of precise
criteria for the inclusion of images, which, as stated, is one of the distinctive
features of this vocabulary. The choice of AAT as the common standard partially
solves the multilingualism issues, providing labels in di erent languages for the
terms already mapped. We must instead provide appropriate translations for
those that have no equivalent in the thesaurus of the Getty Institute. At the end
of the ARIADNE project, the RA Thesaurus will become part of the rich set of
terminological tools that the project is already collecting in order to integrate
them into the platform on which real interoperability will take place. The
ARIADNE Portal will make this resource available and easily accessible online for
external use outside of the project. The publication as Linked Open Data, also
provided by the project, will guarantee its availability in other Cultural Heritage
scenarios.
7</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgements</title>
      <p>The present work has been supported by the ARIADNE project, funded by the
European Commission (grant 313193) under the FP7 INFRA-2012-1.1.3 call.
The authors opinion do not necessarily re ect those of the European Commission</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Felicetti</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scarselli</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mancinelli</surname>
            ,
            <given-names>M.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Niccolucci</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          , (
          <year>2013</year>
          )
          <article-title>Mapping ICCD Archaeological Data to CIDOC-CRM: the RA Schema, Vladimir Alexiev</article-title>
          , Vladimir Ivanov, Maurice Grinberg (eds.):
          <article-title>Practical Experiences with CIDOC CRM and its Extensions (CRMEX</article-title>
          <year>2013</year>
          )
          <article-title>Workshop</article-title>
          , 17th
          <source>International Conference on Theory and Practice of Digital Libraries (TPDL</source>
          <year>2013</year>
          ), Valetta, Malta,
          <year>September 26</year>
          ,
          <year>2013</year>
          , CEUR-WS.org/Vol-
          <volume>1117</volume>
          , pp
          <fpage>11</fpage>
          -
          <lpage>22</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2. CRMarchaeo http://www.ics.forth.gr/isl/index_main.php?l=e&amp;c=
          <fpage>711</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3. CRMsci http://www.ics.forth.gr/isl/index_main.php?l=e&amp;c=
          <fpage>663</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Ronzino</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Amico</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Felicetti</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Niccolucci</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <article-title>European standards for the documentation of historic buildings and their relationship with CIDOC-CRM</article-title>
          ,
          <source>International Conference on Theory and Practice of Digital Libraries (TPDL</source>
          <year>2013</year>
          ),
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>5. http://www.willpowerinfo.co.uk/glossary.htm</mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6. SKOS Core Guide http://www.w3.org/
          <year>2004</year>
          /02/skos/core/guide/
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>7. STELLAR project http://hypermedia.research.glam.ac.uk/kos/stellar/</mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>8. TemaTres website http://www.vocabularyserver.com/</mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <article-title>9. ISO 25964 thesaurus schemas</article-title>
          . http://www.niso.org/schemas/iso25964
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>International Standard</surname>
          </string-name>
          <article-title>Organization (ISO). Documentation - Thesauri and interoperability with other vocabularies: Part 1, Thesauri for information retrieval</article-title>
          ,
          <source>2011. Report ISO 25964.</source>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Alexiev</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Isaac</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <article-title>On the composition of ISO 25964 hierarchical relations (BTG, BTP</article-title>
          , BTI),
          <source>International Journal on Digital Libraries</source>
          , pp
          <fpage>1</fpage>
          -
          <lpage>10</lpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Cardillo</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Folino</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Trun</surname>
            <given-names>o</given-names>
          </string-name>
          , R.,
          <article-title>Towards the reuse of standardized thesauri into ontologies</article-title>
          , Workshop on Ontology and
          <article-title>Semantic Web Patterns (WOP2014), co-located with the 13th</article-title>
          <source>International Semantic Web Conference (ISWC2014)</source>
          ,
          <source>Riva del Garda</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13. SKOS Core Guide,
          <source>W3C Working Draft 10 May</source>
          <year>2005</year>
          http://www.w3.org/TR/ 2005/WD-swbp
          <article-title>-skos-core-guide-20050510/</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>SKOS-Reference</surname>
            <given-names>http</given-names>
          </string-name>
          ://www.w3.org/TR/2009/REC-skos-reference-
          <volume>20090818</volume>
          /
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>15. Getty website http://vocab.getty.edu/http://vocab.getty.edu/</mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>