<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Semantic Web for BIBLIMOS (position paper)</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>B E´ATRICE BOUCHOU MARKHOFF</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>SOPHIE CARATINI</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>FRANCESCO COREALE</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>MOHAMED LAMINE DIAKIT E´</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>ADEL GHAMNIA</string-name>
        </contrib>
      </contrib-group>
      <fpage>49</fpage>
      <lpage>56</lpage>
      <abstract>
        <p>We present the BIBLIMOS project, which aims to address the Western Saharan culture and history, by considering both local ancient Arabic manuscripts and European colonial archives. We describe the project's context and objectives before focusing on ancient Mauritanian manuscripts, the content of which covers many scientific fields. We assess the current state of such ancient manuscripts' digital processing and we analyse what the semantic web can bring for their use by scholars, from North and South: the ability for applications to operate jointly on several distributed and heterogeneous sources of digitized manuscripts and other kinds of archives, to support collaborative reflection.</p>
      </abstract>
      <kwd-group>
        <kwd>Ancient Arabic Manuscripts</kwd>
        <kwd>Data Integration</kwd>
        <kwd>Semantic Virtual Infrastructure</kwd>
        <kwd>Western Saharan Cultural Heritage</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>BIBLIMOS is a long standing programme, led by the CITERES laboratory4, that
proposes to collect information, and facilitate the constitution of thematic corpora, from
public and private archives pertaining to the history of the Western Saharan region.
Its first goal was to provide to local students and researchers the ability to study their
history, through a digital remote access to original materials (through images and
descriptions), and also the ability to collaborate more easily with foreign teams, on these
materials. Moreover, in the long run, it is also planned to deal with both primary sources
(original material created at the time under study) and secondary sources (material
written by scholars). In parallel, it is intended to address colonial archives about this
geographical area, from European countries (mainly France and Spain), in order to cross
complementary points of views, and thus, to discover new knowledge.</p>
      <p>Involving an international and cross-disciplinary team of researchers in the
humanities and, more recently, in computer science, BIBLIMOS aims to renew the knowledge
and analysis of Western Sahara’s societies, by making available to researchers from the
North and the South an open and interactive tool for searching and comparing local
archive funds, including the manuscripts of the desert, and European archives related to
these regions. There is also an important multilingual challenge, as we plan to perform
cross-referencing of Arabic, Pulaar, Soninke, Wolof, French, Spanish, Portuguese,
Italian, Dutch, German, English sources relating to the political, military, economic, legal,
social, scientific and religious history of the territories of the Western Saharan region,
from the modern era to the end of the Cold War.</p>
      <p>Concerning computer science, the BIBLIMOS programme is just getting started: it
aims to create an e-infrastructure based on a network of information around the history
of the Western Sahara. This open tool will offer (i) an access to sets of archival sources
and original manuscripts, (ii) a guide to navigate this knowledge network, (iii) an
automatic registration of new sources and (iv) new tools for knowledge creation and
visualizations. It will also be interfaced with various useful existing applications for research,
such as electronic publishing platforms, collaborative editing tools, bibliography
management tools, etc. To achieve this goal, three lines of work have been initiated. First,
to instigate, assist and sustain the creation of quality digital resources from the
original sources, second, to develop partnerships with providers of already existing digital
resources, and third, to incrementally build the target distributed e-infrastructure,
including a web portal as mediator, relying on semantic web resources and technologies.</p>
      <p>In the first line of work, BIBLIMOS stakeholders in Social Sciences and
Humanities (SSH) are engaged in actions aimed at discovering new local sources and
convincing their owners to join the programme. Concerning the second line of work, today
manuscript sources concealed in the Western Sahara are already partly inventoried, and
many European archive funds are now available to the public. As shown in Table 1, on
the one hand, online digitized full-text manuscripts exist, duly indexed and catalogued,
and on the other hand, institutions or associations offer to collaborate in order to index
digitized materials from many sources (cf. last lines in Table 1). Clearly the Web, that
provides information exploitable by humans, well supports all those very useful
initiatives. However, the query, the analysis, the combination and the overlapping of these
multiple funds, still represents a major challenge for every interested person. This paper
is dedicated to the third line of work in the BIBLIMOS programme, which addresses the
field of the automatic data-processing of such sources, in order to better assist humans
in these tasks. This is a field in which almost everything has to be designed and built.
The Semantic Web, i.e., the web knowledge exploitable automatically by computers, is
the way to cope with these challenges, as we argue in Section 3, after having presented
the state of the art of digital processing of Ancient Arabic Manuscripts in Section 2.
2
2.1</p>
    </sec>
    <sec id="sec-2">
      <title>Digital processing of Mauritanian Ancient Arabic Manuscripts</title>
      <sec id="sec-2-1">
        <title>Mauritanian Ancient Arabic Manuscripts</title>
        <p>
          We focus on Mauritania’s manuscripts because Sophie Caratini, the instigator of the
BIBLIMOS programme, is an anthropologist specialist of Mauritania and she built
strong collaborations with scholars in Nouakchott, in particular through the IMRS5.
5 Institut Mauritanien de la Recherche Scientifique, see http://www.imrs.mr/spip.php?
page=sommaire_fr
http://gallica.bnf.fr/
http://www.tombouctoumanuscripts.org
http://omar.ub.uni-freiburg.de/
http://www.qdl.qa/en
makrim.org
http://www.islamicmanuscript.org/
extresources/manuscriptcatalogues.
aspx
http://openlibrary.org/
Site
http:
//www.westafricanmanuscripts.org/
http://www.archive.org/
http://digital.library.northwestern.
edu/arbmss/index.html
Mauritania is known [: : : ] for its enormously rich heritage of Arab manuscripts, many
brought from the Arab East by pilgrims returning from Makkah, some recopied from
those imported sources by students in the Qur’an schools [: : : ], and others composed
by Mauritania’s own jurists, poets and historians6 [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. According to researchers, some
Mauritanian manuscripts were written as early as in the 10th century, and their forms
and subjects are very diverse, including law, science and religion. To have access to
this legacy, the first step is to build up a precise survey of all manuscript repositories in
existence in the territories of the Western Saharan region. This has been the goal of long
term projects: for instance, the West African Arabic Manuscripts Database Project, from
the University of Illinois at Urbana-Champaign, started in 1987, provides a catalogue
(first line of Table 1) that references more than two thousand manuscripts. Currently, it
references eleven collections, which still is far from representing the actual reality of
family libraries. This is one of the web resources we plan to exploit in the BIBLIMOS
programme, in parallel of completing the repositories survey work performed by the
SSH teams. Several other websites provide information on Western Saharan or, more
generally, on Arabic manuscripts: the list presented in Table 1 shows that there is
already a lot of knowledge available on the web, but this knowledge still is exploitable
only through human labour.
2.2
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Digital Processing of Ancient Arabic Manuscripts</title>
        <p>
          Concerning manuscripts, many different descriptions may be stored in computer
memories: (i) seeing the manuscript as an archaeological object, i.e. starting from its external
aspect, a set of features may be evaluated, for instance the material it is made with,
the colour of ink, etc. This is called codicology [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] and a well-established vocabulary
for such a set of descriptors is provided by the IRHT7; (ii) a numerical image of the
manuscript can be taken; (iii) a transcription of the manuscript’s textual content can be
created, either manually or automatically from its numerical image (with OCR tools);
(iv) both the image and the transcription may be annotated, this is the case for many
European manuscripts, whose textual contents are encoded using the TEI standard; (v) the
manuscript can be catalogued, i.e. classified and described by librarians or archivists,
so it could be found again among collections: this supposes to define and identify
descriptors, including the location, and some general information about the content.
        </p>
        <p>
          For each of these descriptions, active research is conducted and, in some cases, they
converge to well established standards. Specifically for ancient Arabic manuscripts, in
[
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] the authors present the problem of cataloguing, assessing the difficulties involved
in identifying the metadata used by different schools (those dealing with specimen and
those addressing whole volumes). The solution proposed for enhancing interoperability
is to rely on the DCMI8 vocabulary. The TEI9, aimed at helping libraries, publishers,
museums and universities to encode texts in order to facilitate information retrieval from
6 http://www.saudiaramcoworld.com/issue/200306/mauritania.s.manuscripts.htm
7 Institut de Recherche et d’Histoire de Textes, see http://codicologia.irht.cnrs.fr
8 World widely used, simple and generic, digitized resources’ description, see http://
dublincore.org/
9 Text Encoding Initiative: http://www.tei-c.org/index.xml
textual contents, is another important medium for interoperability [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]. Nevertheless we
cannot hope to use it in the short term because for now the only way to get transcriptions
of Mauritanian manuscripts is to manually enter the text. Indeed, automatic character
recognition algorithms hardly apply to these kinds of manuscripts, written with Arabic
graphemes but very often actually in many other languages (e.g. Pulaar, Wolof, etc.).
In [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ], the authors recall the existing difficulties for applying OCR to ancient Arabic
manuscripts and, although recent advances are reported in [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] and [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ], they need to
be further developed. Manuscript image analysis is not reduced to OCR: for instance,
word spotting may be a useful alternative to character recognition. This is why several
works propose to build ontological descriptions (or sets of metadata) of graphical image
features, in order to index and retrieve manuscripts’ digital images on this descriptive
basis [
          <xref ref-type="bibr" rid="ref6 ref7">7, 6</xref>
          ]. But to the best of our knowledge, such proposals have never been applied
to ancient Arabic manuscripts.
        </p>
        <p>
          When it comes to ontological representation of ancient manuscripts, the work
described in [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ], about the SAWS10 project (Sharing Ancient WisdomS), is clearly an
example of what we target in the BIBLIMOS framework. It deals with collections of
moral and social advice and/or philosophical ideas from Greek and Arab wisdom
literatures. Many of the concerned manuscripts have been transcribed and annotated using
TEI, and an extension of the FRBRoo ontology [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] has been developed to describe
the transmission of information (from one copyist to another and from one language
to another). The authors extract the relationships defined in the ontology from the TEI
annotations, to generate a conceptual network expressed in RDF11. This network
allows researchers to explore links between the different documents’ contents. This is an
example of how semantic web technologies contribute to the building of new means
of knowledge, by opening up and linking various sources for research which would
otherwise remain isolated and unused.
3
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Semantic Web Architecture for BIBLIMOS</title>
      <p>For humans, carrying out some scientific work by using the resources listed in Table 1
is still difficult, as there are no means to perform cross-references, comparisons, or to
analyse the different points of view they provide, etc. Regarding BIBLIMOS’ aims,
other kinds of sources than manuscripts (e.g. European archives) should also be
exploited, which increases again these difficulties. Fortunately, while the web allowed
sources’ owners (or depositaries) to publish their resources through websites, the
semantic web now supports the development of softwares that help humans to cope with
these difficulties. Indeed, the semantic web is a network of semantic representations of
web-published information that relies on the same technical principles as the websites’
network, but allows programs to operate on data at this semantic level. Main semantic
web concepts are (i) web ontologies and (ii) linked (open) data; they provide a global
space of interoperability, thus they are important components for BIBLIMOS’ aims.</p>
      <p>
        Figure 1 illustrates the intended general architecture for the BIBLIMOS programme.
The novelties brought by the semantic web obviously start at the DATA level: to benefit
10 http://www.ancientwisdoms.ac.uk/
11 Data model standard: http://www.w3.org/RDF/
from these novelties, beyond all the work that has to be done to obtain results presented
in the previous section, digital sources should also be pushed up to the semantic level.
To this aim, the sources’ concepts and their relationships must be specified, from the
bottom-up (starting from the source contents), top-down (from already well defined
consensual ontologies), or both. The source’s content should be related to this
conceptual level, which may be done by using tools called Mapping Frameworks in Figure 1.
Some of those tools propose to export the source data into a set of RDF triples (the
standard data warehouse approach in data integration systems), and some of them propose
to access data through the conceptual level, based on the ontology-based data access
(OBDA) principles [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] (the mediation approach, which is provided by, e.g., ontop12).
Whatever the chosen approach, the source’s content is then searchable at the semantic
level, with SPARQL. Those contents may be combined using reference thesauri and
ontologies.
      </p>
      <p>WEB PORTALS</p>
      <p>Visualisations Navigations</p>
      <p>Preferences Interactive Analysis</p>
      <p>Collaborations Virtual Res. Inf.</p>
      <p>QUERIES</p>
      <p>Information Retrieval Entity Linking</p>
      <p>Data Mining Inferences</p>
      <p>Integration Mediation Aggregation</p>
      <p>
        Querying the semantic web through its linked data sets is still in its infancy.
Public well-established reference knowledge resources play the important role of hubs in
this linked data network. The most visible are resources of facts, e.g. DBpedia, but at
the conceptual level, reference domain ontologies also act as fundamental integration
means. This is the case for CIDOC CRM [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] for cultural heritage, with its extension
12 http://ontop.inf.unibz.it/
FRBRoo for libraries. These reference domain ontologies are the product of a long,
international collaborative work, reflecting a consensus among the domain experts. These
distributed and collaborative dimensions of the web are naturally inherited by the
semantic web. In the context of BIBLIMOS, this is extremely powerful because these
two features mirror the local structural organization of the Mauritanian family libraries,
open to communities but distributed in the country rather than centralized in only one
authoritative place.
      </p>
      <p>The semantic web resources also promote multilingualism, particularly in
vocabulary resources such as thesaurus, as evidenced by multilingual ones, e.g. VIAF13 or
RAMEAU14, the French national library thesaurus now accessible on the semantic web
(in SKOS), which is fully interlinked with a German (SWD) and an American (LCSH)
thesaurus (thanks to the Multilingual ACcess to Subjects project).</p>
      <p>
        Above the DATA layer is the LOGIC layer, in which all the well-known successful
inventions in the field of data operation (some of which are listed in Figure 1) may be
revisited to take into account the semantic dimension of data. A corner stone for most
of them is to access multiple sources conjointly, which supposes interoperability: one
of the solutions provided by the semantic web is to align the local lightweight
ontologies that describe the sources’ content to the reference ontologies, allowing mediator
systems to aggregate local data sets, for instance following the principles described
in [
        <xref ref-type="bibr" rid="ref12 ref2">12, 2</xref>
        ]. Very active researches are conducted in the semantic web community to
develop this LOGIC level, based on efforts to produce a strong semantic data layer. Lastly
comes the PRESENTATION layer, whose innovative potential is also greatly boosted
by the possibilities issued from the semantic web.
4
      </p>
    </sec>
    <sec id="sec-4">
      <title>Conclusion</title>
      <p>We first drew a state of the art concerning the ways ancient Arabic manuscripts are
processed and made available to the public nowadays, considering that the picture is not so
different in the area of European archives (except that OCR tools are more usable). Once
digitized, sources must be pushed up to the semantic level, for the query, the analysis,
the combination and the intersection of these multiple funds to be supported by
automatic data-processing of sources. We presented the semantic-web Virtual Infrastructure
designed to cope with these challenges within the BIBLIMOS programme.</p>
      <p>
        We are aware that BIBLIMOS is a very ambitious programme - we are not aware
of the existence of a similar enterprise anywhere else - as semantic web applications
in this field are just beginning to emerge. For now, agreements are signed between our
universities (Tours and Nouakchott), both in the computer science side and the social
science side.AFD15 currently funds a training campaign for librarians of the IMRS16
on cataloguing documents, and the Mauritanian government is going to support all the
needed local actions. Concerning the semantic web level, we are building an ontology
13 Virtual International name Authority File: http://viaf.org/
14 http://data.bnf.fr/en/semanticweb
15 Agence Franc¸aise de De´veloppement: http://www.afd.fr/lang/en/home
16 Institut Mauritanien de recherches scientifiques: http://www.imrs.mr/spip.php?page=
sommaire_fr
for the IMRS’ manuscripts [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], a part of which is already digitized, and we plan to
work on designing and building an annotation tool based on this ontology. In order to
include the European side (archives on these countries), we are thinking about a MSC
Action (deadline in january, 2016). The campaign of partnerships with already existing
materials is still to be done, as we must first build the semantic web tools that we should
propose to them.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>Abdel</given-names>
            <surname>Bela</surname>
          </string-name>
          <article-title>¨ıd and Nazih Ouwayed. Segmentation of ancient arabic documents. Guide to OCR for Arabic Scripts</article-title>
          , pages
          <fpage>2</fpage>
          -
          <lpage>16</lpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>Beatrice</given-names>
            <surname>Bouchou</surname>
          </string-name>
          and
          <string-name>
            <given-names>Cheikh</given-names>
            <surname>Niang</surname>
          </string-name>
          .
          <article-title>Semantic mediator querying</article-title>
          .
          <source>In International Database Engineering and Applications Symposium (IDEAS)</source>
          , pages
          <fpage>29</fpage>
          -
          <lpage>38</lpage>
          . ACM,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>W.</given-names>
            <surname>Boussellaa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Zahour</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>El Abed</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Benabdelhafid</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Alimi</surname>
          </string-name>
          .
          <article-title>Unsupervised block covering analysis for text-line segmentation of arabic ancient handwritten document images</article-title>
          .
          <source>In 20th International Conference on Pattern Recognition (ICPR)</source>
          , pages
          <fpage>1929</fpage>
          -
          <lpage>1932</lpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>Stefanie</given-names>
            <surname>Brinkmann</surname>
          </string-name>
          and Beate Wiesm u¨ller, editors.
          <source>From Codicology to Technology: Islamic Manuscripts and Their Place in Scholarship. Frank and Timme GmbH</source>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5. Diego Calvanese, Giuseppe De Giacomo, Domenico Lembo, Maurizio Lenzerini, Antonella Poggi, Mariano Rodriguez-Muro,
          <article-title>Riccardo Rosati, Marco Ruzzi, and Domenico Fabio Savo. The MASTRO system for ontology-based data access</article-title>
          .
          <source>Semantic Web</source>
          ,
          <volume>2</volume>
          (
          <issue>1</issue>
          ):
          <fpage>43</fpage>
          -
          <lpage>53</lpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>M.</given-names>
            <surname>Coustaty</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Pareti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Vincent</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.M.</given-names>
            <surname>Ogier</surname>
          </string-name>
          .
          <article-title>Towards historical document indexing: extraction of drop cap letters</article-title>
          .
          <source>IJDAR</source>
          ,
          <volume>14</volume>
          (
          <issue>3</issue>
          ):
          <fpage>243</fpage>
          -
          <lpage>254</lpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>B.</given-names>
            <surname>Cou</surname>
          </string-name>
          ¨asnonet, J. Camillerapp,
          <string-name>
            <surname>and I. Leplumey.</surname>
          </string-name>
          <article-title>Access by content to handwritten archive documents: Generic document recognition method and platform for annotations</article-title>
          .
          <source>IJDAR</source>
          ,
          <volume>9</volume>
          (
          <issue>2</issue>
          ):
          <fpage>223</fpage>
          -
          <lpage>242</lpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>Mohamed</given-names>
            <surname>Lamine</surname>
          </string-name>
          <article-title>Diakite´ and Beatrice Bouchou Markhoff</article-title>
          . OMOS:
          <article-title>Ontology for Western Saharan Manuscripts</article-title>
          .
          <source>Technical Report 313</source>
          , Universite´ Franc¸ois Rabelais Tours,
          <article-title>Laboratoire d'Informatique (available in HAL: https://hal.archives-ouvertes</article-title>
          .
          <source>fr/hal-01134010)</source>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>Martin</given-names>
            <surname>Doerr</surname>
          </string-name>
          and Patrick Le Boeuf.
          <article-title>Modelling intellectual processes: The frbr - crm harmonization</article-title>
          .
          <source>In Digital Libraries: Research and Development</source>
          , volume
          <volume>4877</volume>
          of Lecture Notes in Computer Science, pages
          <fpage>114</fpage>
          -
          <lpage>123</lpage>
          . Springer, Berlin / Heidelberg,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <given-names>A.</given-names>
            <surname>Jordanous</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. F.</given-names>
            <surname>Lawrence</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hedges</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Tupman</surname>
          </string-name>
          .
          <article-title>Exploring manuscripts: Sharing ancient wisdoms across the semantic web</article-title>
          .
          <source>In 2nd International Conference on Web Intelligence, Mining and Semantics (WIMS)</source>
          , pages
          <fpage>678</fpage>
          -
          <lpage>683</lpage>
          . ACM, New York,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <given-names>A.</given-names>
            <surname>Khemiri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kacem</surname>
          </string-name>
          , and
          <article-title>Belaid A. Towards arabic handwritten word recognition via probabilistic graphical models</article-title>
          .
          <source>In Frontiers in Handwriting Recognition (ICFHR)</source>
          , pages
          <fpage>678</fpage>
          -
          <lpage>683</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Cheikh</surname>
            <given-names>Niang</given-names>
          </string-name>
          , Be´atrice Bouchou, Yacine Sam, and
          <string-name>
            <given-names>Moussa</given-names>
            <surname>Lo</surname>
          </string-name>
          .
          <article-title>A Semi-Automatic approach For Global-Schema Construction in Data Integration Systems</article-title>
          . IJARAS,
          <volume>4</volume>
          (
          <issue>2</issue>
          ):
          <fpage>35</fpage>
          -
          <lpage>53</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <given-names>Dominic</given-names>
            <surname>Oldman. The CIDOC Conceptual Reference</surname>
          </string-name>
          <article-title>Model (CIDOC-CRM): A Primer, Version 1</article-title>
          . CIDOC CRM (http://www.cidoc-crm.
          <source>org/docs/CRMPrimer v1.1.pdf)</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <given-names>Desmond</given-names>
            <surname>Schmidt</surname>
          </string-name>
          .
          <article-title>Towards an interoperable digital scholarly edition</article-title>
          .
          <source>Journal of the Text Encoding Initiative</source>
          [http://jtei.revues.org/979], 7,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>M. O. Soulah</surname>
            and
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Hassoun</surname>
          </string-name>
          .
          <article-title>Which metadata for ancient arabic manuscripts cataloguing?</article-title>
          <source>In International Conference on Dublin Core and Metadata Applications</source>
          , The Hague, Netherlands,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <given-names>L.</given-names>
            <surname>Werner</surname>
          </string-name>
          .
          <article-title>Mauritania's manuscripts</article-title>
          . Saudi Aramco World,
          <volume>54</volume>
          (
          <issue>6</issue>
          ):
          <fpage>2</fpage>
          -
          <lpage>16</lpage>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>