=Paper= {{Paper |id=Vol-2022/paper49 |storemode=property |title= Digital Mathematical Libraries: Overview of Implementations and Content Management Services |pdfUrl=https://ceur-ws.org/Vol-2022/paper49.pdf |volume=Vol-2022 |authors=Alexander M. Elizarov,Evgeny K. Lipachev,Denis Zuev |dblpUrl=https://dblp.org/rec/conf/rcdl/ElizarovLZ17 }} == Digital Mathematical Libraries: Overview of Implementations and Content Management Services == https://ceur-ws.org/Vol-2022/paper49.pdf
             Digital Mathematical Libraries: Overview
       of Implementations and Content Management Services
                   © A.M. Elizarov                     © E.K. Lipachev                         © D.S. Zuev
                                         Volga Region Federal University,
                                                 Kazan, Russia
            amelizarov@gmail.com                  elipachev@gmail.com                      dzuev11@gmail.com
            Abstract. The paper gives a review of existing projects of implementation of digital mathematical
     libraries. An analysis of existing information systems of digital mathematical libraries is performed using the
     evaluation criteria embedded in the DELOS DLRM model, emphasis is placed to the methods of managing
     mathematical content on the basis of semantic technologies. All projects are in different degrees of
     completeness, the range of services provided is different. We found that most of digital mathematical libraries
     are concentrated on the transfer of the resources to the electronic form and their preservation, rather than on
     the development of semantic services.
            Keywords: digital publishing, library automation, machine-actionable digital library, digital
     mathematics library, DML, WDML.


 1 Introduction                                                       representations of mathematical knowledge; presentation
                                                                      formats; authoring languages and tools; creating
     The Digital Era has changed crucially as the methods             repositories    of     formalized       mathematics,    and
 of research, and the ways in which scientists search,                mathematical digital libraries; mathematical search and
 produce, publish, and disseminate their scientific work.             retrieval; implementing math assistants, tutoring and
 A digital library, a collection of information which is              assessment systems; developing collaboration tools for
 both digitized and organized, gives us power we never                mathematics; creating new tools for detecting re-
 had with traditional libraries. Information and                      purposing material, including plagiarism of others' work
 communication technologies are actively implemented in               and self-plagiarism; creation of interactive documents;
 research and development. Therefore, it became possible              developing deduction systems. The solution of this task
 to use the entire volume of accumulated scientific                   requires formalization of mathematical statements and
 knowledge in conducting new research. This requires                  proofs [9].
 creation of complex of technologies that ensure                          At present, research activities in the field of
 management of available knowledge, the organization                  mathematics are associated with the use of modern
 has effective access to this knowledge, as well as sharing           information technology (cloud, semantic, etc.). These
 and multiple use of new kinds of knowledge structures.               technologies are used in research of distributed scientific
 In mathematics also accumulated considerable                         teams, preparation and dissemination of mathematical
 experience in using of electronic mathematical content               knowledge in an electronic form. At present, a new type
 within the various projects on creation of mathematical              of digital library is being formed, connected with the
 digital libraries (see, e. g., [1]).                                 integration of mathematical knowledge into the scientific
     Since inception of the first scientific information              information space, see. [1,10,11]. This type of
 systems, mathematicians have been involved in the full               information system is called Digital Mathematical
 cycle of software product development, from idea to                  Library (DML), a number of global projects are
 implementation. Well-known examples are an open                      implemented, such as European Digital Mathematical
 source system TEX and commercial systems Wolfram                     Library or World Digital Mathematical Library [12–14].
 Mathematica and Wolfram Alpha, led by Stephen                        More details about goals, functions and current results
 Wolfram according to his principles of computational                 are listed below, in Section 3.
 knowledge theory [2, 3]. Tools for mathematical content                  Implementation of digital mathematical libraries
 management are developed with the help of communities                involves the development of special tools and continuous
 of mathematicians, e.g. MathJax by American                          improvement of their functionality. An example is the
 Mathematical Society, information system Math-Net.Ru                 Open Journal Systems (OJS, https://pkp.sfu.ca/ojs/). The
 is developed at the Steklov Mathematical Institute of the            platform is used in many projects, particularly in
 Russian Academy of Sciences [4] and the collection of                Lobachevskii          Journal         of        Mathematics
 publicly available preprints arXiv.org (https://arxiv.org/).         (http://ljm.kpfu.ru/), one of the first digital mathematical
     Main challenges of mathematical knowledge                        journals [15].
 management (MKM) are discussed in [5–9], the most                        In our work, we try to look more deeply into world
 urgent tasks are outlined. Such tasks are: modeling                  largest DML to outline current status of described
                                                                      projects and to investigate services and functions that
Proceedings of the XIX International Conference                       provide these digital mathematical libraries.
“Data Analytics and Management in Data Intensive
Domains” (DAMDID/RCDL’2017), Moscow, Russia,
October 10–13, 2017

                                                                317
2 Mathematical Libraries and DELOS                                   approach to information objects organization lies in the
Digital Library Reference Model                                      ideology of WDML. We use the same approach in
                                                                     creating a digital mathematical library Lobachevskii
2.1 Criteria for investigation                                       DML, which is based on mathematical collections of the
                                                                     Kazan Federal University [18].
    Firstly, we need to establish common criteria and
                                                                         Usually digital library consists of collections, and
main features and functions that we will look at.
                                                                     collections in turn from documents or information
    In DELOS Digital Library Reference Model [16, 17]
three basic concepts are distinguished for defining what             resources (objects). In 1990–2000 there was a large
                                                                     number of studies carried out on the definition,
is called a digital library (DL):
                                                                     architectural and technical aspects of DL systems.
    • DL – a (potentially virtual) organization that
                                                                     Finally, it is necessary to mention the creation of the DL
comprehensively collects, manages, and preserves for the
                                                                     manifesto in the DELOS project, which resulted in the
long term rich digital content and offers to its user
communities specialized functionality on that content, of            creation of a reference model for DL [16, 17].With the
measurable quality, and according to prescribed policies;            development of Semantic Web technologies, it became
                                                                     interesting to investigate the semantics of resources and
    • DL system – a software system that is based on an
                                                                     their links placed in libraries, see, for example, [18]. In
architecture and provides all functionality that is required
by a particular Digital Library. Users interact with a               this case, an information object can already be considered
Digital Library through the corresponding DL system;                 not only as a document, but as its certain parts – abstract,
    • DL management system (DLMS) – a generic                        keywords, bibliography, citations, comments of authors
                                                                     or readers.
software system that provides the appropriate software
                                                                         From the end user's point of view, DL must satisfy the
infrastructure to both produce a basic DL system that
                                                                     user’s expectations. The document itself as an elementary
incorporates all functionality that is considered
                                                                     information object may not be interesting at all. It is much
foundational for Digital Libraries and integrate additional
                                                                     in demand to search for information on a particular entity
software offering more refined, specialized, or advanced
functionality. An intrinsic part of DLMS functionality is            or subject mentioned in the document. At the same time,
related to administrative services that are used to choose           much more interesting to find all possible resources
                                                                     where different versions of mentioned subjects,
the appropriate subset of its functionality, e.g., through
                                                                     especially in cases when various interpretations and
relevant parameters of its components, and then install,
                                                                     definitions are possible. For example, there are a number
deploy, and (re)configure a DL system.
                                                                     of definitions of the concept of “digital library” and the
    A DLMSis “system software”. As in several other
                                                                     user studying this topic will certainly be interested in all
domains (e.g., operating systems, databases, user
                                                                     references to the definition of the digital library from
interfaces), such kernel software may be used as a
                                                                     different sources. Thus, we observe a change in the
foundation to produce Digital Library systems.
                                                                     elementary information object. The electronic document
    While the concept of DL is intended to capture an
                                                                     fragmented into smaller information objects and all
abstract system that consists of both physical and virtual
components, the remaining two capture concrete                       services of a library deal with such objects and manage
software systems. For every DL, there is a unique DL                 the relationships between them. In mathematics, such
                                                                     elementary objects can be, for example, theorems,
system in operation (possibly consisting of many
                                                                     lemmas, definitions or formulas, research of which is
interconnected smaller DL systems in the most general
case), where as all DL systems are based on a handful of             much more informative on a number of sources. The
DLMSs.                                                               services of any DML should provide such an opportunity.
                                                                     All this functionality lies in WDML architecture. Its
    In the role-based aspect, the DELOS DLRM model
                                                                     implementation became possible only with the
consider following types of users: the end user of the DL;
                                                                     development of semantic technologies and the transfer of
the developer of DL; the system administrator of DL and
                                                                     library content into digital form with metadata. Now,
the developer of applications for DL and, four levels of
                                                                     there is no technical problems in maintaining such
user views and expectations are formed. In addition, the
                                                                     approach to the organization of DL.
model identifies six key areas, each of which introduces
and defines its own entities and their properties:                       During our research, we will take into account this
architecture, information space, functionality, users,               transformation of the approach to the organization of
policy and quality of services provided. These areas can             information objects. Note that the change in the approach
be considered as evaluation criteria and, by virtue of their         to the organization of DL does not affect the selected
                                                                     criteria for investigation.
universality, can be used to analyze almost any
information system.                                                  3 Functionality of Digital Mathematical
    We will carry out an analysis of existing digital
                                                                     Libraries
mathematical libraries, performed using the evaluation
criteria embedded in the DELOS DLRM model.                               Below is a brief review of existing digital
                                                                     mathematical libraries. The largest projects are “All-
2.2 Differences between approaches
                                                                     Russian Mathematical Portal Math-Net.RU”,”Centre de
   It is interesting to stop at the discussion at the                diffusion de revues académiques mathématiques”,
approaches of the definition of elementary objects with              “Czech Digital Mathematics Library”, “The Polish
which digital library works. In particular, an interesting           Digital    Mathematics       Library”,      “Göttinger




                                                               318
Digitalisierungs Zentrum”, “Numérisation de doc-                        documentation. This DML is not so large – contains 9
uments anciens mathématiques”, Zentralblatt MATH,                       French math journals, 1 book and 7 proceedings of
“Bulgarian Digital Mathematics Library” and “The                        seminars and conferences.
European Digital Mathematical Library”. It should be                        The CEDRAM websites offer two ways of consulting
outlined, that all projects are in different degrees of                 the hosted articles: quick and advanced search. Search
completeness, the range of services provided is also                    functions provide search by keywords, author, title,
different.                                                              bibliography and full text search. Quick search searches
                                                                        in all fields except full text. Advanced search interface
3.1 Math-Net.ru
                                                                        offers several types of research, more or less
     All-Russian Mathematical Portal Math-Net.ru [4, 20–                complicated. The full entry of articles produced for
22] combines both a digital mathematical library and a                  CEDRAM contains abstracts and bibliographical
publishing system for mathematical texts. It is a web                   references.
portal developed by the V. A. Steklov Institute of                          All online records exist in two formats, which are
Mathematics, Russian Academy of Sciences.                               only different by the way they display mathematical
     The key component of the portal – the “Journals”                   formulas in titles, abstracts, keywords or references:
section links Russian periodicals in the field of                       MathML or TeX and have stable url link.
mathematical sciences to a single information system.                       XHTML+MathML display is best for reading and
Currently contains more than 120 journals with nearly                   browsing, but there are some problems with viewing in
200 thousand publications. Information about the article                browsers, that need to be pre-configured to work
includes a bibliographic description, an annotation, lists              correctly with MathML. The HTML+TeX version used
of literature and a file with the full text of the article. The         for compatibility for users who do not have an
portal presented in two languages – Russian and English.                environment capable of displaying MathML. Now
     The most interesting part is the functionality of the              CEDRAM provide following services [12, 13, 23]:
portal. The portal provides the ability to search for                    • production workflow of journals;
publications and links on the bibliographic description                 •    dedicated web site for each journal;
and keywords in the title, annotation or text. As result of
the search, an abstract, article IDs (DOI, resource                     •    provides creation and maintenance of LATEX
references in abstract databases, URIs), a citation pattern,                 styles (using a specific class);
classifier values are issued. There are no recommender                  •    production of PDFs for print and web with
service, in fact all semantic services work with a                           XML/MathML metadata;
bibliographic description of the resource. MiRef module                 •     DOI registration (Crossref), reference linking
is used to form correctly the description and links to                        (MSN, ZBM, mini-DML, Crossref);
resources. The module is designed to automatically place
links to various publications databases in the literature                • provides publishing platform for mathematical
list. The format of the links must satisfy the rules of the                   articles based on Open Journal System (Public
amsbib package and should be entered in the LaTeX                             Knowledge Project, https://pkp.sfu.ca/ojs/);
format.                                                                  • all resources archived in partner project - the French
     Registered users can create personal pages, manage                       digital math library NUMDAM.
personal collections of publications, authors get access to                 Policy and quality of services. Starting 2017 all
the full texts of their articles, authors can send the                  CEDRAM journals are open access. Access to the
manuscript to the editorial office of the journal                       database containing the bibliographical references of all
electronically, and track the process of its workflow in                the articles of all participating journals is totally free. The
the editorial office.                                                   database itself is the property of Cellule Mathdoc, and
     Statistics on popular authors and resources are                    contains elements covered by copyright. CEDRAM has
maintained, infometric indicators for resources located                 OAI-PMH server, which can be used for systematic
on the portal are calculated.                                           download of metadata in various schemas. Files of the
     The policy for accessing the full texts of articles is             full texts are the property of the journals and it is
determined by the publisher of the paper. Access for any                necessary to refer to the policy of each of them. Also
other information is free.                                              there are some restrictions of full copying and indexing
3.2 CEDRAM                                                              by web robots.

    The center for diffusion of academic mathematical                   3.3  Numerisation  de  Documents                     Anciens
journals (CEDRAM, Centre de Diffusion de Revues                         Mathematiques (NUMDAM)
Académiques Mathématiques) is a web portal for                              The French digital math library NUMDAM [12–14,
common access to a set of mathematical journals [23],                   24] started as a digitisation program for a pilot of 6
available in French and English. CEDRAM’s mission is                    journals. Now it contains more than 57000 articles in 76
to provide a large distribution of their current volumes,               periodicals, 373 books in 4 collections, 263 theses.
and range from help for producing journals according to                     The NUMDAM is the reference French digital
the best standards for electronic publishing to long lasting            mathematics library set up by Cellule MathDoc with the
archiving. CEDRAM is a service of the Cellule MathDoc                   assistance of a network of partners.
(UMS 5638 of CNRS and Université Joseph Fourier)                            From 2007 onwards, publishers send digital born
which completes its important offer in mathematical                     articles into DML. Collections are normally indexed



                                                                  319
within one year of publication, and full texts are freely           validation service get all metadata including abstracts,
downloadable at the end of a period of time set by                  keywords and references transformed into representation
agreement upon each title.                                          using MathML [27].
    The NUMDAM program is designed to support                           End-users cannot submit any resource, everything can
academic publishers and provide the research community              be submitted only through editorial board of journals,
with a sustainable, reliable and easy-to-use library. The           also there is no any personal area for users.
research and dissemination platform was completely                      Search and navigation. As others DMLs DML-CZ
redesigned in 2016. Now portal is available on two                  allows to search by title, author of publications. Also
languages –English and French, formulas can displayed               avaliable search by language or by Zentrablatt MATH
in TeX or in graphical form using MathJax.                          and MathSciNet identifiers. Browse functions provide
    System provide following functions: search and                  navigation through sorted list of resources (authors,
navigation by title, author, references or in full text of          journals etc.).
resources. During search all statistics, related to the                 The most interesting function is search of related
search topic is displayed – co-authors, journals and years          articles (finding similarities between papers). This
of publication. Browse functions provide navigation                 service tries to find similar papers using three methods:
through sorted list of resources (authors, journals etc.).          “Term frequency–Inverse document frequency” (TF-
    Full texts available in PDF and DJVU formats. Each              IDF, see, e. g. [28]), the “Random Projections” or method
article in NUMDAM is available via a stable URL. This               that is built on TF-IDF and simplifies the computations
URL is a compact address, designed to remain valid in               by projecting vectors onto a subspace of lower
the long term. It is displayed in the web page of the               dimensionality [28] and with using “Latent Semantic
article, on the first page of PDF or DjVu files and by the          Indexing” (LSI, [29]). Last method gives the most
OAI-PMH server.                                                     accurate results up to 90%.
    There is no any user registration. All functions have               Policies and quality of service. The digitized journal
open access. NUMDAM only disseminate resources that                 and proceedings papers are displayed with the agreement
already published in journals, books or theses but                  of the publisher who owns the digital data. The digitized
submission process of resources is not clear. Metadata              monographs are displayed with the agreement of the
extraction made only for bibliography. Any additional               author and/or the publisher while the digital data are
services like formula search or recommender system are              property of the Institute of Mathematics CAS. The
absent.                                                             database itself, in particular the bibliographic data, are
    The full text of most recent articles is generally not          property of the Institute of Mathematics CAS.DML-CZ
available. The journals whose archives are on this portal           presents full texts articles and book chapters in PDF
have accepted the principle of “a moving wall”. This is a           format, equipped with enhanced metadata including
time interval between the publication of a volume (in               bibliographical references linked to Zentrablatt MATH
paper or electronic form, delivered to subscribers) and             and MathSciNet. The digital born documents are being
the availability of the full text on the NUMDAM server.             obtained from the original sources provided by
Generally, moving-wall for most of journal in                       publishers. The presented page content and format
NUMDAM is equal to 5 years.                                         corresponds to the original one. Journals are presented
                                                                    and accessed according to the terms of a contract with the
3.4 The Czech Digital Mathematics Library (DML-
                                                                    publisher. The digital documents displayed in the DML-
CZ)
                                                                    CZ are authorized with electronic stamps.
    The Czech Digital Mathematics Library (DML-CZ)
                                                                    3.5 The Polish Digital Mathematical Library
[25, 26] has been developed in order to preserve in a
digital form the content of major part of mathematical                  The Polish Digital Mathematical Library (DML-PL,
literature that has ever been published in the Czech lands,         [30]) has existed since 2002. The library holds full texts
and to provide a free access to the digital content and             of polish mathematical journals and books. The major
bibliographical data. DML-CZ resulted from the project              part of the collection are archive issues of mathematical
no. 1ET200190513 supported by the Czech Academy of                  journals published before World War II. Library consists
Sciences (CAS) in the R&D programme Information                     of 550 books and 36 journals, but only 3 journals provide
Society, and operated by the Institute of Mathematics               access to full text of articles. Portal of DML-PL provide
CAS. Project seems to be finished in 2010 and now is in             search by attributes and navigation through sorted lists of
stable form.                                                        authors, books and journals.
    Functionality. Editors of all journals included in                  Brief explanation of the project is given in [31], but
DML-CZ are using tools and work flows that have been                nowadays it seems that project is already finished. On the
tailored to their individual publishing practice and that           web portal of library there is no additional information
enable them to produce inputs for DML-CZ in a                       about current status. Any information about semantic
semiautomatic way. The formal consistency and integrity             functions or metadata extraction from resources is
of the data are controlled by several validating                    missing.
procedures that have been developed in the project.
                                                                    3.6 GDZ–Gottingen Digitization Centre
    There are some automated procedures for validation
of data of new journal issues but all of them are archived              The task of the GDZ [32, 33] is to record data such as
in DML-CZ for internal use and development. Based on                prints, manuscripts and illustrations and to preserve
limiting the name space of allowed TEX macros,                      them. Main aim of the project is conversion of resources



                                                              320
into digital form. This is multidisciplinary library, that         including text, images, moving images, mpegs and data
contains not only mathematical collections but also                sets. All functionality of DSpace software is clear and we
history of Law, history of the Humanities and the                  will not describe it in this paper. For example, additional
Sciences, travel and North American literature and other           information about DSpace can be found in [17, 37].
collections. Mathematical collections have about 7000
                                                                   3.9 European Digital Mathematics Library
resources and also have some Russian resources. Library
contains more than 15 million digitized pages.                         The European Digital Library (EuDML) was a project
    Portal provides search in metadata and full text of            partly funded by the European Commission. EuDML
resources and browse functions. Many resources are                 [12–14, 38, 39] is an aggregation and indexing services
historical, not modern, main aim of the project is to              with was established under The EuDML Initiative and
digitize and preserve resources. All resources have full           promoted by European Mathematical Society. EuDML
texts and can be viewed page by page or in structured              assemble as much as possible of the digital mathematical
mode. Metadata of any resource contain stable URL of               corpus in order to make it available online, with eventual
resource, metadata can be downloaded in METS format.               open access, in the form of an authoritative and enduring
                                                                   digital collection, growing continuously with publisher
3.7 Zentralblatt MATH
                                                                   supplied new content, augmented with sophisticated
    Zentralblatt MATH (zbMATH, [34]) is abstracting                search interfaces and interoperability services, developed
and reviewing service in pure and applied mathematics.             and curated by a network of institutions.
It is hosted by the Berlin office ofFIZ Karlsruhe                      The system, presented in the diagram in Figure 1,
– Leibniz Institute for Information Infrastructure GmbH            conceptually consists of a metadata repository, a search
(FIZ Karlsruhe) and distributed by Springer. The                   engine, a metadata enhancer, an association analyser,
zbMATH database contains more than 3.5 million                     annotation and accessibility functions and of course the
bibliographic entries with reviews or abstracts currently          interfaces [38].
drawn from more than 3,000 journals and serials, and
170000 books. zbMATH is not a digital library itself, it
is an indexing service and provides easy access to
bibliographic data, reviews and abstracts from all areas
of pure mathematics as well as applications, in particular
to the natural sciences, computer science, economics and
engineering.
    Search functions provide search for documents,
authors and journals. Search can be done in one line, or
in structured form using attributes such as title, author,
subject, source, keywords etc. Service also provide full-
text formula search for indexed arXiv documents
[35].The zbMATH formula search uses the
MathWebSearchsystem, which is a content-based search               Figure 1 EuDML architecture
engine for MathML formula based on substitution tree
                                                                       The metadata repository provides the central point of
indexing.
                                                                   reference for all the managed contents. It will work with
    Portal offer three ways of displaying mathematical             an OAI-PMH harvester to ingest repositories’ content
formulas – MathML, MathJax and LaTeX. The XML-                     descriptions, maps the metadata into the internal EuDML
based MathML is the solution recommended by W3C for                schema. The performance and the quality of responses of
displaying mathematical content on the web and is set as
                                                                   the search service directly influence user experience.
default within zbMATH. Mathematical Reviews and
                                                                   Therefore, particularly this service has to be reliable,
zbMATH maintain the Mathematics Subject                            scalable and customized to fulfill user expectations.
Classification (MSC), a classification scheme for                      The metadata enhancer function consist in a
mathematics. It is used by reviewing services to                   collection of tools that each contribute to expand or
categorize items in the mathematical sciences literature.          complete the existing items’ metadata, depending on the
The database of service contains about 2.1 million direct          improvements needed. These range from applying OCR
links to electronic versions of the indexed publications,          over full texts, adding key words or multilingual
to the publishers’ websites and/or to electronic libraries         metadata by merging information from different
with open access to the full texts.                                databases when an item happens to have such non-
3.8 Bulgarian Digital Mathematics Library                          redundant description, generating MathML for
                                                                   mathematical expressions, etc. The association analyzer
    Bulgarian Digital Mathematics Library, BulDML is a             detects, analyses and records relations between
digital repository at Institute of Mathematics and                 individual items. The annotation component provides
Informatics of Bulgarian Academy of Sciences. Library              mechanisms to attach new material to individual items in
has 7 mathematical journals, 4 book series and                     the repositories and maintain this new material. The
proceedings in its repository. In fact, BulDML is an               accessibility component provides support for enhanced
institutional repository and is built on open-source               accessibility of items, if required, before presentation to
DSpace software [36]. As known, DSpace preserves and               end users. Finally, the user and system interfaces provide
enables open access to all types of digital content



                                                             321
access to the collected resources on different levels both            [3] Wolfram, S.: An elementary introduction to the
to human and machine users. Now EuDML offers several                      Wolfram Language. Wolfram Media, Inc. (2015)
service interfaces that allow other applications to connect           [4] Chebukov, D.E., Izaak, A.D., Misyurina, O.G.,
with the service. These are OAI-PMH server, REST                          Pupyrev, Yu.A., Zhizhchenko, A.B.: Math-Net.Ru
services, OpenSearch service, which allow to query                        as a Digital Archive of the Russian Mathematical
library index in machine way and annotation retrieval                     Knowledge from the XIX Century to Today.
services in JSON.                                                         Intelligent Computer Mathematics, Lecture Notes
    EuDML aims to be an open source of trusted                            in Comput. Sci., 7961, pp. 344-348, Springer
mathematical knowledge. That is why it has some                           (2013), doi: 10.1007/978-3-642-39320-4_26
policies:                                                             [5] Carette, J., Farmer, W.M.: A Review of
• All texts must have been scientifically validated and                   Mathematical Knowledge Management. In
    formally published;                                                   Intelligent Computer Mathematics. Lecture Notes
• All items must be open access after a finite embargo                    in Computer Science, 5625. pp. 233-246 (2009)
  period. Once documents contributed to the library are               [6] Ion, P.D.F.: Mathematics and the World Wide Web.
  made open access due to this policy, they cannot                        In Intelligent Computer Mathematics. Lecture
  revert to close access later on;                                        Notes in Computer Science, 7961, pp. 230-245
• The digital full text of each item contributed to library               (2013)
  must be archived physically at one of the EuDML                     [7] Lange, C.: Enabling Collaboration on Semiformal
  member institutions.                                                    Mathematical Knowledge by Semantic Web
  All DMLs, described above except All-Russian                            Integration. Ph. D. Thesis, Jacobs University
Mathematical Portal Math-Net.RU are partners of                           Bremen (2011)
EuDML.                                                                [8] Elizarov, A.M., Lipachev, E.K., Nevzorova, O.A.,
                                                                          Solov’ev, V.D.: Methods and Means for Semantic
4 Conclusion                                                              Structuring of Electronic Mathematical Documents.
    In order to outline all differences of observed projects              Doklady Mathematics, 90 (1), pp. 521-524 (2014),
we created comparison Table 1 listed below. Note that,                    doi: 10.1134/S1064562414050275
we excluded from table two DMLs due to following.                     [9] Elizarov, A., Kirillovich, A., Lipachev, E.,
BulDML is and built on open-source DSpace software,                       Nevzorova, O., Solovyev, V., and Zhiltsov N.:
so all functionality of it is clear, for DML-PL we could                  Mathematical         Knowledge        Representation:
not find any working portal in order to study it more                     Semantic Models and Formalisms. Lobachevskii J.
deeply.                                                                   of Mathematics, 35 (4), pp. 347-353 (2014),
    In all the projects studied, emphasis is placed on the                doi:10.1134/S1995080214040143
transfer of the resources themselves to the electronic               [10] Elizarov A., Kirillovich A., Lipachev E.,
form, rather than on the development of semantic                          Nevzorova O. (2017) Digital Ecosystem OntoMath:
services. Only a few portals have a mathematical formula                  Mathematical        Knowledge       Analytics    and
search, and only one has a recommender service.                           Management. In: Kalinichenko L., Kuznetsov S.,
                                                                          Manolopoulos Y. (eds) Data Analytics and
    After the analysis done it is clear that there are only
                                                                          Management in Data Intensive Domains.
two types of repository systems: the first is actually
                                                                          DAMDID/RCDL 2016. Communications in
DML, which preserve the resources themselves, the
                                                                          Computer and Information Science, 706, pp. 33-46
second is indexing and aggregating services that do not
                                                                          (2017), doi: 10.1007/978-3-319-57135-5_3
have their own database of electronic documents, but
provide a wide range of convenient search capabilities.              [11] Elizarov, A.M., Kirilovich, A.V., Lipachev, E.K.,
                                                                          Nevzorova, O.A.: Mathematical Knowledge
    This work was funded by the subsidy allocated to                      Management: Ontological Models and Digital
Kazan Federal University for the state assignment in the                  Technology. CEUR Workshop Proceedings, 1752,
sphere of scientific activities, grant agreement no.                      pp.     44-50      (2016),    http://ceur-ws.org/Vol-
1.2368.2017) and with partial financial support of the                    1752/paper08.pdf
Russian Foundation for Basic Research and the
                                                                     [12] Bouche, T.: Towards a World Digital Library:
Government of the Republic of Tatarstan, within the
framework of scientific projects Nos. 15-07-08522, 15-                    Mathdoc, Numdam and EuDML Experiences.
47-02472.                                                                 UMI, La Sapienza, Roma (2016), http://
                                                                          www.mat.uniroma1.it/sites/default/import-
References                                                                files/biblioteca/SEMINARIO2016/bouche.pdf
                                                                     [13] Bouche, T.: Digital Mathematics Libraries: The
[1] Borwein, J.M., Rocha, E.M., Rodrigues, J.F.
                                                                          good, the bad, the ugly. Mathematics in Computer
    Communicating Mathematics in the Digital Era, pp.
                                                                          Science,      (3),    pp. 227-241      (2010),   doi:
    3-21. A K Peters, Ltd. MKM-IG. Mathematical
                                                                          10.1007/s11786-010-0029-2
    Knowledge      Management     (2008).     http://
    www.mkm-ig.org/                                                  [14] Bouche, T.: Reviving the Free Public Scientific
                                                                          Library in the Digital Age? The EuDML Project. In:
[2] Wolfram, S.: A New Kind of Science. Wolfram
                                                                          Kaiser, K., Krantz, S., Wegner, B. (eds.): Topics
    Media, Inc. (2002)
                                                                          and Issues in Electronic Publishing, JMM, Special



                                                               322
     Session, San Diego, January 2013, pp. 57-80                    [24] NUMDAM. www.numdam.org
     (2013),                          http://www.emis.de/           [25] The Czech Digital Mathematics Library (DML-
     proceedings/TIEP2013/05bouche.pdf                                   CZ), http://www.dml.cz/
[15] Elizarov, A.M., Zuev, D.S., Lipachev, E.K.:                    [26] The Czech Digital Mathematics Library. Project
     Mathematical Content Semantic Markup Methods                        Funded by the Academy of Sciences of the Czech
     and Open Scientific E-Journals Management                           Republic, 2005–2009. http://project.dml. cz
     Systems. In: Klinov, P., Mouromtsev, D. (eds.)                 [27] Rákosník, J.: Recent Development of the DML-CZ
     KESW 2014. CCIS, 468, pp. 242-251 (2014), doi:                      and Its Current State. In Proc. of DML 2011:
     10.1007/978-3-319-11716-4 22 29                                     Towards a Digital Mathematics Library. Bertinoro,
[16] Candela, L., Athanasopoulos, G., Castelli, D., El                   Italy, July 20–21st (2011)
     Raheb, K., Innocenti, P., Ioannidis, Y., Katifori, A.,         [28] Rajaraman, A.; Ullman, J. D.: Data Mining (2011).
     Nika, A., Vullo, G., Ross, S.: The Digital Library                  doi:10.1017/CBO9781139058452.002
     Reference Model. FP7-ICT-2007-3. Cultural
                                                                    [29] Deerwester, S., Dumais, S., Landauer, T., Furnas,
     Heritage and Technology Enhanced Learning
                                                                         G., Beck, L.: Improving Information Retrieval with
     (2011)
                                                                         Latent Semantic Indexing. Proc. of the 51st Annual
[17] Candela, L., Castelli, D., Fuhr, N., Ioannidis, Y.,                 Meeting of the American Society for Information
     Klas, C.-P., Pagano, P., Ross, S., Saidis, C., Schek,               Science, 25, pp. 36-40 (1988)
     H.-J., Schuldt, H., Springmann, M.: Current Digital
                                                                    [30] The Polish Digital Mathematics Library,
     Library Systems: User Requirements vs Provided
                                                                         http://pldml.icm.edu.pl/
     Functionality. IST-2002-2.3.1.12. Technology-
     enhanced Learning and Access to Cultural Heritage              [31] Zamlynska, K., Tarkowski, A., Rosiek, T.:
     (2006)                                                              Evolution of the Mathematical Collection of the
                                                                         Polish Virtual Library of Science. Mathematics in
[18] Elizarov, A.M., Lipachev, E.K.: Lobachevskii
                                                                         computer Science, (3), pp. 265-278 (2010), doi:
     DML: Towards a Semantic Digital Mathema-tical
                                                                         10.1007/s11786-010-0029-2
     Library of Kazan University, 2017 (in press),
     DAMDID-2017 proceedings                                        [32] Gottingen Digitalisierungs Zentrum. http://gdz.
                                                                         sub.uni-goettingen.de/gdz/
[19] Kogalovskiy, M.R., Parinov, S.I.: Klassifikatsiya i
     ispol'zovaniye semanticheskikh svyazey mezh-du                 [33] Gottingen digitization Centre https://www.sub. uni-
     informatsionnymi ob’yektami v nauchnykh                             goettingen.de/en/copying-digitising/ goettingen-
     elektronnykh bibliotekakh. Inform. i yee primen., 3                 digitisation-centre/
     (6), pp. 32-42 (2012)                                          [34] Zentralblatt MATH. https://zbmath.org/
[20] All-Russian Mathematical Portal Math-Net.Ru.                   [35] Muller, F., Teschke, O.: Full Text Formula Search
     http://www.mathnet.ru/                                              in zbMATH, EMS Newsletter (2016)
[21] Zhizhchenko, A.B., Izaak, A.D.: The Information                [36] Bulgarian Digital Mathematics Library. http://sci-
     System        Math-Net.Ru.        Application       of              gems.math.bas.bg/jspui/
     Contemporary Technologies in the Scientific Work               [37] DSpace, www.dspace.org
     of Mathematicians. Russian Math. Surveys, 62 (5),              [38] Sylwestrzak, W., Borbinha, J., Bouche, T.,
     pp. 943-966 (2007), http://dx. doi.org/10.1070/                     Nowinski, A., Sojka P.: EuDML – Towards the
     RM2007v062n05ABEH004455                                             European Digital Mathematics Library. In: Sojka,
[22] Zhizhchenko, A.B., Izaak, A.D.: The Information                     P. (ed.) Towards a Digital Mathematics Library.
     System Math-Net.Ru. Current State and Prospects.                    Paris, July 7–8th, 2010, pp. 11-26. Masaryk
     The Impact Factors of Russian Mathematics                           University         Press,       Brno        (2010),
     Journals. Russian Math. Surveys, 64 (4), pp. 775-                   http://dml.cz/bitstream/handle/10338.dmlcz/70256
     784       (2009),     http://dx.doi.org/    10.1070/                9/DML_003-2010-1_5.pdf
     RM2009v064n04ABEH004638                                        [39] EuDML, www.eudml.org
[23] CEDRAM. www.cedram.org




                                                              323
                                                                                      Table 1 Comparison table of DML projects
 DML             Math-Net.ru                      CEDRAM                            NUMDAM                                  DML-CZ                              GDZ                     zbMATH                    EuDML
Criteria
Inform     There is an object              DML contains 9 French          Contains more than 57000            The digitized journal and proceedings     This is                  The database contains        This is an
ation      hierarchy. Collections          math journals, 1 book          articles in 76 periodicals, 373     papers are displayed with the             multidisciplinary        more than 3.5 million        aggregation and
space      split into journals, issues,    and 7 proceedings of           books in 4 collections, 263         agreement of the publisher who owns       library, that contains   bibliographic entries with   indexing service.
           articles and so on.             seminars and                   theses.                             the digital data.                         not only                 reviews or abstracts         EuDML
           Currently contains more         conferences. All               Full texts available in PDF and     DML-CZ presents full texts articles       mathematical             currently drawn from         assemble the
           than 120 journals with          CEDRAM journals are            DJVU formats. Each article in       and book chapters in PDF format,          collections but also     more than 3,000 journals     digital
           nearly 200 thousand             open access. Access to         NUMDAM is available via a           equipped with enhanced metadata           history of Law,          and serials, and 170,000     mathematical
           publications. Information       the database containing        stable URL.                         including bibliographical references.     history of the           books. The database of       corpus in order to
           about the article includes      the bibliographical                                                The digital born documents are being      Humanities and the       service contains about 2.1   make it available
           a bibliographic                 references of all the                                              obtained from the original sources        Sciences, travel and     million direct links to      online.
           description, an                 articles of all                                                    provided by publishers.                   North American           electronic versions of the
           annotation, lists of            participating journals is                                                                                    literature and other     indexed publications, to
           literature and a file with      totally free.                                                                                                collections.             the publishers’ websites
           the full text of the article.   The full entry of articles                                                                                   Mathematical             and/or to electronic
                                           contains abstracts and                                                                                       collections have         libraries with open access
                                           bibliographical                                                                                              about 7000 resources     to the full texts.
                                           references.                                                                                                  and also have some
                                                                                                                                                        Russian resources.
                                                                                                                                                        Library contains
                                                                                                                                                        more than 15 million
                                                                                                                                                        digitized pages.
Functio    The portal provides the         CEDRAM has OAI-PMH             NUMDAM has an OAI-PMH               Editors of all journals are using tools   Portal provides          Search functions provide     EuDML offers
nality     ability to search for           server, which can be used      server, thus allowing sharing of    and workflows that enable them to         search in metadata       search for documents,        several service
           publications and links on       for systematic download        metadata and better visibility of   produce inputs in a semiautomatic         and full text of         authors and journals.        interfaces that
           the bibliographic               of metadata in various         collections.                        way. The formal consistency and           resources and browse     Search can be done in        allow other
           description and keywords        schemas.                       System provide following            integrity of the data are controlled by   functions. All           one line, or in structured   applications to
           in the title, annotation or     Search functions provide       functions: search and               several validating procedures that        resources have full      form using attributes.       connect with the
           text. As result of the          search by keywords,            navigation by title, author,        have been developed in the project.       texts and can be         Service also provide full-   service. These
           search, an abstract, article    author, title, bibliography    references or in full text of       There are some automated procedures       viewed page by page      text formula search for      are OAI-PMH
           IDs (DOI, resource              and full text search.          resources. During search all        for validation of data of new journal     or in structured         indexed arXiv                server, REST
           references in abstract          Quick search searches in       statistics, related to the search   issues but all of them are for internal   mode. Metadata of        documents. The               services,
           databases, URIs), a             all fields except full text.   topic is displayed – co-authors,    use and development.                      any resource contain     zbMATH formula search        OpenSearch
           citation pattern, classifier    Advanced search                journals and years of               DML-CZ allows to search by title,         stable URL of            uses the MathWebSearch       service, which
           values are issued. There        interface offers several       publication. Browse functions       author of publications, by language or    resource, metadata       system. zbMATH               allow to query
           are no recommender              types of research, more        provide navigation through          by zbMATH and MathSciNet                  can be downloaded        maintain a classification    library index in
           service, in fact all            or less complicated. The       sorted list of resources.           identifiers. Browse functions provide     in METS format.          scheme for mathematics.      machine way and
           semantic services work          full entry of articles         Metadata extraction made only       navigation through sorted list of                                                               annotation
           with a bibliographic            produced for CEDRAM            for bibliography. Any               resources. There is search of related                                                           retrieval services
           description of the              contains abstracts and         additional services like formula    articles.                                                                                       in JSON.
           resource                        bibliographical                search or recommender system
                                           references.                    are absent.




                                                                                                              324
Users     There are role model of       No any user registration      No any user registration          No any user registration.                 No any user              There is a personal area     No any user
          users, everybody can                                                                          End-users cannot submit any               registration.            for users – for reviewers,   registration.
          register and create own                                                                       resource, everything can be submitted                              publishers etc.
          personal area.                                                                                only through editorial board of
          Registered users can                                                                          journals, also there is no any personal
          create personal pages,                                                                        area for users.
          manage personal
          collections of
          publications, authors get
          access to the full texts of
          their articles.
Quality   System is available in        Portal is available in        Portal available on English and   Project was finished in 2010 and now      Portal is available in   Portal offers three ways     EuDML has
of        two languages. The            English and French. Files     French. Formulas can be           it is in a stable form.                   German and English.      of displaying                some policies: all
service   policy for accessing the      of the full texts are the     viewed in TeX source code or      Portal is available only in English.      But main aim of the      mathematical formulas –      texts must be
          full texts of articles is     property of the journals.     in compiled, graphical way.                                                 project is to digitize   MathML, MathJax and          scientifically
          determined by the             All online records exist in   NUMDAM only disseminate                                                     and preserve             LaTeX. MathML is set as      validated and
          publisher of the paper.       two formats, which are        resources that were already                                                 resources.               default. Not all services    formally
          Access for any other          only different by the way     published in journals, books or                                                                      of the system are free,      published; all
          information is free.          they display                  theses but submission process                                                                        some of them need to be      items must be
                                        mathematical formulas:        of resources is not clear.                                                                           purchased.                   open access after
                                        MathML or TeX and                                                                                                                                               a finite embargo
                                        have stable url link.                                                                                                                                           period. Once
                                                                                                                                                                                                        documents
                                                                                                                                                                                                        contributed to the
                                                                                                                                                                                                        library are made
                                                                                                                                                                                                        open access due
                                                                                                                                                                                                        to this policy,
                                                                                                                                                                                                        they cannot
                                                                                                                                                                                                        revert to close
                                                                                                                                                                                                        access later on;
                                                                                                                                                                                                        the digital full
                                                                                                                                                                                                        text of each item
                                                                                                                                                                                                        contributed to
                                                                                                                                                                                                        library must be
                                                                                                                                                                                                        archived
                                                                                                                                                                                                        physically at one
                                                                                                                                                                                                        of the member
                                                                                                                                                                                                        institutions.




                                                                                                        325