=Paper=
{{Paper
|id=Vol-2022/paper49
|storemode=property
|title=
Digital Mathematical Libraries: Overview of Implementations and Content Management Services
|pdfUrl=https://ceur-ws.org/Vol-2022/paper49.pdf
|volume=Vol-2022
|authors=Alexander M. Elizarov,Evgeny K. Lipachev,Denis Zuev
|dblpUrl=https://dblp.org/rec/conf/rcdl/ElizarovLZ17
}}
==
Digital Mathematical Libraries: Overview of Implementations and Content Management Services
==
Digital Mathematical Libraries: Overview
of Implementations and Content Management Services
© A.M. Elizarov © E.K. Lipachev © D.S. Zuev
Volga Region Federal University,
Kazan, Russia
amelizarov@gmail.com elipachev@gmail.com dzuev11@gmail.com
Abstract. The paper gives a review of existing projects of implementation of digital mathematical
libraries. An analysis of existing information systems of digital mathematical libraries is performed using the
evaluation criteria embedded in the DELOS DLRM model, emphasis is placed to the methods of managing
mathematical content on the basis of semantic technologies. All projects are in different degrees of
completeness, the range of services provided is different. We found that most of digital mathematical libraries
are concentrated on the transfer of the resources to the electronic form and their preservation, rather than on
the development of semantic services.
Keywords: digital publishing, library automation, machine-actionable digital library, digital
mathematics library, DML, WDML.
1 Introduction representations of mathematical knowledge; presentation
formats; authoring languages and tools; creating
The Digital Era has changed crucially as the methods repositories of formalized mathematics, and
of research, and the ways in which scientists search, mathematical digital libraries; mathematical search and
produce, publish, and disseminate their scientific work. retrieval; implementing math assistants, tutoring and
A digital library, a collection of information which is assessment systems; developing collaboration tools for
both digitized and organized, gives us power we never mathematics; creating new tools for detecting re-
had with traditional libraries. Information and purposing material, including plagiarism of others' work
communication technologies are actively implemented in and self-plagiarism; creation of interactive documents;
research and development. Therefore, it became possible developing deduction systems. The solution of this task
to use the entire volume of accumulated scientific requires formalization of mathematical statements and
knowledge in conducting new research. This requires proofs [9].
creation of complex of technologies that ensure At present, research activities in the field of
management of available knowledge, the organization mathematics are associated with the use of modern
has effective access to this knowledge, as well as sharing information technology (cloud, semantic, etc.). These
and multiple use of new kinds of knowledge structures. technologies are used in research of distributed scientific
In mathematics also accumulated considerable teams, preparation and dissemination of mathematical
experience in using of electronic mathematical content knowledge in an electronic form. At present, a new type
within the various projects on creation of mathematical of digital library is being formed, connected with the
digital libraries (see, e. g., [1]). integration of mathematical knowledge into the scientific
Since inception of the first scientific information information space, see. [1,10,11]. This type of
systems, mathematicians have been involved in the full information system is called Digital Mathematical
cycle of software product development, from idea to Library (DML), a number of global projects are
implementation. Well-known examples are an open implemented, such as European Digital Mathematical
source system TEX and commercial systems Wolfram Library or World Digital Mathematical Library [12–14].
Mathematica and Wolfram Alpha, led by Stephen More details about goals, functions and current results
Wolfram according to his principles of computational are listed below, in Section 3.
knowledge theory [2, 3]. Tools for mathematical content Implementation of digital mathematical libraries
management are developed with the help of communities involves the development of special tools and continuous
of mathematicians, e.g. MathJax by American improvement of their functionality. An example is the
Mathematical Society, information system Math-Net.Ru Open Journal Systems (OJS, https://pkp.sfu.ca/ojs/). The
is developed at the Steklov Mathematical Institute of the platform is used in many projects, particularly in
Russian Academy of Sciences [4] and the collection of Lobachevskii Journal of Mathematics
publicly available preprints arXiv.org (https://arxiv.org/). (http://ljm.kpfu.ru/), one of the first digital mathematical
Main challenges of mathematical knowledge journals [15].
management (MKM) are discussed in [5–9], the most In our work, we try to look more deeply into world
urgent tasks are outlined. Such tasks are: modeling largest DML to outline current status of described
projects and to investigate services and functions that
Proceedings of the XIX International Conference provide these digital mathematical libraries.
“Data Analytics and Management in Data Intensive
Domains” (DAMDID/RCDL’2017), Moscow, Russia,
October 10–13, 2017
317
2 Mathematical Libraries and DELOS approach to information objects organization lies in the
Digital Library Reference Model ideology of WDML. We use the same approach in
creating a digital mathematical library Lobachevskii
2.1 Criteria for investigation DML, which is based on mathematical collections of the
Kazan Federal University [18].
Firstly, we need to establish common criteria and
Usually digital library consists of collections, and
main features and functions that we will look at.
collections in turn from documents or information
In DELOS Digital Library Reference Model [16, 17]
three basic concepts are distinguished for defining what resources (objects). In 1990–2000 there was a large
number of studies carried out on the definition,
is called a digital library (DL):
architectural and technical aspects of DL systems.
• DL – a (potentially virtual) organization that
Finally, it is necessary to mention the creation of the DL
comprehensively collects, manages, and preserves for the
manifesto in the DELOS project, which resulted in the
long term rich digital content and offers to its user
communities specialized functionality on that content, of creation of a reference model for DL [16, 17].With the
measurable quality, and according to prescribed policies; development of Semantic Web technologies, it became
interesting to investigate the semantics of resources and
• DL system – a software system that is based on an
their links placed in libraries, see, for example, [18]. In
architecture and provides all functionality that is required
by a particular Digital Library. Users interact with a this case, an information object can already be considered
Digital Library through the corresponding DL system; not only as a document, but as its certain parts – abstract,
• DL management system (DLMS) – a generic keywords, bibliography, citations, comments of authors
or readers.
software system that provides the appropriate software
From the end user's point of view, DL must satisfy the
infrastructure to both produce a basic DL system that
user’s expectations. The document itself as an elementary
incorporates all functionality that is considered
information object may not be interesting at all. It is much
foundational for Digital Libraries and integrate additional
in demand to search for information on a particular entity
software offering more refined, specialized, or advanced
functionality. An intrinsic part of DLMS functionality is or subject mentioned in the document. At the same time,
related to administrative services that are used to choose much more interesting to find all possible resources
where different versions of mentioned subjects,
the appropriate subset of its functionality, e.g., through
especially in cases when various interpretations and
relevant parameters of its components, and then install,
definitions are possible. For example, there are a number
deploy, and (re)configure a DL system.
of definitions of the concept of “digital library” and the
A DLMSis “system software”. As in several other
user studying this topic will certainly be interested in all
domains (e.g., operating systems, databases, user
references to the definition of the digital library from
interfaces), such kernel software may be used as a
different sources. Thus, we observe a change in the
foundation to produce Digital Library systems.
elementary information object. The electronic document
While the concept of DL is intended to capture an
fragmented into smaller information objects and all
abstract system that consists of both physical and virtual
components, the remaining two capture concrete services of a library deal with such objects and manage
software systems. For every DL, there is a unique DL the relationships between them. In mathematics, such
elementary objects can be, for example, theorems,
system in operation (possibly consisting of many
lemmas, definitions or formulas, research of which is
interconnected smaller DL systems in the most general
case), where as all DL systems are based on a handful of much more informative on a number of sources. The
DLMSs. services of any DML should provide such an opportunity.
All this functionality lies in WDML architecture. Its
In the role-based aspect, the DELOS DLRM model
implementation became possible only with the
consider following types of users: the end user of the DL;
development of semantic technologies and the transfer of
the developer of DL; the system administrator of DL and
library content into digital form with metadata. Now,
the developer of applications for DL and, four levels of
there is no technical problems in maintaining such
user views and expectations are formed. In addition, the
approach to the organization of DL.
model identifies six key areas, each of which introduces
and defines its own entities and their properties: During our research, we will take into account this
architecture, information space, functionality, users, transformation of the approach to the organization of
policy and quality of services provided. These areas can information objects. Note that the change in the approach
be considered as evaluation criteria and, by virtue of their to the organization of DL does not affect the selected
criteria for investigation.
universality, can be used to analyze almost any
information system. 3 Functionality of Digital Mathematical
We will carry out an analysis of existing digital
Libraries
mathematical libraries, performed using the evaluation
criteria embedded in the DELOS DLRM model. Below is a brief review of existing digital
mathematical libraries. The largest projects are “All-
2.2 Differences between approaches
Russian Mathematical Portal Math-Net.RU”,”Centre de
It is interesting to stop at the discussion at the diffusion de revues académiques mathématiques”,
approaches of the definition of elementary objects with “Czech Digital Mathematics Library”, “The Polish
which digital library works. In particular, an interesting Digital Mathematics Library”, “Göttinger
318
Digitalisierungs Zentrum”, “Numérisation de doc- documentation. This DML is not so large – contains 9
uments anciens mathématiques”, Zentralblatt MATH, French math journals, 1 book and 7 proceedings of
“Bulgarian Digital Mathematics Library” and “The seminars and conferences.
European Digital Mathematical Library”. It should be The CEDRAM websites offer two ways of consulting
outlined, that all projects are in different degrees of the hosted articles: quick and advanced search. Search
completeness, the range of services provided is also functions provide search by keywords, author, title,
different. bibliography and full text search. Quick search searches
in all fields except full text. Advanced search interface
3.1 Math-Net.ru
offers several types of research, more or less
All-Russian Mathematical Portal Math-Net.ru [4, 20– complicated. The full entry of articles produced for
22] combines both a digital mathematical library and a CEDRAM contains abstracts and bibliographical
publishing system for mathematical texts. It is a web references.
portal developed by the V. A. Steklov Institute of All online records exist in two formats, which are
Mathematics, Russian Academy of Sciences. only different by the way they display mathematical
The key component of the portal – the “Journals” formulas in titles, abstracts, keywords or references:
section links Russian periodicals in the field of MathML or TeX and have stable url link.
mathematical sciences to a single information system. XHTML+MathML display is best for reading and
Currently contains more than 120 journals with nearly browsing, but there are some problems with viewing in
200 thousand publications. Information about the article browsers, that need to be pre-configured to work
includes a bibliographic description, an annotation, lists correctly with MathML. The HTML+TeX version used
of literature and a file with the full text of the article. The for compatibility for users who do not have an
portal presented in two languages – Russian and English. environment capable of displaying MathML. Now
The most interesting part is the functionality of the CEDRAM provide following services [12, 13, 23]:
portal. The portal provides the ability to search for • production workflow of journals;
publications and links on the bibliographic description • dedicated web site for each journal;
and keywords in the title, annotation or text. As result of
the search, an abstract, article IDs (DOI, resource • provides creation and maintenance of LATEX
references in abstract databases, URIs), a citation pattern, styles (using a specific class);
classifier values are issued. There are no recommender • production of PDFs for print and web with
service, in fact all semantic services work with a XML/MathML metadata;
bibliographic description of the resource. MiRef module • DOI registration (Crossref), reference linking
is used to form correctly the description and links to (MSN, ZBM, mini-DML, Crossref);
resources. The module is designed to automatically place
links to various publications databases in the literature • provides publishing platform for mathematical
list. The format of the links must satisfy the rules of the articles based on Open Journal System (Public
amsbib package and should be entered in the LaTeX Knowledge Project, https://pkp.sfu.ca/ojs/);
format. • all resources archived in partner project - the French
Registered users can create personal pages, manage digital math library NUMDAM.
personal collections of publications, authors get access to Policy and quality of services. Starting 2017 all
the full texts of their articles, authors can send the CEDRAM journals are open access. Access to the
manuscript to the editorial office of the journal database containing the bibliographical references of all
electronically, and track the process of its workflow in the articles of all participating journals is totally free. The
the editorial office. database itself is the property of Cellule Mathdoc, and
Statistics on popular authors and resources are contains elements covered by copyright. CEDRAM has
maintained, infometric indicators for resources located OAI-PMH server, which can be used for systematic
on the portal are calculated. download of metadata in various schemas. Files of the
The policy for accessing the full texts of articles is full texts are the property of the journals and it is
determined by the publisher of the paper. Access for any necessary to refer to the policy of each of them. Also
other information is free. there are some restrictions of full copying and indexing
3.2 CEDRAM by web robots.
The center for diffusion of academic mathematical 3.3 Numerisation de Documents Anciens
journals (CEDRAM, Centre de Diffusion de Revues Mathematiques (NUMDAM)
Académiques Mathématiques) is a web portal for The French digital math library NUMDAM [12–14,
common access to a set of mathematical journals [23], 24] started as a digitisation program for a pilot of 6
available in French and English. CEDRAM’s mission is journals. Now it contains more than 57000 articles in 76
to provide a large distribution of their current volumes, periodicals, 373 books in 4 collections, 263 theses.
and range from help for producing journals according to The NUMDAM is the reference French digital
the best standards for electronic publishing to long lasting mathematics library set up by Cellule MathDoc with the
archiving. CEDRAM is a service of the Cellule MathDoc assistance of a network of partners.
(UMS 5638 of CNRS and Université Joseph Fourier) From 2007 onwards, publishers send digital born
which completes its important offer in mathematical articles into DML. Collections are normally indexed
319
within one year of publication, and full texts are freely validation service get all metadata including abstracts,
downloadable at the end of a period of time set by keywords and references transformed into representation
agreement upon each title. using MathML [27].
The NUMDAM program is designed to support End-users cannot submit any resource, everything can
academic publishers and provide the research community be submitted only through editorial board of journals,
with a sustainable, reliable and easy-to-use library. The also there is no any personal area for users.
research and dissemination platform was completely Search and navigation. As others DMLs DML-CZ
redesigned in 2016. Now portal is available on two allows to search by title, author of publications. Also
languages –English and French, formulas can displayed avaliable search by language or by Zentrablatt MATH
in TeX or in graphical form using MathJax. and MathSciNet identifiers. Browse functions provide
System provide following functions: search and navigation through sorted list of resources (authors,
navigation by title, author, references or in full text of journals etc.).
resources. During search all statistics, related to the The most interesting function is search of related
search topic is displayed – co-authors, journals and years articles (finding similarities between papers). This
of publication. Browse functions provide navigation service tries to find similar papers using three methods:
through sorted list of resources (authors, journals etc.). “Term frequency–Inverse document frequency” (TF-
Full texts available in PDF and DJVU formats. Each IDF, see, e. g. [28]), the “Random Projections” or method
article in NUMDAM is available via a stable URL. This that is built on TF-IDF and simplifies the computations
URL is a compact address, designed to remain valid in by projecting vectors onto a subspace of lower
the long term. It is displayed in the web page of the dimensionality [28] and with using “Latent Semantic
article, on the first page of PDF or DjVu files and by the Indexing” (LSI, [29]). Last method gives the most
OAI-PMH server. accurate results up to 90%.
There is no any user registration. All functions have Policies and quality of service. The digitized journal
open access. NUMDAM only disseminate resources that and proceedings papers are displayed with the agreement
already published in journals, books or theses but of the publisher who owns the digital data. The digitized
submission process of resources is not clear. Metadata monographs are displayed with the agreement of the
extraction made only for bibliography. Any additional author and/or the publisher while the digital data are
services like formula search or recommender system are property of the Institute of Mathematics CAS. The
absent. database itself, in particular the bibliographic data, are
The full text of most recent articles is generally not property of the Institute of Mathematics CAS.DML-CZ
available. The journals whose archives are on this portal presents full texts articles and book chapters in PDF
have accepted the principle of “a moving wall”. This is a format, equipped with enhanced metadata including
time interval between the publication of a volume (in bibliographical references linked to Zentrablatt MATH
paper or electronic form, delivered to subscribers) and and MathSciNet. The digital born documents are being
the availability of the full text on the NUMDAM server. obtained from the original sources provided by
Generally, moving-wall for most of journal in publishers. The presented page content and format
NUMDAM is equal to 5 years. corresponds to the original one. Journals are presented
and accessed according to the terms of a contract with the
3.4 The Czech Digital Mathematics Library (DML-
publisher. The digital documents displayed in the DML-
CZ)
CZ are authorized with electronic stamps.
The Czech Digital Mathematics Library (DML-CZ)
3.5 The Polish Digital Mathematical Library
[25, 26] has been developed in order to preserve in a
digital form the content of major part of mathematical The Polish Digital Mathematical Library (DML-PL,
literature that has ever been published in the Czech lands, [30]) has existed since 2002. The library holds full texts
and to provide a free access to the digital content and of polish mathematical journals and books. The major
bibliographical data. DML-CZ resulted from the project part of the collection are archive issues of mathematical
no. 1ET200190513 supported by the Czech Academy of journals published before World War II. Library consists
Sciences (CAS) in the R&D programme Information of 550 books and 36 journals, but only 3 journals provide
Society, and operated by the Institute of Mathematics access to full text of articles. Portal of DML-PL provide
CAS. Project seems to be finished in 2010 and now is in search by attributes and navigation through sorted lists of
stable form. authors, books and journals.
Functionality. Editors of all journals included in Brief explanation of the project is given in [31], but
DML-CZ are using tools and work flows that have been nowadays it seems that project is already finished. On the
tailored to their individual publishing practice and that web portal of library there is no additional information
enable them to produce inputs for DML-CZ in a about current status. Any information about semantic
semiautomatic way. The formal consistency and integrity functions or metadata extraction from resources is
of the data are controlled by several validating missing.
procedures that have been developed in the project.
3.6 GDZ–Gottingen Digitization Centre
There are some automated procedures for validation
of data of new journal issues but all of them are archived The task of the GDZ [32, 33] is to record data such as
in DML-CZ for internal use and development. Based on prints, manuscripts and illustrations and to preserve
limiting the name space of allowed TEX macros, them. Main aim of the project is conversion of resources
320
into digital form. This is multidisciplinary library, that including text, images, moving images, mpegs and data
contains not only mathematical collections but also sets. All functionality of DSpace software is clear and we
history of Law, history of the Humanities and the will not describe it in this paper. For example, additional
Sciences, travel and North American literature and other information about DSpace can be found in [17, 37].
collections. Mathematical collections have about 7000
3.9 European Digital Mathematics Library
resources and also have some Russian resources. Library
contains more than 15 million digitized pages. The European Digital Library (EuDML) was a project
Portal provides search in metadata and full text of partly funded by the European Commission. EuDML
resources and browse functions. Many resources are [12–14, 38, 39] is an aggregation and indexing services
historical, not modern, main aim of the project is to with was established under The EuDML Initiative and
digitize and preserve resources. All resources have full promoted by European Mathematical Society. EuDML
texts and can be viewed page by page or in structured assemble as much as possible of the digital mathematical
mode. Metadata of any resource contain stable URL of corpus in order to make it available online, with eventual
resource, metadata can be downloaded in METS format. open access, in the form of an authoritative and enduring
digital collection, growing continuously with publisher
3.7 Zentralblatt MATH
supplied new content, augmented with sophisticated
Zentralblatt MATH (zbMATH, [34]) is abstracting search interfaces and interoperability services, developed
and reviewing service in pure and applied mathematics. and curated by a network of institutions.
It is hosted by the Berlin office ofFIZ Karlsruhe The system, presented in the diagram in Figure 1,
– Leibniz Institute for Information Infrastructure GmbH conceptually consists of a metadata repository, a search
(FIZ Karlsruhe) and distributed by Springer. The engine, a metadata enhancer, an association analyser,
zbMATH database contains more than 3.5 million annotation and accessibility functions and of course the
bibliographic entries with reviews or abstracts currently interfaces [38].
drawn from more than 3,000 journals and serials, and
170000 books. zbMATH is not a digital library itself, it
is an indexing service and provides easy access to
bibliographic data, reviews and abstracts from all areas
of pure mathematics as well as applications, in particular
to the natural sciences, computer science, economics and
engineering.
Search functions provide search for documents,
authors and journals. Search can be done in one line, or
in structured form using attributes such as title, author,
subject, source, keywords etc. Service also provide full-
text formula search for indexed arXiv documents
[35].The zbMATH formula search uses the
MathWebSearchsystem, which is a content-based search Figure 1 EuDML architecture
engine for MathML formula based on substitution tree
The metadata repository provides the central point of
indexing.
reference for all the managed contents. It will work with
Portal offer three ways of displaying mathematical an OAI-PMH harvester to ingest repositories’ content
formulas – MathML, MathJax and LaTeX. The XML- descriptions, maps the metadata into the internal EuDML
based MathML is the solution recommended by W3C for schema. The performance and the quality of responses of
displaying mathematical content on the web and is set as
the search service directly influence user experience.
default within zbMATH. Mathematical Reviews and
Therefore, particularly this service has to be reliable,
zbMATH maintain the Mathematics Subject scalable and customized to fulfill user expectations.
Classification (MSC), a classification scheme for The metadata enhancer function consist in a
mathematics. It is used by reviewing services to collection of tools that each contribute to expand or
categorize items in the mathematical sciences literature. complete the existing items’ metadata, depending on the
The database of service contains about 2.1 million direct improvements needed. These range from applying OCR
links to electronic versions of the indexed publications, over full texts, adding key words or multilingual
to the publishers’ websites and/or to electronic libraries metadata by merging information from different
with open access to the full texts. databases when an item happens to have such non-
3.8 Bulgarian Digital Mathematics Library redundant description, generating MathML for
mathematical expressions, etc. The association analyzer
Bulgarian Digital Mathematics Library, BulDML is a detects, analyses and records relations between
digital repository at Institute of Mathematics and individual items. The annotation component provides
Informatics of Bulgarian Academy of Sciences. Library mechanisms to attach new material to individual items in
has 7 mathematical journals, 4 book series and the repositories and maintain this new material. The
proceedings in its repository. In fact, BulDML is an accessibility component provides support for enhanced
institutional repository and is built on open-source accessibility of items, if required, before presentation to
DSpace software [36]. As known, DSpace preserves and end users. Finally, the user and system interfaces provide
enables open access to all types of digital content
321
access to the collected resources on different levels both [3] Wolfram, S.: An elementary introduction to the
to human and machine users. Now EuDML offers several Wolfram Language. Wolfram Media, Inc. (2015)
service interfaces that allow other applications to connect [4] Chebukov, D.E., Izaak, A.D., Misyurina, O.G.,
with the service. These are OAI-PMH server, REST Pupyrev, Yu.A., Zhizhchenko, A.B.: Math-Net.Ru
services, OpenSearch service, which allow to query as a Digital Archive of the Russian Mathematical
library index in machine way and annotation retrieval Knowledge from the XIX Century to Today.
services in JSON. Intelligent Computer Mathematics, Lecture Notes
EuDML aims to be an open source of trusted in Comput. Sci., 7961, pp. 344-348, Springer
mathematical knowledge. That is why it has some (2013), doi: 10.1007/978-3-642-39320-4_26
policies: [5] Carette, J., Farmer, W.M.: A Review of
• All texts must have been scientifically validated and Mathematical Knowledge Management. In
formally published; Intelligent Computer Mathematics. Lecture Notes
• All items must be open access after a finite embargo in Computer Science, 5625. pp. 233-246 (2009)
period. Once documents contributed to the library are [6] Ion, P.D.F.: Mathematics and the World Wide Web.
made open access due to this policy, they cannot In Intelligent Computer Mathematics. Lecture
revert to close access later on; Notes in Computer Science, 7961, pp. 230-245
• The digital full text of each item contributed to library (2013)
must be archived physically at one of the EuDML [7] Lange, C.: Enabling Collaboration on Semiformal
member institutions. Mathematical Knowledge by Semantic Web
All DMLs, described above except All-Russian Integration. Ph. D. Thesis, Jacobs University
Mathematical Portal Math-Net.RU are partners of Bremen (2011)
EuDML. [8] Elizarov, A.M., Lipachev, E.K., Nevzorova, O.A.,
Solov’ev, V.D.: Methods and Means for Semantic
4 Conclusion Structuring of Electronic Mathematical Documents.
In order to outline all differences of observed projects Doklady Mathematics, 90 (1), pp. 521-524 (2014),
we created comparison Table 1 listed below. Note that, doi: 10.1134/S1064562414050275
we excluded from table two DMLs due to following. [9] Elizarov, A., Kirillovich, A., Lipachev, E.,
BulDML is and built on open-source DSpace software, Nevzorova, O., Solovyev, V., and Zhiltsov N.:
so all functionality of it is clear, for DML-PL we could Mathematical Knowledge Representation:
not find any working portal in order to study it more Semantic Models and Formalisms. Lobachevskii J.
deeply. of Mathematics, 35 (4), pp. 347-353 (2014),
In all the projects studied, emphasis is placed on the doi:10.1134/S1995080214040143
transfer of the resources themselves to the electronic [10] Elizarov A., Kirillovich A., Lipachev E.,
form, rather than on the development of semantic Nevzorova O. (2017) Digital Ecosystem OntoMath:
services. Only a few portals have a mathematical formula Mathematical Knowledge Analytics and
search, and only one has a recommender service. Management. In: Kalinichenko L., Kuznetsov S.,
Manolopoulos Y. (eds) Data Analytics and
After the analysis done it is clear that there are only
Management in Data Intensive Domains.
two types of repository systems: the first is actually
DAMDID/RCDL 2016. Communications in
DML, which preserve the resources themselves, the
Computer and Information Science, 706, pp. 33-46
second is indexing and aggregating services that do not
(2017), doi: 10.1007/978-3-319-57135-5_3
have their own database of electronic documents, but
provide a wide range of convenient search capabilities. [11] Elizarov, A.M., Kirilovich, A.V., Lipachev, E.K.,
Nevzorova, O.A.: Mathematical Knowledge
This work was funded by the subsidy allocated to Management: Ontological Models and Digital
Kazan Federal University for the state assignment in the Technology. CEUR Workshop Proceedings, 1752,
sphere of scientific activities, grant agreement no. pp. 44-50 (2016), http://ceur-ws.org/Vol-
1.2368.2017) and with partial financial support of the 1752/paper08.pdf
Russian Foundation for Basic Research and the
[12] Bouche, T.: Towards a World Digital Library:
Government of the Republic of Tatarstan, within the
framework of scientific projects Nos. 15-07-08522, 15- Mathdoc, Numdam and EuDML Experiences.
47-02472. UMI, La Sapienza, Roma (2016), http://
www.mat.uniroma1.it/sites/default/import-
References files/biblioteca/SEMINARIO2016/bouche.pdf
[13] Bouche, T.: Digital Mathematics Libraries: The
[1] Borwein, J.M., Rocha, E.M., Rodrigues, J.F.
good, the bad, the ugly. Mathematics in Computer
Communicating Mathematics in the Digital Era, pp.
Science, (3), pp. 227-241 (2010), doi:
3-21. A K Peters, Ltd. MKM-IG. Mathematical
10.1007/s11786-010-0029-2
Knowledge Management (2008). http://
www.mkm-ig.org/ [14] Bouche, T.: Reviving the Free Public Scientific
Library in the Digital Age? The EuDML Project. In:
[2] Wolfram, S.: A New Kind of Science. Wolfram
Kaiser, K., Krantz, S., Wegner, B. (eds.): Topics
Media, Inc. (2002)
and Issues in Electronic Publishing, JMM, Special
322
Session, San Diego, January 2013, pp. 57-80 [24] NUMDAM. www.numdam.org
(2013), http://www.emis.de/ [25] The Czech Digital Mathematics Library (DML-
proceedings/TIEP2013/05bouche.pdf CZ), http://www.dml.cz/
[15] Elizarov, A.M., Zuev, D.S., Lipachev, E.K.: [26] The Czech Digital Mathematics Library. Project
Mathematical Content Semantic Markup Methods Funded by the Academy of Sciences of the Czech
and Open Scientific E-Journals Management Republic, 2005–2009. http://project.dml. cz
Systems. In: Klinov, P., Mouromtsev, D. (eds.) [27] Rákosník, J.: Recent Development of the DML-CZ
KESW 2014. CCIS, 468, pp. 242-251 (2014), doi: and Its Current State. In Proc. of DML 2011:
10.1007/978-3-319-11716-4 22 29 Towards a Digital Mathematics Library. Bertinoro,
[16] Candela, L., Athanasopoulos, G., Castelli, D., El Italy, July 20–21st (2011)
Raheb, K., Innocenti, P., Ioannidis, Y., Katifori, A., [28] Rajaraman, A.; Ullman, J. D.: Data Mining (2011).
Nika, A., Vullo, G., Ross, S.: The Digital Library doi:10.1017/CBO9781139058452.002
Reference Model. FP7-ICT-2007-3. Cultural
[29] Deerwester, S., Dumais, S., Landauer, T., Furnas,
Heritage and Technology Enhanced Learning
G., Beck, L.: Improving Information Retrieval with
(2011)
Latent Semantic Indexing. Proc. of the 51st Annual
[17] Candela, L., Castelli, D., Fuhr, N., Ioannidis, Y., Meeting of the American Society for Information
Klas, C.-P., Pagano, P., Ross, S., Saidis, C., Schek, Science, 25, pp. 36-40 (1988)
H.-J., Schuldt, H., Springmann, M.: Current Digital
[30] The Polish Digital Mathematics Library,
Library Systems: User Requirements vs Provided
http://pldml.icm.edu.pl/
Functionality. IST-2002-2.3.1.12. Technology-
enhanced Learning and Access to Cultural Heritage [31] Zamlynska, K., Tarkowski, A., Rosiek, T.:
(2006) Evolution of the Mathematical Collection of the
Polish Virtual Library of Science. Mathematics in
[18] Elizarov, A.M., Lipachev, E.K.: Lobachevskii
computer Science, (3), pp. 265-278 (2010), doi:
DML: Towards a Semantic Digital Mathema-tical
10.1007/s11786-010-0029-2
Library of Kazan University, 2017 (in press),
DAMDID-2017 proceedings [32] Gottingen Digitalisierungs Zentrum. http://gdz.
sub.uni-goettingen.de/gdz/
[19] Kogalovskiy, M.R., Parinov, S.I.: Klassifikatsiya i
ispol'zovaniye semanticheskikh svyazey mezh-du [33] Gottingen digitization Centre https://www.sub. uni-
informatsionnymi ob’yektami v nauchnykh goettingen.de/en/copying-digitising/ goettingen-
elektronnykh bibliotekakh. Inform. i yee primen., 3 digitisation-centre/
(6), pp. 32-42 (2012) [34] Zentralblatt MATH. https://zbmath.org/
[20] All-Russian Mathematical Portal Math-Net.Ru. [35] Muller, F., Teschke, O.: Full Text Formula Search
http://www.mathnet.ru/ in zbMATH, EMS Newsletter (2016)
[21] Zhizhchenko, A.B., Izaak, A.D.: The Information [36] Bulgarian Digital Mathematics Library. http://sci-
System Math-Net.Ru. Application of gems.math.bas.bg/jspui/
Contemporary Technologies in the Scientific Work [37] DSpace, www.dspace.org
of Mathematicians. Russian Math. Surveys, 62 (5), [38] Sylwestrzak, W., Borbinha, J., Bouche, T.,
pp. 943-966 (2007), http://dx. doi.org/10.1070/ Nowinski, A., Sojka P.: EuDML – Towards the
RM2007v062n05ABEH004455 European Digital Mathematics Library. In: Sojka,
[22] Zhizhchenko, A.B., Izaak, A.D.: The Information P. (ed.) Towards a Digital Mathematics Library.
System Math-Net.Ru. Current State and Prospects. Paris, July 7–8th, 2010, pp. 11-26. Masaryk
The Impact Factors of Russian Mathematics University Press, Brno (2010),
Journals. Russian Math. Surveys, 64 (4), pp. 775- http://dml.cz/bitstream/handle/10338.dmlcz/70256
784 (2009), http://dx.doi.org/ 10.1070/ 9/DML_003-2010-1_5.pdf
RM2009v064n04ABEH004638 [39] EuDML, www.eudml.org
[23] CEDRAM. www.cedram.org
323
Table 1 Comparison table of DML projects
DML Math-Net.ru CEDRAM NUMDAM DML-CZ GDZ zbMATH EuDML
Criteria
Inform There is an object DML contains 9 French Contains more than 57000 The digitized journal and proceedings This is The database contains This is an
ation hierarchy. Collections math journals, 1 book articles in 76 periodicals, 373 papers are displayed with the multidisciplinary more than 3.5 million aggregation and
space split into journals, issues, and 7 proceedings of books in 4 collections, 263 agreement of the publisher who owns library, that contains bibliographic entries with indexing service.
articles and so on. seminars and theses. the digital data. not only reviews or abstracts EuDML
Currently contains more conferences. All Full texts available in PDF and DML-CZ presents full texts articles mathematical currently drawn from assemble the
than 120 journals with CEDRAM journals are DJVU formats. Each article in and book chapters in PDF format, collections but also more than 3,000 journals digital
nearly 200 thousand open access. Access to NUMDAM is available via a equipped with enhanced metadata history of Law, and serials, and 170,000 mathematical
publications. Information the database containing stable URL. including bibliographical references. history of the books. The database of corpus in order to
about the article includes the bibliographical The digital born documents are being Humanities and the service contains about 2.1 make it available
a bibliographic references of all the obtained from the original sources Sciences, travel and million direct links to online.
description, an articles of all provided by publishers. North American electronic versions of the
annotation, lists of participating journals is literature and other indexed publications, to
literature and a file with totally free. collections. the publishers’ websites
the full text of the article. The full entry of articles Mathematical and/or to electronic
contains abstracts and collections have libraries with open access
bibliographical about 7000 resources to the full texts.
references. and also have some
Russian resources.
Library contains
more than 15 million
digitized pages.
Functio The portal provides the CEDRAM has OAI-PMH NUMDAM has an OAI-PMH Editors of all journals are using tools Portal provides Search functions provide EuDML offers
nality ability to search for server, which can be used server, thus allowing sharing of and workflows that enable them to search in metadata search for documents, several service
publications and links on for systematic download metadata and better visibility of produce inputs in a semiautomatic and full text of authors and journals. interfaces that
the bibliographic of metadata in various collections. way. The formal consistency and resources and browse Search can be done in allow other
description and keywords schemas. System provide following integrity of the data are controlled by functions. All one line, or in structured applications to
in the title, annotation or Search functions provide functions: search and several validating procedures that resources have full form using attributes. connect with the
text. As result of the search by keywords, navigation by title, author, have been developed in the project. texts and can be Service also provide full- service. These
search, an abstract, article author, title, bibliography references or in full text of There are some automated procedures viewed page by page text formula search for are OAI-PMH
IDs (DOI, resource and full text search. resources. During search all for validation of data of new journal or in structured indexed arXiv server, REST
references in abstract Quick search searches in statistics, related to the search issues but all of them are for internal mode. Metadata of documents. The services,
databases, URIs), a all fields except full text. topic is displayed – co-authors, use and development. any resource contain zbMATH formula search OpenSearch
citation pattern, classifier Advanced search journals and years of DML-CZ allows to search by title, stable URL of uses the MathWebSearch service, which
values are issued. There interface offers several publication. Browse functions author of publications, by language or resource, metadata system. zbMATH allow to query
are no recommender types of research, more provide navigation through by zbMATH and MathSciNet can be downloaded maintain a classification library index in
service, in fact all or less complicated. The sorted list of resources. identifiers. Browse functions provide in METS format. scheme for mathematics. machine way and
semantic services work full entry of articles Metadata extraction made only navigation through sorted list of annotation
with a bibliographic produced for CEDRAM for bibliography. Any resources. There is search of related retrieval services
description of the contains abstracts and additional services like formula articles. in JSON.
resource bibliographical search or recommender system
references. are absent.
324
Users There are role model of No any user registration No any user registration No any user registration. No any user There is a personal area No any user
users, everybody can End-users cannot submit any registration. for users – for reviewers, registration.
register and create own resource, everything can be submitted publishers etc.
personal area. only through editorial board of
Registered users can journals, also there is no any personal
create personal pages, area for users.
manage personal
collections of
publications, authors get
access to the full texts of
their articles.
Quality System is available in Portal is available in Portal available on English and Project was finished in 2010 and now Portal is available in Portal offers three ways EuDML has
of two languages. The English and French. Files French. Formulas can be it is in a stable form. German and English. of displaying some policies: all
service policy for accessing the of the full texts are the viewed in TeX source code or Portal is available only in English. But main aim of the mathematical formulas – texts must be
full texts of articles is property of the journals. in compiled, graphical way. project is to digitize MathML, MathJax and scientifically
determined by the All online records exist in NUMDAM only disseminate and preserve LaTeX. MathML is set as validated and
publisher of the paper. two formats, which are resources that were already resources. default. Not all services formally
Access for any other only different by the way published in journals, books or of the system are free, published; all
information is free. they display theses but submission process some of them need to be items must be
mathematical formulas: of resources is not clear. purchased. open access after
MathML or TeX and a finite embargo
have stable url link. period. Once
documents
contributed to the
library are made
open access due
to this policy,
they cannot
revert to close
access later on;
the digital full
text of each item
contributed to
library must be
archived
physically at one
of the member
institutions.
325