Digital Mathematical Libraries: Overview of Implementations and Content Management Services © A.M. Elizarov © E.K. Lipachev © D.S. Zuev Volga Region Federal University, Kazan, Russia amelizarov@gmail.com elipachev@gmail.com dzuev11@gmail.com Abstract. The paper gives a review of existing projects of implementation of digital mathematical libraries. An analysis of existing information systems of digital mathematical libraries is performed using the evaluation criteria embedded in the DELOS DLRM model, emphasis is placed to the methods of managing mathematical content on the basis of semantic technologies. All projects are in different degrees of completeness, the range of services provided is different. We found that most of digital mathematical libraries are concentrated on the transfer of the resources to the electronic form and their preservation, rather than on the development of semantic services. Keywords: digital publishing, library automation, machine-actionable digital library, digital mathematics library, DML, WDML. 1 Introduction representations of mathematical knowledge; presentation formats; authoring languages and tools; creating The Digital Era has changed crucially as the methods repositories of formalized mathematics, and of research, and the ways in which scientists search, mathematical digital libraries; mathematical search and produce, publish, and disseminate their scientific work. retrieval; implementing math assistants, tutoring and A digital library, a collection of information which is assessment systems; developing collaboration tools for both digitized and organized, gives us power we never mathematics; creating new tools for detecting re- had with traditional libraries. Information and purposing material, including plagiarism of others' work communication technologies are actively implemented in and self-plagiarism; creation of interactive documents; research and development. Therefore, it became possible developing deduction systems. The solution of this task to use the entire volume of accumulated scientific requires formalization of mathematical statements and knowledge in conducting new research. This requires proofs [9]. creation of complex of technologies that ensure At present, research activities in the field of management of available knowledge, the organization mathematics are associated with the use of modern has effective access to this knowledge, as well as sharing information technology (cloud, semantic, etc.). These and multiple use of new kinds of knowledge structures. technologies are used in research of distributed scientific In mathematics also accumulated considerable teams, preparation and dissemination of mathematical experience in using of electronic mathematical content knowledge in an electronic form. At present, a new type within the various projects on creation of mathematical of digital library is being formed, connected with the digital libraries (see, e. g., [1]). integration of mathematical knowledge into the scientific Since inception of the first scientific information information space, see. [1,10,11]. This type of systems, mathematicians have been involved in the full information system is called Digital Mathematical cycle of software product development, from idea to Library (DML), a number of global projects are implementation. Well-known examples are an open implemented, such as European Digital Mathematical source system TEX and commercial systems Wolfram Library or World Digital Mathematical Library [12–14]. Mathematica and Wolfram Alpha, led by Stephen More details about goals, functions and current results Wolfram according to his principles of computational are listed below, in Section 3. knowledge theory [2, 3]. Tools for mathematical content Implementation of digital mathematical libraries management are developed with the help of communities involves the development of special tools and continuous of mathematicians, e.g. MathJax by American improvement of their functionality. An example is the Mathematical Society, information system Math-Net.Ru Open Journal Systems (OJS, https://pkp.sfu.ca/ojs/). The is developed at the Steklov Mathematical Institute of the platform is used in many projects, particularly in Russian Academy of Sciences [4] and the collection of Lobachevskii Journal of Mathematics publicly available preprints arXiv.org (https://arxiv.org/). (http://ljm.kpfu.ru/), one of the first digital mathematical Main challenges of mathematical knowledge journals [15]. management (MKM) are discussed in [5–9], the most In our work, we try to look more deeply into world urgent tasks are outlined. Such tasks are: modeling largest DML to outline current status of described projects and to investigate services and functions that Proceedings of the XIX International Conference provide these digital mathematical libraries. “Data Analytics and Management in Data Intensive Domains” (DAMDID/RCDL’2017), Moscow, Russia, October 10–13, 2017 317 2 Mathematical Libraries and DELOS approach to information objects organization lies in the Digital Library Reference Model ideology of WDML. We use the same approach in creating a digital mathematical library Lobachevskii 2.1 Criteria for investigation DML, which is based on mathematical collections of the Kazan Federal University [18]. Firstly, we need to establish common criteria and Usually digital library consists of collections, and main features and functions that we will look at. collections in turn from documents or information In DELOS Digital Library Reference Model [16, 17] three basic concepts are distinguished for defining what resources (objects). In 1990–2000 there was a large number of studies carried out on the definition, is called a digital library (DL): architectural and technical aspects of DL systems. • DL – a (potentially virtual) organization that Finally, it is necessary to mention the creation of the DL comprehensively collects, manages, and preserves for the manifesto in the DELOS project, which resulted in the long term rich digital content and offers to its user communities specialized functionality on that content, of creation of a reference model for DL [16, 17].With the measurable quality, and according to prescribed policies; development of Semantic Web technologies, it became interesting to investigate the semantics of resources and • DL system – a software system that is based on an their links placed in libraries, see, for example, [18]. In architecture and provides all functionality that is required by a particular Digital Library. Users interact with a this case, an information object can already be considered Digital Library through the corresponding DL system; not only as a document, but as its certain parts – abstract, • DL management system (DLMS) – a generic keywords, bibliography, citations, comments of authors or readers. software system that provides the appropriate software From the end user's point of view, DL must satisfy the infrastructure to both produce a basic DL system that user’s expectations. The document itself as an elementary incorporates all functionality that is considered information object may not be interesting at all. It is much foundational for Digital Libraries and integrate additional in demand to search for information on a particular entity software offering more refined, specialized, or advanced functionality. An intrinsic part of DLMS functionality is or subject mentioned in the document. At the same time, related to administrative services that are used to choose much more interesting to find all possible resources where different versions of mentioned subjects, the appropriate subset of its functionality, e.g., through especially in cases when various interpretations and relevant parameters of its components, and then install, definitions are possible. For example, there are a number deploy, and (re)configure a DL system. of definitions of the concept of “digital library” and the A DLMSis “system software”. As in several other user studying this topic will certainly be interested in all domains (e.g., operating systems, databases, user references to the definition of the digital library from interfaces), such kernel software may be used as a different sources. Thus, we observe a change in the foundation to produce Digital Library systems. elementary information object. The electronic document While the concept of DL is intended to capture an fragmented into smaller information objects and all abstract system that consists of both physical and virtual components, the remaining two capture concrete services of a library deal with such objects and manage software systems. For every DL, there is a unique DL the relationships between them. In mathematics, such elementary objects can be, for example, theorems, system in operation (possibly consisting of many lemmas, definitions or formulas, research of which is interconnected smaller DL systems in the most general case), where as all DL systems are based on a handful of much more informative on a number of sources. The DLMSs. services of any DML should provide such an opportunity. All this functionality lies in WDML architecture. Its In the role-based aspect, the DELOS DLRM model implementation became possible only with the consider following types of users: the end user of the DL; development of semantic technologies and the transfer of the developer of DL; the system administrator of DL and library content into digital form with metadata. Now, the developer of applications for DL and, four levels of there is no technical problems in maintaining such user views and expectations are formed. In addition, the approach to the organization of DL. model identifies six key areas, each of which introduces and defines its own entities and their properties: During our research, we will take into account this architecture, information space, functionality, users, transformation of the approach to the organization of policy and quality of services provided. These areas can information objects. Note that the change in the approach be considered as evaluation criteria and, by virtue of their to the organization of DL does not affect the selected criteria for investigation. universality, can be used to analyze almost any information system. 3 Functionality of Digital Mathematical We will carry out an analysis of existing digital Libraries mathematical libraries, performed using the evaluation criteria embedded in the DELOS DLRM model. Below is a brief review of existing digital mathematical libraries. The largest projects are “All- 2.2 Differences between approaches Russian Mathematical Portal Math-Net.RU”,”Centre de It is interesting to stop at the discussion at the diffusion de revues académiques mathématiques”, approaches of the definition of elementary objects with “Czech Digital Mathematics Library”, “The Polish which digital library works. In particular, an interesting Digital Mathematics Library”, “Göttinger 318 Digitalisierungs Zentrum”, “Numérisation de doc- documentation. This DML is not so large – contains 9 uments anciens mathématiques”, Zentralblatt MATH, French math journals, 1 book and 7 proceedings of “Bulgarian Digital Mathematics Library” and “The seminars and conferences. European Digital Mathematical Library”. It should be The CEDRAM websites offer two ways of consulting outlined, that all projects are in different degrees of the hosted articles: quick and advanced search. Search completeness, the range of services provided is also functions provide search by keywords, author, title, different. bibliography and full text search. Quick search searches in all fields except full text. Advanced search interface 3.1 Math-Net.ru offers several types of research, more or less All-Russian Mathematical Portal Math-Net.ru [4, 20– complicated. The full entry of articles produced for 22] combines both a digital mathematical library and a CEDRAM contains abstracts and bibliographical publishing system for mathematical texts. It is a web references. portal developed by the V. A. Steklov Institute of All online records exist in two formats, which are Mathematics, Russian Academy of Sciences. only different by the way they display mathematical The key component of the portal – the “Journals” formulas in titles, abstracts, keywords or references: section links Russian periodicals in the field of MathML or TeX and have stable url link. mathematical sciences to a single information system. XHTML+MathML display is best for reading and Currently contains more than 120 journals with nearly browsing, but there are some problems with viewing in 200 thousand publications. Information about the article browsers, that need to be pre-configured to work includes a bibliographic description, an annotation, lists correctly with MathML. The HTML+TeX version used of literature and a file with the full text of the article. The for compatibility for users who do not have an portal presented in two languages – Russian and English. environment capable of displaying MathML. Now The most interesting part is the functionality of the CEDRAM provide following services [12, 13, 23]: portal. The portal provides the ability to search for • production workflow of journals; publications and links on the bibliographic description • dedicated web site for each journal; and keywords in the title, annotation or text. As result of the search, an abstract, article IDs (DOI, resource • provides creation and maintenance of LATEX references in abstract databases, URIs), a citation pattern, styles (using a specific class); classifier values are issued. There are no recommender • production of PDFs for print and web with service, in fact all semantic services work with a XML/MathML metadata; bibliographic description of the resource. MiRef module • DOI registration (Crossref), reference linking is used to form correctly the description and links to (MSN, ZBM, mini-DML, Crossref); resources. The module is designed to automatically place links to various publications databases in the literature • provides publishing platform for mathematical list. The format of the links must satisfy the rules of the articles based on Open Journal System (Public amsbib package and should be entered in the LaTeX Knowledge Project, https://pkp.sfu.ca/ojs/); format. • all resources archived in partner project - the French Registered users can create personal pages, manage digital math library NUMDAM. personal collections of publications, authors get access to Policy and quality of services. Starting 2017 all the full texts of their articles, authors can send the CEDRAM journals are open access. Access to the manuscript to the editorial office of the journal database containing the bibliographical references of all electronically, and track the process of its workflow in the articles of all participating journals is totally free. The the editorial office. database itself is the property of Cellule Mathdoc, and Statistics on popular authors and resources are contains elements covered by copyright. CEDRAM has maintained, infometric indicators for resources located OAI-PMH server, which can be used for systematic on the portal are calculated. download of metadata in various schemas. Files of the The policy for accessing the full texts of articles is full texts are the property of the journals and it is determined by the publisher of the paper. Access for any necessary to refer to the policy of each of them. Also other information is free. there are some restrictions of full copying and indexing 3.2 CEDRAM by web robots. The center for diffusion of academic mathematical 3.3 Numerisation de Documents Anciens journals (CEDRAM, Centre de Diffusion de Revues Mathematiques (NUMDAM) Académiques Mathématiques) is a web portal for The French digital math library NUMDAM [12–14, common access to a set of mathematical journals [23], 24] started as a digitisation program for a pilot of 6 available in French and English. CEDRAM’s mission is journals. Now it contains more than 57000 articles in 76 to provide a large distribution of their current volumes, periodicals, 373 books in 4 collections, 263 theses. and range from help for producing journals according to The NUMDAM is the reference French digital the best standards for electronic publishing to long lasting mathematics library set up by Cellule MathDoc with the archiving. CEDRAM is a service of the Cellule MathDoc assistance of a network of partners. (UMS 5638 of CNRS and Université Joseph Fourier) From 2007 onwards, publishers send digital born which completes its important offer in mathematical articles into DML. Collections are normally indexed 319 within one year of publication, and full texts are freely validation service get all metadata including abstracts, downloadable at the end of a period of time set by keywords and references transformed into representation agreement upon each title. using MathML [27]. The NUMDAM program is designed to support End-users cannot submit any resource, everything can academic publishers and provide the research community be submitted only through editorial board of journals, with a sustainable, reliable and easy-to-use library. The also there is no any personal area for users. research and dissemination platform was completely Search and navigation. As others DMLs DML-CZ redesigned in 2016. Now portal is available on two allows to search by title, author of publications. Also languages –English and French, formulas can displayed avaliable search by language or by Zentrablatt MATH in TeX or in graphical form using MathJax. and MathSciNet identifiers. Browse functions provide System provide following functions: search and navigation through sorted list of resources (authors, navigation by title, author, references or in full text of journals etc.). resources. During search all statistics, related to the The most interesting function is search of related search topic is displayed – co-authors, journals and years articles (finding similarities between papers). This of publication. Browse functions provide navigation service tries to find similar papers using three methods: through sorted list of resources (authors, journals etc.). “Term frequency–Inverse document frequency” (TF- Full texts available in PDF and DJVU formats. Each IDF, see, e. g. [28]), the “Random Projections” or method article in NUMDAM is available via a stable URL. This that is built on TF-IDF and simplifies the computations URL is a compact address, designed to remain valid in by projecting vectors onto a subspace of lower the long term. It is displayed in the web page of the dimensionality [28] and with using “Latent Semantic article, on the first page of PDF or DjVu files and by the Indexing” (LSI, [29]). Last method gives the most OAI-PMH server. accurate results up to 90%. There is no any user registration. All functions have Policies and quality of service. The digitized journal open access. NUMDAM only disseminate resources that and proceedings papers are displayed with the agreement already published in journals, books or theses but of the publisher who owns the digital data. The digitized submission process of resources is not clear. Metadata monographs are displayed with the agreement of the extraction made only for bibliography. Any additional author and/or the publisher while the digital data are services like formula search or recommender system are property of the Institute of Mathematics CAS. The absent. database itself, in particular the bibliographic data, are The full text of most recent articles is generally not property of the Institute of Mathematics CAS.DML-CZ available. The journals whose archives are on this portal presents full texts articles and book chapters in PDF have accepted the principle of “a moving wall”. This is a format, equipped with enhanced metadata including time interval between the publication of a volume (in bibliographical references linked to Zentrablatt MATH paper or electronic form, delivered to subscribers) and and MathSciNet. The digital born documents are being the availability of the full text on the NUMDAM server. obtained from the original sources provided by Generally, moving-wall for most of journal in publishers. The presented page content and format NUMDAM is equal to 5 years. corresponds to the original one. Journals are presented and accessed according to the terms of a contract with the 3.4 The Czech Digital Mathematics Library (DML- publisher. The digital documents displayed in the DML- CZ) CZ are authorized with electronic stamps. The Czech Digital Mathematics Library (DML-CZ) 3.5 The Polish Digital Mathematical Library [25, 26] has been developed in order to preserve in a digital form the content of major part of mathematical The Polish Digital Mathematical Library (DML-PL, literature that has ever been published in the Czech lands, [30]) has existed since 2002. The library holds full texts and to provide a free access to the digital content and of polish mathematical journals and books. The major bibliographical data. DML-CZ resulted from the project part of the collection are archive issues of mathematical no. 1ET200190513 supported by the Czech Academy of journals published before World War II. Library consists Sciences (CAS) in the R&D programme Information of 550 books and 36 journals, but only 3 journals provide Society, and operated by the Institute of Mathematics access to full text of articles. Portal of DML-PL provide CAS. Project seems to be finished in 2010 and now is in search by attributes and navigation through sorted lists of stable form. authors, books and journals. Functionality. Editors of all journals included in Brief explanation of the project is given in [31], but DML-CZ are using tools and work flows that have been nowadays it seems that project is already finished. On the tailored to their individual publishing practice and that web portal of library there is no additional information enable them to produce inputs for DML-CZ in a about current status. Any information about semantic semiautomatic way. The formal consistency and integrity functions or metadata extraction from resources is of the data are controlled by several validating missing. procedures that have been developed in the project. 3.6 GDZ–Gottingen Digitization Centre There are some automated procedures for validation of data of new journal issues but all of them are archived The task of the GDZ [32, 33] is to record data such as in DML-CZ for internal use and development. Based on prints, manuscripts and illustrations and to preserve limiting the name space of allowed TEX macros, them. Main aim of the project is conversion of resources 320 into digital form. This is multidisciplinary library, that including text, images, moving images, mpegs and data contains not only mathematical collections but also sets. All functionality of DSpace software is clear and we history of Law, history of the Humanities and the will not describe it in this paper. For example, additional Sciences, travel and North American literature and other information about DSpace can be found in [17, 37]. collections. Mathematical collections have about 7000 3.9 European Digital Mathematics Library resources and also have some Russian resources. Library contains more than 15 million digitized pages. The European Digital Library (EuDML) was a project Portal provides search in metadata and full text of partly funded by the European Commission. EuDML resources and browse functions. Many resources are [12–14, 38, 39] is an aggregation and indexing services historical, not modern, main aim of the project is to with was established under The EuDML Initiative and digitize and preserve resources. All resources have full promoted by European Mathematical Society. EuDML texts and can be viewed page by page or in structured assemble as much as possible of the digital mathematical mode. Metadata of any resource contain stable URL of corpus in order to make it available online, with eventual resource, metadata can be downloaded in METS format. open access, in the form of an authoritative and enduring digital collection, growing continuously with publisher 3.7 Zentralblatt MATH supplied new content, augmented with sophisticated Zentralblatt MATH (zbMATH, [34]) is abstracting search interfaces and interoperability services, developed and reviewing service in pure and applied mathematics. and curated by a network of institutions. It is hosted by the Berlin office ofFIZ Karlsruhe The system, presented in the diagram in Figure 1, – Leibniz Institute for Information Infrastructure GmbH conceptually consists of a metadata repository, a search (FIZ Karlsruhe) and distributed by Springer. The engine, a metadata enhancer, an association analyser, zbMATH database contains more than 3.5 million annotation and accessibility functions and of course the bibliographic entries with reviews or abstracts currently interfaces [38]. drawn from more than 3,000 journals and serials, and 170000 books. zbMATH is not a digital library itself, it is an indexing service and provides easy access to bibliographic data, reviews and abstracts from all areas of pure mathematics as well as applications, in particular to the natural sciences, computer science, economics and engineering. Search functions provide search for documents, authors and journals. Search can be done in one line, or in structured form using attributes such as title, author, subject, source, keywords etc. Service also provide full- text formula search for indexed arXiv documents [35].The zbMATH formula search uses the MathWebSearchsystem, which is a content-based search Figure 1 EuDML architecture engine for MathML formula based on substitution tree The metadata repository provides the central point of indexing. reference for all the managed contents. It will work with Portal offer three ways of displaying mathematical an OAI-PMH harvester to ingest repositories’ content formulas – MathML, MathJax and LaTeX. The XML- descriptions, maps the metadata into the internal EuDML based MathML is the solution recommended by W3C for schema. The performance and the quality of responses of displaying mathematical content on the web and is set as the search service directly influence user experience. default within zbMATH. Mathematical Reviews and Therefore, particularly this service has to be reliable, zbMATH maintain the Mathematics Subject scalable and customized to fulfill user expectations. Classification (MSC), a classification scheme for The metadata enhancer function consist in a mathematics. It is used by reviewing services to collection of tools that each contribute to expand or categorize items in the mathematical sciences literature. complete the existing items’ metadata, depending on the The database of service contains about 2.1 million direct improvements needed. These range from applying OCR links to electronic versions of the indexed publications, over full texts, adding key words or multilingual to the publishers’ websites and/or to electronic libraries metadata by merging information from different with open access to the full texts. databases when an item happens to have such non- 3.8 Bulgarian Digital Mathematics Library redundant description, generating MathML for mathematical expressions, etc. The association analyzer Bulgarian Digital Mathematics Library, BulDML is a detects, analyses and records relations between digital repository at Institute of Mathematics and individual items. The annotation component provides Informatics of Bulgarian Academy of Sciences. Library mechanisms to attach new material to individual items in has 7 mathematical journals, 4 book series and the repositories and maintain this new material. The proceedings in its repository. In fact, BulDML is an accessibility component provides support for enhanced institutional repository and is built on open-source accessibility of items, if required, before presentation to DSpace software [36]. As known, DSpace preserves and end users. Finally, the user and system interfaces provide enables open access to all types of digital content 321 access to the collected resources on different levels both [3] Wolfram, S.: An elementary introduction to the to human and machine users. Now EuDML offers several Wolfram Language. Wolfram Media, Inc. (2015) service interfaces that allow other applications to connect [4] Chebukov, D.E., Izaak, A.D., Misyurina, O.G., with the service. These are OAI-PMH server, REST Pupyrev, Yu.A., Zhizhchenko, A.B.: Math-Net.Ru services, OpenSearch service, which allow to query as a Digital Archive of the Russian Mathematical library index in machine way and annotation retrieval Knowledge from the XIX Century to Today. services in JSON. Intelligent Computer Mathematics, Lecture Notes EuDML aims to be an open source of trusted in Comput. Sci., 7961, pp. 344-348, Springer mathematical knowledge. That is why it has some (2013), doi: 10.1007/978-3-642-39320-4_26 policies: [5] Carette, J., Farmer, W.M.: A Review of • All texts must have been scientifically validated and Mathematical Knowledge Management. In formally published; Intelligent Computer Mathematics. Lecture Notes • All items must be open access after a finite embargo in Computer Science, 5625. pp. 233-246 (2009) period. Once documents contributed to the library are [6] Ion, P.D.F.: Mathematics and the World Wide Web. made open access due to this policy, they cannot In Intelligent Computer Mathematics. Lecture revert to close access later on; Notes in Computer Science, 7961, pp. 230-245 • The digital full text of each item contributed to library (2013) must be archived physically at one of the EuDML [7] Lange, C.: Enabling Collaboration on Semiformal member institutions. Mathematical Knowledge by Semantic Web All DMLs, described above except All-Russian Integration. Ph. D. Thesis, Jacobs University Mathematical Portal Math-Net.RU are partners of Bremen (2011) EuDML. [8] Elizarov, A.M., Lipachev, E.K., Nevzorova, O.A., Solov’ev, V.D.: Methods and Means for Semantic 4 Conclusion Structuring of Electronic Mathematical Documents. In order to outline all differences of observed projects Doklady Mathematics, 90 (1), pp. 521-524 (2014), we created comparison Table 1 listed below. Note that, doi: 10.1134/S1064562414050275 we excluded from table two DMLs due to following. [9] Elizarov, A., Kirillovich, A., Lipachev, E., BulDML is and built on open-source DSpace software, Nevzorova, O., Solovyev, V., and Zhiltsov N.: so all functionality of it is clear, for DML-PL we could Mathematical Knowledge Representation: not find any working portal in order to study it more Semantic Models and Formalisms. Lobachevskii J. deeply. of Mathematics, 35 (4), pp. 347-353 (2014), In all the projects studied, emphasis is placed on the doi:10.1134/S1995080214040143 transfer of the resources themselves to the electronic [10] Elizarov A., Kirillovich A., Lipachev E., form, rather than on the development of semantic Nevzorova O. (2017) Digital Ecosystem OntoMath: services. Only a few portals have a mathematical formula Mathematical Knowledge Analytics and search, and only one has a recommender service. Management. In: Kalinichenko L., Kuznetsov S., Manolopoulos Y. (eds) Data Analytics and After the analysis done it is clear that there are only Management in Data Intensive Domains. two types of repository systems: the first is actually DAMDID/RCDL 2016. Communications in DML, which preserve the resources themselves, the Computer and Information Science, 706, pp. 33-46 second is indexing and aggregating services that do not (2017), doi: 10.1007/978-3-319-57135-5_3 have their own database of electronic documents, but provide a wide range of convenient search capabilities. [11] Elizarov, A.M., Kirilovich, A.V., Lipachev, E.K., Nevzorova, O.A.: Mathematical Knowledge This work was funded by the subsidy allocated to Management: Ontological Models and Digital Kazan Federal University for the state assignment in the Technology. CEUR Workshop Proceedings, 1752, sphere of scientific activities, grant agreement no. pp. 44-50 (2016), http://ceur-ws.org/Vol- 1.2368.2017) and with partial financial support of the 1752/paper08.pdf Russian Foundation for Basic Research and the [12] Bouche, T.: Towards a World Digital Library: Government of the Republic of Tatarstan, within the framework of scientific projects Nos. 15-07-08522, 15- Mathdoc, Numdam and EuDML Experiences. 47-02472. UMI, La Sapienza, Roma (2016), http:// www.mat.uniroma1.it/sites/default/import- References files/biblioteca/SEMINARIO2016/bouche.pdf [13] Bouche, T.: Digital Mathematics Libraries: The [1] Borwein, J.M., Rocha, E.M., Rodrigues, J.F. good, the bad, the ugly. Mathematics in Computer Communicating Mathematics in the Digital Era, pp. Science, (3), pp. 227-241 (2010), doi: 3-21. A K Peters, Ltd. MKM-IG. Mathematical 10.1007/s11786-010-0029-2 Knowledge Management (2008). http:// www.mkm-ig.org/ [14] Bouche, T.: Reviving the Free Public Scientific Library in the Digital Age? The EuDML Project. In: [2] Wolfram, S.: A New Kind of Science. Wolfram Kaiser, K., Krantz, S., Wegner, B. (eds.): Topics Media, Inc. (2002) and Issues in Electronic Publishing, JMM, Special 322 Session, San Diego, January 2013, pp. 57-80 [24] NUMDAM. www.numdam.org (2013), http://www.emis.de/ [25] The Czech Digital Mathematics Library (DML- proceedings/TIEP2013/05bouche.pdf CZ), http://www.dml.cz/ [15] Elizarov, A.M., Zuev, D.S., Lipachev, E.K.: [26] The Czech Digital Mathematics Library. Project Mathematical Content Semantic Markup Methods Funded by the Academy of Sciences of the Czech and Open Scientific E-Journals Management Republic, 2005–2009. http://project.dml. cz Systems. In: Klinov, P., Mouromtsev, D. (eds.) [27] Rákosník, J.: Recent Development of the DML-CZ KESW 2014. CCIS, 468, pp. 242-251 (2014), doi: and Its Current State. In Proc. of DML 2011: 10.1007/978-3-319-11716-4 22 29 Towards a Digital Mathematics Library. Bertinoro, [16] Candela, L., Athanasopoulos, G., Castelli, D., El Italy, July 20–21st (2011) Raheb, K., Innocenti, P., Ioannidis, Y., Katifori, A., [28] Rajaraman, A.; Ullman, J. D.: Data Mining (2011). Nika, A., Vullo, G., Ross, S.: The Digital Library doi:10.1017/CBO9781139058452.002 Reference Model. FP7-ICT-2007-3. Cultural [29] Deerwester, S., Dumais, S., Landauer, T., Furnas, Heritage and Technology Enhanced Learning G., Beck, L.: Improving Information Retrieval with (2011) Latent Semantic Indexing. Proc. of the 51st Annual [17] Candela, L., Castelli, D., Fuhr, N., Ioannidis, Y., Meeting of the American Society for Information Klas, C.-P., Pagano, P., Ross, S., Saidis, C., Schek, Science, 25, pp. 36-40 (1988) H.-J., Schuldt, H., Springmann, M.: Current Digital [30] The Polish Digital Mathematics Library, Library Systems: User Requirements vs Provided http://pldml.icm.edu.pl/ Functionality. IST-2002-2.3.1.12. Technology- enhanced Learning and Access to Cultural Heritage [31] Zamlynska, K., Tarkowski, A., Rosiek, T.: (2006) Evolution of the Mathematical Collection of the Polish Virtual Library of Science. Mathematics in [18] Elizarov, A.M., Lipachev, E.K.: Lobachevskii computer Science, (3), pp. 265-278 (2010), doi: DML: Towards a Semantic Digital Mathema-tical 10.1007/s11786-010-0029-2 Library of Kazan University, 2017 (in press), DAMDID-2017 proceedings [32] Gottingen Digitalisierungs Zentrum. http://gdz. sub.uni-goettingen.de/gdz/ [19] Kogalovskiy, M.R., Parinov, S.I.: Klassifikatsiya i ispol'zovaniye semanticheskikh svyazey mezh-du [33] Gottingen digitization Centre https://www.sub. uni- informatsionnymi ob’yektami v nauchnykh goettingen.de/en/copying-digitising/ goettingen- elektronnykh bibliotekakh. Inform. i yee primen., 3 digitisation-centre/ (6), pp. 32-42 (2012) [34] Zentralblatt MATH. https://zbmath.org/ [20] All-Russian Mathematical Portal Math-Net.Ru. [35] Muller, F., Teschke, O.: Full Text Formula Search http://www.mathnet.ru/ in zbMATH, EMS Newsletter (2016) [21] Zhizhchenko, A.B., Izaak, A.D.: The Information [36] Bulgarian Digital Mathematics Library. http://sci- System Math-Net.Ru. Application of gems.math.bas.bg/jspui/ Contemporary Technologies in the Scientific Work [37] DSpace, www.dspace.org of Mathematicians. Russian Math. Surveys, 62 (5), [38] Sylwestrzak, W., Borbinha, J., Bouche, T., pp. 943-966 (2007), http://dx. doi.org/10.1070/ Nowinski, A., Sojka P.: EuDML – Towards the RM2007v062n05ABEH004455 European Digital Mathematics Library. In: Sojka, [22] Zhizhchenko, A.B., Izaak, A.D.: The Information P. (ed.) Towards a Digital Mathematics Library. System Math-Net.Ru. Current State and Prospects. Paris, July 7–8th, 2010, pp. 11-26. Masaryk The Impact Factors of Russian Mathematics University Press, Brno (2010), Journals. Russian Math. Surveys, 64 (4), pp. 775- http://dml.cz/bitstream/handle/10338.dmlcz/70256 784 (2009), http://dx.doi.org/ 10.1070/ 9/DML_003-2010-1_5.pdf RM2009v064n04ABEH004638 [39] EuDML, www.eudml.org [23] CEDRAM. www.cedram.org 323 Table 1 Comparison table of DML projects DML Math-Net.ru CEDRAM NUMDAM DML-CZ GDZ zbMATH EuDML Criteria Inform There is an object DML contains 9 French Contains more than 57000 The digitized journal and proceedings This is The database contains This is an ation hierarchy. Collections math journals, 1 book articles in 76 periodicals, 373 papers are displayed with the multidisciplinary more than 3.5 million aggregation and space split into journals, issues, and 7 proceedings of books in 4 collections, 263 agreement of the publisher who owns library, that contains bibliographic entries with indexing service. articles and so on. seminars and theses. the digital data. not only reviews or abstracts EuDML Currently contains more conferences. All Full texts available in PDF and DML-CZ presents full texts articles mathematical currently drawn from assemble the than 120 journals with CEDRAM journals are DJVU formats. Each article in and book chapters in PDF format, collections but also more than 3,000 journals digital nearly 200 thousand open access. Access to NUMDAM is available via a equipped with enhanced metadata history of Law, and serials, and 170,000 mathematical publications. Information the database containing stable URL. including bibliographical references. history of the books. The database of corpus in order to about the article includes the bibliographical The digital born documents are being Humanities and the service contains about 2.1 make it available a bibliographic references of all the obtained from the original sources Sciences, travel and million direct links to online. description, an articles of all provided by publishers. North American electronic versions of the annotation, lists of participating journals is literature and other indexed publications, to literature and a file with totally free. collections. the publishers’ websites the full text of the article. The full entry of articles Mathematical and/or to electronic contains abstracts and collections have libraries with open access bibliographical about 7000 resources to the full texts. references. and also have some Russian resources. Library contains more than 15 million digitized pages. Functio The portal provides the CEDRAM has OAI-PMH NUMDAM has an OAI-PMH Editors of all journals are using tools Portal provides Search functions provide EuDML offers nality ability to search for server, which can be used server, thus allowing sharing of and workflows that enable them to search in metadata search for documents, several service publications and links on for systematic download metadata and better visibility of produce inputs in a semiautomatic and full text of authors and journals. interfaces that the bibliographic of metadata in various collections. way. The formal consistency and resources and browse Search can be done in allow other description and keywords schemas. System provide following integrity of the data are controlled by functions. All one line, or in structured applications to in the title, annotation or Search functions provide functions: search and several validating procedures that resources have full form using attributes. connect with the text. As result of the search by keywords, navigation by title, author, have been developed in the project. texts and can be Service also provide full- service. These search, an abstract, article author, title, bibliography references or in full text of There are some automated procedures viewed page by page text formula search for are OAI-PMH IDs (DOI, resource and full text search. resources. During search all for validation of data of new journal or in structured indexed arXiv server, REST references in abstract Quick search searches in statistics, related to the search issues but all of them are for internal mode. Metadata of documents. The services, databases, URIs), a all fields except full text. topic is displayed – co-authors, use and development. any resource contain zbMATH formula search OpenSearch citation pattern, classifier Advanced search journals and years of DML-CZ allows to search by title, stable URL of uses the MathWebSearch service, which values are issued. There interface offers several publication. Browse functions author of publications, by language or resource, metadata system. zbMATH allow to query are no recommender types of research, more provide navigation through by zbMATH and MathSciNet can be downloaded maintain a classification library index in service, in fact all or less complicated. The sorted list of resources. identifiers. Browse functions provide in METS format. scheme for mathematics. machine way and semantic services work full entry of articles Metadata extraction made only navigation through sorted list of annotation with a bibliographic produced for CEDRAM for bibliography. Any resources. There is search of related retrieval services description of the contains abstracts and additional services like formula articles. in JSON. resource bibliographical search or recommender system references. are absent. 324 Users There are role model of No any user registration No any user registration No any user registration. No any user There is a personal area No any user users, everybody can End-users cannot submit any registration. for users – for reviewers, registration. register and create own resource, everything can be submitted publishers etc. personal area. only through editorial board of Registered users can journals, also there is no any personal create personal pages, area for users. manage personal collections of publications, authors get access to the full texts of their articles. Quality System is available in Portal is available in Portal available on English and Project was finished in 2010 and now Portal is available in Portal offers three ways EuDML has of two languages. The English and French. Files French. Formulas can be it is in a stable form. German and English. of displaying some policies: all service policy for accessing the of the full texts are the viewed in TeX source code or Portal is available only in English. But main aim of the mathematical formulas – texts must be full texts of articles is property of the journals. in compiled, graphical way. project is to digitize MathML, MathJax and scientifically determined by the All online records exist in NUMDAM only disseminate and preserve LaTeX. MathML is set as validated and publisher of the paper. two formats, which are resources that were already resources. default. Not all services formally Access for any other only different by the way published in journals, books or of the system are free, published; all information is free. they display theses but submission process some of them need to be items must be mathematical formulas: of resources is not clear. purchased. open access after MathML or TeX and a finite embargo have stable url link. period. Once documents contributed to the library are made open access due to this policy, they cannot revert to close access later on; the digital full text of each item contributed to library must be archived physically at one of the member institutions. 325