<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>The Open Access Portal to Vernadsky State Geological Museum of Russian Academy of Sciences Data</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Kirill A. Platonov</string-name>
          <email>k.platonov@sgm.ru</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Vera V. Naumova</string-name>
          <email>v.naumova@sgm.ru</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Vitaliy S. Eremenko</string-name>
          <email>vitaer@gmail.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sergey E. Dyakov</string-name>
          <email>sergdkv@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Iraida A. Starodubtseva</string-name>
          <email>i.starodubtseva@sgm.ru</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Institute of Automation and</institution>
          ,
          <addr-line>Control Processes FEB RAS, Vladivostok</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Vernadsky State Geological</institution>
          ,
          <addr-line>Museum RAS, Moscow</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In article share data of the Vernadsky State Geological Museum of RAS, one of the oldest natural-science museum of Russia, which began to be created in 1759, are described and more than 250 years are replenished thanks to scientists, patrons and collectors. Development of the Portal of open access to data of the Museum is discussed. The Portal provides data sets to users on the Internet. It allows any Internet user to research, load and use data for researches and the educational purposes. The standards of metadata and protocols can be used further for integration of data SGM RAS into the Russian and world networks of natural science museum data.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>The application of information and communication technologies in the second decade of XXI century is the
necessity not only for large museums, but also for quite small museums, even municipal. Information
technologies allow effectively solving many traditional museum problems, including the main museum missions
such as storage, research and exposition of museum collections. It is necessary to mention Internet as a
powerful tool of museum communication with the world, visitors, including virtual. It is an active
implementation of new communication forms, not only through web site, but also through social media. The
purpose of such communication is always museum audience expansion and guarantee to the maximum number
of people, wherever they are, access to the collections [Ana14].</p>
      <p>The basic tendencies in museum information activity are, from one side, intensification of “internal”
museum service functions execution (development and implementation of automated record, storage and
analysis information systems about museum objects), from other side, the increase of museum expositions
interactivity by interactive elements, augmented and virtual systems application and, from the third side, an
attraction of public attention to the museum activity through development of “external” information space,
providing museum information communication as with public as also with scientific and industrial
organizations.</p>
      <p>The information technologies are even more important for natural history museums comparing with other
museums. The natural history museum collections have enormous information about natural objects, both,
exiting and lost (extinct forms of natural life, mined out deposits etc.) This requires the development of
information systems additional functions, as existing ones (for example KAMIS) don’t meet the natural history
museums requirements.</p>
      <p>This article describes the application and development issues of information technologies in natural history
museums on the example of Vernadsky State Geological Museum RAS. The main directions of Vernadsky State
Geological Museum RAS activity are [Mal14]:
1. preservation and augmentation of collections, numbering more than 250 thousands minerals, rocks
and fossils;
2. modern scientific research; assistance in expansion of scientific and industrial organizations
cooperation;
3. educational and cultural activity for a wide public.</p>
      <p>The first step of Internet usage as a tool of museum communication with the world is Natural History
museum data open access portal development.</p>
      <p>The term “open access” was for the first time mentioned at the Budapest conference on open access in
February 2002. The meaning is not practically changed: Open Access is a free, immediate, permanent, fulltext,
online access to scientific publications.</p>
      <p>The seminar “Information Technologies and Dublin Core Metadata Group operation” within the frames of
69th IFLA General Conference defines the main principals of “Open Archive” ideology: world-wide
consolidation of scientific materials archives; open access to archives (metadata); consistent archives and
information providers interfaces; usage simplicity; application of existing standards - HTTP, XML, Dublin
Core, MARC, MARCXML.</p>
      <p>The open access systems to scientific publications, to scientific information archives and to natural history
museums data are developed in the present moment for intensification of scientific research and scientific
communication development.</p>
      <p>Data Portal of Natural History Museum, London; Yale Peabody Museum of Natural History, New Haven;
Australian Museum, Sydney and others are the examples of Natural history museums data open access systems.</p>
      <p>The funds of the one of the oldest Russian natural history museums Vernadsky State Geological Museum
(SGM RAS) started to form in 1759. and for more than 250 years have been supplemented by scientists, patrons
and collectors. Minerals, rocks and ores, meteorite, invertebrate fossils, vertebrates and fossil flora are store
here. The collections are distributed according to the corresponding inventory: “Minerals”, “Rocks and Ores”
“Fossil invertebrate”, “Fossil vertebrate”, “Paleoflora”, “Stone products” and “Written sources to museum
objects and collections”.</p>
      <p>Vernadsky State Geological Museum RAS (SGM RAS) (Moscow, Russia) has started work on development
and implementation of The Museum Open Data Portal in 2017 (Figure 1).</p>
    </sec>
    <sec id="sec-2">
      <title>2 Open Data Portal</title>
      <p>The Portal provides the user with the set of data online. It allows each scientist or professor/student to study,
download or use SGM RAS data for research or education purposes.</p>
      <p>The first step in this direction is the reorganization of the museum 's internal database. The base was created
on Visual FoxFro in 1995. It is available only to museum employees with different levels of access to data. Its
main task is to store and record data on the museum 's collections. The reorganization of the DB made it
possible to correct the old data. When new data is entered, the samples are checked for uniqueness and possible
errors are monitored. The search in the database is possible both by any of the available values and by any
combination thereof. A variety of reports allow you to fully perform accounting functions. Verified data from
the internal database, through file import, is sent to the Internet portal.</p>
      <p>Within the framework of these works, photographs of stock data and scans of printed materials are
organized. The Python programming language has implemented a program that allows to find objects in photos
of exhibits and calculate the dimensions of these objects in pixels according to the data from the SGM database
and then place the "ruler" on the image, which indicates to the user the real dimensions of the object.</p>
      <p>Standards are one of the key components of museum data infrastructure. The introduction of standards
ensures compatibility at the data and software level, avoids information loss and opens up new possibilities for
data integration and joint processing (Antopolsky et al., 2012). The task being solved uses a Dublin Core-based
format.</p>
    </sec>
    <sec id="sec-3">
      <title>3 Software platform</title>
      <p>The Portal provides an access to museum fund information online. Each collected is presented as a set of data.
The data sets are a new digital data representation, containing the whole collection information.</p>
      <p>The form of collection representation can be different: Internet references, pictures, exhibits description,
written sources to museum objects and collections.</p>
      <p>The exhibits descriptions are presented in table form. Internet users at the present moment have access to 6
basic data sets, the exhibits in which are described by 150 parameters. The collection is available for users in
two modes “Initial view” and “Observer”</p>
      <p>The “Observer” mode presents information in three ways: tables, diagram, map. “Table” mode reflects the
whole list of exhibits. The search block and filters are fixed in the top right part. The search allows looking for
exhibits in each field. The filters allow choosing one or several fixed values typical for exhibits. The search and
filters complement each other.</p>
      <p>The navigation block is in the center of the Control Panel allowing to switch pages and to change the
quantity of records on one page. The switch into “Map” mode represents chosen exhibits by marks on
interactive map. The switch on mark shows the exhibit information.</p>
    </sec>
    <sec id="sec-4">
      <title>5 Search system</title>
      <p>The built-in CKAN Storage and Organization Search System offers good array search capabilities. A study of
the system found that the set of CKAN data search and processing operations actually only includes direct
comparison operations.</p>
      <p>We offer a solution for organizing a thematic search for all collections at the same time.</p>
      <p>Search queries are built sequentially. At the first stage there is a search for information in the categories:
"Who," What, "When," Where. " Search is carried out on:
• the author of the collection;
• the name;
• the age;
• the geographical location.</p>
      <p>It is based on the use of the CKAN API system to obtain a list of data arrays and their attributes, to obtain
the csv arrays of interest, to define the csv variety, and to search directly for the resulting data array. This
solution is extremely flexible, reliable, and extensible, while search system performance remains satisfactory
and memory consumption is minimal through the use of Python generators.</p>
      <p>Having received a large number of results in the initial search, we have to clarify it. Search results are
currently refined, usually by facet search methods. The results found are immediately distributed to groups
(facets) according to predefined characteristics. Often their number in each facet is counted, and the user
immediately understands in which direction the search should be narrowed.</p>
      <p>The search in the System is available in two ways for users. In the first case, in reply to a user text inquiry,
the system gives a list of collections containing the search phrase in its description. The general list has a
collection name, brief description and resources types, attached to this collection: table information, exhibits
images, publications and monographies describing this collection. The second case suggests thematic search on
each collection simultaneously. The search inquiries are built consistently. “Who”, “What”, “When”, “Where”
categories are available on the first stage.</p>
      <p>The category “What’ provides search on the name of the deposits, minerals, rocks and ores. The selection of
the value is available out of the specified list. In reply to the request the Portal displays a list of all collections
containing the search phrase.</p>
      <p>The category “Where” is a common geographical search. The coordinates are to be set by hand or to set a
region on an interactive map. The search on countries, geographical regions, districts etc. is also available.</p>
      <p>The category “When” allows to search on stratigraphic age. The category “Who” searches on registration
authors, donators or collection authors. The system allows a complicated serial information search request.</p>
    </sec>
    <sec id="sec-5">
      <title>6 Conclusion</title>
      <p>Vernadsky State Geological Museum RAS is planning to assign DOI names to its Open Data Portal in nearest
future. It will give possibility to achieve uniqueness of each data set on Open Data Portal. The data sets with
DOI names will be identified and will have constant reference for Internet and scientific publications.</p>
      <p>The authors of the project assume that the standards of metadata and protocols applied in the construction of
the Open Data Portal of the Russian Academy of Sciences will be used in the future to integrate the stock data of
the Russian Academy of Sciences into the world network of natural science data.</p>
      <p>The basic principles of development of Vernadsky State Geological Museum Open Data Portal are described
in Sergey Cherkasov, Vera Naumova and others article [Che18].</p>
      <p>The Portal test version is available at the http://data.sgm.ru</p>
      <p>The Help with Portal operation is available at the http://data.sgm.ru/help/.
[Ana14]
[Che18]</p>
      <p>Analytical brief “New information and communication technologies in museums developments”
within the frames of UNESCO/MFGS project “Museum management – XXI century: thematic
regional trainings UNESCO/ICOM and meetings of SNG countries experts on development of
potential in the area of museum management” (2011-2014), Moscow, 2014, 21 p.</p>
      <p>Malyshev Yu.N. A New Stage in Museum Life // Gornaya Promyshlenost, 2014, №4 (116), p. 124
Cherkasov S.V., Naumova V.V., Platonov K.A., Dyakov S.E., Eremenko V.S., Patuk M.I.,
Starodubtseva I.A. The Basic Principals Of Development Of Open Access To Data Of Vernadsky
State Geological Museum Of RAS // Information resources of Russia, 2018, №4 – p.9-14</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>[Mal14]</mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>