<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Design and Development of Information System of Scientific Activity Indicators</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Aleksandr Spivakovsky</string-name>
          <email>Spivakovsky@kspu.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maksym Vinnyk</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yulia Tarasich</string-name>
          <email>YuTarasich@kspu.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maksym Poltoratskiy</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Key Terms: ICTCInfrastructure</institution>
          ,
          <addr-line>ICTComponent, InformationTechnology, WebService</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Kherson State University</institution>
          ,
          <addr-line>27, 40 rokiv Zhovtnya St., 73000 Kherson</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2016</year>
      </pub-date>
      <fpage>21</fpage>
      <lpage>24</lpage>
      <abstract>
        <p>The article provides a brief overview of the most popular information systems of evaluation of scientific activity of scientists. The vision of functional capabilities of processing of the system scientometric indicators of the scientific team, the organization and its business units on the basis of scientific profiles of existing scientometric and bibliometric systems are described. The example of the implemented solutions with the authors description of its components, basic algorithms and used technologies is presented.</p>
      </abstract>
      <kwd-group>
        <kwd>scientific activity</kwd>
        <kwd>information systems</kwd>
        <kwd>scientometric systems</kwd>
        <kwd>bibliometric systems</kwd>
        <kwd>scientometric indicators</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Scientific information is a special kind of information that affects the development of
any and all sectors of modern society. Analysis of scientific information can be
divided into polysyllabic such as information on the research teams, scientific collections,
scientist, scientific works and more. Elementary but the objective component, in our
opinion, is the scientific activities of the scientist. Today there are many information
systems that attempt to create methods and technologies of processing and saving
information on the activities of scientists.</p>
      <p>The most outstanding services with rapidly growing impact are Google Scholar,
Scopus, Orcid, Academia.edu, Research Gate, Mendeley, arXiv.org, cs2n, Epernicus,
Myexperiment, Network.nature, Science community.</p>
      <p>These services contribute to satisfying the needs of the scientific com-munity. In
fact, this positively influences scientific and technical progress and creates a new
paradigm of scientific research. A big number of the recently created scientometric
services allow assessing the relevance of the research results by a scientist. Having
these measurements at hand opens up new opportunities and prospects. In this article
we consider the existing information systems for the processing of scientific activities</p>
      <p>- 104
(section 2), describe your own vision and capabilities to design and develop our
system (section 3), as well as the basic methods and technologies (section 4) used for its
implementation.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related works</title>
      <p>After analyzing the information systems that run on the activities of scientists,
scientific groups, publishers, etc..., we offer to look for the most interesting projects.</p>
      <p>
        Bibliometrics of Ukrainian Science. The pilot project of information-analytical
system "Bibliometrics of Ukrainian Science", is implemented by the Department of
bibliometric and scientometrics of information and analytical support of Vernadsky
National Library [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>The system "Bibliometrics of Ukrainian Science" is representation of in-formation
of Ukraine scientists’ profiles who provided information about their publication in
the Internet; national component of the project Ranking of Scientists
(Cybermetrics Lab).</p>
      <p>
        Information resources of systems are formed by processing: created by scientists
on the platform of Google Scholar bibliometric profiles containing information of
their publication activity results, bibliometric indicators of Scopus, Web of Science,
Ranking Web of Research Centers. Updating of information on value of Hirsch index
in bibliometric profiles of scientists is executing on monthly, the value of other
indicators is updated quarterly (Hirsch index of scholar is h, if he has h publications, each
of which is cited at least h times) [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>
        Scopus. Scopus is a single the world's largest abstract database, which indexes
more than 17 000 items of scientific, technical and medical journals about 4,000
international publishers [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>Scopus system is designed to maintain efficient workflow of researchers, helping
them to: find new articles from the area of their specialization; find information about
the author; analyze the publication activity in the subject area; track citation; view the
h-index; identify the most cited articles and authors; assess the relevance of the study.</p>
      <p>
        Scopus enables researchers to combine their articles under a single profile [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>Google Scholar. Google Scholar is freely accessible search system, which indexed
the full text of the scientific publications all formats and disciplines.</p>
      <p>
        Google Scholar executes not only informational, but scientometric function. From
the list of results on a hyperlink Search Cited by we can obtain the information how
many and what documents are linked on the publication in data base Google Scholar.
The number in Cited by reflects the degree of authoritativeness and publicity of
publication [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        Web of Science. Web of Science – International established database of Scientific
Citation, it is presented by company Thomson Reuters. Web of Science gives
possibility to search among 12 000 magazines and 148 000 materials of conferences in the
field of natural, social, human sciences and arts, which allows to obtain the most
relevant information for your questions. In addition to search, Web of Science establishes
a reference link between the specific research using the cited materials and thematic
links between articles established reputable re-searchers working in this field. It is the
most extensive database of abstracts. It is available by subscription [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <sec id="sec-2-1">
        <title>Russian Science Citation Index (RSCI). RSCI is a national information</title>
        <p>
          analytical system, accumulating more than 2 million publications of Russian authors,
as well as information about the citation of these publications from more than 3,000
Russian magazines. It is designed not only for the operational support of research to
date reference and bibliographic information, but is also a powerful tool to carry out
evaluation of the impact and effectiveness of research organizations, scientists, the
level of scientific journals, etc. [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ].
        </p>
        <p>
          Earlier research team of Kherson State University (KSU), which included the
authors of the article, took part in a number of international and national projects whose
aim was the development and implementation of scientific and management processes
of analytical information systems and services [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ].
        </p>
        <p>
          In addition, this article is a continuation of the previous work of the authors [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]
which addressed the issue of openness of scientific activities of Ukrainian scientists,
as well as the construction of an open scientific training system, one of the main
elements of which are the scientometric information processing system.
        </p>
        <p>
          The authors also conducted a study of the technical component of the
implementation of feedback services in the KSU [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ], as well as the formation of the ICT
infrastructure at higher education institution [
          <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
          ].
3
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Vision of the system. Criteria</title>
      <p>Analysis of information systems described in Section 2 (Table 1) confirms once again
the need to implement a system that would allow build the consolidated ratings of
scientists, scientific groups and organizations in the automatic mode.</p>
      <p>Why consolidated? A significant part of scientometric databases and systems,
which are presented in the scientific world are closed, and accordingly assess only the
academic publications that are indexed by them, while the rest of the scientific work
in this assessment are not included. For example, Scopus indexes, indicators of the
other part of scientometric databases are not always accounted for as tangible.</p>
      <p>In addition, for the analysis of the scientific activities of scientists’ group, or a
specific organization, it should be carried out manually. The only option of its partial
automation is now rating the organization's profile in Google Scholar (which makes
the system "Bіblіometrics of Ukrainian Science"). But what should do if this profile is
not created? Or if not all scientists working in the organization or are part of the
research team, and their articles are incorporated in the profile?</p>
      <p>Thus, the main task of building our system is the realization of the possibility of
automatic processing of scientometric and bibliometric indicators of scientific groups
and organizations on the basis of analysis of scientific profiles of known
scientometric databases and systems, including automatic search and its analysis.
Thus, the user can view general information of scientific activity of scientist,
scientific group, certain university or scientific organization, as well as the consolidated
rating. Scientist, registered in the system is able to receive notifications about changes
in their scientometric indicators. The system administrator can generate a general
statistical report of their organization.</p>
      <sec id="sec-3-1">
        <title>Assumptions and Constraints</title>
        <p>In the current version of the system it is implemented the ability to handle scientist
indicators on Scopus data and Google Scholar. The algorithm of automatically search
for links to profiles of Ukrainian scientists is developed, the algorithm of automatic
distribution profiles of scientists on the name of the organization in which they work
is implemented, the automatic generating of department ratings, faculties and research
teams is implemented, the ability to send messages to e-mail scientists about changes
of academic indexes.</p>
        <p>Scientometric indices on which ratings are based in the system are:
1. h-index (Scopus&amp;Goggle Scholar). The h-index is based on the highest number of
papers included that have had at least the same number of citations;
2. citations (Scopus&amp; Goggle Scholar). Numbers of total citations of documents that
are indexed by the system;
3. i10-index (Goggle Scholar). Numbers of total citations by documents that have ten
or more citations;</p>
        <p>At present, about 3,000 profiles of scientists in Scopus has been processed by the
system, of which 680 have been identified as the profiles of Ukrainian scientists.
Automatic processing of the found profiles allowed constructing the rating of Ukrainian
scientists on their indices in Scopus. By sorting the results of belonging the scientists
to the university (e.g. KSU), it was implemented the ability automatically generate
ratings of chairs, faculties and scientific researches of the university groups (Fig. 2).
The highest number of publications (on 10.02.16) has such scholars as - Oleg
Shishkin (581), Leonid Levchuk (463) and Vladimir Gun'ko (322).</p>
        <p>The analysis of the scientific activity of KSU scientists’ shows the greatest number
of publications has the teachers of Chair of Informatics, Software Engineering and
Economic Cybernetics (98). And the most h-index has the teachers of the Chair of
Botany (5).</p>
        <p>The construction of similar ratings according Goggle Scholar, it is currently
possible only in the presence of links on it’s, as distinct from Scopus, the author himself
should register in the system. There is more complicated the ability to search
scientists. Thus, we have been processed the records, links have been provided by the
University scientists. Now for viewing and analyzing there is available indicators of
scientists of Faculties of Physics, Mathematics and Informatics of KSU, Faculty of
PreSchool and Elementary Education of KSU, general Chair of Philosophy and Social
and Humanities Sciences.</p>
        <p>The next stage of development and improvement of the system will:
─ automatic integration and analysis of information on scientometric indicators
scientist in the case of duplication of its profile in these scientometric database;
─ improving the algorithm of processing information on scientometric indicators of
organizations, scientific collectives in case of misspelling or change their names;
─ improving the algorithm of finding links on the profiles of scientists according
their belonging to the country;
─ the ability to automatically compare the indexes of scientific activities of scientists,
research groups, organizations and the structural divisions.</p>
      </sec>
      <sec id="sec-3-2">
        <title>Analysis of the use</title>
        <p>There are two user groups allocated in the system: the administrator of the system
on the part of the establishment; user.</p>
        <p>The category of "user" is the staff of institutions, scientists, as well as the rest of
Internet users, who can view the information provided on the Web-site of the system.</p>
        <p>As example, Consider the algorithm of the system work with Scopus in details:
The parser takes a reference to the scientific profile from the database system and
loads the appropriate page of Scopus. After that, two parallel streams are run –
processing of scientometric indicators of scientist and processing of information about
his articles. Once when processing of the whole page is over, there is an inquiry about
the presence records under consideration "name" in the database system. If the name
is, it updates the information about scientometric indicators and publications of the
scientist. Otherwise - in the database record is created about the author by assigning a
unique identifier to him, and information about his articles and scientometric indexes
is entered into the appropriate tables. After the upgrading all the database system the
administrator and scientists registered in the database get e-mail with information
about changes of their indexes.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Tools and Technologies</title>
      <p>Developing of solutions requires the use of certain products and technologies:
─ JSON. It is used in the system for the exchange of data for third-party systems.</p>
      <p>Thus, our system can be a source of data for other resources. It implements the data
exchange via json requests.
─ asp.net and framework Entity. It is used to implement Web- Site of System.
─ Library of html align pack. This library is used for processing of Scopus pages. It
uses PATCH requests and then adds the results to the database. In the previous
version of the system the regular expressions were used. The use of html align
pack is greatly affected on her productivity.</p>
      <p>
        One of the most important algorithms used in the system is Levenstein algorithm [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
      </p>
      <p>This algorithm is used for solving the problem of determining belonging the
scientist to a particular organization, which arises at changing of the organization's name,
its spelling errors in the article, the change of scientists their place of work, etc.</p>
      <p>Let’s consider the algorithm in detail:</p>
      <p>Algorithms of fuzzy search are also known as similarity search or fuzzy string
search are the basis of the spelling checker systems and full of search engines like
Google or Yandex, This algorithm is an extremely useful feature of any search
engine. However, its effective implementation is much more complex than
implementing of a simple search by exact match. The most commonly used metric is the
Levenshtein distance or edit distance, its algorithms calculation can be found at every
turn.</p>
      <p>Thus, we compare the author's field of membership of the organization specified in
Scopus with many possible names of organizations in the system database. This takes
into account the possibility of errors.</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusions</title>
      <p>The work is developed by processing system of scientometric indicators of scientist
on the basis of its profile in Scopus and Google Scholar systems. The main difference
between the systems developed by us from others is the ability to automatically build
research teams rankings, organizations and entities to which the scientist applies. In
addition, the algorithm of automatically search and group profiles of scientists for
their attitude to this or that state, organization, is already have developed.</p>
      <p>The personal profile of each scientist collected information about his scientometric
and bibliometric indicators is a list of his publications, displays statistics of scientific
work - change the number of publications, citations, h-index, etc. Graphical display of
the dynamics of scientific work was implemented for the research teams,
organizations and their subdivisions.</p>
      <p>Today the system is used to calculate indicators of scientific activity of Kherson
State University and its structural units - departments, faculties, Specialized
Academic Council, etc.</p>
      <p>The next stage in the development of the system, we see in the realization of its
interaction with other scientometrics systems and databases. Also, one of the most
important and necessary features, we consider the need for implementation of the
comparison options of several organizations, research groups and scientists.</p>
      <p>The implementation of the algorithm of automatic search of references to the
scientific profile of the membership of a particular country and the organization, and
improve the efficiency of the algorithm allows us to speak about the possibility of
sampling and processing of large amounts of information. Thus, in the next version of the
system it is supposed to build a data warehouse on the principles of Big Data and Map
Reduce. That, in turn, will generate ratings of the scientific activities of scientists,
scientific groups and organizations with minimal resources and time-consuming.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>1. Bibliometrics of Ukrainian Science, Http://nbuviap.gov.ua/bpnu/index.php?page_sites=pro_proect</mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>2. Abstracts database Scopus, http://health.elsevier.ru/electronic/product_scopus/</mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>3. Scientometric database, http://www.nbuv.gov.ua/node/1367</mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>Science</given-names>
            <surname>Citation</surname>
          </string-name>
          <article-title>Index for scientists</article-title>
          , http://index.petrsu.ru
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Spivakovsky</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vinnik</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tarasich</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Web Indicators of ICT Use in the Work of Ukrainian Dissertation Committees and Graduate Schools as Element of Open Science</article-title>
          . In: Yakovyna,
          <string-name>
            <given-names>V.</given-names>
            ,
            <surname>Mayr</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.C.</given-names>
            ,
            <surname>Nikitchenko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Zholtkevych</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            ,
            <surname>Spivakovsky</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Batsakis</surname>
          </string-name>
          ,
          <string-name>
            <surname>S. (Eds.) ICTERI</surname>
          </string-name>
          <year>2015</year>
          .
          <article-title>CCIS</article-title>
          , vol.
          <volume>594</volume>
          , pp.
          <fpage>3</fpage>
          -
          <lpage>19</lpage>
          . Springer, Heidelberg (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Spivakovsky</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Klymenko</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Litvinenko</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>The problem of architecture design in a context of partially known requirements of complex web based application “KSU Feedback”</article-title>
          .
          <source>Inf. Technol. Educ</source>
          .
          <volume>15</volume>
          ,
          <fpage>83</fpage>
          -
          <lpage>95</lpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Spivakovsky</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vinnik</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tarasich</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>To the Problem of ICT Management in Higher Educational Institutions</article-title>
          .
          <source>Inf. Technol. Learn. Tools</source>
          <volume>39</volume>
          ,
          <fpage>99</fpage>
          -
          <lpage>116</lpage>
          (
          <year>2014</year>
          ). (In Ukrainian)
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Spivakovska</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Osipova</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vinnik</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tarasich</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Information competence of univer-sity students in Ukraine: development status and prospects</article-title>
          . In: Ermolayev,
          <string-name>
            <given-names>V.</given-names>
            ,
            <surname>Mayr</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.C.</given-names>
            ,
            <surname>Nikitchenko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Spivakovsky</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Zholtkevych</surname>
          </string-name>
          ,
          <string-name>
            <surname>G. (eds.) ICTERI</surname>
          </string-name>
          <year>2014</year>
          .
          <article-title>CCIS</article-title>
          , vol.
          <volume>469</volume>
          , pp.
          <fpage>194</fpage>
          -
          <lpage>216</lpage>
          . Springer, Heidelberg (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Lowenstein</surname>
          </string-name>
          , V .:
          <article-title>Binary codes with correction for deletions, insertions and substitutions of character</article-title>
          .
          <source>Reports, USSR Academy of Sciences 163.4</source>
          (
          <year>1965</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>10. International Projects, http://www.kspu.edu/About/DepartmentAndServices/DSAICI/internationalprojects</mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>