<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Information system for diagnosing pollutants based on data analysis algorithms1</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Yurii I. Molorodov</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Institute of commputational technologies SB RAS ICT SB RAS</institution>
          ,
          <addr-line>Novosibirsk</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2011</year>
      </pub-date>
      <volume>32</volume>
      <issue>6</issue>
      <fpage>55</fpage>
      <lpage>59</lpage>
      <abstract>
        <p>The paper shows the influence of factors, processes and consequences of pollution of components of urban ecosystems on water pollution and hydro chemical composition of water of small rivers and the Ob river basin. The structure of the prototype of an interactive information system (IAS), which provides for the possibility of obtaining a forecast for the next five years, is described. For at least two millennia, water quality has been steadily deteriorating and reaches levels of pollution where the use of water for various purposes is severely restricted or the water may be harmful to humans. This deterioration is related to socio-economic development within the river basin, but atmospheric transport of pollutants over long distances has now changed this picture: even remote areas can be exposed to indirect pollution. The problem of pollution of urbanized territories is relevant all over the world. It is most acute in large industrial cities and their agglomerations, where, as a result of strong technogenic pollution of the natural environment, urbangeochemical anomalies are formed with a high accumulation of organic and inorganic pollutants in ecosystems. Cities serve as powerful sources of migration from air and water flows of pollutants to geochemically associated territories, where pollutants are included in regional and global biogeochemical cycles. Sources and nature of pollution, water resources and the environment in General, cities are diverse and depend on the specifics of industrial production, natural conditions, etc. therefore, scientific and practical interest in studying the factors, processes and consequences of pollution of components of urban ecosystems does not weaken. The results of these studies are necessary for a comprehensive assessment of environmental risks of urban ecosystem functioning, development of measures to improve the quality of life in cities and reduce their negative impact on the environment.</p>
      </abstract>
      <kwd-group>
        <kwd>pollution</kwd>
        <kwd>pollutants</kwd>
        <kwd>information resources</kwd>
        <kwd>control of pollution consequences</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
    </sec>
    <sec id="sec-2">
      <title>Data sources for analysis and operation</title>
      <p>Specialists of the Ministry of natural resources and ecology of the Novosibirsk region (MNR&amp;E NR) regularly
monitor the quality of the atmospheric environment and measure the degree of pollution of the atmosphere and water
of water bodies. They state the increase in pollution every year. These circumstances were the reason for the start of
work on creating an interactive information analytical system (IAS). Data from long-term observations for the period
2006 to 2018 are stored on the MNR&amp;E NR`s Web resource, which is open to the public (https://dlh.nso.ru/). They are
the result of monitoring changes in pollutants in water resources, in order to further analyze quantitative and qualitative
indicators of the state of surface water bodies in the Novosibirsk region.</p>
      <p>The goal of our work is to develop an architecture and create a pilot version of the IAS designed for integration and
advanced processing of data processing of field observations, measurements and laboratory studies.is carried out.</p>
    </sec>
    <sec id="sec-3">
      <title>Leading technologies and standards</title>
      <p>Currently, there are few rivers in the world, that are not polluted by human waste. Fertilizers and pesticides from
agricultural land enter rivers with sewage. And, they also get water from sewers and drainage ditches. Some factories
drain streams of dirty water into rivers and lakes. Pollution of rivers and lakes with nitrate fertilizers is growing on the
planet almost every week. Unfortunately, even if you ban the use of nitrate fertilizers tomorrow, the situation will
worsen. Nitrates slowly, for many years, seep through the earth into the beds of rivers or lakes. Dirty sewage and
fertilizers enter lakes and reservoirs and cause rapid growth of ooze-algae that suffocate river fauna and flora.</p>
      <p>The source of problems from the past include pathogens, oxygen balance, eutrophication, and heavy metals.
Currently, they are recognized, investigated, and the necessary measures to prevent pollution are identified.</p>
      <p>Current time problems are of a different nature. These are, on the one hand, traditional point and more extensive
sources of pollution (nitrates) and widespread pollution problems (synthetic organic matter), and, on the other hand,
problems of the "third generation" associated with global cycles (acid rain, climate change) [2].</p>
      <p>In the past, water pollution in developing countries was mainly caused by the discharge of untreated wastewater.
These problems are now more complex as a result of the production of hazardous waste products and the rapidly
increasing use of pesticides in agriculture. In fact, water pollution is now a more serious problem in some developing
countries, at least in developing new industries, than in developed countries. Unfortunately, developing countries
generally lag far behind in controlling their main sources of pollution. As a consequence, the state of the environment
in developing countries is constantly deteriorating. There are a huge number of microbial agents, elements and chemical
compounds that cause water pollution. They can be divided into the following categories: microorganisms, organic
compounds that can be destroyed biologically, suspended particles, nitrates, salts, heavy metals, fertilizers, and organic
micro-pollutants [3].
3.1</p>
      <sec id="sec-3-1">
        <title>Proposed approach for solving the problem</title>
        <p>To ensure effective measures to address the problem of water pollution, a spatial and temporal analysis of the level
of water pollution is necessary, including the study of toxicity indicators based on geoinformation technologies. The
analysis procedure includes taking samples, identifying the level of contamination, and examining the data obtained. A
modern interactive information system has been created, to work with the collected material. Using the Leaflet library
for visualization and analysis [7]. In addition to modern tools for diagnosing pollutants, algorithms for working with an
interactive map, have been developed based on data analysis algorithms, with the ability to display sample collection
sites and information about research results.</p>
        <p>Leaflet is an open source library written in JavaScript designed for displaying maps on websites. Supports most
mobile and desktop platforms that support HTML5 and CSS3.</p>
        <p>Along with OpenLayers and Google Maps, the API is one of the Most popular JavaScript map libraries used on
major sites such as Flickr, Foursquare, Craigslist, and Google maps. Data.gov, IGN, Wikimedia projects,
OpenStreetMap, Meetup, WSJ, MapBox, CloudMade, CartoDB, and others.</p>
        <p>Leaflet allows a developer who is not familiar with GIS to easily display raster maps consisting of small tile
fragments, with possibly additional layers overlaid on top of the main one. Layers can be interactive, such as displaying
a hint when you click on a marker.</p>
        <p>In addition to displaying information on a geographical map, sometimes there is a need for statistical analysis
accumulated over several years. The Chart JS library is used for this purpose (https://www.chartjs.org/) written in
JavaScript. It provides a variety of pre-created charts, such as combined histograms, bar charts, sector charts,
geosystems, and others.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Information system architecture and its functioning</title>
      <p>The architecture of an information system is defined by its components, their functions, and their interaction. The
system is based on client-server technology and consists of a client part, a server, and a JSON data structure. A
clientserver is a computing or network architecture in which tasks, or network load, are distributed between service providers,
called servers, and service customers, called clients. In the described architecture, the client is the browser (user), and
the server is the web server. The system logic is distributed between the server and the client, data is stored in JSON
format, and information is exchanged over the network. The advantage of this approach is that clients do not depend
on the user's specific operating system, so the system is a cross-platform service.</p>
      <p>An important component of the system is the client part of the application (Fig. 1). The map service is based on an
interactive map from the Leaflet library (http://leafletjs.com).</p>
      <p>It was decided to use JSON, as the data format for exchange between the client and the server, since:
• has built-in support in popular browsers released after 2009;
• native JavaScript format.</p>
      <p>Data in JSON format, according to RFC 4627, can be: JavaScript objects, arrays, quoted strings, numbers, Boolean
values, null.</p>
      <sec id="sec-4-1">
        <title>The format of the data structure JSON:</title>
      </sec>
      <sec id="sec-4-2">
        <title>Id: Number, Name: String, Coordinate: Array,</title>
      </sec>
      <sec id="sec-4-3">
        <title>UKISV: Object {Years: Array, Value years: Array},</title>
      </sec>
      <sec id="sec-4-4">
        <title>Description: String …</title>
        <p>To get an object view, it is enough to use native JavaScript functions for converting from a string view to an
object view, using the deserialization operation.</p>
        <p>Data in JSON format, according to RFC 4627, can be: JavaScript objects, arrays, quoted strings, numbers,
Boolean values, null.
4.1</p>
        <sec id="sec-4-4-1">
          <title>Main modules of the information system</title>
          <p>The IAS functionality provides a set of software implementations made in the form of separate modules. Let's
highlight 8 of them (Fig. 2).</p>
          <p>1. The research data processing module unloads the structure from a JSON file.
2. The geographical data processing module uploads the coordinates of the Novosibirsk region borders from a</p>
          <p>GeoJSON file.
3. The data processing module on the server is responsible for uploading data from the data structure.
4. User tracking module-reacts to the user's action and selects the desired action depending on the logical value.
5. The visualization module is responsible for displaying information on the screen.
6. The report module is responsible for creating and processing dynamic reports.</p>
          <p>7. The query module is a component responsible for displaying information about the dynamics of changes in water
pollution from time to time.</p>
          <p>8. The forecast module is responsible for displaying information based on the research forecast for five years ahead.</p>
          <p>Each module performs an independent function. If you need to make changes to a module, you just need to change
a certain function. Any module can be expanded, and improved, and new functionality can be added.
category «А»
category «Б»
category «В»
category «Г»</p>
          <p>Characteristics
conditionally pure
slightly polluted
contaminated
contaminated
very polluted
dirty
dirty
dirty
very dirty
very dirty
extremely dirty</p>
          <p>ICIWP without regard CWP*</p>
          <p>1
(1-2)
(2-4)
(2-3)
(3-4)
(4-11)
(4-6)
(6-8)
(8-10)
(8-11)
(11- ∞)
*Critical water pollution indicators (CWP).</p>
          <p>With the help of the geographical data processing module, the border of the Novosibirsk region is formed. The
boundary coordinates are stored in a special data structure of the GeoJSON format. After processing, the visualization
module adds borders to the map through layers.</p>
          <p>The forecasting module. This module processes data series and analyzes them using linear approximation. The
forecast is made using the least squares method.</p>
          <p>Approximation of a function f(x) is called finding a function (approximating function) g(x) that is close to the
given function. Criteria for proximity of functions can be different [8].</p>
          <p>If the approximation is based on a discrete set of points, the approximation is called point or discrete. If the
approximation is performed on a continuous set of points (segment), the approximation is called continuous or integral.
An example of such an approximation is the decomposition of a function into a Taylor series, that is, replacing some
function with a power polynomial.</p>
          <p>The most common type of point approximation is interpolation-finding intermediate values of a quantity from an
existing discrete set of known values. Let be a discrete set of points called interpolation nodes, as well as the values of
the function at these points. You need to build a function g(x) that passes closest to all the specified nodes. Thus, the
proximity criterion of the function is g(xi)=yi.</p>
          <p>The function g(x) is usually chosen as a polynomial, which is called an interpolation polynomial. If the polynomial
is one for the entire interpolation area, it is said that the interpolation is global. If the polynomials are different between
different nodes, we talk about piecewise or local interpolation. Having found an interpolation polynomial, we can
calculate the values of the function between nodes, as well as determine the value of the function even beyond the
specified interval (extrapolation) [9].</p>
          <p>User help. To launch the web application, the user enters the URL of the information system in the string. On the
client side, the browser renders the resulting code and presents the user with a set of visual controls: an interactive map,
navigation tools, zooming, and switching the map type (Fig. 3).</p>
          <p>The interactive map is the main component of the system interface. Interactive mode allows you to easily move
around the map and change its scale. Detailed information is displayed through the user interface, which is graphically
and logically divided into three Windows. The window on the left shows the map and the scale that can be changed,
for this purpose, the corresponding buttons are created in the upper left window. There are markers on the map that
mark the places where research works are carried out. Clicking on the marker results in information about the data
received on the site (Fig. 3).</p>
          <p>The upper-right window shows a hierarchical list of all regions of Russia and Kazakhstan. Here you can select a
country, field work area and get information about it. Objects located on the map have different attribute information
that POPs up as a window when you click on the marker (Fig. 3). This is the physical name of the object, its
administrative name, research period, coordinates, quality class according to ICIWP, a brief description of the reservoir.
By clicking on the "Make a forecast" selection window (Fig. 4), it is possible to predict the level of contamination based
on previous samples.</p>
          <p>After the analysis of the information the system will display the result as a "trend Line" (trendline is a geometric
display of the average values of the analyzed indicators), also when you hover over the graph, you detailed rates ICIWP
and the level of pollution (figure 5).</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>A software package was developed based on the modern Leaflet library and methods of mathematical analysis of
observation data to work with the results of long-term monitoring of the ecological state of water bodies in the
Novosibirsk region. It allows you to quickly and efficiently analyze time series of data using statistical analysis
algorithms, which indicates its great potential for scientific research. A software package was created based on the
modern Leaflet library and methods of mathematical analysis of observation data to work with the results of long-term
monitoring of the ecological state of water bodies in the Novosibirsk region. It allows you to quickly and effectively
analyze time series of data using statistical analysis algorithms, which indicates its great potential for scientific research.
Acknowledgements. A project 2.3 “Creating models, algorithms, and integrated systems” supported this work.</p>
    </sec>
  </body>
  <back>
    <ref-list />
  </back>
</article>