=Paper=
{{Paper
|id=Vol-2534/18_short_paper
|storemode=property
|title=Geoinformation Technologies and Software for Operational Assessment of Air Pollution
|pdfUrl=https://ceur-ws.org/Vol-2534/18_short_paper.pdf
|volume=Vol-2534
|authors=Alexey A. Kadochnikov,Alexey V. Tokarev,Oleg E. Yakubailik
}}
==Geoinformation Technologies and Software for Operational Assessment of Air Pollution==
Geoinformation Technologies and Software for Operational Assessment of Air Pollution Alexey A. Kadochnikov1,2, Alexey V. Tokarev1,2, Oleg E. Yakubailik1,2,3 1 Federal Research Center Krasnoyarsk Science Center of the SB RAS, Krasnoyarsk, Russia 2 Institute of Computational Modelling SB RAS, Krasnoyarsk, Russia 3 Siberian Federal University, Krasnoyarsk, Russia scorant@icm.krasn.ru, tav@icm.krasn.ru, oleg@icm.krasn.ru Abstract. The features of the development of an environmental monitoring system based on the integration of data from various sources is considered. Operational data collection from automated observation posts for the state of the atmospheric air and meteorological information has been implemented. Technology platform of the geoportal, intended for the collection, processing and presentation of data from various observations, is the system basis of completed development. Keywords: web mapping, GIS, the natural environment, spatial data, monitoring. 1 Introduction Air quality is the most important environmental factor that determines the health of the population and the state of ecosystems. Environmental and air monitoring systems particular are being created and implemented in many cities of the world both abroad and in Russia [1, 2]. A number of federal and regional organizations in Krasnoyarsk monitor the state of air pollution on the basis of their own methods and technologies of data processing [3]. The variety of solutions used, along with interdepartmental and organizational fragmentation, leads to the fact that a comprehensive analysis and rapid assessment of the entire array of recorded information is currently technically difficult and practically not carried out. The data collected are often published late and are not widely understood. The use of international standards would greatly facilitate the collection and analysis of observations from different sources. However, today the situation in Krasnoyarsk and Russia as a whole is far from ideal. In most cases, access to operational observations is limited and there are no specialized tools for obtaining such data. It should be noted that today the regional authorities, together with the Federal services, are trying to create a unified system for monitoring air quality. This example is a "Unified Information System of Monitoring of Pollution of Atmospheric Air of Krasnoyarsk" (http://www.feerc.ru/uisem/krasnoyarsk/). However, it is not possible to analyze operational data with such a system, as the system provides only daily average air pollution data. 2 Methods and tools In the global community the initiative for the establishment of a network of sensors Sensor Web Enablement (SWE) of Open Geospatial Consortium (OGC) has gained prominence due to its maturity and broad support from academics and industry. SWE is committed to providing open standards and protocols to improve communication between different platforms and data providers. The SWE initiative aims to facilitate search, improve accessibility and provide real-time monitoring of various sensors via the Internet [4–7]. Currently, the SWE family consists of seven standards: Sensor Model Language (SensorML), Observations & Measurements (O&M), Transducer Model Language (TML), Sensor Observation Service (SOS), Sensor Planning Service (SPS), Sensor Alert Service (SAS), Web Notification Service (WNS). The most interesting and useful tool for analyzing the environmental situation in Krasnoyarsk would be a tool based on the standard Sensor Observation Service (SOS). This standard describes a service that provides access to measurement results from the sensor or sensor network, provides the software information about the capabilities of the sensor and the quality of the measurements. It also provides access to the parameters of the sensor. In other words, SOS groups a collection of possible heterogeneous sensors and provides their measurements through a standardized service interface. SOS specification defines the operations offered by a specific sensor. The minimum set of methods includes GetCapabilites, DescribeSensor and GetObservation that return information about the observations and Copyright © 2019 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). measurements supported by SOS. The types of data provided by the sensor and the sensor types themselves can be obtained from the sensor registry. 3 Data collection technology As a result of the analysis of the current situation in Krasnoyarsk, a list of several sources of operational information on the state of atmospheric air, which can be obtained and analyzed, was formed. However, these data have to be extracted by special tools. It should be noted that in most cases the collection of data from external sources can be divided into several groups: 1. Data is presented in the form of web services that transmit information in a structured form, for example, in CSV, JSON, XML formats. Such data is easy to process and check for errors. This block includes data provided within the OGC SOS standard. 2. There are no special web services for receiving data, but there are service services of the resource necessary for its operation. For example, services for plotting or output tabular data in a web interface, tools, data export, etc. Such services can also be used to retrieve the data, but it is necessary to develop a number of tools for data analysis, including the inspection of changes in the structure of the data, changing internal IDs of the format of the output data, the dimensionality of the data, etc. 3. The data is presented in the form of HTML pages of the Internet resource. To obtain such data, you need to analyze the page in order to find some reference text blocks, with which you can always find the necessary information on the page. In this case, a specific text block is extracted from the page and processed further. In some cases, these page fragments can be treated as xml, making it easier to extract information. In some cases, a combination of these groups can be used, because the services may require service information contained in the page itself. Special attention should be paid to structural changes in the received data, developing a series of verification units, including changes in the number of sensors and monitoring stations, the integrity and completeness of the transmitted data, etc. For the operational refinement of the software modules of the collection, the means of logging text information and alerts, for example, by e-mail, should be created. Such tools allow you to receive information about changes on the remote resource and its status constantly. Promptly making corrections to the program code can ensure the integrity and correctness of the information received. There are a number of systems in the world for accessing observational data. For example, the smart Emission platform (https://data.smartemission.nl/data), the OpenSensorHub project (https://opensensorhub.org/), project 52°North (https://52north.org/), istSOS OSGeo project (http://istsos.org/), et al. 4 Results and discussion On the basis of the geoportal of the Institute of Computational Modeling of SB RAS, a unit of scientific and research monitoring was developed for the collection, processing and presentation of data from various observations [8]. Indicators from external data sources are collected on the SensorCollector collection server (http://gis.krasn.ru/sc/). Data sources can be either individual sensors or external databases and information systems through additional adapters. The data collected is recorded in a fact table and stored in multidimensional form with specific dimensions such as date-time, measure, and site. On the basis of primary data, automatic aggregation with different time intervals is performed, calculation of derived indicators is supported. A software interface (API) was implemented to populate and retrieve data, as well as a basic web interface for metadata management and storage navigation. Implemented support for differentiation of access rights to objects of the system depending on the role of the user. The service is implemented in PHP 5 scripting language using Yii framework, PostgreSQL DBMS with PostGIS extension module is used for data storage. The organization of access to observation data is carried out by various applications and services, including viewing tabular data, export, viewing data on maps with the ability to select time intervals and access using generally accepted standards (OGC WMS, SOS). The General scheme of data collection is shown in Figure 1. At the moment, the portal has implemented the collection of operational data from several sources and created the following sections: ‒ Monitoring system of air quality in the city of Krasnoyarsk. ‒ Subsystem and database of the official regional ecological system of municipal authorities. ‒ Subsystem and database of territorial bodies of hydro meteorological service of Russia in Krasnoyarsk region. ‒ Environmental data of independent ecologists and public organizations. ‒ Data of hydrological observations in the Krasnoyarsk region. ‒ et al. Data are also collected from the devices of the network of scientific and research environmental monitoring of the Federal Research Center "Krasnoyarsk Science Center SB RAS". Data is collected at 1 minute intervals from 24 posts using the data access API. Monitoring stations are developed by CityAir company (https://cityair.io/) and allow to obtain data of measurements of particulate matter concentrations in atmospheric air (PM2.5, PM10) and basic meteorological parameters such as temperature, air humidity and pressure. An archive of data obtained from these stations has been accumulated since December 2018. Figure 1. Scheme of data collection observations. The data flow on atmospheric air from the Regional Departmental Information and Analytical System on the State of the Environment of the Krasnoyarsk territory (RDIAS) is set up (http://krasecology.ru/Air/). Observations of air quality are carried out at 11 automated observation posts, 9 of which are located in Krasnoyarsk. To download data from the RDIAS site, a software module with an appropriate "driver" for processing and converting the input data has been prepared, which provides periodic loading of observation data from this system through a web service. At the moment, most of the downloaded data on the fixed observation posts on the remote server is generated every 20 minutes. Data from mobile laboratories appear with a frequency of once a day, after processing them by employees of the regional system. Meteorological data are automatically downloaded from the website of the Federal State Institution "Middle Siberian Department of Hydrometeorology and Environmental Monitoring" (http://meteo.krasnoyarsk.ru/). Meteorological information comes in real time from automatic monitoring posts, it is published on a special website and is updated every 3 hours. Data are collected on the following indicators: atmospheric pressure, humidity, wind direction, wind speed, air temperature. The archive has been maintained since the beginning of 2013. Previously, the site was available daily data on a number of gases (NO2, NO, SO2, CO, hydrofluoride, formaldehyde) and particulate matter PM10. These data were downloaded daily, but since June 2016 free access to these data has been closed. Another source of information is data from the Nebo community project (https://nebo.live/), created by environmental activists from Krasnoyarsk. The project collects data from about 20-30 instruments measuring the particulate matter concentrations in the air (PM2.5) and the main meteorological parameters: temperature and relative humidity. At the end of last year, the owners of the service began to upload archives of data for the past period. However, these archives have gaps in time, but sometimes they allow you to fill in the data missed by the import service. Currently (summer 2019) access to the archives is closed. Special attention was paid to the fact that the data of the Nebo community project on the charts with the average values for the day and month were compared with the maximum one-time maximum permissible concentration of harmful substances, which is incorrect. The site contains a link to the documentation for working with the service API (https://docs.nebo.live/api), but it is not currently available. The software API would make it easier to download data from the site and reduce the number of missing data due to problems with the project site. An adapter has been developed to download daily reports of global coverage (GSOD) from the National Oceanic and Atmospheric Administration (NOAA) of the USA from weather stations of the Krasnoyarsk territory. These data represent records that contain information about temperature, humidity, wind speed and direction, atmospheric pressure, types of precipitation, etc. A data archive has been formed since 2003, services for automatic loading of new information have been created. Special attention was paid to the subsystem of analysis of data received from third-party services. The subsystem analyzes the following events: 1. Unavailability of a web service or site via http protocol or lack of communication. 2. Correctness of the received data (negative or zero values for the data series, coincidence of time in the received data with the requested period, data from the "future", etc.). 3. The unavailability of individual monitoring stations or their indicators. 4. The appearance or disappearance of monitoring stations or indicators on a remote service or website (allows you to quickly add new monitoring stations to the system). The system analyzes the above problems and records all events in a text log on the server, as well as informs the system administrator by e-mail. This subsystem is currently being refined to limit the number of e-mail messages and to send summary information about non-critical errors once in a given period (by default, once a day). This development of the subsystem is due to the fact that part of the services collects data every 10 minutes and each time for the entire service or for each monitoring post separately formed an error message and the number of such messages per hour can be measured in tens of pieces. This is especially true for the stations of CityAir company, because their software is developing and often there are various errors: the data does not come sequentially, some packages have a date with a shift of a week in the future, there are new "virtual" devices on existing stations, etc. To date, in cooperation with the system of operational monitoring data, which is a part of geoportal of Institute of Computational Modeling SB RAS, several web applications have been developed for the publication and analysis of the collected observations that are part of the emerging regional spatial data infrastructure [9]. Figure 2 shows a web application with operational data on all the posts of the CityAir company of the Federal Research Center KSC SB RAS in Krasnoyarsk. The current weather conditions and the level of air pollution (PM2.5) are displayed. Depending on the level of contamination, the indicators are colored. The data is updated on the screen every 20 minutes. Figure 2. Real-time observation data in Krasnoyarsk from CityAir stations. When developing web applications, the analysis of existing solutions related to the presentation of operational information on the state of atmospheric air in the world and Russia was carried out. Among the considered systems there are several examples. Air Quality System based on the data of the United States Environmental Protection Agency (https://gispub.epa.gov/airnow/) provides access to data in the form of Air Quality Index (AQI) [10]. The data on air quality and weather conditions of the AirVisual international system (now part of the IQAir, https://www.airvisual.com/air-quality-map) obtained on the basis of the data from small home stations AirVisual Pro. Non-profit World Air Quality Index project (http://aqicn.org/map/world/) based in China provides access to data in the form of an air quality index for many countries of the world according to the environmental protection Agency methodology. Examples of several regional systems were also considered. One of them is the European system The Copernicus Atmosphere Monitoring Service (https://www.regional.atmosphere.copernicus.eu). Another successful project is the system of the Republican Center for Hydrometeorology, Radioactive Pollution Control and Environmental Monitoring of the Ministry of Natural Resources of the Republic of Belarus (http://rad.org.by/monitoring/air.html). It is also worth mentioning the system "air Quality of Moscow" (http://mosecom.ru/vozdux/) state environmental budget institution "Mosecomonitoring"; system "Environmental portal of St. Petersburg" (http://www.infoeco.ru/index.php?id=53), data published by the Committee on Nature Management, Environmental Protection and Environmental Safety, Air Quality Monitoring in Mumbai, India [11]. Most of these examples provide access only through a web browser or mobile application. Access using standard services (e.g., OGC SOS) is not available. An example of a custom web application built on the components and API of our Institute's geoportal is shown in Figure 3. This development is considered as a prototype of the created regional environmental monitoring system [12]. The map shows data on particulate matter concentrations (PM2.5) in the atmosphere of Krasnoyarsk, obtained from various observation systems. The shape of the icons on the map characterizes the belonging to the data source, and the color reflects the impurity concentration. In addition, the figure shows an example of a graph with Air Quality Index from the stations of the regional system in the city of Krasnoyarsk in general for the first months of 2019. Figure 3. Map with observational data for PM2.5 in Krasnoyarsk on 04.16.2019. 5 Conclusions For operational monitoring and analysis of the environmental situation in Krasnoyarsk, data collection from different sources was organized into a single system "Operational Monitoring Data" of the Institute of Computational Modeling SB RAS. The developed software for importing observations from various resources allows automatic loading of data, regardless of how the access to these data is organized. In case of any changes and errors in the import, e-mail messages are sent, allowing you to quickly make changes to the import program or data on sensors and their composition. In the future, the possibility of organizing access to observational data in Krasnoyarsk with the help of the international standard OGC SOS is being considered. Unfortunately, in Russia this standard is not well developed, the suppliers of monitoring data. References [1] Manuel A. Regueiro, José R.R. Viqueira, Christoph Stasch, José A. Taboada. Semantic mediation of observation datasets through Sensor Observation Services // Future Generation Computer Systems. 2017. 67, pp. 47–56. [2] Azemov D. Saint-Petersburg air pollution monitoring system // Environment of Saint- Petersburg, 2016. 2 (2), pp 8–14. [3] Yakubailik O.E., Kadochnikov A.A., Tokarev A.V. Development of computational technologies and software for assessment of air pollution in Krasnoyarsk // IOP Conference Series: Earth and Environmental Science, 2019, Vol. 211 (1), 012080. [4] Botts M, Percivall G, Reed C, Davidson J 2008 OGC® Sensor Web Enablement: Overview and High Level Architecture. In: Nittel S, Labrinidis A, Stefanidis A (eds) GeoSensor Networks // GSN 2006. Lecture Notes in Computer Science. Vol 4540. Springer, Berlin, Heidelberg. [5] Kamel Boulos et al. Crowdsourcing, citizen sensing and sensor web technologies for public and environmental health surveillance and crisis management: trends, OGC standards and application examples // International Journal of Health Geographics. 2011 10:67. [6] Schmitt R.H. and Voigtmann C. Sensor information as a service – component of networked production // J. Sens. Sens. Syst., 7, 389–402, 2018. [7] Kotsev A, Schade S, Craglia M, Gerboles M, Spinelle L, Signorini M. Next Generation Air Quality Platform: Openness and Interoperability for the Internet of Things // Sensors (Basel, Switzerland) vol. 16,3 403. 2016. [8] Yakubailik O.E., Kadochnikov A.A., Tokarev A.V. Web geographic information system and the hardware and software ensuring rapid assessment of air pollution // Optoelectronics, Instrumentation and Data Processing. 2018. 54 (3), pp 243–249. [9] Kadochnikov A.A., Tokarev A.V., Zavoruev V.V., Yakubailik O.E. Prototype of city environmental monitoring system based on geoportal technologies // IOP Conference Series: Materials Science and Engineering, 2019, Vol. 537, 062052. [10] Technical Assistance Document for the Reporting of Daily Air Quality – the Air Quality Index (AQI) // U.S. Environmental Protection Agency. 2018. https://www3.epa.gov/airnow/aqi-technical-assistance-document- sept2018.pdf [11] Awkash Kumar, Indrani Gupta, Jørgen Brandt, Rakesh Kumar, Anil Kumar Dikshit & Rashmi S. Patil. Air quality mapping using GIS and economic evaluation of health impact for Mumbai City, India // Journal of the Air & Waste Management Association, 2016, 66:5, pp 470-481. [12] Yakubailik O.E. The formation of regional spatial data infrastructure // Journal of Siberian Federal University. Engineering & Technologies, 2016, Vol. 9 (7), pp. 979-986.