Visualization of Sensors’ Data in Time Series Databases for Health Purposes Snezana Savoska 1, Andrijana Bocevska 1 and Hristijan Simonoski 1 1 University “St. Kliment Ohridski”, ul. Partizanska bb, Bitola, 7000, RN Macedonia Abstract The increasing influx of information on pollutants that affect human health leads to collection of huge amounts of data that are recorded in many different formats, servers, for different types of pollutants, for different geographical locations, temporally arranged and most often recorded in time series formats. According to the Law on free access to public information, the institutions are obliged to publish all these data on different types of pollution on public websites and to make them available to the citizens. Institutions most often publish collected and stored data at locations where the measurement sensors are found and present them in visual formats for the current measured data for a certain pollutant, which are understandable to the citizens. Many applications can show at any time the values of pollutants at certain positions. In addition, summary analyzes can be made if the allowed limits for pollutants are known according to the Law on environmental protection. However, there is no organized effort to quantify their common impact on a person during their lifetime, i.e. to assess the risk for each individual depending on their health conditions, chronic disease, if any, or assessment of external pollutants – aero-exposures to the individual at one location over a period. Setting such a premise, especially if a prediction of outcomes could be made, would be very useful if these aero-exposomes were linked to the chronic disease from the patient’s personal health record (PHR), particularly on the respiratory system. The risk assessment of residence in certain areas in a certain period, and the availability of this assessment to his doctor and to him personally, would be very useful because it would protect patient’s health and give recommendations for avoiding locations in periods of stay in certain locations. In this paper, we are trying to give some guidance and propose a model for visual data analysis from time series with environmental data that should help in assessing the risk of residence in some locations at particular time. Data are obtained from measuring sensors, placed in the northwestern region of Republic of North Macedonia. The limits of permitted values of pollutants Information Systems & Grid Technologies: Fifteenth International Conference ISGT’2022, May 27–28, 2022, Sofia, Bulgaria EMAIL: snezana.savoska@uklo.edu.mk (S. Savoska); andrijana.bocevska@uklo.edu.mk (A. Bocevska); hristi- jan171@gmail.com (H. Simonoski) ORCID: 0000-0002-0539-1771 (S. Savoska); 0000-0001-8701-0700 (A. Bocevska) © 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). CEUR Workshop Proceedings (CEUR-WS.org) are taken into consideration, according to the Ministry of Environment and physical planning website, as well as data from patient’s PHR who has been diagnosed with chronic diseases of obstructive respiratory system. Keywords Time series data visualization, visual data analysis, aero-exposome, chronic respiratory disease, aero-pollutants sensor’s data 1. Introduction Time-oriented data analysis is an important task in many applications and scenarios. In recent years, many different techniques have been used to visualize this type of data. This diversity makes it difficult for potential users to choose methods or tools that are useful for their specific task. This research analyzes different perspectives on the possibilities of time series environmental data visu- alization and visual data analysis (VDA) for health purposes. With the proposed categorization, some efforts are made to find the best ways to visualize time series data that should be used by citizens and doctors in order to give recommenda- tions for behavior by location and place of residence. The categorization should be used from both users and researchers to identify future tasks in VDA of time series data, especially when that data are related to human health. Because time is an important data dimension, special methods are required to support proper analysis and visualization to explore trends, patterns, and rela- tionships in different types of time-oriented data. The human perceptual system is very sophisticated and specially adapted for perceiving visual patterns. For this reason, visualization is a highly appropriate method of data analysis and has been used successfully for data analysis of time series. When it comes to their use by medical and health professionals, these techniques should be easy to use, providing a wide range of visual analysis capabilities and comparing them with the permitted values for the measured pollutants [1]. Because of these specific uses of time series data visualization for health purposes, in this paper, we propose a framework – a model that should help to increase the comprehensibility of visual time series data analyzes for environ- mental data. Today, these data are collected in huge amounts, on different servers, in different formats and code systems, and the need for their efficient and fast analysis requires the application of fast and efficient methods for preparation and visualization. Visual analysis systems aim to overcome this lack of diversity and dispersion of time series data by applying and combining interactive visualiza- tion and VDA techniques. The paper is organized as follows. Second chapter considers related works connected with visualization of time series data and VDA. The third chapter fo- 318 cuses on the characteristics of time series data and their usability, with emphasis on the use of the chronic disease and their physicians, considering the possibility of creating frame for VDA of time series data. The forth section considers mecha- nisms that can act on public awareness to reduce air pollution and how they can be activated in the society. The fifth chapter presents a way to visually present data and assess the risk for chronic disease by location, by creation of VDA and con- necting these data with their chronic conditions. In this part the tools previously described in the context of mechanisms for raising public awareness to protect the patients’ health, are considered. The sixth chapter proposes a model for VDA of time series for pollutants for the health purposes and patients (citizens) pro- tection and validates the model with a practical example of a pollutant. The last chapter draws conclusions and proposes ways to capitalize the proposed model in the society as one of the mechanisms for public pressure to reduce pollution that should positively affect the improvement of environmental parameters. 2. Related works For a long period, the time has been an interesting phenomenon that is stud- ied by scientists, many times along with the location of the event that has been analyzed. A useful concept for data and information modeling, used in conjunc- tion with cognitive principles, is the pyramidal framework, which is based on 3 perspectives: “where”, “when” and “what” about data [2]. Interpretations form objects at a cognitively higher level of knowledge, classification, and interrela- tionships, taking into account both discrete and continuous phenomena. The most influential theories in the natural sciences are Newton’s concepts of absolute and relative time. When modeling the time in information systems, the goal is not time itself, but to provide model that is most appropriate to represent the parame- ters under consideration of VDA. The time scale, scope, structure and views from different perspectives and the granularity of the data should be considered [2]. Techniques for VDA of time series take into account the following criteria: time together with its primitives, the level of abstraction and variability of data and their representation according to dimensionality and multivariate [3, 4]. Many tools have been created for this purpose. Some well-known visualization techniques that represent time-oriented data consider time points. Such tools and techniques have a specific representa- tion of time, on a time axis. For example, TimeWheel is a multi-axis represen- tation for visualizing multiple variables over time and place a time axis in the center of the screen that can be rotated to bring different attributes into focus, zoom and move across the time axis. Because it uses lines to represent data for each time point, it is only useful for multiple variables related to time points but not to intervals [5]. 319 A technique suitable for time intervals visualization and a high level of detail is PlanningLines, which consists of two encapsulated strips that represent the minimum and maximum duration, limited by two caps that represent the start and end intervals. This technique also solves the problem of time uncertainty related to future planning or different time granularities (as days or hours). This tech- nique also supports interactive zooming and brushing, which is especially useful for fine-grained and large time scale [6]. An example of a tool that uses a time-point visualization technique is ThemeRiver, which represents the number of appearances on certain topics, for example, in print media, where each topic is displayed as a colored stream that changes its width continuously, such as flows through time [7, 8]. It is suitable for presenting quantitative data in time, but it is not suitable for presenting branching time or time with multiple perspectives or multivariate data. Because of this, the need for advanced techniques for effective visualization of multivariate data has been recognized. If trends and patterns derived from multiple scales and univariate time series have to be visualized, it is appropriate to use cluster and calendar-based visual- izations and data sets where colors are used to show data similarities [9]. Ap- propriate distance or similarity measures using data mining techniques provide cluster visualization and set the basis for clustering, neglecting the time context in a certain granularity, which complicates VDA in relation to basic time-oriented tasks [10]. Several researchers have worked on event-based visualization. Event speci- fication is a step in which users describe their interests to find a match with the technique used and should therefore be based on formal descriptions and formu- las of events. Such formulas contain elements of logic, variables, functions, ag- gregate functions, logical operators, and quantifiers that create valid formulas for events [11]. For time-oriented data analysis, it is necessary to have a sequence of types of events that are supported. For this purpose, a user-centric event specifica- tion model has been created that includes direct specification, parameterization and selection and provides expert and experiential visualization for users [11, 12]. When big data time series have to be visualized, many researchers use data mining methods to make different types of time data groupings [13]. Apart from the mentioned time tasks for data mining, other analytical methods are used as statistically combined operators [14] and extensions of the analysis of the main components [15]. There are many efforts to visualize time series with pixels. In this case, time can be represented using other visual variables if there is a mapping of time → space. If there is mapping time → time, the physical dimension time is used to convey the temporal dependence of the data through animations. The difference between these mappings is crucial for VDA, as different tasks and objectives are 320 supported. There are also some hybrid forms, which combine the both mappings, as well as the data characteristics [12, 16]. Interaction and navigation methods are needed for data exploration as well as for parameter space exploration. In addition, it is important that these methods be designed according to certain re- quirements [17]. Direct interaction in the visual representation of the analytics methods, com- bined with the data mining methods provide greater control and better feedback to the analyst. This includes interactive parameterization of visual and analytical methods. The methods intended for navigation through large big data space are crucial for analyzing research-supporting environments. The tasks and goals of the user determine the choice of visualization method. If cycles in data should be identified, techniques that enable visual detection of periodic behavior need to be selected. In this case, techniques for visualizing time-oriented spiral data that have the ability to interact and animate in order to detect previously unknown cycles in the data can be appropriate [18]. Many practical examples of visualization and interaction methods show the requirements for the user to assess the limitations of the specific domain for the problem and therefore to choose the appropriate technique and tool for specific visualization and interaction components [5]. The idea of a time browser imple- mented in the TimeSearcher tool for exploring multiple time series can also be mentioned here [19]. Its purpose is to identify and find known data exploration models with user-defined search tolerance of a model that can allow varying de- grees of accuracy and matching. Exploratory data analysis (EDA) is also one of the rising areas in the recent decades, which is actually a return to the first goals of statistics, discovering and describing patterns, trends and relationships in data. It is more about generating hypotheses and less about hypothesis testing. Although not a strictly visual meth- od, EDA is strongly associated with the usage of visual data representations [20]. Visual representations developed in EDA often are complemented by validation techniques, displaying vast data sets, multiple variables, temporally or spatially visualized with EDA visual data analysis. There are many useful and innovative techniques developed in visualization’s tools for EDA purposes that include GIS and time-dependent mapping visualizations. These dynamic features include ani- mation and interaction. Many of the EDA tasks use methods to research complex numerical data. Some analog techniques are developed in innovative ways to interact and explore abstract or non-numerical data that belong to the information visualization [21]. Information Visualization (InfoVis) deals with the mapping of abstract data that has led to new and important research on use and interaction of graphical representation space to display complex information efficiently and clearly [22]. Many combinations of GIS, EDA, and tool-assisted temporal visual techniques 321 have been made with this concept in terms of using complex geo-spatial and temporal data in practice over the last three decades, as well as tools that support these complex visualizations [17, 18, 19, 21, 22, 23, 24, 25, 26]. 3. Time series data and their characteristics Time-oriented data visualization is not an easy task, although many ap- proaches to this task have been published in the last years. The reason why most of the visualization methods are customized is simple: it is enormously difficult to consider all the aspects involved when visualizing time-oriented data. Time itself has many theoretical and practical aspects, whether working with time as a point on a time axis or time intervals using different sets of time relation- ships. Time is therefore interpreted as a linearly ordered set of time primitives, or it can be assumed that time primitives repeat cyclically. Time-bound data is another concern as it can be multiple, diverse, abstract, spatially referenced, or event-related. Therefore, it is necessary to think carefully when choosing visual techniques and tools for their analysis. If only the characteristics of the data are considered, it is possible to generate expressive visual representations, but usually visual representation requires thinking about the representative and perceptual issues of time series data, especially if they are related to environ- mental data [15]. Time series are simply measurements of tracked events’ parameters, sam- pled, collected in different ways at specific time intervals [2, 17]. The difference between these time series data and other data sets is that they are always related to the time of occurrence and questions that can be asked are about changes over time. A simple way to determine if the database we are working with is a time series or not is to see if one of the axes, or one of the parameters is time or time period. 3.1. Concept of designing frames for time series visual data analysis By proposing a concept for the development of visual data analysis frame- works for the time series of environmental data analysis, a vision of how the visual analysis of this data can be created and the necessary steps can be obtained as well as its components and functionality can be considered. It should not be a specific framework, but should describe the general steps and functionality of the main components involved. Individual components can be integrated into various specific applications in order to create a visual analytical framework or time-oriented data visualization system. For this purpose, it is necessary to firstly model the time-oriented environmental data, perform their computer analysis and make a display in interactive visualization. 322 Time series data come in two forms: regular and irregular [27]. The regular time series consist of measurements collected by software or hardware sensors at regular intervals (e.g., 10 seconds) and are often referred to as metrics. Irregular time series are event driven. Irregular time series sums can be seen as regular time series. For example, the average response time in the application, over an interval of one day, or the display of the average value of pollution per hour during the day are already regular time series. When they are in place of mechanisms to increase the public awareness of the citizens in a country, they are important information, especially if they affect human health [1, 28, 29]. 4. Action mechanisms for public awareness to reduce pollution and how to activate them Environmental reporting through indicators is an ambitious endeavor that should produce a report, a picture of the environment’s state, presented with quantitative and qualitative data obtained through scientifically based measure- ments and analyzes. They should point to the sources, causes, consequences and trends of specific conditions. The prepared environmental indicators are based on numerical data showing the condition, special feature or movement of a certain phenomenon. Indicators can warn about problems and are a useful tool in the en- vironmental reporting process. These indicators answer the key questions for the development of a country’s environmental policy. The indicator is an inevitable tool for monitoring the achievement of sectoral policy objectives – strategies, plans, other documents and the basis for planning an effective policy for environ- mental protection and sustainable development [1]. All indicators from the environmental data set are arranged according to the framework known by the acronym DPSIR (Moving Forces – Pressures – Condi- tion – Implications – Reactions) [1] where each phase conveys its meaning and importance and is clear about creating environmental protection policy. The driv- ing forces are social and economic factors and activities that cause an increase or decrease in environmental pressures. They include economic, transport, social and other activities that affect the environment. Pressures are actually direct an- thropogenic pressures and implications for the environment, such as emissions of pollutants or depletion of natural resources. Implications are the effects that envi- ronmental changes have on the human’s health. Reactions are society’s responses to environmental problems. These may include special measures of the state, such as taxes on the consumption of natural resources and penal provisions of the Law on environmental protection. Our goal will be to provide analysis of time series environmental data from accurate and geo-referenced measured values of the parameters that speak for the environment and to initiate the mechanism of 323 Implications – Reactions that should result in providing a better environment for citizens in the region following the indicators: A – Descriptive indicator (gives an answer to the question “What is happen- ing to the environment and people?”, i.e. describes the current situation) B – Progress indicator (gives an answer to the question “What is the distance between the existing situation and the established goal?”, i.e. it compares the existing state of the environment with the established goals for environmental protection and serves to monitor the progress towards such goals) C – Indicator for the efficiency of environmental protection (gives an answer to the question “Is the quality of the environment improving?”, i.e. describes whether the society improves the quality of its products and processes in terms of resources, emissions and waste per unit of product) D – Indicator of the effectiveness of the policy (gives an answer to the ques- tion “How effectively is the official policy of the country for the protection of the environment implemented?”, i.e. whether and to what extent the official policy of the country is implemented) E – Indicator for the overall well-being (gives an answer to the question “Has our situation completely improved?”, i.e. describes whether and to what extent the country achieves sustainable development or economic development that en- sures social welfare of citizens and environmental protection). In order to detect these conditions on the indicators, visual data analysis for data of the north-western region of Republic of North Macedonia was performed and the data on the quality condition of the environmental parameters in the re- gion were analyzed. 5. Assessment of the risk by patient’s chronic disease by location The analysis of the time series of the data for exposure of living organisms, and especially of humans, requires the generation of a comprehensive spatial- temporal record of exposure, obtained by recording the data obtained from the measurement sensors of environmental parameters [24, 29, 30]. The metadata they contain should describe the data, clarify the limits’ values for all parameters and provide information related to the large amount of data that are collected and available in time series databases. The important platform’s architecture is described, if the purpose is to integrate the diverse data that can be in the form of big data and obtained from sensors for measuring human health parameters. Understanding the effects of modern human exposure to the environment and their impact on human health is an important domain in biomedical research [31]. Air pollution is associated with one of the eight leading causes of death globally and is associated with the emergence of many chronic diseases such as childhood asthma [28]. Although the contributions of these factors may vary, 324 various studies show that at least 50% of human health problems are caused by environmental pollution, a person’s lifestyle, and society’s attitudes toward the environment. The phenomenon of the “individual” is the result of an interaction between his genome and the exosome, which is defined as “the total exposure of an individual throughout his life in the environment” [28, 29]. The assessment of the health risk from these pollutants for each individual can be done if we have data for parameters as pollution of the locations where they reside and his chronic health condition to which the parameters of the environment affect. This is also a complex and ambitious task involving an interdisciplinary approach that includes medical staff (for quantification of the impact of each exposure on each patient’s disease code [24], patient health record data [32], ICD10 classification) as well as data scientists. The data scientists should create an assessment of the factors of environmental impact on the individual using healthcare big data analytics [33]. These data can be seen as the base for utilizing an application with methods and algorithms of artificial intelligence to assess the risk according to predefined parameters or rules. Exposure can be estimated from the sensors’ data for pollutants, with differ- ent parameters and accuracy, measured by different methods (as air pollution with PM10 particles, CO2, CO, N2, etc.). For this purpose, in many locations around the world, centers for measuring these parameters have been set up, because all countries are obliged to measure them and inform their citizens transparently. For example, the Center for Excellence in Health Informatics at Exposure (CEEHI) was established with funding from the National Institute of Biomedical Imaging and Children’s Bioengineering Research, which uses integrated sensors [31]. The monitoring systems program has developed an infrastructure to support sensors based on weather data research [1]. 5.1. Visualization of time series data for the specific locations For the purpose of this paper, a visual data analysis of the time series data of pollutants were made for the cities of Tetovo, Gostivar and Kicevo with PM10 particles as well as in parts in Skopje municipality where sensors for measuring environmental parameters are present. By displaying the data in Power BI tool, users can easily analyze the days when the maximal permitted values for pollut- ants are exceeding and alarm patients with chronic diseases to avoid those loca- tions where the value is higher than the maximum allowed, as shown in Figures 1, 2, 3. Although there are many applications that provide similar data, there is no organized effort to compare these data visually in the current time. However, by displaying visually and transparently these data and setting up locations that are polluted more than the limit of permitted values, it can be expected that this information will be used by patients, doctors and citizens to detect areas of pol- 325 lution that exceed the permissible limits. This visual analysis can have a slider in order to be more flexible and allow to set up new limits of the permitted val- ues for some chronic and more sensitive patients to these pollutions and thus to protect this population who have diagnoses on which these pollutants can have a devastating effect. Other possibility is to list the diagnoses according to the ICD10 classification [33] and to indicate by the doctors which diseases are associated with certain pol- lutions, i.e., which of the pollutants can be fatal for patients with this chronic dis- ease, how pollutants and their parameter’s limit affect patients [24]. Some alerts through media can be published in order to protect the endangered citizen groups, to activate the mechanisms Implication – Reaction [1] and to influence the reac- tions to reduce pollutants. This also envisages recommendations to the chronic disease patients by medical staff to change temporarily the location of residence while a danger from particular pollutants lasts. All of these proposals require broader cooperation with the health sector and municipal and state institutions that can respond in the event of excessive pollutant values. Many examples of visualization in various visual forms can be created to be accessible and transpar- ent, as well as predictions can be made for seasonal exceedances of pollutants such as PM10, PM2.5, CO, CO2, SO2, ozone and other pollutants that disrupt human health and destroy the ecosystem. Figure 1: Visual analysis of pollution with PM10 particles for the month of Oc- tober with a permitted value presented on the Clustered column chart [Gosti- var – light blue, Kicevo – dark blue, Tetovo – brown and Bitola – green] Figure 1 shows a visual data analysis of measured data for a dynamic period chosen by slider, on which the limit for the permitted value of the pollutant is 326 shown. It can be seen that, on certain days, in some of the three cities, this value is exceeded. If the data are taken in real time, visualized with interactive VDA, the resulting visualizations can alert the patients with chronic diseases to distance themselves from these locations or temporarily leave these sites in order to pro- tect their health. Figure 2: Usage of sliders along the X and Y axis for data from November 2021 for the Skopje region [Gazi Baba – light blue, Gjorce Petrov – Dark blue, Karpos – orange, Lisice – purple, Miladinovci – pink, Rectorate – green, Center – yellow. Limit value – horizontally dashed red line] In addition, it can detect pollutants in time and therefore alert the responsible persons to take actions according the REACTION indicator [1]. The next example of VDA shown in Figure 2 provides a comparative analy- sis of PM10 particles pollution by regions in Skopje with a limit of permitted value for pollution. Figure 3 shows a line diagram for regional pollution, and each region is represented with a line in different color. Figure 4 displays VDA for 3 consecutive months on the Dashboard, enriched with slider and pollutant permit- ted limit that can be moved for the patients with chronic diseases. The permis- sible pollution limit for the given parameter is set on the visual display and shows excessive pollution that should trigger the mechanism REACTION of the Minis- try of Environment Protection and INTERVENTION of the relevant authorities. 327 Figure 3: Visual data analysis of the PM10 pollutant for daily time series – Line chart [November 2021 – Skopje region] Figure 4: VDA of 3 consecutive months on the Dashboard with slider and per- mitted limit for PM10 made for proposed model validation These images are understandable to users due to their comparability with the reference values that have to be respected and not be exceeded. It can serve as an easy way to provide information about the days (maybe time) when the permitted limits for the pollutants have been exceeded and to react. 328 6. Proposed model for VDA of time series data for pollutants intended for healthcare staff and patients The experience gained by creating a large number of visualizations of envi- ronmental data from time series was used to extract a model – a framework for VDA of this type of data that includes the use of mobile applications that show air pollution by locations and citizens who have their own PHRs and diagnoses ac- cording to ICD10 classification of diseases [24]. Using environmental time series databases and defined pollutant values for the population and for each chronic obstructive pulmonary disease, each patient adjusts their permissible pollution limits according to their ICD10 diagnosis by PHR [32]. According to the pol- lutant that is analyzed, for each patient, a health risk assessment can be made and furthermore recommendations can be given by the system according to the principle of an expert system for diagnosis of chronic patients based on the IC10 classification, previously defined by specialist doctors [32, 33]. The model in- cludes steps from event specification, pollutant types, and pollutant data loading, permitted limit of values and visual analysis appropriate to the representation, as well as possibilities for setting a filter and warning by the system. The proposed model is shown in Figure 5. Figure 5: Proposed model of visual data analysis of environmental time series data for pollutants and their influence on the human health The model assumes that the citizens (patients) are aware of their chronic condition and use a mobile application that can use the data obtained from serv- ers that collect data on pollutants by time and location, their allowable limits, 329 other sensors included in the data network and laboratory and other measure- ments available. They can also use mobile devices to measure their vital signs life parameters, connected to a mobile application for that purpose. The model as- sumes that the patient has digital health literacy and a culture of self-management of their health [32]. The second part of the model is intended for the creators of Policies and strategies at the municipal and regional level who through the mechanisms IM- PLICATION – REACTION should react and prevent pollutants from causing such harmful consequences for human health. They should also apply the laws and penalties to sanction polluters and create recommendations for future actions of environmental policy makers [1]. The model assumes first to define the events (pollutants) that will be subject to data collection from time series, to make their detailed specification. Metadata structures are then created for intended events to collect and store data. The next phase is connected with the process of data collection, their clarification and re- finement for storing in databases suitable for time series. We validate the usability of the proposed model on many VDA for envi- ronmental pollutants, from different data sources of pollutants stored in time se- ries, in different formats, and with different visualization techniques and tools, as shown in Figure 4. We have to mention that there is possibility of using sensors to measure vital signs of life related to the user’s mobile applications and their connection to pol- lutants’ data, known as exposome. 7. Conclusion The air pollution is a serious risk for citizens and public health everywhere. Therefore, it is necessary to strengthen the capacities for air quality management, especially in the part of preparation and monitoring the air quality of the national and local level with the plans’ improvement. This presupposes the provision of accurate data, obtained from the air quality monitoring system in more locations and the creation of information from air data quality and their public presentation. For this purpose, VDA are created and model for VDA from environmental data time series is proposed. This model can be used as a framework for visu- alization of this type of data applicable in similar situations. With the proposed model, pollutants’ parameters limits can be shown on the screens for different groups of citizens with chronic diseases (as ICD10 codes from PHR), they can be alerted to avoid such locations and furthermore environmental authorities can be alerted to prevent these exceeding of the pollution’s limits. The model applicability is validated with creation of VDA dashboard for PM10 particle pollution from 3 months’ data in time series of measured envi- 330 ronmental data, using visualization according to the user’s preferences that can be changed, made with Power BI. There is also the possibility of using sensors to measure vital signs of life related to the user’s mobile applications and their connection to pollutants’ data, also known as exposome. These data can provide the ability to assess the health risk of each citizen for their diagnosis [32], at that location at time [24], which can be one of the future research goals. As opportunities for interesting visual insights from VDA can be mentioned the possibility provided by Power BI [34] for prediction of measured parameters that can be created according to data from previous years, seasonal trends and trend analysis. In addition, Accuweather [35] weather services can be included as the weather factor for prediction. This prediction can be made using visualiza- tions in the Analytics section, where Forecast should be selected. This opportu- nity would provide forecasts for pollutants based on weather conditions and data about air pollution from previous years, which would be useful for the patients with chronic diseases to make suitable decisions in time, receiving alerts from the system and recommendations from their doctors. One of the future research top- ics that demands serious research using health big data analytics [33]. 8. References [1] Report for environmental condition with indicators, 2020, https://www.mo- epp.gov.mk/wp-content/uploads/2014/11/0301_IndikatorskiIzvestaj_2020. pdf. [2] J. Mennis, D. J. Peuquet, L. Qian, A conceptual framework for incorpo- rating cognitive principles into geographical database representation. Intl. Journal of Geographical Information Science 14 (2020). [3] A. U. Frank, Different Types of “Times” in GIS, in: M. J. Egenhofer, R. G. Golledge (Eds.), Spatial and Temporal Reasoning in Geographic Informa- tion Systems, Oxford University Press, New York, USA (1998). [4] J. F. Roddick, M. Spiliopoulou, A Survey of Temporal Knowledge Discov- ery Paradigms and Methods, IEEE Transactions on Knowledge and Data Engineering 14 (4) (2002). [5] C. Tominski, J. Abello, H. Schumann, Axes-Based Visualizations with Ra- dial Layouts, in Proc. of ACM Symp. on Applied Computing. ACM Press 2004, (SAC04), 1242–1247: ACM. [6] W. Aigner, S. Miksch, B. Thurnher, S. Biffl, PlanningLines: Novel Glyphs for Representing Temporal Uncertainties and their Evaluation, in Proc. of the 9th Intl. Conf. on Information Visualisation (IV05). IEEE Press (2005). [7] S. Havre, E. Hetzler, P. Whitney, L. T. Nowell, ThemeRiver: Visualizing Thematic Changes in Large Document Collections, IEEE Transactions on Visualization and Computer Graphics, vol. 8, no. 1, pp. 9–20 (2002). 331 [8] S. Havre, B. Hetzler, L. Nowell, ThemeRiver: Visualizing Theme Changes Over Time, in Proc. IEEE Symp. on Information Visualization (InfoVis’00), Salt Lake City, USA (2000). [9] J. J. Van Wijk, E. R. Van Selow Cluster and calendar based visualization of time series data, Proceedings 1999 IEEE Symposium on Information Visu- alization (InfoVis’99), 1999, pp. 4-9, doi: 10.1109/INFVIS.1999.801851 (1999). [10] T. Nocke, H. Schumann, U. B¨ohm, M. Flechsig, Information Visualization Supporting Modeling and Evaluation Tasks for Climate Models, in Proc. of Winter Simulation (2003). [11] R. Sadri, C. Zaniolo, A. Zarkesh, J. Adibi, Expressing and optimizing se- quence queries in database systems. ACM Trans. Database Syst. 29 (2004): 282–318. [12] D. A. Keim, Designing pixel-oriented visualization techniques: Theory and applications. IEEE Trans. on Visualization and Computer Graphics 06 (1): 59–78 (2000). [13] S. Laxman, P. S. Sastry, A Survey of Temporal Data Mining. Sadhana 31: 173–198 (2006). [14] S. Miksch, W. Horn, C. Popow, F. Paky, Utilizing Temporal Data Abstrac- tion for Data Validation and Therapy Planning for Artificially Ventilated Newborn Infants. AI in Medicine 8 (1996). [15] W. Aigner, S. Miksch, W. Müller, H. Schumann, C. Tominski, Visual methods for analyzing time-oriented data. IEEE Trans Vis Comput Graph. (2008) Jan-Feb; 14(1): 47–60. [16] doi: 10.1109/TVCG.2007.70415. PMID: 17993701. [17] S. D. Santos, K. Brodlie, Gaining understanding of multivariate and mul- tidimensional data through visualization, Computers & Graphics (2004) DOI: 10.1016/j.cag2004.03.013. [18] B. Shneiderman, The Eyes Have It: A Task by Data Type Taxonomy for In- formation Visualizations. In Proc. of the IEEE Symp. on Visual Languages, 336–343: IEEE CS Press, September 3–6 (1996). [19] M. Weber, M. Alexa, W. Muller, Visualizing Time-Series on Spirals. In Proc. of the IEEE Symp. on Information Visualization 2001 InfoVis01 (2001). [20] P. Buono, A. Aris, C. Plaisant, A. Khella, B. Shneiderman, Interactive Pat- tern Search in Time Series. In Proc. of the Conf. on Visualization and Data Analysis (VDA 2005), 175–186: SPIE. [21] J. W. Tukey, Exploratory Data Analysis. Reading, Mass.: Addison-Wesley. pp. 688 (1977). [22] R. A. Becker, W. S. Cleveland and A. R. Wilks., Dynamic graphics for data analysis. In Dynamic Graphics for Statistics, edited by W. S. Cleveland and M. E. McGill, 1–49 Belmont, California: Wadsworth & Brooks (1998). 332 [23] S. K. Card, J. D. Mackinlay, B. Shneiderman., Information visualization. In Readings in Information Visualization: Using Vision to Think, 1–34. San Francisco: Morgan Kaufman (1999). [24] B. Erbas, RJ. Hyndman, “Data Visualisation for Time Series in Environ- mental Epidemiology.” Journal of Epidemiology and Biostatistics 6.6 (2001): 433–443. [25] M. Radezova Trifunovska, I. Jolevski, B. Ristevski, S. Savoska, Environ- mental Data as Еxposome and Оpportunity of Combining with Cloud- Based Personal Health Records. In: The 14-th conference on Information Systems and Grid Technologies, May 28–29, Sofia, Bulgaria (2021). [26] S. Savoska, V. Muaremi, A. Bocevska, B. Ristevski, Z. Kotevski, Model- ling of GIS Based Visual System for Local Educational Institutions’ Stake- holders. In: 10th International Conference of Information Systems & Grid Technologies ISGT 2016, 30.09-01.10, Sofia, Bulgaria (2016). [27] S. Savoska, S. Loskovska, V. Blazeski, Time Histograms With Interactive Selection Of Time Unit And Dimension, In: Conference on Data Mining and Data Warehouses (SiKDD 2008) October 17, 2008, Ljubljana, Slove- nia, 17 October 2008, Ljubljana Slovenija (2008). [28] Why Time Series Matters for Metrics, Real-Time Analytics and Sensor Data, An INFLX DATA TECHNICAL PAPER, Paul Dix, CTO and Found- er, InfluxData Revision 5, July 2021 [29] https://get.influxdata.com/rs/972-GDU-533/images/why%20time%20se- ries.pdf#page=1&zoom=auto,- 99,798, Accessed 09.2021. [30] G. Miller, The Exposome: A Primer, in 1 edition. Amsterdam ; Boston: Aca- demic Press, 2013. [31] S. Savoska, B. Ristevski, N. Blazheska-Tabakovska, I. Jolevski, Towards Integration Exposome Data and Personal Health Records in the Age of IoT. In: 11th ICT Innovations Conference 2019, 17–19 October, Ohrid, Repub- lic of Macedonia (2019). [32] M. Sathiyanarayanan, V. Varadarajan, K.V. Pradeep, Visual Analytics on Spatial Time Series for Environmental Data, Internation Journal of recent technology and engineering (IJRTE), ISSN: 2277-3878, Volume-8, Issue- 1C2, May (2019). [33] K. Sward, N. Patwari, R. Gouripeddi, J. Facelli, An Infrastructure for Gen- erating Exposomes: Initial Lessons from the Utah PRISMS Platform, Inter- national Society of Exposure Science Annual Meeting, Research, Research Triangle Park, NC, USA. 2017 (2017). [34] N. Tabakovska-Blazheska, A. Bocevska, I. Jolevski, B. Ristevski, N. Bere- dimas, V. Kilintzis, N. Maglaveras, S. Savoska, Implementation of Cloud- Based Personal Health Record Integrated with IoMT. In: The 14-th confer- 333 ence on Information Systems and Grid Technologies, May 28–29, Sofia, Bulgaria (2021). [35] B. Ristevski, S. Savoska, N. Tabakovska-Blazheska, Opportunities for Big Data Analytics in Healthcare Information Systems Development for Deci- sion Support. In: The 13-th conference on Information Systems and Grid Technologies ISGT 2020, Sofia, Bulgaria May, (2020). [36] Introduction to dashboards for Power BI designers, Microsoft Power BI, Article, https://docs.microsoft.com/en-us/power-bi/create-reports/service- dashboards#:~:text=A%20Power%20BI%20dashboard%20is,the%20 Power%20BI%20service%20only. [37] AccuWeather data website, https://corporate.accuweather.com/resources/ downloads. 334