Using Fuzzy Logic in the Research of the Data Visualization Process in Infographics Oleksandr Tymchenkoa,b, Orest Khamulab, Svitlana Vasiutab, Olha Lukovskab, Solomiya Doroshb and Olha Sosnovskab a University of Warmia and Mazury, Olsztyn, Poland b Ukrainian Academy of Printing, Lviv, Ukraine Abstract Data visualization in infographics is a rather interesting and yet little-studied process that requires comprehensive research. In the study, an analysis of the factors identified by an expert survey and divided by importance, which affect the visualization of data in infographics, is carried out. The analytic hierarchy process and the ranking method were used for the research. These methods show good calculation results when studying processes in which factors cannot be described using mathematical data. But, in the vast majority, the results obtained by these two methods give different results. Therefore, to verify the precision of the obtained results, the researchers recommend using the calculation of integral indicators of the quality of the considered process based on fuzzy logic. To calculate the integral quality indicators, it is required to establish a universal set of terms of values of isolated linguistic variables. As a result, this solution lies in the description of linguistic variables with the assignment of their designations, determination of recommended limits of values of universal fuzzy sets, and linguistic terms for the comparative process, construction, and calculation of functions belonging to linguistic variables. Keywords 1 Influence factors, infographics, data visualization, linguistic variables, fuzzy sets, fuzzy logic, membership function, integral quality indicator 1. Introduction It is a well-known fact that the key way of obtaining information about the world is through vision. On average, this number is 90%. Perceived information allows to memorize information faster and, accordingly, to increase labor productivity. Information presented in the form of pictures, diagrams, or in any other presentation of infographics, is even better perceived and memorized. This is due to the fact that the information presented in the form of infographics allows to better visualize certain details and draw attention to specific information better than text. Usually, data visualization covers large amounts of information and condenses it. Another purpose of visualization is that it makes the perception of complex information more accessible and allows you to create parities in the comparison of quantities. A crucial task of visualization is its persuasiveness. Therefore, distortion of the presented information should be avoided. Correctly presented information does not interfere with the perception of details. That is why, when creating a visualization, it is important to consider the effectiveness of the data presented, what is worth sharing, and presenting it in a way that engages the user. The next step of SCIA-2022: 1st International Workshop on Social Communication and Information Activity in Digital Humanities, October 20, 2022, Lviv, Ukraine EMAIL: olexandr.tymchenko@uwm.edu.pl (O. Tymchenko); khamula@gmail.com (O. Khamula); svitlanavasyta@gmail.com (S. Vasiuta); olhalukovska@yahoo.com (O. Lukovska); solomiadorosh@gmail.com (S. Dorosh); olhakh@gmail.com (O. Sosnovska) ORCID: 0000-0001-6315-9375 (O. Tymchenko); 0000-0003-0926-9156 (O. Khamula); 0000-0003-0078-9740 (S. Vasiuta); 0000-0002- 4074-7454 (O. Lukovska); 0000-0002-6824-5421 (S. Dorosh); 0000-0001-5413-2517 (O. Sosnovska) © 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). CEUR Workshop Proceedings (CEUR-WS.org) this process is choosing the method of creating the visualization itself, which will convey specific information. Therefore, the study of various visualization techniques today is quite an essential issue and deserves the attention of researchers. 2. Related works There is a sufficient number of publications and discussions regarding the research of data visualization processes and the use of infographics in general in practice. This, in our opinion, indicates that this topic is quite relevant. Among the publications that reveal research on this topic, we would like to draw attention to the following. One of the main ways of using infographics, especially for scientists and teachers, is in the field of education. Among the publications that reveal research on this topic, we would like to draw attention to the following. One of the main ways of using infographics, especially for scientists and teachers, is in the field of education. It is necessary to provide the essential information concisely and clearly so that in a short time, it is possible to present a maximum of information and it is understandable for everyone. Thus, in study [1], the author considers the structure issue of the presentation of information visualization elements. In this work, the author notes that the study of the information visualization process is quite a significant topic for teachers, as they try to understand and help students gain new knowledge with the help of modern literature and practices. The author highlighted the following elements of visualization: text, images, data, and the interaction between them, noting and exploring their impact on the audience. In research [2], the authors reveal another field of data visualization — the military. The authors indicate how it is necessary to correctly create a visualization of the required data so that they can be quickly comprehended, perceived, and an accurate decision could be made. In this study, the most modern data transmission methods, and the main tasks of visualization are considered. As a result of the research, the authors proposed a concept demonstrator for visualizing collaborative effort modeling. Important research in the field of data visualization or information in general is the work of authors [3], which focuses on the suitability of specific color combination types. The work distinguishes seven types: harmonious colors, opposite colors, highly saturated colors, low saturated colors, high-brightness, low-brightness colors, and disharmonious colors. Evaluation of these studies focused on color visibility. Studies have shown that preference is still given to harmonious colors, contradicting previous claims that disharmonious colors can be beneficial. Another vital field of data visualization is management. The authors proposed options and technology for processing the text array and visualization of it in the study [4With the help of the proposed technology, it will be possible to highlight valuable information hidden in a dense abstract form of text and transform it into simple and intuitive information. This, in turn, will enable managers to quickly understand and make more valuable decisions. As the questionnaire regarding this technology showed, the proposed technology facilitates quick and informative access to key information, which will ultimately reduce the workload and time needed to realize the status of their projects. Cognitive forms of the human interaction process with visualization are considered in work [5]. The authors discovered when there is a cognitive understanding between the type of problem being solved and the information, the task load index is lower, compared with the case when there is no cognitive fit. The research data confirm the relevance of the study of the data visualization process and its effectiveness. Also, the research data are crucial because they provide greater validity to cognitive theory and assessment of cognitive load, as well as effort in general. Every day, the amount of data that needs to be stored and processed increases. Accordingly, the need for tools that can provide the correct visualization of data has increased, which, in turn, has led to the development of various programming software for data visualization. But the use of these software tools requires users to develop visualization from scratch. An alternative is to create interactive visualization design environments. Interesting examples of such systems are NarVis [6], TechSpectogram [7], and VisComposer [8]. These systems allow users to create a variety of visualizations with the possibility of previewing and optimizing the data in real-time. In work [9], the authors considered the issue of automated generation of visualization, tools, and documents of recommendations for network graphs usage. This study is also important in terms of the challenges and promising directions for further research in the field of automated infographics and visualization recommendations. In conclusion, it is important to note the study [10] of authors who consider the principles of effective data visualization. The paper states that researchers do not always present information correctly or use incorrect data visualization practices. The authors provide ten principles that serve to improve data visualization, including technical aspects and, for example, the combination of colors. All these principles and proposals are designed for the most effective presentation of specific data. 3. The methodology of using fuzzy logic in researching the process of data visualization in infographics With the help of previous research [11], using the analysis of expert judgments, we established a certain list of factors that affect the process of data visualization in infographics. From this list, six of the most significant in this process were chosen, namely: text (T), numerical data (ND), graphs and charts (GC), flowcharts (FC), image (IM), and icons (IC). The next question that was considered in this paper is how these factors influence each other. A dependency graph of the relationships between factors was also constructed. In work [12], we investigated which factor is more important in the process of data visualization. For this, the analytic hierarchy process was used. As a result of the research, we obtained an optimized model of influencing factors in the data visualization process in infographics. The factors were divided into 5 levels. From this model, we can conclude that the most important factor in the data visualization process was the use of icons. Our next step was a study [13], namely a comparison of different methods of determining the priority or importance of a specific factor with others. A study of the priority of influencing factors in the data visualization process was conducted using two methods analytic hierarchy process and the ranking method. As research has shown — the ranking method appeared to be the best in this case. It showed the distribution of factors in 6 priority levels. This method is more comprehensive since it takes into account indirect dependencies that prefer one factor over another. Since the data visualization process is described by factors that are difficult to represent in the form of specific mathematical units, it is advisable to use fuzzy logic to clarify the accuracy of the results. It became necessary to use a mathematical tool with the help of which ambiguous expert statements would be translated into the language of clear mathematical formulas. As research shows, the usage of fuzzy logic is a good solution as it provides an opportunity to operate the fuzzy input data. In general, fuzzy logic uses linguistic variables with the help of explicit rules. Fuzzification is the basis of the use of fuzzy logic. Its task is to transform the initial data into a set that would correspond to the terms of the linguistic variable. These data can be described by one or several terms. The degree of their correspondence to the term is given as the degree of membership on a fuzzy set. As it follows from using the theory of fuzzy logic, fuzzification is the base for evaluating the quality of the process under research. For calculations, fuzzy logical equations and a knowledge base are used. The knowledge base is built according to expert data that are based on fuzzy linguistic rules with the condition – "if-then." Accordingly, the operation that involves mapping the obtained numerical results of the calculation is called defuzzification. Moreover, these numerical results may have different units of measurement. Fuzzy sets and linguistic variables are the basis of any fuzzy logic, which, in turn, are based on membership functions constructed using a set of terms and linguistic terms of the factors under study. The main values of a linguistic variable are words and phrases in natural or formal expression. Of course, the data presented in the form of numbers is more precise than the data presented in words, regarding with which the linguistic variable describes the phenomena approximately, because they are complex and cannot be described in quantitative terms. It can be stated that the purpose of fuzzy logic is the same as words or phrases in living language. When comparing linguistics and fuzzy variables, it can be noted that linguistic variable are of a higher order. Linguistic variables include a number of rules that make up the majority of the description of its structure, namely syntactic: can be set grammatically; can specify an algorithmic procedure for calculating the meaning of the values. Conducting calculations using the theory of fuzzy logic involves performing some operations divided into certain stages and, accordingly, calculations presented below. А. The model of forming an integral indicator of process quality In order to better understand the complex process, the method of fuzzy logic is recommended to divide into certain sub-processes, which could be better described [14]. The data visualization design process in infographics, which is considered in the research, will be denoted by, for example, the function G, which in turn can be divided into two subprocesses: the argument Q, which determines the quality of the artistic presentation of information, and the argument V, which defines the technical features quality of this process. The arguments of the functions are factors. In our calculations, these are linguistic variables obtained and described in previous studies [12]. The values of the separated functions form integral quality indicators, which are determined, in turn, by partial quality indicators of linguistic variables, which are divided into groups according to their functional purpose. Thus, using the theory of fuzzy logic, some function G determines the quality of data visualization in infographics, meaning: (1) Accordingly, the arguments of this function can be described:  argument Q, which determines the quality of the first subprocess: (2) where qi – linguistic variables of the argument Q.  argument V, which determines the quality of the second subprocess: (3) where vi – linguistic variables of the argument V. For further research, we need to establish a universal set of terms of values of separated linguistic variables, so it, in turn, corresponds to the limits of the existence of linguistic variables. As a result, this solution is in the form of a table with linguistic variables, specifying their designations, and determining the recommended limits of values of universal sets of terms and linguistic terms for the comparative research process of data visualization in infographics. Based on the data obtained by this method, we can start building a multi-level model for the formation of integral process quality indicators. The use of a multi-level model of fuzzy logical derivation contributes to the consistent establishment of a forecast of the implementation quality of the publication structuring process by accumulating knowledge from the lowest to the highest of its levels. B. Construction and calculation of membership functions of linguistic variables The next stage of solving the given task according to the theory of fuzzy logic is the construction and calculation of the membership functions of linguistic variables [15]. The membership function is the main characteristic of the formation level of the process quality, which in turn is given by linguistic terms, such as N and G. In the calculations, it is assumed that the original database is a universal fuzzy set, for example, В, which should be divided into certain parts (quanta). At these points of division, we set the separated linguistic variables and ranks, with the help of which linguistic terms can be identified. Let's set our universal fuzzy set as В = {b1, b2, …, bn}, and also set the corresponding ranks rN(bi) – for the study of platforms for the implementation of distance learning and rJ(bi) – visualization of data in infographics in the ranges of bi (і = 1, …, n). Then, taking into account the above assumptions, the linguistic term for our researched process, which we will call Y, can be represented in the form of a certain fuzzy set: (4) where YF⸦B; µy(bi) – degree of mentorship to the set YF of the element bi B. The membership functions, or the membership degree, which are specified as the value µy(bi), become the basis of logical expressions for the numerical expression of the linguistic term Y. The distribution of which is given in the form: (5) where µi = µy(bi) – degree of mentorship; ri = ry(bi) – corresponding ranks for all і = 1, …, n. The normalization condition must also be fulfilled for the membership functions, namely their sum must be equal to 1: (6) To determine the ranks of factors established by experts or calculated using ranking methods or the analytic hierarchy process, their numerical values can be obtained using the following ratios: (7) For a better visual perception of linguistic terms and their high-quality graphic display, the range of values of linguistic variables specified in the relevant tables discussed above is presented in the form of three points. As a result, we will get a square inverse symmetric matrix Х = хij, where хij = ri / rj for i, j = 1, 2, 3, and the estimates of the ranks of linguistic terms should also be considered. The result of our task to obtain an integral indicator of the quality level of the process researched is presented as follows: (8) ⸦ The goal of the research, according to formula (8), is to achieve the maximum value of the functions that characterize the level of quality of the considered process. And the next steps of calculations according to this method are the construction of a matrix of pairwise comparisons for all linguistic terms, using the scale of the relative importance of objects. In this case, the components of the eigenvector of the matrix will be the ranks of the linguistic terms, which in turn are used to calculate the values of the membership functions µi for each of the terms Therefore, the final numerical values of the membership functions at the determined ranks of the linguistic terms at the three points of division of the universal set (b1, b2, …, bn) will be obtained as a result of the matrix calculation: (9) C. Designing a fuzzy knowledge base and a system of fuzzy logic equations For the next studies, namely the study of the relationships between the separated factors of influence on the relevant processes, a fuzzy knowledge base should be involved. It represents a set of fuzzy rules of the "if-then" type and makes it possible to link information between the input and output object researched. This implementation is possible when using fuzzy sets and linguistic variables [16]. This combination allows obtaining dependencies between physically separated processes, as already mentioned above. To determine the procedure for obtaining the degree of the membership of functions to a set of linguistic terms of the quality indicator of the researched processes, an established fuzzy knowledge base is used in the form of a system of fuzzy logic equations. The basis for considering a fuzzy knowledge base is expert judgment of the impact of factors on the considered processes. Also, this base implements an algorithm, with the help of which the forecasting of the quality of processes is formed, taking into account the obtained combinations of the values of the relevant linguistic terms. At this stage, there is a need to build a fuzzy knowledge base, as well as to calculate an integral indicator of the quality of the process of data visualization in infographics as a function of the highest level and defined by the linguistic variable G with its linguistic terms "low," "medium," and "high." And the functions of the lower level are sets of linguistic variables Q and V, which in turn are also represented by the linguistic terms "low," "medium," and "high." According to the fuzzy knowledge base, taking into account equality (1) and based on the obtained model, we receive the following general form: IF (Q = low) І (Q = medium) І (Q = high) І (V = low) І ( V= medium) І (V = high) THEN (G = low) І (G = medium ) І (G = high) Based on the above information, we construct the linguistic variable G table of the knowledge matrix. Our next step should be to obtain the membership function values for our sets of terms. We can solve this task by forming fuzzy logic equations based on the obtained knowledge matrices G. In this way, we will be able to obtain an integral indicator of the quality of the process researched. Accordingly, we obtain the following equalities of the membership function of the linguistic variable G:  for the term "low," the following is received µlow (G) = µlow(Q) ˄ µlow(V) ˅ µlow (Q) ˄ µmedium(V);  for the term "medium," the following is received µmedium(G) = µmedium(Q) ˄ µlow(V) ˅ µmedium(Q) ˄ µmediumV);  for the term "high," the following is received µhigh (G) = µhigh (Q) ˄ µmedium(V) ˅ µhigh (Q) ˄ µhigh(V). Using the same technique, we obtain fuzzy knowledge bases and linguistic equations for the lower level of the corresponding linguistic variables, for example, Q – Y(q1, q2, q3) and V – Y(v1, v2, v3), which belong to the process under study. Based on the obtained generalized variants of the logical expression of linguistic variables Q and V, let's build knowledge matrices. Based on the obtained knowledge matrices and using a system of logical equations, the membership functions for our linguistic variables Q and V are obtained. Accordingly, the membership functions for the linguistic variable Q can be represented as follows:  for the term "low" µlow(Q) = µsimple(q1) ˄ µcomplicated(q2) ˄ µsimple(q3) ˅ µcomplicated(q1) ˄ µsimple(q2) ˄ µsimple(q3)  for the term "medium" µmedium(Q) = µsimple(q1) ˄ µsimple(q2) ˄ µcomplicated(q3) ˅ µcomplicated(q1) ˄ µcomplicated(q2) ˄ µsimple(q3)  for the term "high" µhigh(Q) = µcomplex(q1) ˄ µcomplex(q2) ˄ µcomplicated(q3) ˅ µcomplicated(q1) ˄ µcomplicated(q2) ˄ µcomplex(q3). Accordingly, the membership functions for the linguistic variable V can be represented as follows:  for the term "low" µlow(V) = µsimple(v1) ˄ µsmall(v2) ˄ µsimple(v3) ˅ µsimple(v1) ˄ µmedium (v2) ˄ µsimple(v3)  for the term "medium" µmedium(V) = µsimple(v1) ˄ µsmall(v2) ˄ µcomplex(v3) ˅ µsimple(v1) ˄ µsmall(v2) ˄ µcomplicated(v3)  for the term "high" µhigh(V) = µcomplex(v1) ˄ µlarge(v2) ˄ µcomplicated(v3) ˅ µcomplex(v1) ˄ µmediumv2) ˄ µsimple(v3). Based on the obtained equalities, we form fuzzy sets of linguistic variables Q and V with corresponding membership functions for the accepted terms "low," "medium," and "high" in the form of some fuzzy set: (10) where n1, n2, n3 – quantitative values of the linguistic variable G regarding the defined terms. The defuzzification is performed based on the expert knowledge base and using the knowledge matrices of the linguistic variable G. In reference sources, defuzzification is understood as a certain procedure of transforming a fuzzy set into a clear number by the degree of membership. Its purpose is to obtain the usual quantitative value of each of the initial variables, external concerning the fuzzy inference system, by using the results of the accumulation of all the initial linguistic variables. For this, the pre-numbered values of the membership functions at the three points of division of the selected universal set A for all linguistic variables are presented. To obtain the real weightings of the membership functions of the linguistic variables Q and V, the values of the defined terms "low," "medium," and "high" should be substituted into fuzzy logic equations. It should be noted that the membership functions of the linguistic variables are calculated based on the initial values of the linguistic variables presented by the experts. After obtaining the values of the membership functions of the lower-level linguistic variables and taking the above formulas as a basis, the membership function of the highest-level linguistic variable G is calculated according to the accepted terms "low", "medium", and "high". In the final stage of calculations according to this method, namely, using the defuzzification of fuzzy sets and the center of the mass method, the indicators of the level of the predicted quality of process implementation are calculated. According to this, we receive: (11) where – minimum and maximum values of quality level indicators; n is the number of specified fuzzy terms (in our case, 3). The values of other variables present in expression (9) are taken as follows: µ1(G) = µlow(G); µ2(G) = µmedium(G); µ3(G) = µhigh(G); values of the lower and higher limits for the linguistic variable G and are taken as equal, respectively – = 5%, = 100%. It is recommended to calculate indicators of the predicted quality level of the process at three points of the interval from 5 to 100%. The predicted quality should be greater than 50%. Then it is considered that the conducted research is accurate. 4. Research results Taking this method as a basis, let's calculate the indicator of the predicted level of the data visualization process quality in infographics. The process of designing data visualization in infographics will be denoted by function G. The arguments of the functions are factors of linguistic variables. Integral quality indicators are formed by function values and are determined by partial quality indicators of linguistic variables, divided into groups according to their functional purpose. This process of designing data visualization in infographics will be divided into two sub-processes, namely: Q, which determines the quality of the artistic presentation of information, and V, which determines the quality of technical features in this process. Taking expressions (2) and (3) into account, we will describe the linguistic variables for these attributes.  for the Q argument: q1 – "data visualization type;" q2 – "color gamut of visualization;" q3 – "special effects;"  for the V argument: v1 – "labels in visualization;" v2 – "scale of elements in visualization;" v3 – "presenting data in visualization." We establish a universal set of terms of values of isolated linguistic variables so that it, in turn, corresponds to the limits of the existence of linguistic variables. As a result, this solution is shown in Table 1 with linguistic variables, specifying their designations and determining the recommended limits of values of a universal set of terms and linguistic terms for the comparative research process of data visualization in infographics. Table 1 Set of terms of linguistic variables values Linguistic essence of the Universal set of Variable Linguistic terms (set B) variable values (set A) type of data visualization Simple, complicated, q1 (1-9) CU (types of infographics) complex q2 color gamut of visualization (1-3) CU RGB, CMYK, HSB Simple, complicated, q3 special effects (1-9) CU complex Simple, complicated, v1 inscriptions in visualization (1-9) CU complex scale of elements in the v2 (10-95) % Small, medium, large visualization data presentation in Simple, complicated, v3 (1-9) CU visualization complex In this table, the linguistic variable "type of data visualization" is presented as a variety of infographics; the linguistic variable "data presentation in visualization" identifies the complexity of the presentation; linguistic variable "scale of elements in the visualization" – the geometric dimensions in percentage. For universal sets of values, where it is impossible to give a numerical explanation, the scale of relative importance of objects, according to which there are 9 importance ratings, is taken into account. Due to this, their conditional values for q1, q3, v1 and v3 are taken to be equal to (1-9) CU (Conventional Units), and for q2 equal to (1-3) CU. Taking the obtained data as a basis, we begin to build a multi-level model of the formation of integral indicators of the data visualization quality in infographics (Fig. 1). The use of a multi-level model of fuzzy logical derivation contributes to the consistent establishment of a forecast of the implementation of the publication structuring process quality by accumulating knowledge from the lowest to the highest of its levels. This multi-level model includes subordinate models: a model of the quality of artistic presentation of information, a model of the quality of technical features. Quality of data visualization in infographics The quality of artistic presentation The quality of technical features of information type of data Inscriptions q1 v1 visualization in visualization color gamut scale of elements in q2 v2 of visualization the visualization data presentation in q3 special effects v3 visualization Figure 1: A multi-level model of the formation of integral indicators of the data visualization quality in infographics The resulting model reflects the logic of forming the quality of the data visualization process in infographics with the linguistic variables identified in the survey process, according to expressions (1, 2, 3), which become a source for calculating integral quality indicators. Let's calculate the membership functions to the linguistic quality variables of the data visualization process in infographics. The resulting model (Fig. 1) reflects the logic of forming the quality of the data visualization process in infographics with the linguistic variables identified during the survey. Using matrix (9) as a basis, we construct matrix Y. For the linguistic variable q1 "type of visualization," the universal set of values is equal to A (q1) = [1; 9] CU, and the set of terms of values for it is equal to B (q1) = < simple, complicated, complex >. According to expert judgments and according to the data in Table 1, the universal set has the following values at the points of division: а1 = 1; а2 = 5; а3 = 9. The appearance of the matrices for the corresponding terms "simple," "complicated," and "complex" will have the following form: The last line of the term "simple" is determined by an expert method and the rank of the variable decreases. Using the values of the matrix and relation (9), as well as using the program for calculating the method of binary comparisons, we obtain the values of the membership functions for the terms "simple," "complicated," and "complex": µsimple(а1) = 0,53; µcomplicated(а1) = 0,09; µcomplex(а1) = 0,059; µsimple(а2) = 0,412 µcomplicated(а2) = 0,82; µcomplex(а2) = 0, 412; µsimple(а3) = 0,059; µcomplicated(а3) = 0,09; µcomplex(а3) = 0,53. The next step is to perform the normalization of the values of the membership functions, taking into account the established linguistic terms relative to the unit, and obtain the normalization coefficient according to the formula: (12) where n– terms of the corresponding linguistic variable; µпj(ai) = kn×µп(ai). Let's normalize the values of the membership functions of the linguistic variable "communication tools," taking into account the above expression: µsimplej(а1) = 1,00; µcomplicatedj(а1) = 0,11; µcomplexj(а1) = 0,11; µsimplej(а2) = 0,77; µcomplicatedj(а2) = 1,00; µcomplexj(а2) = 0,77; µsimplej(а3) = 0,11; µcomplicatedj(а3) = 0,11; µcomplexj(а3) = 1,00; In order to display linguistic terms in fuzzy sets, it is necessary to use the normalized values of the membership functions of the linguistic variable "type of visualization" (types of infographics) and equality (4). Accordingly, we receive: simple types of infographics = CU; complicated types of infographics = CU; complex types of infographics = CU. The data obtained in the calculation process of the membership functions of linguistic variables become the basis for constructing a fuzzy knowledge base and a system of fuzzy logical equations. On their basis, the design and calculation of the integral indicator of the quality level of research processes of platforms for the implementation of distance learning and data visualization in infographics are carried out. The next stage of the considered process in the research is the need to build a fuzzy knowledge base, as well as to calculate an integral indicator of the quality of the data visualization process in infographics as a function of the highest level and defined by the linguistic variable G – "data visualization quality in infographics" and its linguistic terms "low", "medium", and "high." As well as the functions of the lower level – the set of linguistic variables Q – "quality of artistic presentation" and V – "quality of technical presentation", that in turn are also represented by the linguistic terms "low", "medium", and "high". According to the theory of the fuzzy knowledge base, taking into account equality (1) and based on the obtained model from (Fig. 1), the following general view is obtained for data visualization in infographics "if - then" is presented on page 6. Based on the above information, let's start building the linguistic variable G Table 2 of the knowledge matrix: Table 2 Table of the knowledge matrix of the linguistic variable G The quality of artistic The quality of technical The quality of data representation representation visualization in infographics (Q) (V) (G) low low low low medium medium low medium medium medium high medium high high high Using the same technique, we obtain fuzzy knowledge bases and linguistic equations for the lower level of the corresponding linguistic variables Q – Y(q1, q2, q3), V – Y(v1, v2, v3), which refer to the process of data visualization in infographics. Based on the obtained equalities, let's form fuzzy sets of linguistic variables Q and V with corresponding membership functions for the accepted terms "low," "medium," and "high" in the form of some fuzzy set (10). The defuzzification is performed based on the expert knowledge base and using the knowledge matrices of the linguistic variable G given in table (2) and the given fuzzy sets (10). Its purpose is to obtain the usual quantitative value of each of the initial variables, external concerning the fuzzy inference system, by using the results of the accumulation of all the initial linguistic variables. For this, the pre-numbered values of the membership functions at the three points of division of the selected universal set A for all linguistic variables are presented. To obtain the real weightings of the membership functions of the linguistic variables Q "quality of artistic presentation" and V "quality of technical presentation," the values of the defined terms "low," "medium," and "high" should be substituted into fuzzy logic equations. It should be noted that the membership functions of the linguistic variables are calculated based on the initial values of the linguistic variables presented by the experts. To calculate the linguistic variables Q, the values of the corresponding terms should be taken from the earlier calculations into fuzzy logic equations, and we will receive:  for the term "low" µlow(Q) = 0,77˄1˄0,55 ˅ 1˄0,77˄0,55 = 0,55  for the term "medium" µmedium(Q) = 0,77˄0,77˄1˅1˄1˄0,55 = 0,77  for the term "high" µhigh(Q) = 0,77˄0,66˄1 ˅1˄1˄0,66 = 0,66. Accordingly, the membership functions for the linguistic variable V can be represented as follows:  for the term "low" µlow(V) = 0,77˄0,77˄0,66 ˅0,77˄1˄0,66 = 0,66  for the term "medium" µmedium(V) = 0,77˄0,77˄0,55 ˅0,77˄0,77˄1 = 0,77  for the term "high" µhigh(V) = 0,33˄0,45˄1˅0,33˄1˄0,66 = 0,45. By obtaining the values of the membership functions of the lower-level linguistic variables and using the above formulas as a basis, let's proceed to calculate the membership function of the highest-level linguistic variable G "data visualization quality in infographics" according to the accepted terms "low," "medium," "high." Thus, the following values are received:  for the term "low" µlow(G) = 0,55˄0,66˅ 0,55˄0,77 = 0,55  for the term "medium" µmedium(G) = 0,77˄0,66˅ 0,77˄0,77 = 0,77  for the term "high" µhigh(G) = 0,66˄0,77˅ 0,66˄0,45 = 0,66. In the final stage of calculations according to this method, namely, using the defuzzification of fuzzy sets (10) and the center of the mass method, we calculate the indicators of the level of the predicted quality of the implementation of the data visualization process in infographics according to expression (11). The calculation of the indicator of the predicted level of the data visualization process quality in the infographics is carried out at three points of the interval – 5, 55, and 100%, and the following numerical value is received: 5. Conclusions From the material described above, it is clear that the data visualization process is described by factors that are difficult to represent in the form of certain mathematical units. Therefore, to clarify the accuracy of the results of expert judgment, it is advisable to use such methods as the theory of fuzzy logic. When analyzing the previous results, namely using the analytic hierarchy process and the ranking method, it became necessary to use a mathematical tool with the help of which the ambiguous statements of experts would be translated into the language of clear mathematical formulas. As research shows, the usage of fuzzy logic is a convenient solution. It provides an opportunity to operate the fuzzy input data. Based on the calculations, using the theory of fuzzy logic, an indicator of the predicted level of quality of the data visualization process in infographics was established. As the results showed, it was 52.5%, which is greater than the value of 50%, which means that our assumptions and calculations were correct. Therefore, using the theory of fuzzy logic helps to provide a qualitative assessment of the process researched based on the calculation of the indicator of its level of predicted quality. 6. Acknowledgements The authors of this study are sincerely grateful to the large audience of students and teachers who contributed to the research and establishment of influencing factors on the data visualization process in infographics. Special gratitude to colleagues for help and relevant suggestions that helped to improve the submitted material. 7. References [1] M. Sorapure, Text, Image, Data, Interaction: Understanding Information Visualization. Computers and Composition, Volume 54, December 2019, 10251. [2] H. Shen, T. Bednarz, H. Nguyen, F. Feng, T. Wyeld, P. J. Hoek, E. H.S. Lo, Information visualisation methods and techniques: State-of-the-art and future directions, Journal of Industrial Information Integration, Volume 16, December 2019, 100102. [3] S. Einakian, T. S. Newman, An examination of color theories in map-based information visualization, Journal of Computer Languages, Volume 51, April 2019, рр. 143-153. [4] J. Sun, K. Lei, L. Cao, B. Zhong, Y. Wei, J. Li, Z. Yang, Text visualization for construction document information management, Automation in Construction, Volume 111, March 2020, 103048. [5] J. K. Nuamah, Y. Seong, S. Jiang, E. Park, D. Mountjoy, Evaluating effectiveness of information visualizations using cognitive fit theory: A neuroergonomics approach, Applied Ergonomics, Volume 88, October 2020, 103173. [6] C. Edmond, T. Bednarz, Three trajectories for narrative visualization, Visual Informatics, Volume 5 (2021), рр. 26–40. [7] E. Perez-Molina, F. Loizides, Novel data structure and visualization tool for studying technology evolution based on patent information: The DTFootprint and the TechSpectrogram, World Patent Information, Volume 64, March 2021, 102009. [8] H. Mei. W. Chen, Y. Ma, H. Guan, W. Hu., VisComposer: A Visual Programmable Composition Environment for Information Visualization, Visual Informatics, Volume 2, 2018, рр. 71-81. [9] S. Zhu, G. Sun, Q. Jiang, M. Zha, R. Liang, A survey on automatic infographics and visualization recommendations, Visual Informatics, Volume 4, Issue 3, September 2020, рр. 24- 40. [10] S. R. Midway, Principles of Effective Data Visualization, Patterns, Volume 1, Issue 9, 11 December 2020, 100141. [11] O. Tymchenko, N. Kunanets, O. Khamula, S. Vasiuta, O. Sosnovska, Synthesis and research of a model of factors of infographics compositional design with elements of visual communication, Proceedings of the 2nd International Workshop on Intelligent Information Technologies & Systems of Information Security with CEUR-WS Khmelnytskyi, Ukraine, March 24–26, 2021, pp. 303-322. [12] O. Tymchenko, S. Vasiuta, O. Khamula, A. Konyukhov, O. Sosnovska, M. Dudzik, Synthesis of Factors Model of Data Visualization in the Infographics, 2019 IEEE International Scientific- Practical Conference Problems of Infocommunications Science and Technology PIC S&T’2019, Volume 2. (October 8-11). Kyiv, 2019. рp. 451-454. [13] O. Tymchenko, O. Khamula, S. Vasiuta, O. Sosnovska, O. Mlynko, A Comparison of Methods for Identifying the Priority Hierarchy of Influencing Factors, Proceedings of the 3rd International Workshop on Intelligent Information Technologies & Systems of Information Security, Khmelnytskyi, Ukraine, March 23-25, 2022, CEUR Workshop Proceedings 3156, CEUR- WS.org 2022, pp. 228-237. [14] V. Repeta, O. Havrylyshyn, I. Myklushka, Modelling of the Factors Influence on the Quality of the Digitization Process of Old Books by Fuzzy Logic, Proceedings of the 16th IEEE International Conference on Computer Science and Information Technologies, CSIT 2021, Lviv, Ukraine, September 22-25, 2021, Volume 2, рр. 182-185, Code 175940. [15] V. Repeta, V. Zhydetskyy, O. Cherednichenko, I. Liakh, Modeling with fuzzy logic the migration capacity of UV-ink components as a factor of potentially harmful pollution of packaging, CEUR Workshop Proceedingsthis link is disabled, 2021, 2853, pp. 83–89. [16] V. Repeta, Y. Kukura, V Kukura, Analysis of the relationship of quality factors in the solventless lamination process, Journal of Print and Media Technology Research, Volume 7, Issue 1, 2018, рр. 27 – 34.