=Paper=
{{Paper
|id=Vol-2571/CSP2019_paper_11
|storemode=property
|title=A Fuzzy Set Tool in the Classification and Prediction Software System (CLAPSS) (short paper)
|pdfUrl=https://ceur-ws.org/Vol-2571/CSP2019_paper_11.pdf
|volume=Vol-2571
|authors=Krzysztof Pancerz,Jaromir Sarzyński
|dblpUrl=https://dblp.org/rec/conf/csp/PancerzS19
}}
==A Fuzzy Set Tool in the Classification and Prediction Software System (CLAPSS) (short paper)==
A Fuzzy Set Tool in the Classification and Prediction Software System (CLAPSS) Extended Abstract? Krzysztof Pancerz and Jaromir Sarzyński Department of Computer Science, Faculty of Mathematics and Natural Sciences University of Rzeszów Prof. S. Pigonia Str. 1, 35-310 Rzeszów, Poland kpancerz@ur.edu.pl Abstract. In the paper, we give the outline of a fuzzy set tool imple- mented in the Classification and Prediction Software System (CLAPSS). CLAPSS is being developed for solving different classification and predic- tion problems using, among others, some specialized approaches based mainly on fuzzy sets and rough sets which are not available in other machine learning software systems. Theoretical background as well as the module embedded in CLPASS, for fuzzification of attribute values in information/decision systems, are described. Moreover, possible further steps in the usage of CLAPSS (generation of fuzzy decision trees as well as fuzzy flow graphs) are mentioned. Key words: Fuzzy sets, Fuzzification, Software system, CLAPSS. 1 Theorethical Background Most of the methods implemented in the CLAPSS system are applied for infor- mation/decision tables representing information/decision systems understood as Pawlak’s knowledge representation systems (cf. [12]). A decision system is a tuple DS = (U, C, D, {Va }a∈C∪D , finf , fdec ), where U is the non-empty, finite set of objects, C is the non-empty, finite set of condition attributes, D is the non-empty, finite set of decision attributes, {Va }a∈C∪D is the family of S non-empty sets of condition and decision attribute values, finf : C×U → Vc is the information function such that finf (c, u) ∈ Vc for each c∈C S c ∈ C and u ∈ U , fdec : D × U → Vd is the decision function such that d∈D fdec (d, u) ∈ Vd for each d ∈ D and u ∈ U . An information system is a specific case of a decision system. In this case D = ∅ and the decision function is not defined. Further, only information systems, in the form IS = (U, A, {Va }a∈A , finf ), will be considered. ? Copyright c 2019 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). Fuzzification is the process that transforms the real value variables into lin- guistic variables whose domains contain linguistic values which can be described by fuzzy sets (their membership functions). Let IS = (U, A, {Va }a∈A , finf ) be an information system such that Va ⊆ R for each a ∈ A. For each attribute a ∈ A, we can define a linguistic variable λa . With each linguistic variable λa , a set Lλa = {l1λa , l2λa , . . . , lkλaa } of linguistic values is associated. Each linguis- tic value liλa , where i = 1, 2, . . . , ka , is described by a membership function µlλa : R → [0, 1]. In CLAPSS, the user has a broad set of membership functions i which can be used to make a fuzzification process. This set consists of: – a triangular shaped membership function, – a trapezoidal shaped membership function, – a Gaussian shaped membership function, – a generalized bell shaped membership function, – an S shaped membership function, – a π shaped membership function, – a sigmoidal shaped membership function, – a fuzzy singleton membership function, – a sinusoidal shaped membership function, – a Z shaped membership function, – a pseudo-exponential shaped membership function, – an L-R shaped membership function, – a two Gaussian shaped membership function, – a D-sigmoidal shaped membership function, – a P-sigmoidal shaped membership function. Let: – IS = (U, A, {Va }a∈A , finf ) be an information system, where card(U ) = n and card(A) = m, such that Va ⊆ R for each a ∈ A, – {Lλa }a∈A be the family of sets of linguistic values associated with linguistic variables from the family {λa }a∈A defined for attributes from A, where Lλa = {l1λa , l2λa , . . . , lkλaa } for each a ∈ A. A fuzzified information system is a tuple F(IS) = (U F , Φ, {Vφ }φ∈Φ , finf F ), where F ∗ F U is the non-empty, finite set of objects such that each u ∈ U corresponds exactly to one u ∈ U , Φ = Φa1 ∪ Φa2 ∪ · · · ∪ Φam is the non-empty, finite set of S {Vφ }φ∈Φ is the family of sets of fuzzifiedF attribute ∗values, fuzzified attributes, F finf : Φ×U F → Vφ is the information function such that finf (φ λaj , u ) ∈ Vφ φ∈Φ li ∗ F F for each φ λaj ∈ Φ and u ∈ U , finf (φ λaj , u∗ ) = µ λaj (finf (aj , u)), where µ λaj li l i l i l i λa is a membership function describing li j . 2 CLAPSS CLAPSS is our tool developed for solving different classification and prediction problems using, among others, some specialized approaches based mainly on fuzzy sets and rough sets. CLAPSS is equipped with the graphical user interface (see Figure 1). In general, our main idea is to implement in CLAPSS those specialized approaches which are not available in other machine learning software systems. Selected functionalities of CLAPSS were earlier described in [7], [8], and Fig. 1. CLAPSS (the graphical user interface). [11]. The general usage of methods, based on fuzzy sets, implemented in CLAPSS is shown in Figure 2. Information/decision systems (also those fuzzified) can be imported from/exported to other machine learning software systems, RSES [1], WEKA [4], ORANGE [2], as it was depicted in Figure 2. For methods based on fuzzy sets, CLAPSS offers, first of all, a tool for fuzzi- fication of attribute values in information/decision systems. The fuzzification process can be done in three ways: graphical, scripting, and external. For graphical fuzzification, Membership Function Creator (MFC) has been developed (see Figure 3). MFC enables the user to: – determine linguistic values and membership functions (their shapes and pa- rameters) associated with them, – manually modify membership functions created earlier (for example, char- acteristic points or slopes can be moved), – see calculated values of the fuzzified attribute (these values are automatically updated if some changes in membership functions are made). After the fuzzification process of the selected attribute, a script (a special scripting language was designed for CLAPSS) is generated. The script, consisting of membership function definitions for each attribute to be fuzzified, can also be created manually, i.e., in a scripting way (see an example below). ATTR[0]->fuzzification(lingvalues={low=(trapezoidal,0.0000,0.0000,1.0000,3.0000), medium=(triangular,1.0000,2.5000,4.0000),high=(trapezoidal,2.0000,4.0000,5.0000,5.0000)}); Fig. 2. CLAPSS (a general scheme of the usage of the fuzzy set tool). Fig. 3. CLAPSS (Membership Function Creator). ATTR[1]->fuzzification(lingvalues={low=(trapezoidal,0.0000,0.0000,1.0000,3.0000), medium=(triangular,1.0000,2.5000,4.0000),high=(trapezoidal,2.0000,4.0000,5.0000,5.0000)}); ATTR[2]->fuzzification(lingvalues={low=(trapezoidal,0.0000,0.0000,1.0000,3.0000), medium=(triangular,1.0000,2.5000,4.0000),high=(trapezoidal,2.0000,4.0000,5.0000,5.0000)}); Fuzzification of attribute values in the external tool is also possible. Then, the user can import a fuzzified information/decision system into CLAPSS. Further steps which can be performed in CLAPSS for fuzzified informa- tion/decision systems can be as follows: – Generation of fuzzy decision trees. Fuzzy decision trees are generated using the algorithm based on cumulative information estimations of initial data [5]. – Generation of fuzzy flow graphs [11]. Fuzzy flow graphs are generated using the fuzzy cardinality (power) of linguistic values (cf. [6]). The visualization of fuzzy decision trees and fuzzy flow graphs is possible due to the option for exporting them to the DOT format [3]. The practical usage of CLAPSS was presented in case of analysis and clas- sification of MMPI (Minnesota Multiphasic Personality Inventory) data. Fuzzi- fied decision systems were used among others in determining the importance of ranges of MMPI scales [10] and classification of MMPI profiles using fuzzy decision trees [9]. 3 Conclusions In the paper, we have briefly presented an important part of the CLAPSS system concerning implemented methods based on fuzzy sets. CLAPSS is constantly being developed. One of the main directions in further developing of CLAPSS, in the considered area, is to add other types of membership functions and to add graph algorithms for fuzzy flow graphs to extract rules, episodes, etc. Acknowledgments This work was partially supported by the Center for Innovation and Transfer of Natural Sciences and Engineering Knowledge at the University of Rzeszów. References 1. Bazan, J.G., Szczuka, M.S.: The Rough Set Exploration System. In: Peters, J., Skowron, A. (eds.) Transactions on Rough Sets III, Lecture Notes in Artificial Intelligence, vol. 3400, pp. 37–56. Springer-Verlag, Berlin Heidelberg (2005) 2. Demšar, J., Curk, T., Erjavec, A., Črt Gorup, Hočevar, T., Milutinovič, M., Možina, M., Polajnar, M., Toplak, M., Starič, A., Štajdohar, M., Umek, L., Žagar, L., Žbontar, J., Žitnik, M., Zupan, B.: Orange: Data mining toolbox in Python. Journal of Machine Learning Research 14, 2349–2353 (2013) 3. Ellson, J., Gansner, E.R., Koutsofios, E., North, S.C., Woodhull, G.: Graphviz and Dynagraph - static and dynamic graph drawing tools. In: Jünger, M., Mutzel, P. (eds.) Graph Drawing Software, pp. 127–148. Springer, Berlin, Heidelberg (2004) 4. Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: An update. ACM SIGKDD Explorations Newsletter 11(1), 10–18 (2009) 5. Levashenko, V., Zaitseva, E.: Usage of new information estimations for induction of fuzzy decision trees. In: Yin, H., Allinson, N., Freeman, R., Keane, J., Hubbard, S. (eds.) Proceedings of the Third International Conference on Intelligent Data Engineering and Automated Learning (IDEAL 2002). LNCS, vol. 2412, pp. 493– 499. Springer Berlin Heidelberg (2002) 6. Mieszkowicz-Rolka, A., Rolka, L.: Flow graphs and decision tables with fuzzy at- tributes. In: Rutkowski, L., Tadeusiewicz, R., Zadeh, L.A., Żurada, J.M. (eds.) Artificial Intelligence and Soft Computing (ICAISC 2006). LNCS, vol. 4029, pp. 268–277. Springer, Berlin, Heidelberg (2006) 7. Pancerz, K.: On selected functionality of the classification and prediction software system (CLAPSS). In: Proceedings of the International Conference on Information and Digital Technologies (IDT’2015). pp. 278–285. Zilina, Slovakia (2015) 8. Pancerz, K., Grochowalski, P., Paja, W.: On selected data preprocessing proce- dures with the classification and prediction software system (CLAPSS). In: Pro- ceedings of the International Conference on Information and Digital Technologies (IDT’2016). pp. 219–226. Rzeszow, Poland (2016) 9. Pancerz, K., Levashenko, V., Zaitseva, E., Gomula, J.: Experiments with classifi- cation of MMPI profiles using fuzzy decision trees. In: Ganzha, M., Maciaszek, L., Paprzycki, M. (eds.) Proceedings of the 2018 Federated Conference on Computer Science and Information Systems (FedCSIS’2018), Annals of Computer Science and Information Systems, vol. 15, pp. 125–128. Poznań, Poland (2018) 10. Pancerz, K., Paja, W., Sarzyński, J., Gomula, J.: Determining importance of ranges of MMPI scales using fuzzification and relevant attribute selection. In: Howlett, R., Toro, C., Hicks, Y., Jain, L. (eds.) Proceedings of the 22nd International Conference on Knowledge-Based and Intelligent Information & Engineering Systems (KES- 2018), Procedia Computer Science, vol. 126, pp. 2065–2074. Belgrade, Serbia (2018) 11. Pancerz, K., Lewicki, A., Sarzyński, J.: Discovering flow graphs from data tables using the classification and prediction software system (clapss). In: Mihálydeák, T., Min, F., Wang, G., Banerjee, M., Düntsch, I., Suraj, Z., Ciucci, D. (eds.) Rough Sets, LNCS, vol. 11499, pp. 356–368. Springer International Publishing, Cham (2019) 12. Pawlak, Z.: Rough Sets. Theoretical Aspects of Reasoning about Data. Kluwer Academic Publishers, Dordrecht (1991)