<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Fuzzy Set Objects Clustering Method Using Evolution Technologies</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Hryhorii Hnatiienko</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Oleh Suprun</string-name>
          <email>oleh.o.suprun@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Taras Shevchenko National University of Kyiv</institution>
          ,
          <addr-line>Kyiv</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
      </contrib-group>
      <fpage>129</fpage>
      <lpage>138</lpage>
      <abstract>
        <p>Presented in the article method allows an expert, or person, who makes decisions, to effectively evaluate, calculate and make the best decision on the large set of opinions. The biggest problem while performing the trustful quality assessment is to build an adequate comparison matrix, based on human opinions, since reviews may vary greatly, and it's impossible to appreciate them all, when dealing with big data. The number of these reviews may be up to hundreds, when working with the group of experts, or even thousands, operating with clients' opinions. Thus, a method to rank and cluster these results, for further operation and making decisions, is required. Analyzing the fuzzy set objects, most of classic algorithms can not be used, since it's impossible or hard to design a proper target function, or it can be discontinuous or impossible to evaluate by classic methods, so the evolution technology method is used. It allows not only to build a proper cluster around the given centers, but also to find these centers, to evaluate the best radius and the most important attributes. Besides that, large amount of parameters allows an expert to choose the most important aspects or vary the time, used to solve the problem. Another benefit is the possibility of an algorithm to adapt to the external impacts and improve the result. The algorithm and required preparations are described in the article.</p>
      </abstract>
      <kwd-group>
        <kwd>Complex Information Systems</kwd>
        <kwd>Fuzzy Sets</kwd>
        <kwd>Evolution Technologies</kwd>
        <kwd>Objects Clustering</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>In the modern information society, more and more information require evaluating,
investigating and ranking every day. It can be seen in every day life, when planning
the day or the nearest future, choosing different products in the shop, or looking for
the best options of summer vocation. Most of the decisions are made subconsciously,
so the person doesn’t pay big attention to this process, but it’s much harder to choose
the best option, if there are many different aspects, or the person is unfamiliar with the
required topic. In all these situations the human has to overview large amounts of
information and responds to make right decision. In most cases it is done, guided by
pure experience, by the advices or topic analysis.</p>
      <p>The same problem, but in much greater scale, appears while planning the future
steps of big companies, or even countries, choosing the best products and options to
improve, and simulating the economical and financial models. The consequences are
very important, the choice that has to be made affect the future of many people. But,
to make right decisions, it’s necessary to take into account the thoughts of many
experts, their opinions and advises, the number of such experts in big companies may
vary from dozens to hundreds.</p>
      <p>Beside that, nowadays, when the Internet makes possible to communicate and
share information among large groups of costumers from all over the world, it’s very
important for companies and selling structures to analyze the opinions and feedback
from all of the buyers, to choose the best product, to improve their services and give
answer to all questions and comments. It is necessary to maintain the company’s
authority among the opponents and costumers.</p>
      <p>Dealing with large number of opinions is a complicated task by it’s own, and even
more, very often they differ not only by their content, but also the form, and a lot of
time is required to build a normalized table of opinions. Doing this manually is
impossible for big companies, since requires a lot of time, and new opinions may appear
every minute. So, the normalization and first analysis must be automated to show an
expert an approximate table. To handle this automatically the fuzzy set principles are
used. Also, dealing with large amount of information requires the usage of big data
and data mining functions.</p>
      <p>But, at the same time, most algorithms of data mining are made to evaluate strict
statistical data that has the same form and ranged values, can be put into a table and
placed according to the current program, which is used. This can’t be done with
expert opinions that are presented as text massage and the form may be varying very
differently. Thus, the non-classic method is required to simulate the work of a team
that ranks and makes basic conclusions, based on large number of opinions. It must
include both analyzing the text and building the initial table of opinions for their
pairwise comparison.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Clustering the Fuzzy Set Objects</title>
      <p>
        Cluster analysis is a method of grouping experimental data into classes [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
Performing this method, the following condition must be fulfilled: the experimental values or
data that are gathered into one class or group are closer to each other than to values
from other classes, according to a certain parameter or attribute. The number of
clusters can be arbitrary or constant.
      </p>
      <p>The main purpose of data clustering methods is to ensure that the similarity of the
data that is combined into one cluster is maximal. As a result of solving the
clusterization problem, the values ranking in relatively similar groups is performed. The cluster
analysis application is very diverse and common in many subject areas.</p>
      <p>It is necessary to perform the initial clustering preparation, dealing with opinions
and expert reviews, first of all, to find similar ones, to analyze the overall dynamic
and make initial conclusions, for example, is the new technology, used in production,
effective or not, or are the customers satisfied with the made improvements.
Furthermore, it is much easier to deal with clusters on the next steps, analyzing their common
features instead of each opinion separately, that can require a lot of time, when the
decision must be made quickly. It is especially important in economics and financial
sphere.</p>
      <p>Of course, dealing with clusters instead of separated opinions, some detalization
will be lost, since assumptions are made and the overall opinion is analyzed. But,
choosing the right coefficients, this detalization loss can be lowered to the acceptable
level that can be different for each situation. For example, if the mistake is found, it is
more important to deal with it as fast as possible, and leave detailed analysis for the
further steps.</p>
      <p>
        Clustering methods can be used to construct a fuzzy set membership function [
        <xref ref-type="bibr" rid="ref2 ref3">2,3</xref>
        ].
By definition, membership function  A ( x) quantitatively calibrates the
membership of the fundamental set elements of the considerations space x  X to the
fuzzy set A . The value  A ( x)  0 means that the fuzzy set does not include an
element x . The value  A ( x)  1 means the full membership of an element to a
fuzzy set. The values of the membership function from the interval (0,1)
numerically characterize fuzzy elements.
      </p>
      <p>
        The designing of membership functions (belonging function, F-functions) is one of
the most important stages in solving decision-making problems in the fuzzy
statement. The uncertainty of measuring the attribute intensity of an object can consist of
the complexity of measurement, inaccurate intensity measurement, different
perception of the objects properties by experts, etc. The membership function must fulfil the
following requirements [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]:
 continuity, that is a formalization of the following intuitive statement: when two
solutions of the set X do not differ much from each other, the values of
membership functions for these solutions are also closeto each other;
 consistency with the ratio of advantages, i.e.  D (x1 )  D (x2 ) then and only
when x1  x2.
      </p>
      <p>Let a series of empirical data in a range (0,1) , obtained as a result of measuring a
certain value x is given. According to the data analysis results, it is necessary to
formulate a conclusion acceptable to the researcher, what values the indicated
variable acquires. One-dimensional analysis suggests describing the distribution of one
variable, including its central trend (including average values, median and mode), and
dispersion (including the range and quintile of the data set, and dissemination
measures such as variance and standard deviation). Since for many practical tasks the
determination of the mode, the median, or the average mean of a given series is
insufficient, a cluster analysis can be applied to the specified series. It is necessary to define
clusters that can be used to build a membership function for the value x , which best
classify the results of measuring the value x and allow to design an membership
function for the value x in the interval (0,1) .</p>
      <p>
        The membership function of a fuzzy set is a generalization of the indicator function
of the classical set. In fuzzy logic, it is a degree of truthfulness. The degrees of truth
are sometimes confused with probability, but they are fundamentally different, since
truth indicates that the value belongs to a given set that is not similar to any
phenomenon or condition. Algorithms for determining the membership function by
analyzing the frequency of values were considered in [
        <xref ref-type="bibr" rid="ref5 ref6">5, 6</xref>
        ].
      </p>
      <p>Fuzziness appears when the expert tries to quantify the subject area. Obtaining
fuzzy knowledge is an extremely difficult task, such as experts, as a rule, are not able
to adequately design a membership function. Therefore, so-called standard
membership functions, which have a given form and are described by well-known analytic
functions, are often used. Standard membership function features are easy to apply to
many practical tasks.</p>
      <p>Usually, there are two groups of methods for constructing a membership function:
direct and indirect. In direct methods, the expert directly sets the rules for determining
the values of the function: a table, defined by formulas or with examples. Direct
methods are used to describe the concepts that are characterized by features that can
be measured: height, weight, volume, etc.</p>
      <p>Using the direct methods is the best option when dealing with more or less
standard opinions, for example, when the experts or customers are asked the same
questions with direct answers. Of course, this is rather partial option, but at the same time
it must be used when possible, to save time. The indirect methods require more time
and more knowledge about domain area, thus, more preparation, but they allow to
make better and more precise function for current situation. Besides, such function
may be more complicated, so it is impossible to use classic analyzing methods, or
their performance will take a lot of time.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Clusterization Algorithm</title>
      <p>
        When solving multicriteria optimization problems, the problem of determining the
Pareto domain is strictly objective and solved without the use of any heuristics [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
Narrowing the area of effective objects requires the use of additional information
from experts, since effective set of parameters can not be compared formally with
each other.
      </p>
      <p>But, such additional information may be found far not always. Gathering the
information, such as opinions, takes time, and during assembling of a new expert team
and getting new opinions the situation may change. And customers far not always are
ready and willing to answer more questions, also taking into account anonymous
replies, which also must be considered for making the services better. Expanded reply
system could help with it, but bigger reply far not always means better reply. So, the
system has to work in conditions of lack of information. It is possible only making
some assumptions, that may have negative influence on the result, but their usage is
necessary in general.</p>
      <p>As a rule, three heuristics are used to determine a single solution to a multicriteria
problem:
 one of the allowed transformations is used to transform all values of object
parameters to dimensionless form in a given value range;
 the vector of the criteria relative importance is determined;
 it is assumed that the multicriteria problem solution is the point of intersection
of the normalized weight coefficients of criteria relative importance beam with
the field of effective alternatives to the problem.</p>
      <p>It is known that designing a structured table of benefits in a formalized form is a
complex task for a human, since it is rather hard to present the opinion as a set of
numbers for different categories. Research in the field of expert evaluation tasks and
the practice of building decision support systems show that far not always experts and
decision makers have a clear idea of the structure of preferences for a plurality of
objects. In most cases, a person can not adequately determine the weight factors, or
allocate in the obvious case the heuristics that are used in a decision-making situation.
This is especially important, when dealing not with experts and professionals in the
current domain, but analyzing the customers’ opinions.</p>
      <p>A common method of presenting the weight coefficients values for n objects with
index i, i  I  1, ..., n are valid numbers taking into account the condition of
normalization is the following:
where
Let the result of a calculation series the weight coefficients set is formed:
iI
 i  1,
 i  0, i  I .</p>
      <p>
         i   i1 , ...,  iL  , i  I ,
where L is the number of values’ indexes of normalized weight coefficients
obtained as a computations result. Based on the obtained values, the membership
functions of the weight coefficients values in the fuzzy set (0,1) are determined.
Approaches to the determination of membership functions and algorithms for
constructing membership functions based on the analysis of the values frequency are given in
[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. That is, each weighted coefficient as a result of the described procedure
application will be characterized by its function of membership to the fuzzy set.
4
      </p>
    </sec>
    <sec id="sec-4">
      <title>Evolution Technologies Implementing</title>
      <p>
        Using the elements of evolutionary technologies for clustering fuzzy sets objects is
logical and can give practically useful results, since they allow to avoid most
problems, that the classic methods meat [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. For example, in most cases the target function
is undifferentiated, it may not have a certain interpretation, since it’s built upon the
human opinions that are rather hard to normalize.
      </p>
      <p>Although evolutionary methods do not necessarily have a concrete result, the
resulting data obtained as a result can be effectively used in practice. It is especially
useful when calculating the overall customers opinion and improving the service – it
is impossible to reach the perfect state, since it depends on customers desires and
always change, and the evolution technologies allow to fend a better solution, then
implemented, at each current situation.</p>
      <p>The most common situation while evaluating the expert’s conclusions is the need
to analyse their thoughts on the set of objects according to more then one attribute.</p>
      <p>Thus, the clusterization problem can be stated as follows: to split the set  , that
consists of n objects into m clusters   S1, S2 ,..., Sm  . Every object Si has the
set of characteristics, i, i  I  1, ..., n .</p>
      <p>The objects data are presented in the "object-attribute" type table:
where, X i , i  I  1, ..., k are the characteristics or attributes of each object,
that is analyzed.</p>
      <p>Let the set C   C1 , C2 ,..., C m  be the solution of the clustering problem.
Considering that the objects are points located inside a k - dimensional
hyperparallelepiped, the following can be obtained:</p>
      <p>C   arg min F (c)  arg min</p>
      <p>C C</p>
      <p>n m
C1,mC2i,.n..Cm  i1 j1  Si  R j   d  Si , C j  ,
with restriction that all potential cluster centers lie inside the hyperparallelepiped,
C  C1, C2 ,..., Cm  .</p>
      <p>Obtaining the cluster centers, it will be possible to operate not with all the
objectsopinions, but only with their centers, that can be presented as averaged object.
Number of these objects will be much lower and easier to analyze.</p>
      <p>
        To find the problem solution, genetic algorithm can be used, since it is one of the
most flexible evolution methods. Also, the evolution strategies usage is presented as
example [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
      </p>
      <p>The solution search can be presented as a sequence of the following steps:
Step 1. Generate q sets, that consist of m elements C i  C1i , C2i ,..., C mi , i  1, q
and, as usual, q is a number from the set. The values C ij are uniformly distributed in
the hypercube, i  1, q, j  1, m ,</p>
      <p>Step 2. Calculate the distance from each object to each cluster center:
 k
d ijp  d  S j , C pj    l1  x jl  cpjl 2 
1/2
,
where i  1, q, j  1, n, p  1, m .</p>
      <p>In order to determine which cluster the objects belong to, the search problem must
be solved: for i  1, q, j  1, n find
arg min d (S j , C ip ) .</p>
      <p>p
for i -th potential problem solution.</p>
      <p>n
di   d (S j , C piij ), i  1, q .</p>
      <p>j1
or</p>
      <p>Step 4. If the Genetic Algorithm is chosen to solve the problem, then, considering
the value di , the following operations are made with the set C i : crossover operation,
mutation, and elite selection into the new population of potential solutions is made.</p>
      <p>If the Evolution Strategy method is used, the new potential population is generated,
where every new solution is obtained from “parent” solution, by adding a normally
distributed random displacement X new  X parent   ( N (0, 2 )) . The amount of new
potential solutions, as usual, is 7 times bigger, than the amount of “parent” solutions.
The best solutions from the “parent” and intermediate populations are selected to the
new population.</p>
      <p>The choice of the used evolution algorithm is based on each current problem and
situations, since almost every problem is unique.</p>
      <p>Step 5. Steps 3-5 are repeated until the criteria for iterative process stop, that are
set by the expert before the algorithm starts and can be changes for result improving,
are not achieved.</p>
      <p>Such criteria may be:
– the priori number of iterations;
– for a given  :
max di(it)  max di(it1)   ,</p>
      <p>i i
avi g di(it )  avgdi(it1)   ,
max d (C i(it) , C i(it 1) )   ,</p>
      <p>i
where it is iteration number.</p>
      <p>These criteria can be used in different situations. The number of iterations allows
to get a result in a certain time period, when the solution must be found fast, and with
set accuracy it is possible to find the result, good enough for the current problem.</p>
      <p>Also, other criteria can be used. For example, the results in different iterations can
be compared between each other, it allows to control the convergence rate, and,
hence, not to waste additional time, if improving the current result will be ineffective
and take too long.</p>
      <p>Step 6. The clustering problem solutions, after the criteria for iterative process stop
are achieved, are the following:
arg min di(it) .</p>
      <p>Ci(it)</p>
      <p>The proposed clustering method is a parametric method and it’s efficiency depends
on researcher’s qualification and efficiency of parameters setting for each specific
problem.</p>
      <p>These parameters are the following:
 The iteration process stop criteria;
 The crossover and mutation type;
 The type of parents selection and population of the next generation formation, if
the Genetic Algorithm is chosen as optimization method;
 The parent-solutions and offspring-solution number;
 Constant or variable value standard deviation;
 Positive or negative dynamics of standard deviation for the Evolution Strategy.</p>
      <p>It is known that the convergence in probability holds for a genetic algorithm with
an elite selection to a new solutions population, and for (   ) – Evolution Strategy,
where  - parent-solution number, and  – offspring-solution number with iterations
number tending to infinity.</p>
      <p>Depending on each problem and desired solution accuracy, other evolution
algorithms, like differential evolution, can be used. Besides that, modifications of the
classic genetic algorithm can be made to improve the result, besides the coefficient
changing. At the same time, it is necessary to test these changes on a simple model
before the actual usage to avoid mistakes in future.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusions</title>
      <p>Working with big data is one of many aspects, connected with modern information
society, and nowadays a lot of ranking and clustering methods are designed and
improved. They are required to evaluate and compute large amount of statistical data,
like economical or financial reports and other. But the main flaw of most of them is
that these methods can be used only with the same systematic data.</p>
      <p>At the same time, for large companies it is necessary to combine, or at least rank
the reviews and opinions of many experts, that is impossible using the classic
methods. Also, dealing with these opinions manually is impossible because of their large
number.</p>
      <p>To solve this problem the fuzzy sets are used. They allow to formalize and
combine different opinions and to make some conclusions, based on them. The main
principles of performing the fuzzy set objects are used, so that the proper results can be
obtained, and the main heuristics are described in the article. This allows to normalize
the results, so that more complicated algorithm can be implemented.</p>
      <p>Besides, even normalizing the opinions will allow making some conclusions and to
find weak spots of the company functioning even without the further computing.</p>
      <p>Clustering different objects can be mat in different fields of science, but it’s rather
new for the fuzzy sets, since it also requires normalized sets of data. The target
function may differ very much, it can be undifferentiated and discontinuous, thus, the
classic methods can’t be used. The evolution algorithm performance is presented, that
allows to conduct the clustering process in permissible time period.</p>
      <p>The evolution algorithm steps are presented, they are not strict and can be changed
depending on every current problem, that allows to improve the algorithm every time
instead of using the stereotyped one. The large number of coefficients also allows to
adapt the algorithm for each task. This requires more preparation time and some
initial analyzes, but allows to get a better result, using less time.</p>
      <p>As for the future investigations, the possibility of genetic algorithm improvement
must be considered. For example, the usage of evolution strategy, or implementing
the penalty method to improve the clusterization results.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>M.</given-names>
            <surname>Ester</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. P.</given-names>
            <surname>Kriegel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Sander</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <article-title>A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise</article-title>
          ,
          <source>Second International Conference on Knowledge Discovery and Data Mining</source>
          , pp.
          <fpage>226</fpage>
          -
          <lpage>231</lpage>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>E.</given-names>
            <surname>Nakamura</surname>
          </string-name>
          and
          <string-name>
            <given-names>N.</given-names>
            <surname>Kehtarnavaz</surname>
          </string-name>
          ,
          <article-title>Optimization of fuzzy membership function parameters,"</article-title>
          <source>1995 IEEE International Conf on Syst., Man and Cybern</source>
          .,
          <source>Intelligent Systems for the 21st Century'</source>
          , vol.
          <volume>1</volume>
          ,
          <fpage>22</fpage>
          -
          <lpage>25</lpage>
          Oct.
          <year>1995</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>6</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>S.</given-names>
            <surname>Himavathi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Umamaheswari</surname>
          </string-name>
          ,
          <article-title>New membership functions for effective design and implementation of fuzzy systems</article-title>
          ,
          <source>IEEE Trans. Syst., Man and Cybern</source>
          ., vol.
          <volume>31</volume>
          , no.
          <issue>6</issue>
          , pp.
          <fpage>717</fpage>
          -
          <lpage>723</lpage>
          , Nov.
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4. A. Kaufman, M. Gupta,
          <article-title>Introduction to fuzzy arithmetic: theory and applications</article-title>
          . NY: Van Nostrand Reinhold,
          <year>1991</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Hnatyenko H.M.</surname>
          </string-name>
          <article-title>Algorithms for determining the membership function by analyzing the values frequency, Proceedings of the III-th international school-seminar "Theory of decision-making"</article-title>
          ,
          <source>Uzhorod</source>
          ,
          <year>2006</year>
          . - P.
          <fpage>32</fpage>
          -
          <lpage>34</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Hnatyenko</surname>
            <given-names>H.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Snityuk</surname>
            <given-names>V.E.</given-names>
          </string-name>
          <string-name>
            <surname>Expert</surname>
          </string-name>
          <article-title>Decision Technology: Monograph, K: LLC "</article-title>
          <source>Maclaut"</source>
          ,
          <year>2008</year>
          . -
          <fpage>444p</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>Patrick</given-names>
            <surname>Ngatchou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Anahita</given-names>
            <surname>Zarei</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.A.</given-names>
            <surname>El-Sharkawi</surname>
          </string-name>
          ,
          <article-title>Pareto Multi Objective Optimization</article-title>
          ,
          <source>Proceedings of the 13th International Conference on Intelligent Systems Application to Power Systems (ISAP</source>
          <year>2005</year>
          ), pp.
          <fpage>84</fpage>
          -
          <lpage>91</lpage>
          , IEEE Press, Washington, DC, USA,
          <fpage>6</fpage>
          -
          <issue>10</issue>
          <year>November</year>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>V. Y.</given-names>
            <surname>Snytyuk ; O. O. Suprun</surname>
          </string-name>
          :
          <article-title>Evolutionary techniques for complex objects clustering</article-title>
          ,
          <source>2017 IEEE 4th International Conference Actual Problems of Unmanned Aerial Vehicles Developments (APUAVD)</source>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Hai-hui</surname>
            <given-names>Wang</given-names>
          </string-name>
          ;
          <article-title>Wen-jie Zhao, Data clustering based on approach of genetic algorithm, 2008 Chinese Control</article-title>
          and Decision Conference,
          <volume>2</volume>
          -
          <fpage>4</fpage>
          ,
          <year>July 2008</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>