<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>A.E. Bondarev</string-name>
          <email>bond@keldysh.ru</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>V.A. Galaktionov</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Keldysh Institute of Applied Mathematics RAS</institution>
          ,
          <addr-line>125047 Miusskaya sq. 4, Moscow</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The paper considers the tasks of visual analysis of multidimensional data sets of medical origin. For visual analysis, the approach of building elastic maps is used. The elastic maps are used as the methods of original data points mapping to enclosed manifolds having less dimensionality. Diminishing the elasticity parameters one can design map surface which approximates the multidimensional dataset in question much better. To improve the results, a number of previously developed procedures are used - preliminary data filtering, removal of separated clusters (flotation). To solve the scalability problem, when the elastic map is adjusted both to the region of condensation of data points and to separately located points of the data cloud, the quasi-Zoom approach is applied. The illustrations of applying elastic maps to various sets of medical data are presented.</p>
      </abstract>
      <kwd-group>
        <kwd>multidimensional data</kwd>
        <kwd>visual analysis</kwd>
        <kwd>elastic maps</kwd>
        <kwd>quasi-Zoom</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>In the analysis of multidimensional data a special place is
occupied
the
classification.</p>
      <p>
        When
solving
classification problems, the approaches of visual analytics are
very useful. They are the synthesis of several algorithms for
reducing
the
dimension
and
the
visual
presentation
of
multidimensional data in manifolds of a lower dimension nested
in the original volume. These algorithms include the display of
the original multidimensional volume in elastic maps [
        <xref ref-type="bibr" rid="ref18 ref8 ref9">8, 9, 18</xref>
        ]
with different properties of elasticity. These methods allow to get
insight of the
cluster structure
contained in the initial
multidimensional data volume under question.
      </p>
      <p>Our team became interested in elastic maps in the process of
implementing a project to develop computational technologies
for
building,
processing,
analyzing
and
multidimensional parametric
solutions of CFD
visualizing
problems.</p>
      <p>Computational technology is implemented in the form of a single
technological
pipeline
of algorithms
for the
production,
processing, visualization and analysis of multidimensional data.
Such pipeline can be considered as a prototype of a generalized
computational experiment for non-stationary problems
of
computational gas dynamics. As a result, such a generalized
computational experiment makes it possible to obtain a solution
not for a single individual problem, but for a whole class of
problems, defined by ranges of variation of the determining
parameters. It should also be noted the universality of such
approach. It can be applied to a wide range of problems of
mathematical
modeling
of non-stationary
processes.</p>
      <p>
        The
description of the elements of the implemented computing
technology is given in [
        <xref ref-type="bibr" rid="ref5 ref6">5, 6</xref>
        ].
      </p>
      <p>In practice, elastic maps turned out to be a useful and quite
versatile tool, which</p>
      <p>
        made it possible to apply them to
multidimensional data volumes of various types. This approach
was applied to the tasks of analyzing textual information, where
the frequencies of using words [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] were used as numerical
characteristics, as well as to the tasks of analyzing mineral
samples [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. In the process of working on these tasks, a number
of procedures for processing the studied data were developed and
tested, which made it possible to improve the results of visual
analysis. These procedures include the preliminary filtering of
data, which allows weeding out points with indistinctly defined
values, the removal of separated clusters (flotation), quasi-Zoom.
The latter procedure is designed to solve the problem
of
scalability, when the elastic map adapts both to the area of data
points concentration and to separately located points of the data
cloud, which complicates visual analysis. The essence of this
technological approach is that for finer adjustment it is necessary
to select large clusters in the studied volume of multidimensional
data and build elastic maps for selected clusters separately, thus
organizing an effect similar to the zoom function in modern
phototechnics. The results of applying these procedures to
multidimensional volumes of data of various origins are
presented in [
        <xref ref-type="bibr" rid="ref1 ref2 ref3 ref4">1-4</xref>
        ].
      </p>
      <p>This approach is generally universal, since it does not depend
on the nature of the studied multidimensional data. This makes it
possible to apply this approach and the developed procedures to
the tasks of studying multidimensional medical data. This paper
represents the results of applying the construction of elastic maps
and procedures developed earlier for the visual analysis of
multidimensional data volumes of medical origin.</p>
      <p>
        In most of the previous cases, we considered data sets that
were specially prepared in advance. Here, for the first time, we
took several sets of publicly available medical data sets [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ].
Some results were previously presented in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Elastic maps approach</title>
      <p>
        The ideology and algorithms for construction of elastic maps
are described in detail [
        <xref ref-type="bibr" rid="ref18 ref8 ref9">8, 9, 18</xref>
        ].
      </p>
      <p>Elastic map is a system of
elastic springs embedded in a multidimensional data space. This
approach is based on an analogy with the problems of mechanics:
the main manifold passing through the "middle" of the data can
be represented as an elastic membrane or plate. The method of
elastic maps is formulated as an optimization problem, which
assumes optimization of a given functional from the relative
location of the map and data.</p>
      <p>
        According to [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ], the basis for constructing an elastic map
is a two-dimensional rectangular grid
G embedded in a
multidimensional space that approximates the data and has
adjustable elastic properties with respect to stretching and
bending. The location of the grid nodes is sought as a result of
solving the optimization problem for finding the minimum of the
functional:
 =
 1 + 
| |
 2 + 


 3 → 
where │X│ is the number of points in the multidimensional data
volume X; m is the number of grid nodes, λ and μ are the elastic
coefficients responsible for the stretching and curvature of the
mesh. Here D1, D2, D3 are the terms responsible for the properties
of the grid. The term D1 is a measure of the proximity of the grid
nodes to the data. The term D2 represents the measure of the
stretching of the grid. The term D3 represents the measure of the
curvature of the grid.
      </p>
      <p>
        The author of the approach [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] has developed the software
package [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ],
which
allows the
construction
and
visual
presentation of elastic maps. The main functional features of this
software are described in detail in [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. The figures below in this
article are created by means of this software package.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Procedures for visual analysis</title>
      <p>
        Previously, to study multidimensional data, a number of
procedures for processing the studied data were developed,
which allowed to improve the results of visual analysis. These
procedures include the preliminary filtering of data, which allows
weeding out points with indistinctly defined values, the removal
of separated clusters (flotation), quasi-Zoom. Below we briefly
give examples of the application of these procedures to
multidimensional volumes of data of different origin.
An example of constructing elastic maps for the volume of
multidimensional data representing the characteristics of mineral
resources, namely, three types of coal from Polish deposits [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ],
is given in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Multidimensional data are considered,
representing points in the multidimensional feature space
(characteristics of coal samples). The data set displays three
grades of coal. The task of classifying coal by grade was
considered. By combining the construction of elastic maps, the
removal of fuzzy points and separated classes (filtering and
flotation of data), it is possible to completely separate the
samples specified in the initial volume into three classes
corresponding to three types of coal.
      </p>
      <p>
        Examples of the use of quasi-Zoom for analyzing the thematic
proximity of the words of the Russian language are given in [
        <xref ref-type="bibr" rid="ref1 ref2 ref4">1,
2, 4</xref>
        ]. The basis of the proposed method is the analysis of the
environment of words. The main hypothesis is that similar words
should occur in approximately the same context. In this regard,
in the space of attributes, they will be located at a relatively close
distance from each other, while the different words will be
located at a distance more distant from each other. Text boxes
from news sources were used as test data (news feeds for a certain
period). For the primary tests, about 100 verbs with 353 nouns
associated with them were selected. The data thus obtained was
further considered as a multidimensional data volume,
representing 100 points in 353-dimensional space. The numerical
values of the resulting matrix are defined as frequencies of
sharing. The data volume under study contained a region of high
data density and points far enough from this region. In the study
of the frequency of the joint use of verbs and nouns, the practical
task was set as follows. It was necessary to separate the "stuck
together" points. The use of filtering and two consecutive
quasiZoom procedures allowed to solve this problem completely
(Fig.1).
      </p>
      <p>Fig. 1. Extension of the elastic map after two consecutive
quasi</p>
      <p>Zoom applications.</p>
      <p>The use of a similar approach for the transposed data file allowed
us to select among the set of nouns a number of semantic clusters
(Fig.2). This opens up additional opportunities for the analysis
and interpretation of semantic groups for specialists in this field.
Fig. 2. Extension of the elastic map for the transposed data set
after applying quasi-Zoom.</p>
      <p>
        Also, the construction of elastic maps was applied to the
study of multidimensional arrays of errors of different solvers
compared to the etalon solution [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. We considered the numerical
results of comparing the accuracy of the work of various solvers
of the OpenFOAM software package using the example of the
well-known inviscid flow problem around a cone at zero angle
of attack. The results obtained using various OpenFOAM solvers
were compared with the well-known numerical solution of this
problem with the variation of the free-stream Mach number and
the angle of the cone. Four solvers of OpenFOAM software
package - rhoCentralFoam, pisoCentralFoam, sonicFoam,
rhoPimpleFoam participated in the comparison. All these solvers
have different approximation and computational properties.
Figure 3 shows the elastic map for pressure, obtained as a result
of parametric calculations, in the space of the first principal
components. The yellow circles show the results for
rhoCentralFoam solver, the red ones for pisoCentralFoam, the
green ones for sonicFoam and the blue ones for rhoPimpleFoam.
      </p>
      <p>The results of the visual analysis showed that the errors for
rhoCentralFoam and for pisoCentralFoam can be roughly
approximated by a plane reflecting the dependence of the error
on the Mach number and the cone angle.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Processing of medical datasets</title>
      <p>
        The attempt of applying elastic maps to medical data was
made in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. For this purpose the data from [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] were used. This
data set contains values for six biomechanical features used to
classify orthopaedic patients into 2 classes (normal or abnormal).
Each patient is represented in the data set by six biomechanical
attributes derived from the shape and orientation of the pelvis and
lumbar spine (in this order): pelvic incidence, pelvic tilt, lumbar
lordosis angle, sacral slope, pelvic radius and grade of
spondylolisthesis. The data set contains 310 points in
6dimensional space. Unfortunately, elastic maps didn’t give good
results from the point of view of classification.
      </p>
      <p>
        Below are the results for the three other volumes of
multidimensional data that involve the solution of the
classification problem. All data sets were taken from UCI
Machine Learning Repository [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ].
      </p>
      <p>
        The first data set considers variability of impedivity in
normal and pathological breast tissue [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] and tasks of
classifying various types of diseases [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. This dataset contains
106 points placed in 9-dimensional attribute space. Also each
point has its class attribute corresponding to the type of disease
carcinoma, fibro-adenoma, mastopathy, glandular, connective,
adipose. According to [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], the dataset can be used for predicting
the classification of either the original 6 classes or of 4 classes by
merging together the fibro-adenoma, mastopathy and glandular
classes whose discrimination is not important (they cannot be
accurately discriminated anyway).
      </p>
      <p>Further, we use the following notation and color scheme for
the classes studied: car (carcinoma) - red, adi (adipose) - yellow,
con (connective) - green, fad + (fibro-adenoma + mastopathy +
glandular) - blue. We use the combined fad + class because of
the above remark by the authors of the volume of data that these
classes are not separated exactly.</p>
      <p>Below one can see the illustrations of the construction of
elastic maps for the studied data volume. Figure 4 shows the
source data in the space of the first three principal components.
Figures 5 and 6 show the elastic map and its development for a
given amount of data.</p>
      <p>Figures show that (car + fad +) and (con + adi ) pairs of
classes are well separated. However, within the pair, data from
different classes are mixed. To improve the picture of the
separation, use flotation and remove fad +. The results of
building an elastic map for this case are shown in Figure 7. In
this case, the car class was fully distinguished.</p>
      <p>Now remove the car class and consider separately the
remaining pair of classes - con and adi. After constructing the
elastic map and its development, we obtain the picture presented
in Figure 8. In this case, a satisfactory separation of classes was
achieved.</p>
      <p>
        Next, consider together a couple of classes - car and fad +.
Figure 9 presents the extension of the elastic map for these
classes. There is also a satisfactory separation. The use of
qZoom in order to improve the separation in the center of the
picture did not lead to success. Also, the attempt to divide the
mixed fad + class into the fad, mas, gla classes was not
successful. The comment in [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] about the inseparability of these
classes turned out to be true.
      </p>
      <p>
        The following data set is also devoted to the problems of
forecasting breast diseases [
        <xref ref-type="bibr" rid="ref12 ref7">7, 12</xref>
        ]. The data set contains 116
points in a 10 -dimensional attribute space. Each point also
contains a binary variable indicating the presence or absence of
the disease. Attribute space contains ten predictors. According to
[
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], the predictors are anthropometric data and parameters
which can be gathered in routine blood analysis.
      </p>
      <p>Prediction models based on these predictors, if accurate, can
potentially be used as a biomarker of breast cancer.</p>
      <p>For this data volume, an elastic map was constructed. Dots
with the absence of the disease are shown in green, and the
presence of the disease is marked in red.</p>
      <p>Figures 10 and 11 represent the constructed elastic map and
its extension. As one can see, the green and red dots are strongly
mixed. This caused some confusion, since by construction this
picture represents points that have to be close to each other in the
multidimensional attribute space.</p>
      <p>
        However, in the original article [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] a picture was given from
which it was possible to conclude that only for 4 parameters
(glucose, Insulin, Resistin, HOMA-homeostasis model
assessment) there is a significant difference between patients and
healthy people. From the data space, only these 4 dimensions
were left, and the elastic map was re-constructed. The results are
shown in Fig. 12. The separation between the green and red dots
has improved significantly, however, in the center of the picture
there is an area where the dots are mixed.
      </p>
      <p>
        The following dataset is for the early diagnosis of the Autistic
Spectrum Disorder (ASD) [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. The data set consists of 692
points originally defined in the 21-dimensional attribute space.
The diagnostic approach is based on the analysis of the
questionnaire data consisting of 10 questions. About half of the
attributes are patient data. Therefore, it was decided to leave 12
attributes - 10 answers to the questionnaire, the age of the patient
and the total score according to the results of the questionnaire.
The results are presented in Figures 13 and 14 in the form of an
elastic map and its scan.
      </p>
      <p>Fig. 13. Elastic map for 12-dimensional attribute space when
diagnosing ASD.</p>
      <p>Fig. 14. Extension of elastic map for 12-dimensional attribute
space when diagnosing ASD.</p>
      <p>These results show that the separation between diagnoses
about the presence or absence of ASD is quite satisfactory on the
studied data set.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusions</title>
      <p>For the analysis of structures in multidimensional data
volumes, technologies for constructing elastic maps are used,
which are methods for mapping points of the original
multidimensional space to nested manifolds of lower dimension.
A number of data processing techniques that can improve the
results are considered - pre-filtering of data, removal of separated
clusters (flotation), quasi-Zoom. Examples of the construction of
elastic maps and the use of these procedures for
multidimensional data of medical origin are given. The results
showed that the construction of elastic maps together with the
procedures of accompanying data processing can serve as a
useful tool for visual data analysis and complement other
methods for studying multidimensional data volumes.</p>
      <p>However, the results show that when processing medical data
from open sources, we are faced with a new problem. The data
considered are clearly overloaded with unnecessary
measurements and unnecessary information. This makes the data
“noisy” and does not allow class division. To overcome this
problem, it is planned in the future to implement an additional
procedure for analyzing the contribution of each measurement to
the total variance, followed by the removal of unnecessary
criteria.</p>
    </sec>
    <sec id="sec-6">
      <title>6. References</title>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Bondarev</surname>
            ,
            <given-names>A.E.</given-names>
          </string-name>
          et al,
          <year>2016</year>
          .
          <article-title>Visual analysis of clusters for a multidimensional textual dataset</article-title>
          .
          <source>Scientific Visualization</source>
          .
          <volume>8</volume>
          (
          <issue>3</issue>
          ),
          <fpage>1</fpage>
          -
          <lpage>24</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Bondarev</surname>
            ,
            <given-names>A.E.</given-names>
          </string-name>
          ,
          <year>2017</year>
          .
          <article-title>Visual analysis and processing of clusters structures in multidimensional datasets</article-title>
          .
          <source>ISPRS Archives, XLII-2/W4</source>
          ,
          <fpage>151</fpage>
          -
          <lpage>154</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Bondarev</surname>
            ,
            <given-names>A. E.</given-names>
          </string-name>
          :
          <article-title>The procedures of visual analysis for multidimensional data volumes</article-title>
          ,
          <source>Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci., XLII-2/W12</source>
          ,
          <fpage>17</fpage>
          -
          <lpage>21</lpage>
          , doi.org/10.5194/isprs-archives-XLII-2
          <string-name>
            <surname>-W12-</surname>
          </string-name>
          17-2019
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Bondarev</surname>
            ,
            <given-names>A.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bondarenko</surname>
            ,
            <given-names>A.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Galaktionov</surname>
            ,
            <given-names>V.A.</given-names>
          </string-name>
          ,
          <year>2018</year>
          .
          <article-title>Visual analysis procedures for multidimensional data</article-title>
          .
          <source>Scientific Visualization</source>
          <volume>10</volume>
          (
          <issue>4</issue>
          ),
          <fpage>109</fpage>
          -
          <lpage>122</lpage>
          , doi.org/10.26583/sv.10.4.09.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Bondarev</surname>
            ,
            <given-names>A.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Galaktionov</surname>
            ,
            <given-names>V.A.</given-names>
          </string-name>
          ,
          <year>2015a</year>
          .
          <article-title>Analysis of Space-Time Structures Appearance for Non-Stationary CFD Problems</article-title>
          . Procedia Computer Science,
          <volume>51</volume>
          ,
          <fpage>1801</fpage>
          -
          <lpage>1810</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Bondarev</surname>
            ,
            <given-names>A.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Galaktionov</surname>
            ,
            <given-names>V.A.</given-names>
          </string-name>
          ,
          <year>2015b</year>
          .
          <article-title>Multidimensional data analysis and visualization for timedependent CFD problems</article-title>
          .
          <source>Programming and Computer Software</source>
          ,
          <volume>41</volume>
          (
          <issue>5</issue>
          ),
          <fpage>247</fpage>
          -
          <lpage>252</lpage>
          , doi.org/10.1134/S0361768815050023.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Crisóstomo</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          et al.,
          <year>2016</year>
          .
          <article-title>Hyperresistinemia and metabolic dysregulation: a risky crosstalk in obese breast cancer</article-title>
          .
          <source>Endocrine</source>
          ,
          <volume>53</volume>
          (
          <issue>2</issue>
          ),
          <fpage>433</fpage>
          -
          <lpage>442</lpage>
          , doi.org/10.1007/s12020-016- 0893-x
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Gorban</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          et al,
          <year>2007</year>
          .
          <article-title>Principal Manifolds for Data Visualisation</article-title>
          and Dimension Reduction, Springer, Berlin - Heidelberg - New York,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Gorban</surname>
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zinovyev</surname>
            <given-names>A.</given-names>
          </string-name>
          ,
          <year>2010</year>
          .
          <article-title>Principal manifolds and graphs in practice: from molecular biology to dynamical systems</article-title>
          .
          <source>International Journal of Neural Systems</source>
          ,
          <volume>20</volume>
          (
          <issue>3</issue>
          ),
          <fpage>219</fpage>
          -
          <lpage>232</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Jossinet</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <year>1996</year>
          .
          <article-title>Variability of impedivity in normal and pathological breast tissue</article-title>
          .
          <source>Med. &amp; Biol. Eng. &amp; Comput</source>
          ,
          <volume>34</volume>
          ,
          <fpage>346</fpage>
          -
          <lpage>350</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Niedoba</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <year>2014</year>
          .
          <article-title>Multi-parameter data visualization by means of principal component analysis (PCA) in qualitative evaluation of various coal types</article-title>
          / Physicochemical Problems of Mineral Processing,
          <volume>50</volume>
          (
          <issue>2</issue>
          ),
          <fpage>575</fpage>
          -
          <lpage>589</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Patrício</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , et al
          <year>2018</year>
          .
          <article-title>Using Resistin, glucose, age and BMI to predict the presence of breast cancer</article-title>
          .
          <source>BMC Cancer</source>
          ,
          <volume>18</volume>
          (
          <issue>1</issue>
          ), doi.org/10.1186/s12885-017-3877-1.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>Rocha</given-names>
            <surname>Neto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Barreto</surname>
          </string-name>
          ,
          <string-name>
            <surname>G.</surname>
          </string-name>
          ,
          <year>2009</year>
          .
          <article-title>On the Application of Ensembles of Classifiers to the Diagnosis of Pathologies of the Vertebral Column: A Comparative Analysis</article-title>
          ,
          <source>IEEE Latin America Transactions</source>
          ,
          <volume>7</volume>
          (
          <issue>4</issue>
          ),
          <fpage>487</fpage>
          -
          <lpage>496</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Silva</surname>
            ,
            <given-names>J.E.</given-names>
          </string-name>
          , Marques de Sá,
          <string-name>
            <given-names>J.P.</given-names>
            ,
            <surname>Jossinet</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.</surname>
          </string-name>
          ,
          <year>2000</year>
          .
          <article-title>Classification of Breast Tissue by Electrical Impedance Spectroscopy</article-title>
          .
          <source>Med &amp; Bio Eng &amp; Computing</source>
          ,
          <volume>38</volume>
          ,
          <fpage>26</fpage>
          -
          <lpage>30</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Thabtah</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <year>2017</year>
          .
          <article-title>Machine learning in autistic spectrum disorder behavioral research: A review and ways forward. Informatics for Health and Social Care, doi</article-title>
          .org/ · 10.1080/17538157.
          <year>2017</year>
          .1399132
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>UCI</given-names>
            <surname>Machine Learning</surname>
          </string-name>
          <string-name>
            <surname>Repository</surname>
          </string-name>
          ,
          <year>2019</year>
          . archive.
          <source>ics.uci.edu/ml/ (01 March</source>
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>ViDaExpert</surname>
          </string-name>
          ,
          <year>2019</year>
          . bioinfo.curie.fr/projects/vidaexpert (01
          <source>March</source>
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <surname>Zinovyev</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <year>2000</year>
          .
          <article-title>Vizualizacija mnogomernyh dannyh [Visualization of multidimensional data]</article-title>
          .
          <source>Krasnoyarsk, publ. NGTU</source>
          .
          <year>2000</year>
          . 180 p. [In Russian].
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>