<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Application of Methods of Machine Learning for the Recognition of Mathematical Expressions</article-title>
      </title-group>
      <fpage>0000</fpage>
      <lpage>0001</lpage>
      <abstract>
        <p>The article describes the study of the peculiarities of presentation of mathematical methods, as well as methods and algorithms for their recognition. The possibility of simultaneous execution of structural analysis and character classification is investigated. The process of classification of the symbols and construction of the corresponding system, based on methods of machine learning, is described. For the initial initialization of the symbol classification process, a segmented binary image passes a "rough" classification by the Bayesian Network. Classification using contexts is processed by artificial Neural Networks. The system being developed is a multi-classifier. Five different classifiers work to get the optimal result.</p>
      </abstract>
      <kwd-group>
        <kwd>classification</kwd>
        <kwd>classifier</kwd>
        <kwd>symbol</kwd>
        <kwd>structure</kwd>
        <kwd>mathematical expression</kwd>
        <kwd>machine learning</kwd>
        <kwd>Bayesian Inference</kwd>
        <kwd>Neural Networks</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Computer vision is the theory and technology of developing systems that can find,
track, classify and identify objects by extracting data from images and analyzing
received information [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>
        The purpose of Computer Vision and Pattern Recognition (MVPR) is to develop
useful applications, especially through the use of processing and analysis of digital
images. Computer vision is used to recognize objects, video analytics, description of
image and video content, gesture recognition and handwriting, as well as intelligent image
processing [
        <xref ref-type="bibr" rid="ref2 ref3 ref4 ref5">2-5</xref>
        ]. Statistical data uses statistical methods and uses models that are
constructed using geometry, physics and theory of learning.
      </p>
      <p>Today, computer vision is at the peak of its development. The speed of modern
digital devices and the possibility of parallel computing provide the ability to implement
many algorithms for working with digital image libraries.</p>
    </sec>
    <sec id="sec-2">
      <title>Analysis of recent research and publications</title>
      <p>
        Optical character recognition (OCR) is a mechanical or electronic transfer of
handwritten, typewritten or printed text into sequences of codes used for presentation in a text
editor [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Recognition is widely used for converting books and documents into
electronic form, automating accounting systems in business or publishing text on the
Internet. Optical recognition of images containing text is a widely studied problem at the
interface between the field of artificial intelligence and computer vision.
      </p>
      <p>Modern hardware and software systems allow automate large volumes of data into
a computer, using, for example, a network scanner and parallel text recognition on
multiple computers simultaneously. The most popular OCR systems are ABBYY
FineReader, SimpleOCR, FreeOCR, Microsoft Office Document Imaging, and more.</p>
      <p>The most difficult problems associated with the recognition of handwritten and
printed characters are a variety of forms and ways of representing characters; distorting
character images; variations in the size and scale of symbols (Tabl. 1).</p>
      <p>
        Some methods for recognizing expressions are based only on spatial measurements
such as baselines [
        <xref ref-type="bibr" rid="ref6 ref7">6, 7</xref>
        ]. Other methods use rules-based systems and analyze the
expression for its interpretation [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. In several algorithms, knowledge of mathematical
symbols and operators and their spatial properties are taken into account [
        <xref ref-type="bibr" rid="ref10 ref11 ref12 ref13 ref14 ref15 ref16 ref9">9-16</xref>
        ].
      </p>
      <p>
        In Zanibby's works, the baselines that are in expressions [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] are analyzed. In
particular, the dominant baseline is considered, which is the line on which the expression will
be written and, for example, the embedded baselines that correspond to the indices.
During the first step, a tree is constructed based on these baselines. Knowledge about
the properties of mathematical notation for some tree transformations is also used. The
resulting tree represents the content of the equation. In further research, Zanibby
improved the recognition of indices and indexes using fuzzy regions [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. This was
motivated by the fact that most ambiguities in handwritten mathematical expressions refer
to variants of index / line and line / upper index. In addition, the use of fuzzy logic
makes it possible to return a ranked list of interpretations.
      </p>
      <p>
        Tapia and Rojas first receive baselines and recursively build a mini-tree, in which
each node is a symbol [
        <xref ref-type="bibr" rid="ref6 ref8">6, 8</xref>
        ]. In addition to baseline analysis, using a graph to represent
the expression, they construct a minimal bias tree. Then a syntactic and semantic
analysis is performed, using rules based on the features of the operator. Suzuki uses a
network of virtual links [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Ray and Kim presented a method for conducting an effective
search for structural analysis recognition [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Miller and Viola retain ambiguity during
the character recognition stage [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. They then calculate the probability that each
character belongs to a certain class (small letter, number, binary operator, etc.), as well as
the probability of being an index, upper index or linear expression, according to
character recognition and some location properties. Chen performs both recognition and
understanding of the formula [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. Aval tried to simultaneously optimize segmentation
and character recognition and structure for handwritten expressions [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. Wang and
Fore do not use any information about the character. According to the relative height
of the two characters, they build the distribution of probabilities for bindings (index /
string / upper index) between the symbols according to their relative vertical
arrangement [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. They also investigated the segmentation of manuscript forms based on
human visual perception of a mathematical expression. Ali for the correct recognition of
indexes and add-ins uses normalized bounding rectangles as the main feature of a
character [
        <xref ref-type="bibr" rid="ref15 ref16">15, 16</xref>
        ]. A virtual remote element is added before the interconnection is
recognized. They proved that with normalized restrictive rectangles, along with the special
processing of the wrong characters, they can effectively recognize the connection using
the Bayesian classifier.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Features of recognition of mathematical expressions</title>
      <p>Available OCR systems are high-quality products in their field of application.
However, the specificity of recognizing mathematical expressions requires more specialized
software.</p>
      <p>Character recognition is performed by classical methods of OCR, for example, using
methods of reference vectors, coincidence with patterns. The analysis of the structure
is mainly carried out with the help of geometric considerations, which are grounded on
implicit rules or grammatical rules. Uncertainty in mathematical expressions, especially
in manuscripts, is generally accepted. This may be uncertainty about the meaning of a
symbol or structure. Despite the fact that artificial intelligence is used in the recognition
of a structure with fuzzy logic or search algorithms, machine learning is not sufficiently
used in the analysis of the structure. The development of science generates new
mathematical notation. They may not be identified and break the structural analysis. There
is a need for the ground-based application of machine learning methods for recognizing
mathematical expressions.</p>
      <p>The purpose of the study is to develop an intelligent recognition system for
mathematical expressions based on machine learning, where character classification and
structure analysis are separate tasks. An intelligent intelligence engineer should
recognize mathematical expressions in two dimensional binary images and submit them in
Latex format.</p>
      <p>Mathematical expressions can be presented in a format that a person reads easily, or
as a two-dimensional graph. They can also be submitted for use by computers. The
presentation may be different from the reverse Polish record used in pocket calculators
of the 1980s, to tree-like structures in some symbolic computing systems.</p>
      <p>A mathematical expression is not just random symbols. They have a well-organized
structure that is subject to the rules of the system of mathematical notation. The
arrangement of two symbols relative to each other has a certain content.</p>
      <p>The usual order of writing оf reading for mathematical expressions is left-to-right.
Therefore, understanding the mathematical expression is not completely
two-dimensional. Interlinked characters are usually located next to each other. However, reading
a mathematical expression is not straightforward, since different characters are usually
read in different ways.</p>
      <p>The main differences between plain text and mathematical expressions of the field
are that mathematical formulas use many more symbols and have many types of
connections. This justifies two main problems in the recognition of mathematical
expressions, namely: the number of types of symbols and the context of their application;
types of spatial relationships (upper index (or upper right), lower index (or lower right),
in the same line, top, bottom, inside).</p>
      <p>Mathematical expressions, as a rule, can be considered as embedded structures,
especially because of the presence and properties of spatial relationships. Since the
formulas are generally written in a line called the baseline, the embedded structure implies
the presence of embedded baselines.</p>
      <p>The most intuitive way to represent mathematical expressions is graphical. It is a
printed or handwritten two-dimensional structure with symbols of various sizes and
positions. This way of presenting is user friendly for reading and understanding the
expression. There are other forms and are suitable for entering expressions into a
computer. Simple formulas can be written on one line. To fix the logical content of the
expression, it can be represented as a tree. There is also MathML - XML format. It
captures the embedded properties of the mathematical expressions: Presentation
MathML focuses on the spatial representation of the expression; Content MathML is a
text translation of a tree view.</p>
      <p>Recognition of mathematical formulas is a task where an image representing a
mathematical expression is interpreted by the computer so that it can be stored, interpreted
and reused. Recognition of mathematical formulas consists of two tasks (see Fig.1):
• character recognition: each pixel of the foreground in the image fills the character
and each symbol is in the expression and transmits some information;
• structure recognition: the two-dimensional layout of the expression is subject to
some rules and each scheme corresponds to a certain value.
Character recognition is a procedure by which each symbol is recognized and
classified. This is a difficult task because of the large number of sim-wolves. There is no
dictionary, as for the recognition of the text. One and the same sim-wave may appear
in different contexts, and it is important to distinguish, for example, symbol summation
Σ and the Greek letter Σ. Some different characters have the same form, for example, p
and P. Even more problems arise when it comes to hand-written expressions. To solve
the problem of character recognition, neural networks or the methods of reference
vectors are used.</p>
      <p>The complexity of the task of recognizing the structure depends on the level of
interpretation. Recognition of the expression structure is a collection of location
analysis and interpretation of the representations of symbols and interconnections.</p>
      <p>The structure of mathematical formulas looks quite simply to execute its recognition
without the symbol value. There are several reasons for recognizing sim-wolves before
recognizing the structure. First, the value of the characters is a huge limitation to the
possible structure. However, symbols do not completely determine the location. The
rules of their association are well structured. For example, the top index will never be
found under its parent symbol. It is always located in the upper right corner. The main
component in the coupling of the structure is not the characters, but their positions and
sizes. The range of symbols and rules used to write mathematical expressions is not
fixed. Common symbols and structural rules are just a subset, perhaps an infinite
number, since characters and their new meanings can be invented at any time.</p>
      <p>Spatial links between symbols are well defined, in limited quantities (index, upper
index, etc.), but relationships can appear in a context in which they usually do not occur.
Consequently, the result of the communication recognition should not affect the value
or class of the symbol.</p>
      <p>Character identification is not necessarily required for the recognition of the
structure. Characters can be classified using only their restrictive rectangle and context.</p>
      <p>The range of symbols and rules used to write mathematical expressions is not fixed.
The symbol context is the information about the symbol itself (for example, a restrictive
rectangle, a symbol class), as well as its parent and child symbols. Important features
are their relative size and position, as well as the relationship between them. The
expression from the arrangement of characters will be recognized. To do this, the symbols
will be reduced to their bounding rectangles.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Methods and Technologies for the Recognition of</title>
    </sec>
    <sec id="sec-5">
      <title>Mathematical Expressions</title>
      <p>Mathematical expressions with limited complexity will be concentrated on, namely:
• zero order: this is only a one-dimensional sequence of characters; it does not contain
indices, upper indices, etc., for example,  +  −  ; ∑ ×  ;
• first order: an expression that contains one level of nested structures; when the
indices and upper indices are expressions of zero order, for example,   +   +1;

∑ =0   +1 +  ;
• n-th order: expressions in which the embedded structures have order  − 1.
The expressions of the zero order look simpler, because they are one-dimensional,
therefore, it is a typical OCR task. Only spatial functions will be considered, so the
amount of information in the expression of zero order is small. First-order expressions
can be quite complex, since it is necessary, for example, to identify an expression as an
index. We define the intermediate order. Expression of the 0.5 order is an expression
in which the embedded expressions are separate characters. Example,   +   this is
an expression of 0.5 order, whereas   + - no.</p>
      <p>The input format is the image of a handwritten mathematical expression. It is
necessary to develop a method for recognizing expressions of 0.5 order, as well as simple
expressions of 1 and 1.5 orders, in order to check the reliability and ability of the system
to adapt to more complex situations.</p>
      <p>It is not only needed to recognize the structure, but also try to find a character class
using this structure. A high order means a lot of context that should simplify the
classification. However, when the expressions become more complex, the recognition of
the structure is also complicated. It is important to make a compromise between
problems that arise from the complexity and necessity of a context.</p>
      <p>During the recognition, the format of the data changes. At the input, binary image
is obtained, and the output must get the interpretation of the expression. As a system
input, a binary (binary) image is selected. The segmentation algorithm reads the image
and obtains related components. From the found components only restrictive rectangles
are stored. This is the easiest way to present a layout of the expression. Each element
in the list of bounding rectangles has the form  
right, upper and lower limits of each rectangle.</p>
      <p>The list of bounding rectangles is used to create a representation of the expression,
which is a list of characters. The essence of the "symbol" represents a symbol without
context. It is created from the coordinates of the restrictive rectangular. The essence
contains information about the size and position of the character. The essence of the
"context" is a symbol with its context. It can be associated with other characters, such
,  
,  
,  
- this is the left,
as a parent symbol or its child elements. It also contains a connection that has a
character with its parent element, and the probability value for the symbol class and
relationships. The essence of the "expression" consists of a list of characters. Creating an
expression creates the "character" object for each bounding rectangle in the list and
creates a "context" for each character. All contexts are stored in a list that is an
"expression".</p>
      <p>To find the relationship between characters and define character classes, the
classification is performed. This can be seen as bundling characters together and adding
information to an existing structure. A tree is created from the initial list. Each node
corresponds to the context. Information contained in the node: the corresponding
character; connection with the parent element; regions where child elements should be found;
the distribution of probability values over the possible classes of characters for each
classifier; the distribution of probability values for possible relationships with the
parent element. Each subsidiary node corresponds to the child element of the symbol
represented by that node. There is also a feedback link to access the parent's character.</p>
      <p>To classify characters, several technologies of machine learning are used. Bayesian
Inference carries a rough classification. Classification with the use of context is handled
by artificial Neural Networks. Different classifiers work separately but are used
together. They are tied to return the optimal result.
Classifier of symbols classifies characters in one of four classes: "small", "upper",
"under", "variable range". Only the bounding rectangles are considered and the value of
the probability of the character of the symbol for each class is returned. For
classification of characters, their context is considered (see Fig. 2).</p>
      <p>The system being developed is a multiclassifier that adapts to each character, taking
into account its context. Each classifier returns a set of probability values.</p>
      <p>Classifier based on child elements. The classifier is made up of five neural
networks, one for each child element. Four entries for each classifier: the child element
class; relative vertical position; relative horizontal position; size relative to the parent
element.</p>
      <p>The classifier based on parenting elements looks at the position and size of the
symbol relative to the parent. It also takes into account the parent element class and type of
connection (e.g., index). The classifier is also a neural network.</p>
      <p>A ratio classifier allows classifying a character regardless of its context, using only
information about the restriction rectangle. For a rough character classification, the
Bayesian system is used.
Classifier of relationships determines which is the most probable link between two
characters. The probability value for each class is returned. The components of the
classifier are the neural network, fuzzy areas and fuzzy baselines. The classifier consists of
three independent parts, each of which gives the value of the probability of
communication (see Fig. 3). The results are combined to give a final answer, which can then be
compared with the limits.</p>
      <p>The central part is the neural network (Neural Network). It is trained in data mining
to effectively identify the relationships between two given sim-wolves. Input:
• H - relative size of the child element relative to the parent: 
• D - relative vertical position of the child element: 
• V - relative horizontal position of the child element:  =
=   −</p>
      <p>;
ℎ


= 
ℎ
ℎ

;
 −
 
  , where the p
and c indices are "parent" and "child", h is height, w is width, y is the vertical center
of the restrictive rectangle, and xmin and xmax are the left and right boundaries.
The purpose of the Fuzzy Regions system is to help the neural network classify the
relationship, and also indicate when the characters have no links. Fuzzy areas are used
for all interconnections, except "embedded", where fuzzy baselines are used instead.
To evaluate the confidence that a child character is in a certain relation to a possible
parent symbol, we compute the membership value for the center of the left border of
the child element in the corresponding fuzzy field of the parent element.</p>
      <p>Fuzzy Baseline. Unlike other child elements, the built-in is not necessarily close to
its parent's character. This makes the use of regions more difficult. The built-in child
element is on the same line as its parent's character. The position of the base line of the
character mainly depends on its class.</p>
      <p>Since a flexible solution is been developed, the possible change of the line of writing
by considering fuzzy baselines is processed. For a couple of parent / child symbols first
the basic level of the parent element is computed, taking into account its class. Then,
the possible baseline lines of the child character are considered, and the probability
estimation is calculated based on the distance from the parent-baseline and the
probability of the child-class.
5</p>
    </sec>
    <sec id="sec-6">
      <title>Ingredients of the Intelligent System for the Recognition of</title>
    </sec>
    <sec id="sec-7">
      <title>Mathematical Expressions</title>
      <p>
        Various paradigms and technologies for working with information resources have been
analyzed for the project implementation [
        <xref ref-type="bibr" rid="ref17 ref18 ref19 ref20 ref21 ref22 ref23 ref24">17-24</xref>
        ]. An object-oriented approach to
designing an information system is chosen. The intelligent information system can be
divided into the following structural components:
• the main part, representing the expression and performing the classification,
• part of the input / output to avoid repeating the same things;
• graphical user interface.
      </p>
      <p>The prototype of the experimental implementation of the intellectual system uses the
frameworks (Weka and JLatexMath) and the database, which is implemented using MS
Access.
Using a graphical interface makes use of the system and visualization of the results
easier and more intuitive (see Fig. 4). Classes implementing the graphical interface built
using Web technologies.</p>
      <p>Testing of the system on a complete set of tasks was carried out. To ensure the speed
of recognition, and also given that the average number of symbols of the mathematical
expression is about 6, the variant of the algorithm with the optimal number of iterations
is selected – 7. Successfully classified 441 of 570 characters (percentage of correct
results – 77.36%).</p>
      <p>The results of the analysis showed that simple structure recognition can help classify
symbols. Characters can be properly classified in the presence of a sufficient context.
In the case of an incorrect classification of the character, it was determined that the
probability value for the correct class was also high enough. Recognition of the
structure was fast and meets the requirements. A general analysis of the performed tests
confirmed that the methods of machine learning allow recognizing the structure by
comparing the characters by two.
6</p>
    </sec>
    <sec id="sec-8">
      <title>Conclusions</title>
      <p>The paper analyzes existing methods and approaches to the recognition of mathematical
expressions. The possibility of simultaneous execution of structural analysis and
classification of characters, using little knowledge about the syntactic system of the
mathematical expression is investigated. The proposed approach is based on a mutual
limitation between symbols and structure. Knowledge of the symbol value helps to analyze
the structure, but the structure can help eliminate ambiguity in recognizing a character.</p>
      <p>Instead of defining the characters, they are classified. Classification can be
accomplished with the use of bounding rectangles of symbols and only the structure of the
expression. An iterative algorithm is developed for the use of reciprocal constraints
between the structure and type of characters. Although the classification of characters
consists in the classification of each symbol separately, the recognition of the structure
is a more complex task. Links between symbols must be found and identified. One-pass
algorithm is implemented, which contains a search with return ion, which provided a
quick recognition of the structure.</p>
      <p>Expression recognition returns the probability value for each character and link,
rather than a clear interpretation of the expression. Presenting results using probability
values makes it easy to use the system as part of the larger one that performs all
recognition. These probabilities are also used to determine estimates that give an idea of how
good the system is. A flexible, adaptive method is implemented that returns the value
of probability, and not a clear answer. A combination of neural networks is used to
classify links between two symbols and estimates based on fuzzy baselines and fuzzy
areas around the symbol.</p>
      <p>The project of the intellectual system implements an iterative algorithm based on the
methods of machine learning. A graphical user interface is created that allows using the
expression recognition system quickly and easily.
7</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Shapiro</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stockman</surname>
          </string-name>
          , G.:
          <article-title>Computer vision</article-title>
          . Washington University (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Veres</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rusyn</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sachenko</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rishnyak</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Choosing the method of finding similar images in the reverse search system</article-title>
          .
          <source>In: CEUR Workshop Proceedings</source>
          . vol.
          <volume>2136</volume>
          ,
          <source>Proc. of the Int. Conf. COLINS</source>
          , vol.
          <volume>1</volume>
          , pp.
          <fpage>99</fpage>
          -
          <lpage>107</lpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Rusyn</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lutsyk</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lysak</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lukeniuk</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pohreliuk</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Lossless Image Compression in the Remote Sensing Applications</article-title>
          .
          <source>In: Int. Conf. on Data Stream Mining &amp; Processing (DSMP)</source>
          ,
          <fpage>195</fpage>
          -
          <lpage>198</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Rashkevych</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Peleshko</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vynokurova</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Izonin</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lotoshynska</surname>
          </string-name>
          , N.:
          <article-title>Single-frame image super-resolution based on singular square matrix operator</article-title>
          .
          <source>In: IEEE 1th Ukraine Conference on Electrical and Computer Engineering (UKRCON)</source>
          ,
          <fpage>944</fpage>
          -
          <lpage>948</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Lytvyn</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vysotska</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Veres</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rishnyak</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rishnyak</surname>
          </string-name>
          , H.:
          <article-title>Classification methods of text documents using ontology based approach</article-title>
          .
          <source>In: Advances in Intelligent Systems and Computing</source>
          , pp.
          <fpage>229</fpage>
          -
          <lpage>240</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Zanibbi</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Blostein</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cordy</surname>
          </string-name>
          , J.:
          <article-title>Recognizing mathematical expressions using tree transformation</article-title>
          .
          <source>In: IEEE Transactions on Pattern Analysis and Machine Intelligence</source>
          <volume>24</volume>
          (
          <issue>11</issue>
          ), pp.
          <fpage>1455</fpage>
          -
          <lpage>1467</lpage>
          (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Blostein</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zanibbi</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          :
          <article-title>Using fuzzy logic to analyze superscript and subscript relations in handwritten mathematical expressions</article-title>
          . In: Eighth International Conference on
          <source>Document Analysis and Recognition (ICDAR'05)</source>
          , vol.
          <volume>8</volume>
          , pp.
          <fpage>972</fpage>
          -
          <lpage>976</lpage>
          . (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Tapia</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rojas</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          :
          <article-title>Recognition of on-line handwritten mathematical expressions using a minimum spanning tree construction and symbol dominance</article-title>
          .
          <source>In: Graphics Recognition Algorithms and Applications (Lecture Notes in Computer Science)</source>
          . pp.
          <fpage>329</fpage>
          -
          <lpage>340</lpage>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Eto</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Suzuki</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Mathematical formula recognition using virtual link network</article-title>
          .
          <source>In: Proc. Sixth Int'l Conf</source>
          .
          <article-title>Document Analysis and Recognition (ICDAR</article-title>
          <year>2001</year>
          ), pp.
          <fpage>762</fpage>
          -
          <lpage>767</lpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Rhee</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kim</surname>
          </string-name>
          , J.:
          <article-title>Efficient search strategy in structural analysis for handwritten mathematical expression recognition</article-title>
          .
          <source>Pattern Recognition</source>
          <volume>42</volume>
          (
          <issue>12</issue>
          )(
          <volume>12</volume>
          ), pp.
          <fpage>3192</fpage>
          -
          <lpage>3201</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Miller</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Viola</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Ambiguity and constraint in mathematical expression recognition</article-title>
          .
          <source>In: Proc. 15th National Conf. on Artificial Intelligence (AAAI 98)</source>
          , pp.
          <fpage>784</fpage>
          -
          <lpage>791</lpage>
          (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shimizu</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Okada</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Fundamental study on structural understanding of mathematical expressions</article-title>
          .
          <source>Systems, Man, and Cybernetics 2</source>
          , pp.
          <fpage>910</fpage>
          -
          <lpage>914</lpage>
          (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Awal</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mouchere</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Viard-Gaudin</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Towards handwritten mathematical expression recognition</article-title>
          .
          <source>In: 10th International Conference on Document Analysis and Recognition</source>
          ,
          <source>(ICDAR</source>
          <year>2009</year>
          ), pp.
          <fpage>1046</fpage>
          -
          <lpage>1050</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Faure</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Automatic perception of the structure of handwritten mathematical expressions</article-title>
          .
          <source>Computer Processing of Handwritting</source>
          , pp
          <fpage>337</fpage>
          -
          <lpage>361</lpage>
          (
          <year>1990</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Aly</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Uchida</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Suzuki</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Identifying subscripts and superscripts in mathematical documents</article-title>
          .
          <source>Mathematics in Computer Science</source>
          <volume>2</volume>
          (
          <issue>2</issue>
          ), pp.
          <fpage>195</fpage>
          -
          <lpage>209</lpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Aly</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Uchida</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fujiyoshi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Suzuki</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Statistical classification of spatial relationships among mathematical symbols</article-title>
          .
          <source>In: 10th International Conference on Document Analysis and Recognition</source>
          , pp.
          <fpage>1350</fpage>
          -
          <lpage>1354</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Lytvyn</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vysotska</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Veres</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rishnyak</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rishnyak</surname>
          </string-name>
          , H.:
          <article-title>The Risk Management Modelling in Multi Project Environment</article-title>
          .
          <source>In: Computer Science and Information Technologies (CSIT2017)</source>
          , pp.
          <fpage>32</fpage>
          -
          <lpage>35</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Shakhovska</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bolubash</surname>
            ,
            <given-names>Yu.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Veres</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          :
          <article-title>Big Data Federated Repository Model</article-title>
          . In:
          <article-title>The Experience of Designing and Application of CAD Systems in Microelectronics (СADMS'</article-title>
          <year>2015</year>
          ), pp.
          <fpage>382</fpage>
          -
          <lpage>384</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Shakhovska</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Veres</surname>
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bolubash</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bychkovska-Lipinska</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Data space architecture for Big Data managering</article-title>
          .
          <source>In: Computer Science and Information Technologies (CSIT2015)</source>
          , pp.
          <fpage>184</fpage>
          -
          <lpage>187</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Veres</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shakhovska</surname>
          </string-name>
          , N.:
          <article-title>Elements of the formal model big date</article-title>
          . In:
          <article-title>Perspective Technologies and Methods in MEMS Design (MEMSTECH'</article-title>
          <year>2015</year>
          ), pp.
          <fpage>81</fpage>
          -
          <lpage>83</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Lytvyn</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vysotska</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dosyn</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Burov</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Method for ontology content and structure optimization, provided by a weighted conceptual graph</article-title>
          ,
          <source>Webology</source>
          ,
          <volume>15</volume>
          (
          <issue>2</issue>
          ), pp.
          <fpage>66</fpage>
          -
          <lpage>85</lpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Lytvyn</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Peleshchak</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vysotska</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Peleshchak</surname>
          </string-name>
          , R.:
          <article-title>Satellite spectral information recognition based on the synthesis of modified dynamic neural networks and holographic data processing techniques</article-title>
          ,
          <source>2018 IEEE 13th International Scientific and Technical Conference on Computer Sciences and Information Technologies</source>
          , CSIT,
          <fpage>330</fpage>
          -
          <lpage>334</lpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dosyn</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lytvyn</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sachenko</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Smart Data Integration by Goal Driven Ontology Learning</article-title>
          .
          <source>In: Advances in Big Data. Advances in Intelligent Systems and Computing</source>
          . - Springer International Publishing AG 2017. P.
          <volume>283</volume>
          -
          <fpage>292</fpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Su</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vysotska</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sachenko</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lytvyn</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Burov</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Information resources processing using linguistic analysis of textual content</article-title>
          .
          <source>In: Intelligent Data Acquisition and Advanced Computing Systems Technology and Applications</source>
          , Romania,
          <fpage>573</fpage>
          -
          <lpage>578</lpage>
          , (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>