<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Method for Documents Rubrication and Analysis Based on Fuzzy Relations of Difference between Their Syntactical Characteristics</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">V</forename><surname>Borisov</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">The Branch of National Research University &quot;Moscow Power Engineering Institute&quot;</orgName>
								<address>
									<settlement>Smolensk</settlement>
									<country key="RU">Russia</country>
								</address>
							</affiliation>
						</author>
						<author role="corresp">
							<persName><forename type="first">M</forename><surname>Dli</surname></persName>
							<email>midli@imail.ru</email>
							<affiliation key="aff0">
								<orgName type="institution">The Branch of National Research University &quot;Moscow Power Engineering Institute&quot;</orgName>
								<address>
									<settlement>Smolensk</settlement>
									<country key="RU">Russia</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">P</forename><surname>Kozlov</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">The Branch of National Research University &quot;Moscow Power Engineering Institute&quot;</orgName>
								<address>
									<settlement>Smolensk</settlement>
									<country key="RU">Russia</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Method for Documents Rubrication and Analysis Based on Fuzzy Relations of Difference between Their Syntactical Characteristics</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">F238E7569833B303FF0D65BFDE0D578D</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-23T22:04+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The paper states the formulation and proposes a method for rubrication and analysis of electronic nonstructural documents. The application of the proposed method results in forming a tree structure of a rubric field based on fuzzy relations of difference between syntactical characteristics of rubricated documents. The documents analysis is based on the determination of the fuzzy correspondence for these documents according to syntactical characteristics with the values of the centers for the detected clusters sequentially from the root to the leaves of the built fuzzy decision tree. The conducted computational experiments have shown that the proposed method allows reducing the number of erroneously rubricated documents (in comparison with probabilistic and neural network methods)</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>The program "Electronic government" suggests the dynamic introduction of information and communication technologies in the activities of public authorities. The main program goal is to increase the efficiency of public administration and to develop partnerships with civil society and business.</p><p>A key task of program implementation is to develop Internet services, which provide information support and a variety of services in electronic form. Their use can improve the quality and accessibility of state and municipal services to citizens and businesses, reduce the cost of their provision and increase the labor productivity in institutions of government at various levels.</p><p>One of the ways to use information and communication technology to solve this task is to automate the process of analyzing electronic appeals (applications, complaints, suggestions) of individuals and legal entities arriving at official websites and portals of authorities and local self-government.</p><p>The text rubrication plays an important role in the process of automatic analysis of incoming electronic appeals. It consists of their distribution according to thematic rubrics that determine the areas of activity of the departments involved in their processing and preparation of the corresponding response.</p><p>Today, there are many methodological approaches to the classification of documents of various types. The choice of a specific method is directly determined by the characteristics of the rubrication objects (i.e. documents received by public authorities).</p><p>The analysis has revealed the following specific characteristics of electronic documents received on official websites and portals of public authorities, which must be taken into account when choosing a rubrication method:</p><p> relatively small size of electronic documents that impedes their statistical analysis;  absence of marking in these documents that complicates the procedures for highlighting the structure and extracting the information relevant to the analysis;  presence of grammar and syntactical errors in electronic messages that entails the necessity for additional processing;  nonstationarity of the thesaurus (the composition and relevance of the rubric words);  dynamic changes of the legislative and regulatory framework that can change the distribution of tasks between departments;  description of several problems in one message (answers can be prepared by several specialists or even several departments). These features significantly limit the possibilities of application of the methods based on the probabilistic and statistical approach to the rubrics generation and electronic text analysis <ref type="bibr" target="#b0">[1,</ref><ref type="bibr" target="#b5">6,</ref><ref type="bibr" target="#b26">27]</ref>.</p><p>The aforementioned determines the urgency of the task of developing a new method of rubricating the electronic unstructured documents, taking into account the specific features of text messages received on official websites and portals of public authorities.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Related works</head><p>At present, there are a variety of methods, models and algorithms for the classification of text documents written in natural language. However, each of them has its applicability conditions determined by the statement of the rubrication problem.</p><p>It was shown in articles <ref type="bibr" target="#b9">[10,</ref><ref type="bibr" target="#b10">11,</ref><ref type="bibr" target="#b11">12]</ref> that the choice of a specific classification (rubrication) method is determined by such characteristics as the size of the analyzed document, the degree of rubric thesaurus intersection and the amount of accumulated statistical information.</p><p>Machine learning is a well-known approach to classifying unstructured documents. It offers the use of artificial intelligence methods that can learn from a set of precedents.</p><p>One of the machine learning methods that have been successfully used to solve various classification problems is artificial neural networks. The classification of texts is devoted to the works of authors <ref type="bibr" target="#b4">[5,</ref><ref type="bibr" target="#b16">17,</ref><ref type="bibr" target="#b19">20,</ref><ref type="bibr" target="#b20">21,</ref><ref type="bibr" target="#b25">26]</ref>. The main limitation of the application of this approach is the requirement for the presence of a large amount of statistical data necessary for training algorithms.</p><p>Another machine learning method that can be used to classify text documents is fuzzy decision trees. They are based on learning by examples, while the rules are presented in the form of a hierarchical sequential structure. The issues of using fuzzy decision trees are considered in the works <ref type="bibr" target="#b1">[2,</ref><ref type="bibr" target="#b8">9,</ref><ref type="bibr" target="#b12">13,</ref><ref type="bibr" target="#b14">15,</ref><ref type="bibr" target="#b15">16,</ref><ref type="bibr" target="#b21">22,</ref><ref type="bibr" target="#b22">23,</ref><ref type="bibr" target="#b24">25,</ref><ref type="bibr" target="#b25">26]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Statement of the rubrication problem</head><p>Initial data 1. For the formalized presentation of electronic unstructured documents (EUD) "a unification" for a set of syntactical characteristics is performed in advance. These characteristics are selected by a classical analyzer (parser), for example, LinkGrammar <ref type="bibr" target="#b23">[24]</ref>:</p><formula xml:id="formula_0">{ | 1.. }, n S s n N  </formula><p>where for the typical case 5 N  ; s 1 -the root word or the predicate; s 2 -the subject; s 3 -the adverbial modifier; s 4 -the object under the action; s 5 -the predicate.</p><p>2. There is a set of EUD</p><formula xml:id="formula_1">{ | 1.. }, k V V k K  </formula><p>in which every document k V is presented by a set of its relevant words:</p><formula xml:id="formula_2">( ) 1.. { | 1.. }, k k k l k k k K V v l L     where ( ) k k l v -the relevant word of EUD, k</formula><p>L -the number of words in the k-th EUD. 3. The set of EUD V is presented as a set of SD formalized documents:</p><formula xml:id="formula_3">{ | 1.. }, k SD SD k K  </formula><p>in which the formalized document k SD corresponds to each EUD:</p><formula xml:id="formula_4">( ) 1.. { | 1.. }, k k n k K SD SD n N     where ( ) k n</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>SD</head><p>-the set of words from EUD k V , corresponding to the syntactical parameter n s <ref type="bibr" target="#b6">[7]</ref>. Required To propose a method for rubrication and EUD analysis based on the hierarchical clustering which uses fuzzy relations between syntactical characteristics of rubricating documents.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Method description</head><p>The proposed method for rubrication and analysis of EUD includes the steps discussed below.</p><p>Step 1. To give the parameters to determine the degree of correspondence for formalized documents according to the syntactical characteristics.</p><p>For each formalized document</p><formula xml:id="formula_5">k SD ( 1.. k K  ) a set of values for parameters     ( ) / | 1.. k k n n SD SD s n N  </formula><p>is given to assess the degree of its correspondence according to all syntactical characteristics.</p><p>Step 2. To determine the degree of difference between all pairs of formalized documents according to all syntactical characteristics.</p><p>Consider a pair of documents</p><formula xml:id="formula_6">k SD and l SD , , 1.. k l K  : ( ) { | 1.. } k k n SD SD n N   и ( ) { | 1.. }. l l n SD SD n N  </formula><p>To compare these documents sets of parameters values are given for all syntactical characteristics:</p><formula xml:id="formula_7">    ( ) / | 1.. k k n n SD SD s n N   и     ( ) / | 1.. l l n n SD SD s n N   .</formula><p>As a result, sets of parameter values are formed. These parameters characterize the degrees of difference for documents k SD and l SD according to all syntactical characteristics:</p><formula xml:id="formula_8">      ( ) ( ) ( , ) , / | 1.. k l k l n n n d SD SD d SD SD s n N    ,</formula><p>where, for example,  </p><formula xml:id="formula_9">( ) ( ) ( ) ( ) , k l k l n n n n d SD SD SD SD   .</formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Note. The obtained set of values ( , )</head><p>k l d SD SD  can be presented in the form of a fuzzy set and interpreted as a fuzzy difference between fuzzy sets</p><formula xml:id="formula_10">    ( ) / | 1.. k k n n SD SD s n N   and     ( ) / | 1.. l l n n SD SD s n N   , syntactical characteristics from { | 1.. } n S s n N  </formula><p>are their carriers, and the documents degrees of correspondence to these characteristics  As a result, the following set of parameters values, characterizing the degree of difference between the documents according to the syntactical characteristics, is formed: .</p><formula xml:id="formula_11">            1 2<label>3</label></formula><formula xml:id="formula_12">            1 2<label>3</label></formula><p>The calculation for the degree of differences according to all syntactical characteristics is performed for all pairs of formalized documents</p><formula xml:id="formula_13">k SD and l SD , , 1.. k l K  .</formula><p>Step 3. To form a matrix of difference between all pairs of the formalized documents.</p><p>The results of the previous step allow forming a compose matrix of difference between all pairs of documents.</p><p>Figure <ref type="figure">1</ref> shows such type of a matrix. , where</p><formula xml:id="formula_14">1 SD l SD K SD 1 SD 1 1 ( , ) d SD SD  … 1 ( , ) l d SD SD  … 1 ( , ) K d SD SD  … … … k SD 1 ( , ) k d SD SD  … ( , ) k l d SD SD  … ( , ) k K d SD SD  … … … K SD 1 ( , ) K d SD SD  … ( , ) K l d SD SD  … ( , ) K K d SD SD </formula><formula xml:id="formula_15">    ( ) / | 1.. i i n n Cl Cl s n N   . The detected clusters { | 1.. } i Cl Cl i I   correspond to the rubrics: { | 1.. }, i R R i I  </formula><p>where for all 1..</p><formula xml:id="formula_16">i I      , ( / )| 1.. | 1.. , n i j i j i n i R t w s n N j J    ji t -j-th relevant word in the rubric i R , [0, 1] n ji</formula><p>w  -the degree of correspondence for the word ji t to the syntactical characteristic n s in the rubric i R . Thus, the result of the hierarchical clustering for documents is a tree-type structure of the formed rubric field based on the fuzzy relations between syntactical characteristics of the rubricating documents.</p><p>Step 5. Documents analysis.</p><p>The proposed procedure of analysis is based on the comparison of the correspondence degrees k SD for the analyzing document k SD according to the syntactical characteristics with the values for the clusters centers k SD sequentially from the root to the leaves of the built decision tree. In this case, the analysis procedure takes into account the specificity of the detected clusters.</p><p>The analyzing document k SD is the most relevant to the rubric l R  , the degree of fuzzy correspondence to which is the maximum:</p><formula xml:id="formula_17">1.. : max ( , )</formula><p>.</p><formula xml:id="formula_18">k i l i I R SD Cl   </formula><p>To calculate a parameter characterizing the degree of fuzzy correspondence of formalized documents k SD to the rubric i R , it is reasonable to use the following <ref type="bibr" target="#b2">[3,</ref><ref type="bibr" target="#b3">4]</ref>:</p><formula xml:id="formula_19"> <label>2 ( ) ( ) 1 1 ( , ) 1 .</label></formula><formula xml:id="formula_20">N k i i k n n n SD R SD Cl N      </formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">The results of the proposed method application</head><p>The proposed rubrication method was programmatically implemented as a component of the comprehensive information system for the automatic processing of electronic unstructured documents arriving at official websites and portals of public authorities.</p><p>This method was tested in the automated processing and analysis of appeals (applications, complaints or suggestions) of citizens and organizations receiving by Administration of Smolensk region in 2018-2019.</p><p>To carry out the classification of incoming electronic appeals, the experts have identified 17 interconnected rubrics reflecting the urgent civic problems: general issues of society and politics (R 1 ), separation of powers and functions in the Administration (R 2 ), social sphere (R 3 ), education (R 4 ), suggestions for improving the city of Smolensk (R 5 ), family (R 6 ), culture (R 7 ), physical education and sport (R 8 ), housing and communal services (R 9 ), maintenance and utilities (R 10 ), housing stock (R 11 ), non-residential fund (R 12 ), securing the right to housing (R 13 ), economy (R 14 ), business activities (R 15 ), natural resources (R 16 ) and environmental protection (R 17 ).</p><p>Two well-known methods (probabilistic and neural network) successfully used to classify unstructured text documents have been practically implemented for comparative text analysis.</p><p>The Bayes classification was chosen as the first alternative method because of its ease of implementation and minimal human and financial costs for software implementation. It uses the procedure for classifying documents based on Bayes formula for conditional probability.</p><p>The input text document is presented as a sequence of terms {w n }. Each rubric R i is characterized by the unconditional probability P(R i ) of the assignment of document V to it and the conditional probability P(w|R i ) to meet the term w in document V, subject to the choice of rubric R i . Then the probability P(V|R i ) is understood as the probability that the text document will be classified subject to the selection of rubric R i .</p><p>The procedure for document rubrication consists in calculating the probabilities P(R i |V) for all rubrics R i and choosing the rubric for which this probability is maximal. Classifier training consists of compiling a vocabulary of probabilities of various terms {w n } for each rubric.</p><p>The methods of using probabilistic algorithms for the classification of text documents are considered in more detail in <ref type="bibr" target="#b7">[8]</ref>.</p><p>Convolutional neural networks were used as the second alternative method for document rubrication.</p><p>Convolutional networks are artificial neural networks of feedforward type when a signal travels sequentially along the neurons (from the first layer to the last). These networks were originally developed for image analysis. Good results in this area have led to their application for solving other classification tasks, including unstructured documents.</p><p>This neural network is an alternation of convolutional, subsampling and fullyconnected layers. A text document arrives at the network input wherein each word is determined by the vector (e.g., may use the algorithm word2vec). The Softmax function which implements multiclassification is used for the output layer of the neural network.</p><p>Convolutional neural networks for the classification of text documents are considered in more detail in <ref type="bibr" target="#b17">[18,</ref><ref type="bibr" target="#b18">19,</ref><ref type="bibr" target="#b27">28]</ref>.</p><p>During the preliminary analysis, the authors have identified 4 typical situations, identified depending on three indicators: the size of the received document, the degree of intersection of the headings, and the amount of accumulated statistics for training the models.</p><p>Depending on these typical situations, Table <ref type="table" target="#tab_2">1</ref> shows the results of comparative assessment for the correct rubrication and analysis based on the example of more than 10 thousand mеssages.</p><p>For the mentioned typical situations the proposed classification method has allowed reducing the number of erroneously rubricated text documents by 7% on average compared with the probabilistic method and by 6.3% compared with the neural network method. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">Conclusion</head><p>As a result of the implemented method a tree structure of a rubric field is formed, this structure is based on the fuzzy relations between the syntactical characteristics of the rubricated documents. The document analysis is based on the detection of the fuzzy correspondence for these documents according to the syntactical characteristics with the values of the determined clusters sequentially from the root to the leaves of the built decision tree.</p><p>The proposed method for rubrication and analysis of electronic unstructured text documents was implemented by the software and tested during automated processing of appeals (applications, complaints or suggestions) of citizens and organizations receiving by Administration of Smolensk region. It has made possible to ensure efficient and high-quality actualization for the rubrics and document analysis under the conditions of nonstationary composition of the thesaurus and the relevance of the words in rubrics.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head></head><label></label><figDesc>into account the below-mentioned parameters:</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 1 .Step 4 .</head><label>14</label><figDesc>Figure 1. The compose matrix of differences between all pairs of documents Step 4. Fuzzy hierarchical clustering of documents based on the fuzzy relations of difference between all pairs of formalized documents according to all syntactical characteristics. Parameters   ( ) ( ) , k l n n d SD SD are used as the parameters for fuzzy hierarchical clustering of formalized documents, their values characterize the results of pairwise comparison</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 1 .</head><label>1</label><figDesc>The results of the comparative assessment for the correct rubrication and analysis of EUD received by Administration of Smolensk region</figDesc><table><row><cell cols="5">Typical situation for analysis</cell><cell cols="3">Results for rubrication and</cell></row><row><cell cols="5">and rubrication of EUD</cell><cell cols="2">analysis of EUD, %</cell></row><row><cell>EUD size</cell><cell cols="3">Degree of rubrics overlay</cell><cell>Sufficient statistics</cell><cell>Probabilistic method</cell><cell>Neural network method</cell><cell>Proposed method</cell></row><row><cell>up to 150</cell><cell cols="2"></cell><cell>0.4</cell><cell>not enough</cell><cell>65</cell><cell>60</cell><cell>65</cell></row><row><cell>words</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>up to 150</cell><cell></cell><cell></cell><cell>0.15</cell><cell>not enough</cell><cell>62</cell><cell>66</cell><cell>79</cell></row><row><cell>words</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>up to 50</cell><cell></cell><cell cols="2">0.15</cell><cell>enough</cell><cell>69</cell><cell>87</cell><cell>90</cell></row><row><cell>words</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>more</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>than 150</cell><cell></cell><cell cols="2">0.15</cell><cell>enough</cell><cell>89</cell><cell>85</cell><cell>89</cell></row><row><cell>words</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row></table></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7">Acknowledgment</head><p>The reported study was funded by RFBR according to the research project No 18-01-00558.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<ptr target="https://www.adminsmolensk.ru/obrascheniya_grazhdan/obzori_obrascheniy/news_16096.html" />
		<title level="m">Analytical report on the work of Administration of Smolensk region with citizens&apos; appeals</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Acquisition of knowledge in the form of fuzzy rules for cases classification</title>
		<author>
			<persName><forename type="first">T</forename><surname>Avdeenko</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Makarova</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Lecture Notes in Computer Science. Data Mining and Big Data</title>
		<imprint>
			<biblScope unit="volume">10387</biblScope>
			<biblScope unit="page" from="536" to="544" />
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">On definition and construction of association measures</title>
		<author>
			<persName><forename type="first">I</forename><surname>Batyrshin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Intelligent &amp; Fuzzy Systems</title>
		<imprint>
			<biblScope unit="volume">29</biblScope>
			<biblScope unit="page" from="2319" to="2326" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Towards a general theory of similarity and association measures: Similarity, dissimilarity and correlation functions</title>
		<author>
			<persName><forename type="first">I</forename><surname>Batyrshin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Intelligent &amp; Fuzzy Systems</title>
		<imprint>
			<biblScope unit="volume">36</biblScope>
			<biblScope unit="page" from="2977" to="3004" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">A Neural Probabilistic Language Model</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Bengio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Ducharme</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Vincent</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Jauvin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">JMLR</title>
		<imprint>
			<biblScope unit="volume">3</biblScope>
			<biblScope unit="page" from="1137" to="1155" />
			<date type="published" when="2003">2003</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Analysis and monitoring of electronic text documents rubrication</title>
		<author>
			<persName><forename type="first">V</forename><surname>Borisov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Dli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Kozlov</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">MPIE Bulletin</title>
		<imprint>
			<biblScope unit="volume">4</biblScope>
			<biblScope unit="page" from="121" to="127" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">The method of fuzzy analysis of texts and their rubrics actualization</title>
		<author>
			<persName><forename type="first">V</forename><surname>Borisov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Dli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Kozlov</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the II International Scientific and Practical Conference</title>
				<meeting>the II International Scientific and Practical Conference<address><addrLine>Ulyanovsk</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="259" to="263" />
		</imprint>
	</monogr>
	<note>Fuzzy Technologies in the Industry -FTI 2018</note>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Using optimize naïve bayes classifier in problem of sms classification</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">E</forename><surname>Burlakov</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Izvestia of Samara Scientific Center of the Russian Academy of Sciences</title>
		<imprint>
			<biblScope unit="volume">18</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="705" to="709" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<title level="m" type="main">Genetic Fuzzy Systems: Evolutionary Tuning and learning of Fuzzy Knowledge Bases</title>
		<author>
			<persName><forename type="first">O</forename><surname>Cordon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Herrera</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Hoffmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Magdalena</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2001">2001</date>
			<publisher>World Scientific Publishing</publisher>
			<biblScope unit="page">462</biblScope>
			<pubPlace>New Jersey, London, Hong Kong</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Developing the economic information system for automated analysis of unstructured text documents</title>
		<author>
			<persName><forename type="first">M</forename><surname>Dli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Bulygina</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Kozlov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Ross</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Applied Informatics</title>
		<imprint>
			<biblScope unit="volume">13</biblScope>
			<biblScope unit="issue">5</biblScope>
			<biblScope unit="page" from="51" to="57" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Development of multimethod approach to rubrication of unstructed electronic text documents in various conditions</title>
		<author>
			<persName><forename type="first">M</forename><surname>Dli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Bulygina</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Kozlov</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the International Russian Automation Conference (RusAutoCon)</title>
				<meeting>the International Russian Automation Conference (RusAutoCon)<address><addrLine>Sochi</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Formation of the structure of the intellectual system of analyzing and rubricating unstructured text information in different situations</title>
		<author>
			<persName><forename type="first">M</forename><surname>Dli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Bulygina</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Kozlov</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Applied Informatics</title>
		<imprint>
			<biblScope unit="volume">13</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="111" to="123" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Bottom-up Partitioning in Fuzzy Decision Trees</title>
		<author>
			<persName><forename type="first">M</forename><surname>Faifer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Janikow</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 19th International Conference of the North American Fuzzy Information Society. IEEE</title>
				<meeting>the 19th International Conference of the North American Fuzzy Information Society. IEEE</meeting>
		<imprint>
			<date type="published" when="2000">2000</date>
			<biblScope unit="page" from="326" to="330" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<monogr>
		<author>
			<persName><forename type="first">M</forename><surname>Jambu</surname></persName>
		</author>
		<title level="m">Hierarchical cluster analysis and correspondences</title>
				<meeting><address><addrLine>Moscow</addrLine></address></meeting>
		<imprint>
			<publisher>Finance and statistics</publisher>
			<date type="published" when="1988">1988</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Fuzzy Decision Trees: Issues and Methods</title>
		<author>
			<persName><forename type="first">C</forename><surname>Janikow</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Transactions of Man, Systems, Cybernetics</title>
		<imprint>
			<biblScope unit="volume">28</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="1" to="14" />
			<date type="published" when="1998">1998</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Decision Tree&apos;s Features of Application in Classification Problems</title>
		<author>
			<persName><forename type="first">I</forename><forename type="middle">L</forename><surname>Kaftannikov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">V</forename><surname>Parasich</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Bulletin of the South Ural State University. Ser. Computer Technologies, Automatic Control, Radio Electronics</title>
		<imprint>
			<biblScope unit="volume">15</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="26" to="32" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Recurrent convolutional neural networks for discourse compositionality</title>
		<author>
			<persName><forename type="first">N</forename><surname>Kalchbrenner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Blunsom</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Workshop on CVSC</title>
				<imprint>
			<date type="published" when="2013">2013</date>
			<biblScope unit="page" from="119" to="126" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Convolutional neural networks for sentence classification</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Kim</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEMNLP</title>
		<imprint>
			<biblScope unit="page" from="1746" to="1751" />
			<date type="published" when="2014-09">September. 2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Imagenet classification with deep convolutional neural networks</title>
		<author>
			<persName><forename type="first">A</forename><surname>Krizhevsky</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Sutskever</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Hinton</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">NIPS</title>
		<imprint>
			<biblScope unit="page" from="1106" to="1114" />
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<monogr>
		<title level="m" type="main">Fuzzy logic and artificial neural networks</title>
		<author>
			<persName><forename type="first">V</forename><surname>Kruglov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Dli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Golunov</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2001">2001</date>
			<publisher>Nauka, Fizmatlit</publisher>
			<pubPlace>Moscow</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<monogr>
		<title level="m" type="main">Text understanding from scratch</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Lecun</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2016">2016</date>
		</imprint>
		<respStmt>
			<orgName>Computer Science Department</orgName>
		</respStmt>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Dependency tree-based sentiment classification using CRFs with hidden variables</title>
		<author>
			<persName><forename type="first">T</forename><surname>Nakagawa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Inui</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Kurohashi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of ACL</title>
				<meeting>ACL</meeting>
		<imprint>
			<date type="published" when="2010">2010</date>
			<biblScope unit="page">2010</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<monogr>
		<title level="m" type="main">Fuzzy Control</title>
		<author>
			<persName><forename type="first">K</forename><surname>Passino</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Yurkovich</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1998">1998</date>
			<publisher>Addison-Wesley</publisher>
			<biblScope unit="page">522</biblScope>
			<pubPlace>NJ</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<monogr>
		<author>
			<persName><forename type="first">S</forename><surname>Protasov</surname></persName>
		</author>
		<ptr target="http://sz.ru/parser/doc/" />
		<title level="m">LinkGrammar</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">Induction of decision trees</title>
		<author>
			<persName><forename type="first">J</forename><surname>Quinlan</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Machine Learning</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="81" to="106" />
			<date type="published" when="1998">1998</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">Text classification with decision trees and feedforward neural networks</title>
		<author>
			<persName><forename type="first">O</forename><forename type="middle">G</forename><surname>Shevelyov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">V</forename><surname>Petrakov</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Tomsk State University Journal</title>
		<imprint>
			<biblScope unit="volume">290</biblScope>
			<biblScope unit="page" from="300" to="307" />
			<date type="published" when="2006">2006</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<analytic>
		<title level="a" type="main">Classification of text information with the use of SVM</title>
		<author>
			<persName><forename type="first">N</forename><surname>Uchitelev</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Information technologies and system</title>
		<imprint>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="335" to="340" />
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b27">
	<analytic>
		<title level="a" type="main">Character-level convolutional networks for text classification</title>
		<author>
			<persName><forename type="first">X</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Zhao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Lecun</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Advances in Neural Information Processing Systems</title>
				<imprint>
			<date type="published" when="2015">Febrary. 2015</date>
			<biblScope unit="page" from="649" to="657" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
