<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Quantitative Parameters of J. London&apos;s Short Stories Collection &quot;Children of the Frost&quot; and its Translation</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Mariia</forename><surname>Bekhta-Hamanchuk</surname></persName>
						</author>
						<author>
							<persName><forename type="first">Halyna</forename><surname>Oleksiv</surname></persName>
							<email>halyna.d.oleksiv@lpnu.ua</email>
						</author>
						<author>
							<persName><forename type="first">Tetiana</forename><surname>Shestakevych</surname></persName>
							<email>tetiana.v.shestakevych@lpnu.ua</email>
						</author>
						<author>
							<persName><forename type="first">Yuliia</forename><surname>Shyika</surname></persName>
							<email>yuliia.i.shyika@lpnu.ua</email>
							<affiliation key="aff0">
								<orgName type="institution">Lviv Polytechnic National University</orgName>
								<address>
									<addrLine>Stepana Bandery Street, 12</addrLine>
									<postCode>79000</postCode>
									<settlement>Lviv</settlement>
									<country key="UA">Ukraine</country>
								</address>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff1">
								<orgName type="department">International Conference on Computational Linguistics and Intelligent Systems</orgName>
								<address>
									<addrLine>May 12-13</addrLine>
									<postCode>2022</postCode>
									<settlement>Gliwice</settlement>
									<country key="PL">Poland</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Quantitative Parameters of J. London&apos;s Short Stories Collection &quot;Children of the Frost&quot; and its Translation</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">9622DC2BACCB5A0D017CC63555B199BF</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T12:55+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Corpus, corpus annotation, corpus linguistics, source text, target text 0000-0002-3133-0948 (M. Bekhta-Hamachuk)</term>
					<term>0000-0002-8800-6217 (H. Oleksiv)</term>
					<term>0000-0002-4898-6927 (T. Shestakevych)</term>
					<term>0000-0003-2474-0479 (Ju. Shyika)</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The paper presents the quantitative comparative analysis of Jack London's collection of short stories "Children of the Frost" and Ukrainian translations by V. Hladka and K. Koriakina which has been carried out on the basis of the digital marked corpus of original texts. The novelty of the research lies in the fact that the above-mentioned literary work has not been previously studied from the statistical perspective. The theoretical background of the study is outlined, particularly emphasizing the issues of the corpus, corpus annotation and corpus linguistics software. The source and target texts have been compared according to the following coefficients: text volume, number of different word forms, number of sentences, number of letters, number of content words, number of functional words, hapax legomena and number of words with a frequency of 10 or more. The most frequently used parts of speech both in source and target texts are stated. The quantitative indices of the lexical level, which have been calculated on the basis of the general characteristics of the source and target texts, have been compared. The reproduction of the nominal character of the source text in the target text has been analyzed.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>One of the key issues in modern linguistics is natural language processing. Working with large amounts of factual information enables the researcher to avoid subjective selection of facts for confirming or rejecting the hypothesis. Nowadays there is a number of information technologies enabling an automated search with the aim of forming the factual basis of the research, corpora of texts being one of them. The corpus of texts is a central concept in corpus linguistics and its object of study. The issues of corpus linguistics are widely ranged and involve studies of the general theory of corpus linguistics, correlations of corpus linguistics and other linguistic disciplines, corpus typologies and methods of corpus data interpretation, the principles of creating natural languages text corpora (D. Biber <ref type="bibr">[3; 3]</ref>, J. Sinclair <ref type="bibr" target="#b27">[28]</ref>, W. Teubert <ref type="bibr" target="#b29">[30]</ref>, G. Kennedy <ref type="bibr" target="#b15">[16]</ref>, G. Leech <ref type="bibr">[14; 20]</ref>, A. Stefanowitsch <ref type="bibr" target="#b28">[29]</ref>, T. McEnery <ref type="bibr">[10; 14]</ref>, D. Barth, S. Stefan <ref type="bibr" target="#b1">[2]</ref>, N.S. Dash, S. Arulmozi <ref type="bibr" target="#b10">[11]</ref>, G. Desagulier <ref type="bibr" target="#b11">[12]</ref>, M. Paquot, S. Th. Gries <ref type="bibr" target="#b24">[25]</ref>, V. Shyrokov <ref type="bibr" target="#b8">[9]</ref>, O. Demska-Kulchytska <ref type="bibr" target="#b12">[13]</ref>, A. <ref type="bibr">Baranov</ref>  <ref type="bibr" target="#b0">[1]</ref> etc). Since a language is not a strictly arranged system and has probabilistic and stochastic character, it is advisable to apply statistical methods in order to research it <ref type="bibr" target="#b16">[17]</ref>. Research in corpus linguistics is facilitated by special software tools -concordancers and corpus managerswhich provide various opportunities to obtain the necessary information from the corpus. Thus, corpora allow addressing the variety of research questions and have been applied in a wide range of linguistic disciplines, including lexicography, grammar, discourse analysis, sociolinguistics, language teaching, literary studies, translation studies, pragmatics, cognitive linguistics, conceptual studies, etc <ref type="bibr">[26, p.473</ref>].</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Theoretical background</head><p>Biber et al. define a corpus as "a large and principled collection of natural texts" <ref type="bibr">[3, p.4]</ref>. Generally understood as the collection of texts, the term corpus can have different meanings in various disciplines. In fiction studies, it is a collection of particular author's works. In the field of linguistics, the corpus refers to any collection of data (whether narrative texts or separate sentences) obtained for the purpose of linguistic research, often taking into account a specific research goal <ref type="bibr">[27, p.769; 29, p.22]</ref>. But the term is used in a different way in corpus linguistics -"it refers to a collection of samples of language use with the following properties:</p><p> the instances of language use contained in it are authentic;  the collection is representative of the language or language variety under investigation;  the collection is large" <ref type="bibr">[27, p.769; 29, p.22</ref>]. Additionally, texts in such collections are often commented on in order to enhance their potential for linguistic analysis. In particular, they may contain information about the paralinguistic aspects of the source data (intonation, font style, etc.), linguistic properties of utterances (parts of speech, syntactic structure) and demographic information about speakers / writers <ref type="bibr">[29; p.22]</ref>. The volume and content of the corpus may change, but these changes must neither influence its representativeness nor change it reasonably. Search in the data corpus allows a researcher to build a concordance for any word, i.e. to build a list of all usages of the word in the context and with the references to the source. Corpora can be used to obtain a variety of data and statistics on language and language units.</p><p>As a rule, the research process within a corpus involves the following stages: 1. Selection of sources of linguistic material. 2. Data entry. Texts in electronic form with the extension .txt were included in the corpus.</p><p>3. Philological verification and texts editings. 4. Converting and graphematic analysis which includes recoding of nontextual elements or their removal and division of the text into structural parts. 5. Providing texts and their components with additional information, i.e. text markup. 6. Converting of marked texts into the corpus and providing access to it. To serve as a basis of the scientific research, a corpus should not only have a significant volume or contain data of various types but also it should possess the following features:</p><p> Representativeness. The corpus must represent all the features of a particular area. It can be very large (national corpus) or very small (author corpus). T. McEnery argues that the representativeness of the corpora is caused by two factors: the set of genres that are in the corpus and the selection of texts <ref type="bibr" target="#b9">[10]</ref>. Selection is characterized by the limit of real material, selecting certain parts of speech from the language array. However, the largest language corpus can display only a small part of oral and written texts. Representativeness is closely related to volume of the corpus. However, volume of the corpus is determined by two factors: representativeness (sufficiency of texts (words) for accurate representation of the language material) and practicality (accessibility and labour-intensiveness). For example, it is necessary to cover all works of a certain author, or historical texts of a certain period, or texts of a certain subject (for example, radio or TV series, political speeches). In other cases, full representation of language cannot be achieved.  Balance. Corpus representativeness largely depends on how balanced a corpus is. The acceptable balance of a corpus is determined by its intended uses. A balanced corpus usually covers a wide range of text categories which are supposed to be representative of the language or language variety under consideration. <ref type="bibr" target="#b9">[10]</ref> Although balance is indispensable in corpus design, there is no scientific method of measuring it. Nonetheless, text typology is of high relevance if one attempts at corpus balance. To achieve balance, a corpus requires certain characteristics of text selection, which include differences between the book and newspaper, different genres of literature and authors.</p><p> Machine readability is the main criterion for electronic text corpus. Machine readability also requires encoding of corpus data. Corpus computerization has many advantages. It speeds up processing and makes working with data sets much easier. After computer processing of data, the objective and accurate results are obtained. Machine readability enables further automatic processing of data of a particular corpus, and allows the researcher to improve the corpus with all sorts of markup. It is the use of computerized corpora, together with computer programs which facilitate linguistic analysis, that distinguishes modern machine-readable corpora from early corpora <ref type="bibr" target="#b9">[10]</ref>. The purpose of the language corpus is to show the functioning of linguistic units in their natural contextual environment. The following prerequisites form the basis for further creation and usage of corpora:</p><p>1. substantial (representative) and balanced volume of the corpus guarantees the typicality of the data and provides the whole spectrum of linguistic phenomena; 2. various data, which are included in the corpus, are in their natural contextual form, which creates the possibility of their comprehensive and objective study; 3. once created and prepared data set can be used repeatedly, by different researchers and for different purposes. In the process of creating a corpus, the certain procedures should be followed, regardless of whether the corpus includes spoken or written language material. Some of the issues that are optimal in building the corpus include: typology of texts, file names and their format, etc. The next important step in building a corpus is marking and annotation. Document markup refers to labeling, similar to HTML code used to indicate features of a document: paragraphs, fonts, sentences, including sentence numbers, author identification, and end-of-text markings. At the basic level, the title can be considered as a type of markup as it provides additional information about the text.</p><p>Apart from corpus, another key term in corpus linguistics is corpus annotation, which is defined by G. Leech as the process of "adding interpretative, linguistic information to an electronic corpus of spoken and / or written language data <ref type="bibr" target="#b19">[20]</ref>. The main issue in corpus linguistics is the creating of means of automatic / automated text annotation based on different criteria -morphological, orthoepic, semantic, syntactic, etc. V Shyrokov states that automated division of an electronic literary text into 'microcontexts' is the main idea of linguistic corpus engineering, with microcontexts being text fragments grouped around the object under interpretation [9; p.99].</p><p>Corpus annotation can take many forms that can be implemented at different levels: 1. at the phonological level: the corpora can be commented on the constituent boundaries (phonetic / phonemic annotation) or prosodic features (prosodic annotation); 2. at the morphological level: the corpora can be annotated as prefixes, stems and suffixes (morphological annotation); 3. at the lexical level: the corpora can be annotated by parts of speech, lemmas (lemmatization) and semantic fields (semantic annotation); 4. at the syntactic level: the corpora can be annotated to reflect anaphoric connections, pragmatic information such as language acts (pragmatic annotation), or stylistic features such as speech and thought representation (stylistic annotation). The most common form of corpus annotation includes tags of the parts of speech (POS tagging or grammatical tagging), which mark each word in the corpus as a grammatical category (e.g. noun, adjective, adverb etc.). When corpora began to be annotated, the levels of annotation applied were simple. However, as the tools evolved, more levels of linguistic knowledge started to be incorporated into the texts and corpora <ref type="bibr">[15, p.47</ref>]. These tags facilitate settling a number of issues about a simple search for a particular keyword. Many words are ambiguous, but when a word is marked with a part of speech, it eliminates ambiguity and helps focus the search results clearly. Therefore, annotated corpora can be widely applied. Many linguistic analyses depend heavily on POS tagging <ref type="bibr" target="#b9">[10]</ref>.</p><p>To sum up, annotation aims at the addition of extralinguistic, structural, and linguistic special markers to texts. Different types of linguistic markup are distinguished: morphological, syntactic, semantic, anaphoric and prosodic. Also, the following procedures are carried out: tokenization, lemmatization, stemming and parsing. Most corpora belong to the morphological or syntactic type. It should be noted that the latter explicitly or implicitly contain morphological characteristics of lexical units.</p><p>Since corpus linguistics uses large and representative samples of natural language texts for the research, there are several types of software that can be used in the study. They are: concordancers (LEXA, MonoConc, MicroConcord, TACT, WordSmith, WordCruncher, Manatee (Bonito), IMS Corpus Workbench (CQP), XAIRA, LEXA, Virtual Corpus Manager (VMC), EXMARaLDA Corpus-Manager (Co-Ma)) and a specific software for comprehensive analysis.</p><p>Concordancers are used to make lists of examples (occurrences) of the required token (tokens, lemmas, morphemes etc) in the minimum context. Usually such a context is a fragment of several linguistic units on the left (L) and on the right (R).</p><p>The corpus manager refers to the system for managing textual and linguistic data. It is a special search system that uses software to search for data in the corpus, obtain statistical information and provide results to the user in a convenient form. The results of this procedure are presented in the form of horizontal lines with a search word in the middle. This process is called KWIC (Key Word In Context) <ref type="bibr" target="#b17">[18]</ref>.</p><p>Corpus analysis software tools vary in functionalities, but all of them facilitate to search the corpus for a specific set of linguistic units. Most of these software packages have the following features:</p><p>1. they create KWIC (keyword in context) concordants, i.e. they display the query in their immediate context, defined in terms of a certain number of words or symbols on the left and right; 2. they identify the collocations of this expression, i.e. the forms of words that occur in a particular position in relation to another word; these words are usually listed in the order in which they occur in the appropriate position; 3. they form lists of frequencies, i.e. lists of all lines of symbols in the corpus, listed in the order of their frequency. Generally, modern software tools used in corpus linguistics research are fast and rich in features. On the other hand, most of the tools are English-centric in that they only allow access to English corpora. In addition, they all offer a different user-experience, because each tool is created in isolation and thus offers a different user interface, control flow, and functionality <ref type="bibr">[19, p.154</ref>]. Nevertheless, corpus software tools are indispensable in corpus-based research projects.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Results and discussion</head><p>Text corpus, being the main issue of corpus linguistics, is widely applicable in translation studies. This study focuses on the contrastive analysis of the quantitative parameters of the source (English) text and its translation (Ukrainian). Jack London's short stories collection "Children of the Frost" is in the centre of attention. The choice has been made due to the fact that the literary work in question has not been studied from statistical perspective before. In the process of analysis quantitative and qualitative analytical methods have been used.</p><p>In this research the analysis of Jack London's collection of short stories "Children of the Frost" has been conducted on the basis of the digitally processed and marked up corpus of original texts and Ukrainian translations by V. Hladka and K. Koriakina <ref type="bibr">[22; 23]</ref>. It covers a number of characteristics which are compared in Tables <ref type="table" target="#tab_3">1-4</ref>. Here and after, we propose some denotations, the text volume is N, the number of different word forms is V, the number of sentences is S, the number of letters is C, the number of content words is C1, the number of functional words is F1, the number of Hapax legomena is V1, the number of words in the text with a frequency of 10 or more is N10. The visualization (Fig. <ref type="figure">1</ref>) of the data from the Table <ref type="table" target="#tab_0">1</ref> is performed to show the ratio between the quantitative characteristics of the Source and Target texts. Here each quantitative characteristics of the Source text has been divided by the appropriate number that characterizes the Target text. When the result of such division is above 1, it means the appropriate characteristic of the Source text exceeds the Target text.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Figure 1. The ratio of Source text and Target text characteristics</head><p>As is seen from the Figure <ref type="figure">1</ref>, in the process of translation, the number of functional words decreased, as well as text volume and Number of words in the text with a frequency of 10 or more. The number of different word forms is higher in the Target text, which is predictable at least because the Ukrainian language has seven cases, as opposed to two cases in English. </p><formula xml:id="formula_0">,9 C 1,0 1,2 1,3 1,1 1,3 1,1 1,3 1,2 0,9 1,1 C1 1,<label>6 1,9 1,5 1,5 1,6 1,7 1,8 2,0 1,5 1,7 F1 0,9 0,8 0,8 0,9 0,7 0,7 0,8 0,7 1,0 0,8 V1 1,1 1,2 1,6 1,2 1,7 1,2 1,8 1,2 1,0 1,1</label></formula><p>The analysis of general characteristics has shown that the number of word usages in the source text exceeds the number of word usages in the target text both in the whole corpus and in separate stories. Altogether, the volume of the source text is 20.69% larger than the volume of the target text. It should be noted that this contradicts the theory of translation S-universals and T-universals, which was put forward by A. Chesterman <ref type="bibr" target="#b6">[7]</ref>, and involves an increase in the volume of the target text compared to the source text.</p><p>The visualization (Fig. <ref type="figure" target="#fig_0">2</ref>) of the data from the Table <ref type="table" target="#tab_1">2</ref> and 3 is performed to show the ratio between the quantitative characteristics of each Source and Target texts. Here each quantitative characteristics of the Source text has been divided by the appropriate number that characterizes the Target text. When the result of such division is above 1, it means the appropriate characteristic of the Source text exceeds the Target text. The following S-universals can be distinguished: increasing the volume of the translated text compared to the original; simplification at the syntactic level; simplification at the lexical levelreduction of lexical diversity and the tendency to use more frequent words in the target language; reduction or avoidance of recurrences in the target language; avoiding the ethnospecific units in translation; standardization (use of typical target language structures); convergence (translated texts show greater linguistic similarities with each other than with the original texts).</p><p>As for T-universals, their taxonomy includes:  simplification (reduction of lexical diversity and density);  conventionalization (standardization);  atypical (unstable) lexical patterns <ref type="bibr" target="#b6">[7]</ref>. The frequency of each part of speech in the text and the vocabulary of the author (translators) has been compared since the ratio of parts of speech is an important statistical parameter of the individual style of both the author and a particular work (Table <ref type="table" target="#tab_4">5</ref>).</p><p>The most frequent in the source and target texts are the functional words (5% of the vocabulary in the source text and 6.37% in the target text). These words function most actively and cover almost a quarter (29.81% in the original text and 23.31% in the translated text) of the text. Pronouns have similar high activity in the text (3.18% of the vocabulary in the source text and 3.24% in the target text). Pronouns cover about 13% of the text. Approximately the same share in the text and the vocabulary is covered by adverbs (7.20% and 8.91% in the source text and 10.13% and 12.16% in the target text) and numerals (0.91% and 1.07 in source text and 1.26% and 1.06% in the target text) (see Table <ref type="table" target="#tab_5">6</ref>). In Figure <ref type="figure">3</ref>, each quantitative characteristics of the Source text was divided by the appropriate number that characterizes the Target text, as it is calculated in Table <ref type="table" target="#tab_6">7</ref>. When the result of such division is above 1, it means the appropriate characteristic of the Source text exceeds the Target text. </p><p>Nouns, verbs and adjectives are the most frequent; their relative number in the vocabulary, on the contrary, exceeds the relative number in the text both source and target. These parts of speech represent the vocabulary richness of the source and target texts and their ratio confirms that the nominal character of the individual style of the original text has been preserved in the translation.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Figure 3. The ratio of each Source text and Target text characteristics</head><p>Linguistic and statistical analysis of the corpus under research has been carried out according to the formula developed by S. Buk <ref type="bibr" target="#b4">[5]</ref>. The following characteristics of the corpus have been calculated:</p><p> The average word length in source and target texts which is calculated as the total number of letters divided to the total number of words; </p><p>The average frequency of the word in the text (A), which is calculated as the volume of the text (N) divided to the volume of the dictionary of tokens (V). This value is inverse to the index of diversity and is calculated according to the formula (1). In our case, each word of the source texts is repeated at least thrice, and in the target texts -at least twice.</p><formula xml:id="formula_2">𝐴 = 𝑁 / 𝑉<label>(1)</label></formula><p> Exclusivity index of (Eт) is calculated as a number of words with a frequency of 1 (such words are referred to as hapax legomena) (V1) to the total volume of text (N). The formula is the following: </p><p> Exclusivity index of the vocabulary (Ec), i.e. the total number of separate words reduced to the original form (V) is calculated according to the formula:</p><formula xml:id="formula_4">Ec = V1 / V<label>(3)</label></formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head></head><p>The richness of the vocabulary (B) or in other words the index of diversity is calculated as the volume of dictionary of tokens (V) to the volume of text (N). the formula is the following:</p><formula xml:id="formula_5">B = V / N<label>(4)</label></formula><p>The higher the index of diversity is, the bigger amount of diverse words the author or the translator used in a particular text. In our case, the index equals 0,264 in source text and 0,443 in the target text. These indices are high enough, since according to S. Buk, the average index fiction equals 0.067. <ref type="bibr" target="#b5">[6]</ref> </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head></head><p>Concentration index is a value opposite to the index of exclusivity and indicates what share of the text (N) or vocabulary (V) is taken by highly frequency vocabulary (with absolute frequency of 10 or more). Concentration index is calculated according to the formulas: V10т / N is the text concentration index and V10 / V is the vocabulary concentration index.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head></head><p>Index of lexical density (L) is calculated as the ratio of the number of different words to the total number of words in the text. The algorithm for calculating the index of lexical density includes the following steps: defining an input set of words (either a meaningful text or a part of it, or a random set of words); conversion of each word into its vocabulary form (stemming); deleting all duplicates. The formula for calculating lexical index is</p><formula xml:id="formula_6">L = K / N<label>(5)</label></formula><p>where N stands for the number of words after stemming and K stands for a number of words after deleting the duplicates.</p><p> The automated readability index (ARI) is a measure of the complexity of a reader's perception of a text. ARI index is calculated according to the formula:</p><formula xml:id="formula_7">ARI = 4.71 × 𝐶 𝑊 + 0.5 × 𝑊 𝐶 − 21.43,<label>(6)</label></formula><p>where C is the number of letters and numbers in the text, W is the number of words in the text and S is the number of sentences in the text. The degree of aggression is the same in the source and target texts and equals 0.19. This confirms the fact that the nominal character of the original text is accurately reproduced in the translation.</p><p> The index of epithetization (Inat), as follows from its definition, indicates the ratio between the total number of nouns in the text (Vn) ant the total number of adjectives (Vadj). The index of epithetization is calculated according to the formula:</p><formula xml:id="formula_8">Inat = Vn / Vadj<label>(7)</label></formula><p>The higher the index of epithetization is, the fewer adjectives per noun are present. It can be concluded that this index in source and target texts does not differ significantly: 2.86 / 3.51, and therefore the translator was able to maintain the saturation of the text with figurative phrases.</p><p> The index of verb phrases shows the ratio between adverbs and verbs in the text. The original texts have a slightly bigger percentage: 0.47 adverbs per 1 verb, while in translation -0.51 per 1.  Nominality degree shows the ratio between nouns and verbs in the text. In the original texts, there are 1.32 nouns per verb, in translation -1.22 per 1.  The average sentence size indicates the peculiarities of verbal intelligence or a radical change of emotional state. There is a negative correlation between the increase of emotionality of speech and the amount of In other words, the more emotional the speaker is, the shorter their statements are.  The coefficient of aggression represents the ratio between the number of verbs (and participles) and the total number of the words in the text. The coefficient is calculated according the formula:</p><formula xml:id="formula_9">Aggression coefficient = N verbs N of all words × 100%,<label>(8)</label></formula><p>where N -number of appropriate words. High coefficient of aggression indicates considerable emotional tension of the text, dynamics of events, poor emotional state of the author during text synthesis.</p><p> The coefficient of logical coherence represents the ratio between the total number of function words (prepositions and conjunctions) and the total number of sentences in the text. Values within 1 show a fairly harmonious ratio between function words and syntactic constructions in the text.</p><p>The coefficient of logical coherence = N service words / N sentences,</p><p>where N -number of appropriate words.</p><p> The coefficient of embolism means pragmatic tagging or clogging of speech and represents the ratio between the total number of emboli (words that do not have semantic meaning) and the total number of words in the text. Such words include interjections, vulgarisms, repetitions, etc. The coefficient of embolism negatively correlates with the indicators of verbal intelligence and the degree of emotional excitement of the speaker / author of the text. The coefficient of embolism is calculated according to the formula:</p><p>Embolism ratio = Nembol / All words × 100%,</p><p>where N -number of appropriate words.</p><p>The quantitative indices, which have been calculated on the basis of the general characteristics of the source and target texts, have been compared (Table <ref type="table" target="#tab_7">8</ref>). Coefficient of embolism 0,0037 0,0415 0,1</p><p>As presented in table <ref type="table" target="#tab_7">8</ref>, the main indicators that characterize the individual style in the source and target texts, do not differ significantly (Figure <ref type="figure">4</ref>), except of average word frequency, which in source text is almost twice higher, and the coefficient of embolism is ten times higher in a target text, then it is in source text.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Figure 4. Indicators characterizing the individual style in source and target texts</head><p>To determine the significance / insignificance of the statistical difference between the values of the indices, t-criterion has been calculated, using the appropriate functions in Excel. For the given data on our samples, the t-criterion equals 0,69.</p><p>To decide whether the t-criterion indicates a significant difference, it is necessary to evaluate it according to the table of critical values of t. This evaluation is carried out by determining the number of degrees of freedom, which in our case f = 15-2 = 13 (the number of indicators subtract the number of samples under comparison). The difference is considered significant if the calculated value of t is greater than the tabular value for a given level of significance. In our case, 0,69 is less than the smallest number in rows. This means that the difference in the statistical indicators of the source and target texts is insignificant and statistically acceptable. </p><p>1,8</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Conclusion</head><p>All in all the paper presents the quantitative comparative study of the collection "Children of the Frost" by J. London and its Ukrainian translation by V. Hladka and K. Koriakina, which have not been analyzed from the statistical viewpoint before. Concluding the research it can be noted that:</p><p> the number of word usages in the source text exceeds the number of word usages in the target text both in the whole corpus and in separate stories. In general, the volume of the source text is bigger than the volume of the target text by 20.69%;  indices of vocabulary richness, exclusivity for the text and the vocabulary, the concentration of the vocabulary do not differ significantly;  the mainly used parts of speech in English and Ukrainian texts are nouns (22.22% and 25.21%), verbs (19.27% and 18.89%), adjectives (7.7% and 7.64%) and adverbs (7.2% and 7.64%);  the translation preserves the ratio of different parts of speech. The number of pronouns, adverbs and functional words in the vocabulary of the target text has slightly decreased;  the index of epithetization which indicates the number of nouns per adjective in the text, does not differ significantly in source and target texts -2.86 / 3.51;  the index of verb phrases shows the number of adverbs per verb in the text. The index is higher in the source text 0.47 adverbs per 1 verb, while in target text 0.51 per 1;  degree of nominality shows the number of nouns per verb. In the source text, there are 1.32 nouns per verb, in the target text -1.22 per 1. Therefore, the degree of aggression, which is calculated as the ratio of the number of verbs and verb forms (particles) to the total number of the words, is identical in source and target text and equals 0.19. This confirms the fact that the nominal character of the source text has been accurately reproduced in the target text. Various linguistic disciplines will benefit from the research findings. These findings can be applicable in the analysis conducted within the scope of corpus linguistics, translation studies, literary studies, discourse analysis, lexicography etc.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 2 .</head><label>2</label><figDesc>Figure 2. The ratio of each Source text and Target text characteristics</figDesc><graphic coords="7,89.76,71.76,438.24,334.56" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head></head><label></label><figDesc>-sts of the North The Law of Life Nam-bok the Unve-racious The Master of Mystery The Sunlan-ders The Sick-ness of Lon Chief Keesh, Son of Keesh The Death of Ligoun Li Wan, the Fair The League of the Old Men Eт = V1 / N</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1</head><label>1</label><figDesc>Quantitative parameters of source and target texts</figDesc><table><row><cell cols="3">Coefficient Source text Target text</cell><cell>Ratio</cell></row><row><cell>N</cell><cell>45678</cell><cell>32192</cell><cell>1,42</cell></row><row><cell>V</cell><cell>11790</cell><cell>16263</cell><cell>0,72</cell></row><row><cell>S</cell><cell>3185</cell><cell>3527</cell><cell>0,90</cell></row><row><cell>C</cell><cell>210423</cell><cell>199852</cell><cell>1,05</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2</head><label>2</label><figDesc>Quantitative parameters of original stories</figDesc><table><row><cell cols="2">Coefficient In the</cell><cell>The</cell><cell>Nam-</cell><cell>The</cell><cell>The</cell><cell>The</cell><cell>Keesh,</cell><cell>The</cell><cell>Li</cell><cell>The</cell></row><row><cell></cell><cell>Fore-</cell><cell>Law</cell><cell>bok</cell><cell>Master</cell><cell>Sun-</cell><cell>Sick-</cell><cell>Son of</cell><cell>Death</cell><cell>Wan,</cell><cell>League</cell></row><row><cell></cell><cell>sts of</cell><cell>of</cell><cell>the</cell><cell>of</cell><cell>lan-</cell><cell>ness of</cell><cell>Keesh</cell><cell>of</cell><cell>the</cell><cell>of the</cell></row><row><cell></cell><cell>the</cell><cell>Life</cell><cell>Unve-</cell><cell>Mystery</cell><cell>ders</cell><cell>Lon</cell><cell></cell><cell>Ligoun</cell><cell>Fair</cell><cell>Old</cell></row><row><cell></cell><cell>North</cell><cell></cell><cell>racious</cell><cell></cell><cell></cell><cell>Chief</cell><cell></cell><cell></cell><cell></cell><cell>Men</cell></row><row><cell>N</cell><cell cols="3">5970 2836 4500</cell><cell>4085</cell><cell cols="2">6368 3632</cell><cell>3135</cell><cell>3610</cell><cell>5249</cell><cell>6293</cell></row><row><cell>V</cell><cell cols="2">1485 916</cell><cell>1059</cell><cell>1275</cell><cell>1369</cell><cell>906</cell><cell>898</cell><cell>903</cell><cell>1472</cell><cell>1507</cell></row><row><cell>S</cell><cell>477</cell><cell>193</cell><cell>379</cell><cell>295</cell><cell>463</cell><cell>180</cell><cell>254</cell><cell>186</cell><cell>413</cell><cell>345</cell></row><row><cell>C</cell><cell cols="2">28372 11673</cell><cell>22882</cell><cell>17933</cell><cell>32653</cell><cell>14505</cell><cell>15675</cell><cell>14421</cell><cell>26397</cell><cell>25912</cell></row><row><cell>C1</cell><cell cols="3">4200 1992 3049</cell><cell>2968</cell><cell cols="2">4202 2533</cell><cell>2121</cell><cell>2423</cell><cell>3548</cell><cell>4382</cell></row><row><cell>F1</cell><cell cols="2">1770 844</cell><cell>1451</cell><cell>1117</cell><cell cols="2">2166 1099</cell><cell>1014</cell><cell>1187</cell><cell>1701</cell><cell>1911</cell></row><row><cell>V1</cell><cell>890</cell><cell>444</cell><cell>654</cell><cell>678</cell><cell>806</cell><cell>372</cell><cell>557</cell><cell>372</cell><cell>981</cell><cell>699</cell></row><row><cell>N10</cell><cell>95</cell><cell>41</cell><cell>78</cell><cell>70</cell><cell>102</cell><cell>66</cell><cell>51</cell><cell>54</cell><cell>74</cell><cell>94</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 3</head><label>3</label><figDesc>Quantitative parameters of translated stories</figDesc><table><row><cell cols="2">Coefficient In the</cell><cell>The</cell><cell>Nam-</cell><cell>The</cell><cell>The</cell><cell>The Sick-</cell><cell>Keesh,</cell><cell>The</cell><cell>Li</cell><cell>The</cell></row><row><cell></cell><cell>Fore-</cell><cell>Law</cell><cell>bok the</cell><cell>Master</cell><cell>Sun-</cell><cell>ness of</cell><cell>Son of</cell><cell>Death</cell><cell>Wan,</cell><cell>League</cell></row><row><cell></cell><cell>sts of</cell><cell>of</cell><cell>Unve-</cell><cell>of</cell><cell>lan-</cell><cell>Lon Chief</cell><cell>Keesh</cell><cell>of</cell><cell>the</cell><cell>of the</cell></row><row><cell></cell><cell>the</cell><cell>Life</cell><cell>racious</cell><cell>Mystery</cell><cell>ders</cell><cell></cell><cell></cell><cell>Ligoun</cell><cell>Fair</cell><cell>Old</cell></row><row><cell></cell><cell>North</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>Men</cell></row><row><cell>N</cell><cell cols="3">5512 2155 3271</cell><cell>3487</cell><cell>4627</cell><cell>2950</cell><cell>2221</cell><cell cols="3">2713 5264 5256</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 4</head><label>4</label><figDesc>Ratio of characteristics of the source text and target text</figDesc><table><row><cell cols="2">Coefficient In the</cell><cell>The</cell><cell>Nambok the</cell><cell>The</cell><cell>The</cell><cell>The</cell><cell>Keesh,</cell><cell>The</cell><cell>Li</cell><cell>The</cell></row><row><cell></cell><cell>Forests</cell><cell>Law</cell><cell>Unveracious</cell><cell>Master</cell><cell>Sunlan-</cell><cell>Sick-</cell><cell>Son of</cell><cell>Death</cell><cell>Wan,</cell><cell>League</cell></row><row><cell></cell><cell>of the</cell><cell>of</cell><cell></cell><cell>of</cell><cell>ders</cell><cell>ness</cell><cell>Keesh</cell><cell>of</cell><cell>the</cell><cell>of the</cell></row><row><cell></cell><cell>North</cell><cell>Life</cell><cell></cell><cell>Mystery</cell><cell></cell><cell>of</cell><cell></cell><cell>Ligoun</cell><cell>Fair</cell><cell>Old</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>Lon</cell><cell></cell><cell></cell><cell></cell><cell>Men</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>Chief</cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>N</cell><cell>1,1</cell><cell>1,3</cell><cell>1,4</cell><cell>1,2</cell><cell>1,4</cell><cell>1,2</cell><cell>1,4</cell><cell>1,3</cell><cell>1,0</cell><cell>1,2</cell></row><row><cell>V</cell><cell>0,6</cell><cell>0,8</cell><cell>0,9</cell><cell>0,8</cell><cell>0,8</cell><cell>0,7</cell><cell>0,9</cell><cell>0,7</cell><cell>0,6</cell><cell>0,7</cell></row><row><cell>S</cell><cell>0,9</cell><cell>0,9</cell><cell>0,9</cell><cell>0,9</cell><cell>0,9</cell><cell>1,0</cell><cell>0,9</cell><cell>0,9</cell><cell>0,9</cell><cell>0</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_4"><head>Table 5</head><label>5</label><figDesc>Part of speech frequency in the source text</figDesc><table><row><cell>Part of</cell><cell>In the</cell><cell>The</cell><cell>Nam-</cell><cell>The</cell><cell>The</cell><cell>The</cell><cell>Keesh,</cell><cell>The</cell><cell>Li Wan,</cell><cell>The</cell></row><row><cell>speech</cell><cell>Fore-</cell><cell>Law</cell><cell>bok</cell><cell>Master</cell><cell>Sunlan-</cell><cell>Sick-</cell><cell>Son of</cell><cell>Death</cell><cell>the Fair</cell><cell>League</cell></row><row><cell></cell><cell>sts of</cell><cell>of</cell><cell>the</cell><cell>of</cell><cell>ders</cell><cell>ness</cell><cell>Keesh</cell><cell>of</cell><cell></cell><cell>of the</cell></row><row><cell></cell><cell>the</cell><cell>Life</cell><cell>Unve-</cell><cell>Mystery</cell><cell></cell><cell>of</cell><cell></cell><cell>Ligoun</cell><cell></cell><cell>Old</cell></row><row><cell></cell><cell>North</cell><cell></cell><cell>racious</cell><cell></cell><cell></cell><cell>Lon</cell><cell></cell><cell></cell><cell></cell><cell>Men</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>Chief</cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>Noun</cell><cell cols="3">1533 570 1070</cell><cell>897</cell><cell>1494</cell><cell>760</cell><cell>767</cell><cell>754</cell><cell>1243</cell><cell>1369</cell></row><row><cell>Adjective</cell><cell cols="2">556 243</cell><cell>293</cell><cell>325</cell><cell>404</cell><cell>336</cell><cell>258</cell><cell>274</cell><cell>423</cell><cell>557</cell></row><row><cell>Pronoun</cell><cell cols="2">602 331</cell><cell>524</cell><cell>508</cell><cell>597</cell><cell>476</cell><cell>334</cell><cell>489</cell><cell>574</cell><cell>793</cell></row><row><cell>Adverb</cell><cell cols="2">565 221</cell><cell>408</cell><cell>352</cell><cell>646</cell><cell>257</cell><cell>255</cell><cell>181</cell><cell>424</cell><cell>457</cell></row><row><cell>Verb</cell><cell cols="2">887 592</cell><cell>731</cell><cell>844</cell><cell>985</cell><cell>676</cell><cell>478</cell><cell>664</cell><cell>863</cell><cell>1148</cell></row><row><cell>Numeral</cell><cell>57</cell><cell>35</cell><cell>23</cell><cell>42</cell><cell>76</cell><cell>28</cell><cell>29</cell><cell>61</cell><cell>21</cell><cell>58</cell></row><row><cell>Preposition</cell><cell cols="2">819 317</cell><cell>618</cell><cell>471</cell><cell>877</cell><cell>420</cell><cell>454</cell><cell>474</cell><cell>748</cell><cell>734</cell></row><row><cell>Conjunction</cell><cell cols="2">429 180</cell><cell>325</cell><cell>282</cell><cell>511</cell><cell>326</cell><cell>202</cell><cell>378</cell><cell>414</cell><cell>575</cell></row><row><cell>Particle</cell><cell>-</cell><cell>-</cell><cell>-</cell><cell>-</cell><cell>-</cell><cell>-</cell><cell>-</cell><cell>-</cell><cell>-</cell><cell>-</cell></row><row><cell>Interjection</cell><cell>19</cell><cell>11</cell><cell>16</cell><cell>17</cell><cell>9</cell><cell>32</cell><cell>24</cell><cell>16</cell><cell>10</cell><cell>14</cell></row><row><cell>Article</cell><cell cols="2">503 336</cell><cell>492</cell><cell>347</cell><cell>769</cell><cell>321</cell><cell>334</cell><cell>319</cell><cell>529</cell><cell>588</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_5"><head>Table 6</head><label>6</label><figDesc>Part of speech frequency in the target text Part of speech</figDesc><table><row><cell></cell><cell>In the</cell><cell>The</cell><cell>Nam-</cell><cell>The</cell><cell>The</cell><cell>The</cell><cell>Keesh,</cell><cell>The</cell><cell>Li Wan,</cell><cell>The</cell></row><row><cell></cell><cell>Fore-</cell><cell>Law</cell><cell>bok</cell><cell>Master</cell><cell>Sunlan-</cell><cell>Sick-</cell><cell>Son of</cell><cell>Death</cell><cell>the Fair</cell><cell>League</cell></row><row><cell></cell><cell>sts of</cell><cell>of</cell><cell>the</cell><cell>of</cell><cell>ders</cell><cell>ness</cell><cell>Keesh</cell><cell>of</cell><cell></cell><cell>of the</cell></row><row><cell></cell><cell>the</cell><cell>Life</cell><cell>Unve-</cell><cell>Mystery</cell><cell></cell><cell>of</cell><cell></cell><cell>Ligoun</cell><cell></cell><cell>Old</cell></row><row><cell></cell><cell>North</cell><cell></cell><cell>racious</cell><cell></cell><cell></cell><cell>Lon</cell><cell></cell><cell></cell><cell></cell><cell>Men</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>Chief</cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>Noun</cell><cell cols="2">1341 529</cell><cell>697</cell><cell>808</cell><cell>1087</cell><cell>673</cell><cell>551</cell><cell>717</cell><cell>122</cell><cell>1251</cell></row><row><cell>Adjective</cell><cell cols="2">411 192</cell><cell>155</cell><cell>229</cell><cell>248</cell><cell>232</cell><cell>154</cell><cell>175</cell><cell>373</cell><cell>440</cell></row><row><cell>Pronoun</cell><cell cols="2">977 316</cell><cell>389</cell><cell>605</cell><cell>431</cell><cell>532</cell><cell>268</cell><cell>426</cell><cell>967</cell><cell>875</cell></row><row><cell>Adverb</cell><cell cols="2">542 225</cell><cell>370</cell><cell>353</cell><cell>514</cell><cell>283</cell><cell>214</cell><cell>205</cell><cell>521</cell><cell>505</cell></row><row><cell>Verb</cell><cell cols="2">1051 408</cell><cell>674</cell><cell>707</cell><cell>921</cell><cell>558</cell><cell>433</cell><cell>533</cell><cell>1044</cell><cell>980</cell></row><row><cell>Numeral</cell><cell>60</cell><cell>36</cell><cell>26</cell><cell>34</cell><cell>86</cell><cell>32</cell><cell>22</cell><cell>49</cell><cell>33</cell><cell>67</cell></row><row><cell>Preposition</cell><cell cols="2">476 176</cell><cell>339</cell><cell>323</cell><cell>457</cell><cell>267</cell><cell>227</cell><cell>292</cell><cell>468</cell><cell>510</cell></row><row><cell>Conjunction</cell><cell cols="2">429 177</cell><cell>424</cell><cell>259</cell><cell>693</cell><cell>261</cell><cell>251</cell><cell>237</cell><cell>436</cell><cell>473</cell></row><row><cell>Particle</cell><cell>210</cell><cell>94</cell><cell>189</cell><cell>153</cell><cell>177</cell><cell>104</cell><cell>95</cell><cell>78</cell><cell>193</cell><cell>149</cell></row><row><cell>Interjection</cell><cell>15</cell><cell>3</cell><cell>8</cell><cell>16</cell><cell>13</cell><cell>8</cell><cell>6</cell><cell>1</cell><cell>6</cell><cell>6</cell></row><row><cell>Article</cell><cell>-</cell><cell>-</cell><cell>-</cell><cell>-</cell><cell>-</cell><cell>-</cell><cell>-</cell><cell>-</cell><cell>-</cell><cell>-</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_6"><head>Table 7</head><label>7</label><figDesc>Ratio of part of speech frequency of source and target texts</figDesc><table><row><cell>Part of speech</cell><cell>In the Fore-sts of the North</cell><cell>The Law of Life</cell><cell>Nam-bok the Unve-racious</cell><cell>The Master of Mystery</cell><cell>The Sunlan-ders</cell><cell>The Sick-ness of Lon Chief</cell><cell>Keesh, Son of Keesh</cell><cell>The Death of Ligoun</cell><cell>Li Wan, the Fair</cell><cell>The League of the Old Men</cell></row><row><cell>Noun</cell><cell>1,1</cell><cell>1,1</cell><cell>1,5</cell><cell>1,1</cell><cell>1,4</cell><cell>1,1</cell><cell>1,4</cell><cell>1,1</cell><cell>10,2</cell><cell>1,1</cell></row><row><cell>Adjective</cell><cell>1,4</cell><cell>1,3</cell><cell>1,9</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_7"><head>Table 8</head><label>8</label><figDesc>Quantitative indices in the source and target texts</figDesc><table><row><cell>Coefficient</cell><cell>The average value in</cell><cell>The average value in</cell><cell>Ratio of average</cell></row><row><cell></cell><cell>the corpus of the</cell><cell>the corpus of the</cell><cell>values of source</cell></row><row><cell></cell><cell>source text</cell><cell>target text</cell><cell>text and target text</cell></row><row><cell>Average word length</cell><cell>4,56</cell><cell>5,381</cell><cell>0,8</cell></row><row><cell>Average word frequency</cell><cell>3,846</cell><cell>2,294</cell><cell>1,7</cell></row><row><cell>Vocabulary exclusivity</cell><cell>0,538</cell><cell>0,503</cell><cell>1,1</cell></row><row><cell>index</cell><cell></cell><cell></cell><cell></cell></row><row><cell>Diversity index</cell><cell>0,264</cell><cell>0,443</cell><cell>0,6</cell></row><row><cell>Exclusivity index for text</cell><cell>0,144</cell><cell>0,217</cell><cell>0,7</cell></row><row><cell>Vocabulary</cell><cell>0,058</cell><cell>0,0731</cell><cell>0,8</cell></row></table></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<author>
			<persName><forename type="first">A</forename><surname>Baranov</surname></persName>
		</author>
		<title level="m">Introduction to Applied Linguistics</title>
				<meeting><address><addrLine>Moscow</addrLine></address></meeting>
		<imprint>
			<publisher>Nauka</publisher>
			<date type="published" when="2001">2001</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<title level="m" type="main">Understanding Corpus Linguistics</title>
		<author>
			<persName><forename type="first">D</forename><surname>Barth</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Stefan</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2022">2022</date>
			<publisher>Routledge</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<author>
			<persName><forename type="first">D</forename><surname>Biber</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Conrad</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Reppen</surname></persName>
		</author>
		<title level="m">Corpus Linguistics. Investigating Language Structure and Use</title>
				<meeting><address><addrLine>Cambridge</addrLine></address></meeting>
		<imprint>
			<publisher>Cambridge University Press</publisher>
			<date type="published" when="1998">1998</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Representatives in Corpus Design</title>
		<author>
			<persName><forename type="first">D</forename><surname>Biber</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Literary and Linguistic Computing</title>
				<imprint>
			<date type="published" when="1993">1993</date>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="page" from="243" to="257" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Buk Texts&apos; Quantitative Comparison (based on the 1884 and 1907 editions of Ivan Franko&apos;s novel &quot;Boa Constrictor</title>
		<author>
			<persName><forename type="first">S</forename></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Ukrainian Literary Studies</title>
		<imprint>
			<biblScope unit="volume">76</biblScope>
			<biblScope unit="page" from="179" to="192" />
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Statistical Characteristics of the Lexis of Main Functional Styles of the Ukrainian Language</title>
		<author>
			<persName><surname>Buk</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Lexicographic Bulletin</title>
				<imprint>
			<biblScope unit="page" from="166" to="170" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Hypotheses about translation universals</title>
		<author>
			<persName><surname>Chesterman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Сlaims, Changes and Challenges in Translation Studies</title>
				<editor>
			<persName><forename type="first">G</forename><surname>Hansen</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">K</forename><surname>Mlmkjaer</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">D</forename><surname>Gile</surname></persName>
		</editor>
		<meeting><address><addrLine>Amsterdam</addrLine></address></meeting>
		<imprint>
			<publisher>John Benjamins Publishing Company</publisher>
			<date type="published" when="2004">2004</date>
			<biblScope unit="page" from="1" to="13" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Translation Universals: Do they exist? A corpus-based and NLP approach to convergence</title>
		<author>
			<persName><forename type="first">G</forename><surname>Corpas Pastor</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Mitkov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Afzal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Garcia</surname></persName>
		</author>
		<author>
			<persName><surname>Moya</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the LREC (2008) Workshop on &quot;Comparable Corpora</title>
				<meeting>the LREC (2008) Workshop on &quot;Comparable Corpora<address><addrLine>Marrakesh, Morocco</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
	<note>LREC-08</note>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<title level="m">Corpus Linguistics</title>
				<editor>
			<persName><forename type="first">V</forename><surname>Shyrokov</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">O</forename><surname>Bugakov</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">T</forename><surname>Hriaznukhina</surname></persName>
		</editor>
		<meeting><address><addrLine>Kyiv</addrLine></address></meeting>
		<imprint>
			<publisher>Dovira</publisher>
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<monogr>
		<title level="m" type="main">Corpus-based Language Studies: An Advanced Resource Book</title>
		<editor>T. McEnery, R. Xiao, Y. Tono</editor>
		<imprint>
			<date type="published" when="2006">2006</date>
			<publisher>Routledge</publisher>
			<pubPlace>London</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<monogr>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">S</forename><surname>Dash</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Arulmozi</surname></persName>
		</author>
		<title level="m">History, Features, and Typology of Language Corpora</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<author>
			<persName><forename type="first">G</forename><surname>Desagulier</surname></persName>
		</author>
		<title level="m">Corpus Linguistics and Statistics with R</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<author>
			<persName><forename type="first">O</forename><surname>Demska-Kulchytska</surname></persName>
		</author>
		<title level="m">The Bases of the National Corpus of the Ukrainian Language</title>
				<meeting><address><addrLine>Kyiv</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<monogr>
		<title level="m" type="main">Introducing Corpus Annotation</title>
		<author>
			<persName><forename type="first">R</forename><surname>Garside</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Leech</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Mcenery</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1997">1997</date>
			<publisher>Longman</publisher>
			<pubPlace>London</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<monogr>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">P</forename><surname>Gomide</surname></persName>
		</author>
		<title level="m">Corpus Linguistics Software: Understanding Their Usages and Delivering Two New Tools</title>
				<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
		<respStmt>
			<orgName>Lancaster University</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Ph.D. thesis</note>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">The corpus as a research domain</title>
		<author>
			<persName><forename type="first">G</forename><surname>Kennedy</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Comparing English worldwide: The International Corpus of English</title>
				<meeting><address><addrLine>Oxford</addrLine></address></meeting>
		<imprint>
			<publisher>Clarendon Press</publisher>
			<date type="published" when="1996">1996</date>
			<biblScope unit="page" from="217" to="226" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Software-Based Approach Towards Automated Authorship Acknowledgement -Chi-Square Test on One Consonant Group</title>
		<author>
			<persName><forename type="first">I</forename><surname>Khomytska</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Teslyuk</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Kryvinska</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Bazylevych</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Electronics</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="page">1138</biblScope>
			<date type="published" when="2020-07">July 2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Technical Aspects of Natural Language Information Processing</title>
		<author>
			<persName><forename type="first">I</forename><surname>Kulchytskyi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">The Journal of Lviv Polytechnic National University, Informational Systems and Networks</title>
		<imprint>
			<biblScope unit="volume">783</biblScope>
			<biblScope unit="page" from="344" to="353" />
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">A Critical Look at Software Tools in Corpus Linguistics</title>
		<author>
			<persName><forename type="first">A</forename><surname>Laurence</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Linguistic Research</title>
		<imprint>
			<biblScope unit="volume">30</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="141" to="161" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Introducing Corpus Annotation</title>
		<author>
			<persName><forename type="first">G</forename><surname>Leech</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Corpus Annotation</title>
				<editor>
			<persName><forename type="first">R</forename><surname>Garside</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">G</forename><surname>Leech</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Mcenery</surname></persName>
		</editor>
		<meeting><address><addrLine>London</addrLine></address></meeting>
		<imprint>
			<publisher>Longman</publisher>
			<date type="published" when="1997">1997</date>
			<biblScope unit="page" from="1" to="18" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Automated Identification of Metaphors in Annotated Corpus (Based on Substance Terms)</title>
		<author>
			<persName><forename type="first">O</forename><surname>Levchenko</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Tyshchenko</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Dilai</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 5th International conference on computational linguistics and intelligent systems (COLINS 2021)</title>
				<meeting>the 5th International conference on computational linguistics and intelligent systems (COLINS 2021)<address><addrLine>Kharkiv, Ukraine</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2021-04-22">2021. April 22-23, 2021</date>
			<biblScope unit="volume">2870</biblScope>
			<biblScope unit="page" from="16" to="31" />
		</imprint>
	</monogr>
	<note>main conference</note>
</biblStruct>

<biblStruct xml:id="b21">
	<monogr>
		<author>
			<persName><forename type="first">J</forename><surname>London</surname></persName>
		</author>
		<ptr target="https://library.um.edu.mo/ebooks/b28284872.pdf" />
		<title level="m">Children of the Frost</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<monogr>
		<author>
			<persName><forename type="first">J</forename><surname>London</surname></persName>
		</author>
		<title level="m">Tvory v 12-kh tomakh, Tom 1</title>
				<editor>
			<persName><forename type="first">К</forename><surname>Гладка</surname></persName>
		</editor>
		<editor>
			<persName><surname>Корякіна</surname></persName>
		</editor>
		<editor>
			<persName><surname>Дніпро</surname></persName>
		</editor>
		<meeting><address><addrLine>Kyiv; Київ</addrLine></address></meeting>
		<imprint>
			<date type="published" when="1968">1968. 1968</date>
		</imprint>
	</monogr>
	<note>Твори в 12-х томах, Том 1. з англ. В</note>
</biblStruct>

<biblStruct xml:id="b23">
	<monogr>
		<author>
			<persName><forename type="first">A</forename><surname>Lüdeling</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Kytö</surname></persName>
		</author>
		<title level="m">Corpus Linguistics, An International Handbook</title>
				<meeting><address><addrLine>Berlin, New York</addrLine></address></meeting>
		<imprint>
			<publisher>Walter de Gruyter</publisher>
			<date type="published" when="2008">2008</date>
			<biblScope unit="volume">1</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<monogr>
		<title level="m" type="main">A Practical Handbook of Corpus Linguistics</title>
		<author>
			<persName><forename type="first">M</forename><surname>Paquot</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Th</surname></persName>
		</author>
		<author>
			<persName><surname>Gries</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2020">2020</date>
			<publisher>Springer</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">Application of Corpus Technologies in Conceptual Studies (based on the Concept Ukraine Actualization in English and Ukrainian Political Media Discourse</title>
		<author>
			<persName><forename type="first">N</forename><surname>Romanyshyn</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">4 th International Conference on Computational Linguistics and Intelligent Systems</title>
				<meeting><address><addrLine>Colins</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="472" to="488" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<monogr>
		<title level="m" type="main">Corpora</title>
		<author>
			<persName><forename type="first">M</forename><surname>Sebba</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">D</forename><surname>Fligelstone</surname></persName>
		</author>
		<editor>Ronald E. Asher &amp; James M.Y. Simpson</editor>
		<imprint>
			<date type="published" when="1994">1994</date>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="page" from="769" to="773" />
			<pubPlace>Pergamon, Oxford</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b27">
	<monogr>
		<author>
			<persName><forename type="first">J</forename><surname>Sinclair</surname></persName>
		</author>
		<title level="m">Corpus, Concordance, Collocation</title>
				<meeting><address><addrLine>Oxford</addrLine></address></meeting>
		<imprint>
			<publisher>Oxford University Press</publisher>
			<date type="published" when="1991">1991</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b28">
	<monogr>
		<title level="m" type="main">Corpus Linguistics: A Guide to the Methodology</title>
		<author>
			<persName><forename type="first">A</forename><surname>Stefanowitsch</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2020">2020</date>
			<publisher>Language Science Press</publisher>
			<pubPlace>Berlin</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b29">
	<analytic>
		<title level="a" type="main">Corpus Linguistics and Lexicography</title>
		<author>
			<persName><forename type="first">W</forename><surname>Teubert</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">International Journal of Corpus Linguistics</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="page" from="125" to="153" />
			<date type="published" when="2001">2001</date>
		</imprint>
	</monogr>
	<note>Special issue</note>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
