<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Linguistic and Stylistic Parameters for the Study of Literary Language in the Corpus of Russian Short Stories of the First Third of the 20th Century</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>1 National Research University Higher School of Economics,</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>2Saint Petersburg State University</institution>
          ,
          <addr-line>Saint Petersburg, Russian Federation</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <abstract>
        <p>One of the important tasks of creating the Corpus of Russian Short Stories of the first third of the 20th century is to identify and describe the changes that took place in the Russian language and in stylistics of Russian literature in the chain of dramatic events of the World War I, the February and October Revolutions, and the Civil War. The essential principle for creating the corpus is an attempt to include in the database literary texts of the maximum number of authors who wrote stories in 1900-1930. The article describes the principles of writers and text selection for the annotated subcorpus containing stories of 300 Russian prose writers and considers the list of linguistic and stylistic parameters proposed for studying the language of literary texts in synchrony and diachrony.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>period, and by diachronic systems the sequence of successive synchronic ones [Tynyanov,
1929]. If to consider texts as a statistical population [ Cˇebanov, Martynenko, 1998], each
literary system can be characterized through a set of statistical variables, which are used for two
purposes: to solve taxonomic tasks (diagnostics, clustering, typology, etc.) and to build
theoretical models describing a distribution of texts and their units through a system of stylistic
variables [Martynenko, Sherstinova, 2000].</p>
      <p>Tynyanov’s approach implies the necessity to study literary texts of the maximum
number of writers who wrote in a given historical period. Compiling the Corpus of Russian short
stories, we try to adhere to this principle to the extent possible [Martynenko, Sherstinova,
2018]. This method makes the research more objective, as apart from well-known and
outstanding writers, the large number of second-rate authors are also involved into consideration.
Thereby we achieve better literary representation of different aspects of social and cultural
life, as well as of language and stylistics diversity [Martynenko, 2019a].</p>
      <p>Besides implementation of Tynyanov’s idea, our other important goal is to study the
changes that occur in the language at the crucial historical moments. Thus, this research is
made in the framework of the project aimed at studying the complex of linguistic and stylistic
variables in dynamics during the first three decades of the 20th century in order to identify
and describe the changes that occurred in Russian in the chain of dramatic events of the
World War I, the February and October Revolutions, and the Civil War. To fulfill this task
we consider Russian literature in the period of 1990–1930.</p>
      <p>Obviously, any revolutionary period cause changes in the usual way of life and existing
social relations; it leads to transformations both of behavior stereotypes and of common
system of values. Any significant social turbulence cannot but affect language changes. The
revolutionary events that took place in Russia over 100 years ago dramatically influenced the
Russian language. Thus, a huge amount of out-of-date vocabulary was replaced by new words
reflecting new concepts and ideas, many words from past acquired either new meanings or new
connotations. Significant changes took place in stylistics too, revealing themselves by
transformation of generally accepted speech structures the functional frequencies of many lexical
units were changed, the set of frequency collocations were revised, new syntactic patterns
appeared, etc.</p>
      <p>Some of the linguistic shifts that occurred in the revolutionary period are noticeable
by the unaided eye, other changes are more latent in nature. However, in order to become
aware of the magnitude of language transformations of different kinds, it is necessary to apply
strict quantitative methods, to process the representative volume of language data at several
linguistic levels, and to compare the adjacent chronological periods in dynamics. Only in
the result of such an analysis will it be possible to say with certainty to what extent one of
the most dramatic periods of Russian history influenced the transformation of Russian, what
language levels were affected in the first place, and what is the real share of the changes that
took place [Martynenko et al., 2018a].</p>
      <p>In this paper we briefly discuss our approach for text selection for the annotated subcorpus
and consider the list of appropriate linguistic and stylistic parameters which will be used for
text analysis.</p>
    </sec>
    <sec id="sec-2">
      <title>The Corpus of Russian Short Stories: the Principles of</title>
    </sec>
    <sec id="sec-3">
      <title>Writers and Texts Selection</title>
      <p>In recent years, more and more studies of literary texts using computer or corpus methods have
appeared and increasing numbers of literary corpora have become available [Fischer-Starcke,
2010; Balossi, 2014; Mahlberg et al., 2016; etc.]. Literary texts are necessarily included in
all major national corpora, as fiction prose is traditionally used for linguistic studies [Sinclair,
2004]. However, it should be mentioned that in the most of computational studies of literature,
long literary forms (novels) prevail; dramatic pieces are frequently considered as well [Zyngier,
2008; Archer et al., 2009; Fischer et al., 2018; Skorinkin, Fischer, 2018], while research on
small literary forms is less common.</p>
      <p>The Corpus of Russian Short Stories as the name suggests is designed to be
homogeneous in genre. We decided to choose this short literary form, because it is the story is the
most common genre of fiction. Being very popular among prose writers, it covers almost all
literary schools and involves almost all writers. Besides, short stories go through the
publishing cycle much faster than the larger prosaic works. Therefore, we can say that the story, as a
genre, performs a prospecting function and even is working in a proactive manner, sensitively
capturing the changes in public consciousness and responding to them [Martynenko et al.,
2018b].</p>
      <p>For creating the corpus, the first important task was to obtain the most complete list
of authors who wrote stories in 1900–1930. It was decided to include in the list the writers
whose literary heritage is represented by at least one story. At the same time, the goal was to
involve in the corpus not only capital writers living in Moscow and St. Petersburg (Petrograd,
Leningrad), but also regional writers who wrote in Russian and lived anywhere on the territory
of the Russian Empire (until 1917), and later on the territory of the Russian SFSR and the
USSR. However, taking into account the project’s aim of studying language changes, it was
decided that writers who emigrated after Revolution should not be included in the systems
analysis after their emigration. Also, at this stage it is not planned to involve in research
literary works by children’s writers [ibid.].</p>
      <p>Firstly, the list of writers was being formed on the basis of literary encyclopedias and
bibliographic dictionaries [KLE; Muratova, 1963; Russian Writers; etc.]. Then it was decided
to catalogue the works presented in electronic catalogues of Russian National Library, which
are tagged as a story. As a result, a database of all stories published as separate brochures
in 1900–1930 was formed. In addition, the famous literary magazines and anthologies of this
historical period (Apollon, Shipovnik, Niva, Ogonyok, Novy Mir, and many others) as well as
some other Internet resources were selectively catalogued [Sovlit; 3500 Texts; etc.]. As a result,
a preliminary list of writers’ names for the given time period was obtained, which currently
comprises an impressive number of 2800 personalities. However, this list is still not complete,
and we proceed the cataloging process. In parallel with compilation of authors’ list, we search
for electronic versions of texts in open sources (if available) and we have to digitize texts for
less-known and forgotten authors. Currently, the digital collection of short stories comprises
more than 4500 texts.</p>
      <p>Since the number of Russian writers in the first third of the 20th century turned out to
be much bigger than it was expected, in the first place we decided to annotate the subcorpus
for 300 writers (one story for each writer). The selection of 300 stories was made to present
evenly three historical periods:</p>
    </sec>
    <sec id="sec-4">
      <title>The Approaches for Creating the List of Parameters</title>
      <p>In linguistics, there are two main approaches to the selection of essential variable used for
solving various tasks associated with processing of large text data. The first one is a technocratic
approach, when an extended list of variables (or attributes) is input into the system. Then,
one of statistical algorithms for reducing the number of variables is used with the aim to reveal
the most essential features of the sample. The second approach may be called a thoughtful
one and it is based on the knowledge of the basic mechanisms of language and text formation.
The second method allows to identify structurally significant features on the base of expert
estimates. However, it does require a high level of philological competence. Data processing
of the Corpus of Russian Short Stories assume the implementation of both approaches with
an assessment of the effectiveness of each of them.</p>
      <p>Taking into account the important goal to track the changes of the language during 30
years under study, it seems obvious to start with the parameters, which may refer to the
feature list of the language of revolutionary prose, as it is the revolution, that caused the
most of language dramatic changes. Therefore, first, on the basis of numerous publications
devoted to a qualitative analysis of the features of the “language of revolutionary era” and the
influence of revolutionary events on the language [Jacobson, 1921; Selishchev, 1928; Polivanov,
1931; etc.], a list of linguistic and stylistic parameters was formed, according to which the most
significant transformations of the Russian language in the revolutionary and post-revolutionary
period were noted.</p>
      <p>
        Then, based on the current achievements of computational and mathematical
linguistics and on that of the corpus developers
        <xref ref-type="bibr" rid="ref14 ref19 ref20 ref21 ref34">(e. g., [Martynenko, 1988; 2019a; Martynenko,
Fomin, 1989; Martynenko, Sherstinova, 2000; Martynenko, Martinovich, 2003; Grebennikov,
1998; 2007; Kuznetsov, Skrebtsova et al., 2019])</xref>
        , a wider list of linguistic and stylistic
parameters aimed at describing the fiction language in synchrony and diachrony was created,
and a list of working statistics in nominal, quantitative and ordinal scales was determined.
The list presented in this article would be considered as preliminary. Some of the variable
are already being calculated, the others are still in a planning state. At the initial stage of
project implementation, it is supposed to use first those algorithms that have been tested in
previous studies. These are primarily the methods of statistical lexicography (associated with
building frequency dictionaries and studying their properties) and syntactic studies, which are
considered to be the most effective in the study of fiction [Martynenko, 2019a].
4
      </p>
    </sec>
    <sec id="sec-5">
      <title>Traditional Conceptual Approach: Some Features of</title>
    </sec>
    <sec id="sec-6">
      <title>Revolutionary and Early Soviet Prose Language</title>
      <p>Numerous publications concerning peculiarities of the language in the revolutionary historical
period [Barannikov, 1921; Jacobson, 1921; Rempel, 1921; Kartsevsky, 1923; Vinokur, 1923;
Selishchev, 1928; Polivanov, 1931; Granovskaya, 2005; etc.] note primarily lexical changes.
The other linguistic levels phonetic, morphological and syntactic are usually considered to
be more conservative and not amenable to the influence of organized management [Polivanov,
1931], therefore, they commonly remain without any detailed analysis. On the lexical level,
the following lexical features are normally noted: 1) abbreviations, 2) toponymic changes, 3)
neologisms, 4) bureaucratese, formulaicity, and phraseology, 5) vulgar tongue, popular speech
and slang words. Most of these phenomena can be studied by means of frequency list analyses
(see below). Let us make some comments in concern:</p>
      <p>1. Abbreviations. It is planned to obtain a list of the most frequent abbreviations, to
determine the share of abbreviations in texts, as well as the number of them, which are still
understandable and relevant so far.</p>
      <p>2.Toponymic changes. Renaming geographical names is aimed at creating a new
reality. From the point of view of language development, it is interesting to see how quickly the
language responds to changes initiated by the authorities. Knowing the date of the official
toponymic change, it is worth seeing how fast the new name takes root in literary works.</p>
      <p>3. Neologisms and new word meanings. During periods of significant social changes,
the emergence of a huge mass of neologisms is quite understandable, as new words are required
to label new realities. For the same reason, in such time periods one can observe the
development of new word meanings. Both of these phenomena can be analyzed with the use of
frequency lists.</p>
      <p>As for neologisms, it seems relevant to trace the most common word-forming patterns.
Currently, no continuous morphemic annotation of texts is planned. Nevertheless, a selective
analysis of individual word formation constructions and their shares can be calculated basing
on word frequency lists, and the identification of the most frequent models of neologisms
formation in the revolutionary and early Soviet era will be done.</p>
      <p>4. Bureaucratese, Formulaicity, and Phraseology. Set phrases consisting of two or
more words usually fall outside the scope of traditional frequency lists analysis. However, these
typical features of revolutionary and post-revolutionary language can be studied by means of
n-grams calculation [Sherstinova, 2018].</p>
      <p>5. Vulgar Tongue, Popular Speech and Slang Words. Due to the activation of
broad masses of people and the cardinal change of the role, which people from the lowest social
classes started to play in society, vulgarisms, slang words and vernacular expressions began to
penetrate the language. Certainly, these are reflected in literature. Thus, the evident feature
of the literature of this period is its ever-growing democracy, caused by involving huge human
masses to literary work. In stories and novels, multivarious folk speech can be heard albeit
being literary, and albeit being stylized it was very unusual for the reader brought up on
classical literature [Martynenko, 2019b, p. 396]. As in a number of cases described above,
frequency word lists will be used for their analyses.
5</p>
    </sec>
    <sec id="sec-7">
      <title>Lexical Analysis and Frequency Word Lists</title>
      <p>
        In the study of individual writers’ lexical systems as well as of the totality of works belonging
to a particular historical period the theory of distributions, mainly the rank ones, will be
used. Then, these distribution are subject to their subsequent parameterization using
rankfrequency
        <xref ref-type="bibr" rid="ref19">(for example, rank averages [Martynenko, Fomin, 1989])</xref>
        and ordinal statistics (e.
g., quantiles), which have shown good effectiveness in earlier studies [Martynenko 2019a]. In
the previous section, the importance of lexical analysis for solving the given task was already
noted. The main approach for lexical analysis in this research is the method of compiling and
measuring frequency dictionaries [Tuldava, 1986; Alekseev, 2001; Grebennikov, 2007; Popescu,
2009; Shaykevich, 2015; etc.].
      </p>
      <p>Statistical measures of frequency word lists are the following:</p>
      <p>Number of units (tokens )
Number of units (types)
Number of single-used words (hapax, lexical richness )
Lexical density
Lexical diversity coefficient [Voronchak, 1972; Tuldava, 1977]</p>
      <p>Standardized diversity index (TTRSt: Type / Token Ratio) [Baker et al., 2006].</p>
      <p>In addition to this traditional set of variables we plan to introduce an extended list of
statistics. Thus, in [Martynenko, Martinovich, 2003], a set of parameters suitable for studying
communicative-thematic fields displayed in the form of frequency word lists is discussed and
a complex of statistical parameters is formed that would reflect the system properties of
these frequency lists in a concise form, for example: diversity vs. limitation of diversity,
concentration vs. scattering, stability vs. instability, homogeneity vs. heterogeneity, etc. For
detailed description of each parameter see [ibid]:</p>
      <p>Variables in the nominal scale:</p>
      <p>Mode (Mo)
Dictionary Volume (n)
Maximum Frequency (Fmax)
Entropy (E )
Maximum Entropy (Emax)</p>
      <p>Degree of order (O = E / Emax)
Variables in the quantitative scale:</p>
      <p>Arithmetic mean (Fave)
Geometric mean (Fgeom)
Standard Deviation ( )
Mean linear deviation (Df )
Variation coefficient of standard deviation (V)
Variation coefficient of the mean linear deviation (VDf )</p>
      <p>Diversity coefficient (K ).</p>
      <p>Variables in the ordinal (rank) scale [Martynenko, 2017]:</p>
      <p>Rank mean (Rave) [Martynenko, Fomin, 1989]
Standard Deviation ( )
Coefficient of variation (Vr)
Median (Mer)
Golden Ratio (Gr)
Mean Deviation (Dr)
Coefficient of variation for Dr (VDr)</p>
      <p>Concentration Index ( ).</p>
      <p>Besides word frequency lists it is worth compiling that of POS (see Fig. 1) and grammar
word forms, that may be also processed in the similar way.
6</p>
    </sec>
    <sec id="sec-8">
      <title>Other Linguistic and Stylistic Variables and Statistics</title>
      <p>In this section we list the variables and statistics which are planned to be used in the Corpus
of Russian Short Stories.</p>
      <p>General Quantitative Text Variables</p>
      <p>The number of characters.</p>
      <p>The number of words.</p>
      <p>The number of sentences.</p>
      <p>The number of paragraphs.</p>
      <p>The number and the share of punctuation marks. POS Distribution.</p>
      <p>Frequency list of POS and their shares (an example is shown on Fig. 1).</p>
      <p>The number and the share of prepositions.</p>
      <p>The number and the share of prepositions in the initial position of the sentence.
The number and the share of co-ordinate conjunctions.</p>
      <p>The number and the share of co-ordinate conjunctions in the initial position of the
sentence.</p>
      <p>The number and the share of subordinative conjunctions.</p>
      <p>The number and the share of subordinative conjunction in the initial position of the
sentence.</p>
      <p>The number and the share of modal words (mozhno can, nuzhno need, dolzhen
should/must, etc.).</p>
      <p>The number and the share of particles.</p>
      <sec id="sec-8-1">
        <title>Word sizes and positions</title>
        <p>The average size of words in text.</p>
        <p>The average word size in the first, second, third position, etc. in the sentence
The average word size in the last position in one-word, two-word, etc. sentences.</p>
        <p>The dynamic rank average of word size in a sentence (here, the independent variable
is the number of words in a sentence, and the dependent variable is the average word size in
sentences of different length).</p>
        <p>Sentences and paragraphes</p>
        <p>The average number of words in a sentence.</p>
        <p>The average number of sentences in a paragraph.</p>
        <p>The average length of the sentence in the first, second, third, etc. position in the
paragraph consisting of one, two, three, etc. sentences.</p>
        <p>The dynamic rank average length of sentences in paragraphs of different sizes.
The dynamic rank average size of paragraphs. Measures of syntactic complexity.
The width of the tree in a root node (i. e., the number of subordinate members).
The number of left and right subordinate members in a root node.</p>
        <p>Tree symmetry index the ratio of the left subordinate members to the right ones
(Symmetry I).</p>
        <p>The ratio of the left subordinate members to the right ones relatively to the root node
measured in word numbers (Symmetry II).</p>
        <p>The height of the tree (i. e., the maximum number of sequentially subordinate nodes).
The maximum length of left-branching branches in a sentence.</p>
        <p>The maximum length of right-branching branches in a sentence.</p>
        <p>The ratio of left-branching subordinates’ length to the right one.</p>
        <p>The distance degree the maximum number of nodes between two syntactically related
words (Distance I).</p>
        <p>The distance degree the maximum number of embedded words between two
syntactically related words (Distance II).</p>
        <p>The number of homogeneous parts (groups) in a sentence.</p>
        <p>The number of elements in an enumeration sequence.</p>
        <p>Unprojectivity.</p>
        <p>Measures of syntactic complexity is to be calculated on syntax annotation, which is made
by ETAP-4 [ETAP-4]. An example of tree graph is shown on Fig. 2. (visualization utility
was developed by Alexey Melnik).</p>
      </sec>
      <sec id="sec-8-2">
        <title>Other stylistic features</title>
        <p>The number and the share of the first person pronouns.</p>
        <p>The number and the share of the third person pronouns.</p>
        <p>The number and the share of indefinite pronouns.</p>
        <p>The number and the share of participles, ending in –vshij, –shchij.</p>
        <p>The number and the share of nouns, ending in –ost, –stvo, –anie, –enie [Martynenko,
1988, p. 90]. Phonetic Variables</p>
        <p>Consonant coefficient (i. e., the ratio of the number of consonants to the number of
vowels);</p>
        <p>Stress index (characterizes the accent structure);
Rhythmic structure of the word;</p>
        <p>The rhythmic dictionary and the language model [Kazartsev, 2015; 2017].</p>
        <p>Each statistic listed in this and previous section will be calculated for individual texts and
for four historical periods in the whole: 1) the early 20th century (1900–1913), 2) the World
War I and prerevolutionary years (1914–1916), 3) the revolutionary years the February and
October Revolution and the Civil War (1917–1922), and 4) the postwar years and the early
Soviet period (from the end of the Civil War until 1930). Thereby individual and generalized
story profiles for each analyzed parameter will be built.</p>
        <p>This will allow to analyze these linguistic and stylistic variables in diachrony using the
time series method, special attention being paid to the behavior of variables during the
revolutionary years. As a result, the variables that have undergone the maximum transformation
will be revealed, and a quantitative description of these changes will be obtained.
7</p>
      </sec>
    </sec>
    <sec id="sec-9">
      <title>Conclusion</title>
      <p>The paper discussed the principles of text selection for the annotated subcorpus containing
stories of 300 Russian prose writers and considers the list of linguistic and stylistic parameters
proposed for studying the language of literary texts in synchrony and diachrony. Currently we
are working on the development of methods, which should allow automatic text processing,
and make text annotations required for obtaining the proposed diagnostic parameters, using
both automatic natural language processing methods (POS markup, syntactic structures, etc.)
and expert methods of manual annotation.</p>
      <p>The proposed list of variables should not be considered final. Thus, it turned out to
be efficient to use some additional statistical parameters. For example, the effectiveness of
specificity as a measure of subcorpus lexical distinction from the whole corpus is currently
being evaluated [Lafon, 1980; Lavrentiev et al., 2018], and it seems interesting to compare the
results to be obtained with those that provide the measures of determining keywordness, as
well as those that are used for dynamic thematic modeling (e. g., see the article by Ekaterina
Zamirailova in this volume).</p>
      <p>Moreover, already in the process of creating the corpus, the need for additional annotation,
in particular thematic text tagging and some elements of literary annotation [Skrebtsova, 2019;
Rogova, 2018] became evident, so it is planned to include these types of tagging too.</p>
      <p>The approbation of the proposed variables on corpus data will allow to evaluate their
effectiveness and deficiencies. It is expected that the technique developed in the framework
of this study can be successfully applied in research of diachronic and evolutionary changes
in texts of other prose genres publicistic and scientific literature, special texts, transcripts
of oral speech, etc. and can be applied to study linguistic changes traced in texts of any
historical period, including the modern linguistic trends. Moreover, in the future it can be
adapted to be used for texts in any language.</p>
    </sec>
    <sec id="sec-10">
      <title>Acknowledgements</title>
      <p>The research is supported by the Russian Foundation for Basic Research, project 17-29-09173
“The Russian language on the edge of radical historical changes: the study of language and
style in prerevolutionary, revolutionary and post-revolutionary artistic prose by the methods
of mathematical and computer linguistics (a corpus-based research on Russian short stories)”.
[Barannikov, 1921] Barannikov A.P. (1921) Iz nablyudenij nad razvitiem russkogo yazyka v
poslednie gody [From observations of the development of the Russian language in recent
years].
[ETAP-4] http://cl.iitp.ru/ru/etap4download (last accessed on 31.08.2019).
[Fischer et al., 2018] Fischer F., Trilcke P., Kittel C., Milling C., Skorinkin D. (2018) To
Catch a Protagonist: Quantitative Dominance Relations in German Language Drama
(1730–1930), in: Digital Humanities 2018: Book of Abstracts / Libro de resu´menes.
Mexico: Red de Humanidades Digitales A. C., Pp. 193–201.
[Fischer-Starcke, 2010] Fischer-Starcke B. (2010) Corpus Linguistics in Literary Analysis:</p>
      <p>Jane Austen and Her Contemporaries. London; New York: Continuum.
[Granovskaya, 2005] Granovskaya L.M. (2005) Russkij literaturnyj yazyk v konce XIX i XX
veke [Russian literary language at the end of the 19th and 20th centuries]. Moscow: Elpis.
[Grebennikov, 1998] Grebennikov A.O. (1998) O sostoyatel’nosti statistik chastotnogo
slovarya khudozhestvennoy prozy [On the consistency of statistics of the frequency dictionary
of fiction], Structural and Applied Linguistics. Vol. 5. St. Petersburg.
[Grebennikov, 2007] Grebennikov, A.O. (2017) K voprosu o merakh leksicheskogo skhodstva
chastotnykh slovarey [On the measures of lexical similarity between frequency dictionaries],
Advances in Social Science Education and Humanities Research, 75019, Paris: Atlantis
Press, Том 122. Pp. 256–259.
[Martynenko et al., 2018a] Martynenko, G.Ya., Sherstinova, T.Yu., Melnik, A.G., Popova,
T.I. (2018) Metodolo-gicheskie problemy sozdaniya Komp’yuternoj antologii russkogo
rasskaza kak yazykovogo resursa dlya issledovaniya yazyka i stilya russkoj
khudozhestvennoj prozy v ehpokhu revolyucionnykh peremen (pervoj treti XX veka) [Methodological
problems of creating a Computer Anthology of the Russian story as a language resource
for the study of the language and style of Russian artistic prose in the era revolutionary
changes (first third of the 20th century)]. In: Computational linguistics and
computational ontologies. Issue 2 (Proceedings of the XXI International United Conference The
Internet and Modern Society, IMS-2018, St. Petersburg, May 30 - June 2, 2018 Collection
of scientific articles), ITMO University, St. Petersburg. Pp. 99–104.
[Tuldava, 1977] Tuldava Yu.A. (1977) O kvantitativnykh kharakteristikakh bogatstva
leksicheskogo sostava khudozhestvennykh tekstov [On the quantitative characteristics of the
richness of the lexical composition of literary texts] Linguistica. Tartu. IX. Pp. 159–175.
[Tuldava, 1986] Tuldava Yu.A. (1986) O chastotnom spektre leksiki teksta [On the frequency
spectrum of text vocabulary], Scientific Notes of Tartu University, Vol. 745. Quantitative
linguistics and automatic text analysis. Tartu. Pp. 139–162.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <source>[Alekseev</source>
          , 2001]
          <string-name>
            <surname>Alekseev P. M.</surname>
          </string-name>
          (
          <year>2001</year>
          )
          <article-title>Chastotnyye slovari: Uchebnoye posobiye [Frequency Dictionaries: Textbook]</article-title>
          .
          <source>St. Petersburg: Publishing House of St</source>
          . Petersburg University,
          <fpage>2001</fpage>
          - 156 p.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [Archer et al.,
          <year>2009</year>
          ] Archer,
          <string-name>
            <given-names>D.</given-names>
            ,
            <surname>Culpeper</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            , and
            <surname>Rayson</surname>
          </string-name>
          ,
          <string-name>
            <surname>P.</surname>
          </string-name>
          (
          <year>2009</year>
          )
          <article-title>Love 'a familiar or a devil' ? An Exploration of Key Domains in Shakespeare's Comedies and Tragedies, Word frequency and keyword extraction</article-title>
          ,
          <source>AHRC ICT Methods Network Expert Seminar on Linguistics, 8 September</source>
          <year>2006</year>
          , Lancaster University. Available at .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [Baker et al.,
          <year>2006</year>
          ]
          <string-name>
            <surname>Baker P.</surname>
          </string-name>
          et al. (
          <year>2006</year>
          )
          <article-title>Glossary of Corpus Linguistics</article-title>
          , Edinburgh University Press.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <source>[Balossi</source>
          , 2014]
          <string-name>
            <surname>Balossi G.</surname>
          </string-name>
          (
          <year>2014</year>
          )
          <article-title>A Corpus Linguistic Approach to Literary Language and Characterization: Virginia Woolf's The Waves</article-title>
          . Amsterdam; Philadelphia: John Benjamins Publishing Company.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [ Cˇebanov, Martynenko 1998]
          <string-name>
            <surname>Chebanov</surname>
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Martynenko</surname>
            <given-names>G.</given-names>
          </string-name>
          (
          <year>1998</year>
          )
          <article-title>Text as Real Population in A.A. Cˇuprov's Sense</article-title>
          ,
          <source>Journal of Quantitative Linguistics</source>
          , Volume
          <volume>5</volume>
          , Number 3.
          <source>December</source>
          <year>1998</year>
          . Pp.
          <volume>163</volume>
          -
          <fpage>166</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <source>[Jacobson</source>
          , 1921]
          <string-name>
            <surname>Jacobson</surname>
            <given-names>R.</given-names>
          </string-name>
          (
          <year>1921</year>
          )
          <article-title>Vliyanie revolyucii na russkij yazyk [The influence of the revolution on the Russian language]</article-title>
          . Prague.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <source>[Kartsevsky</source>
          , 1923]
          <string-name>
            <surname>Kartsevsky</surname>
            <given-names>S.O.</given-names>
          </string-name>
          (
          <year>1923</year>
          )
          <article-title>Yazyk, vojna i revolyuciya [Language, war</article-title>
          and revolution]. Berlin.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <source>[Kazartsev</source>
          , 2015]
          <string-name>
            <surname>Kazartsev E.V.</surname>
          </string-name>
          (
          <year>2015</year>
          )
          <article-title>The Rhythmic Structure of Tales of Belkin and the Peculiarities of a Poet's Prose, A Convenient Territory</article-title>
          .
          <article-title>Russian Literature at the Edge of Modernity. Essays in Honor of Barry Scherr</article-title>
          . Edit. J.
          <string-name>
            <surname>Kopper</surname>
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Wachtel</surname>
          </string-name>
          . Columbus: Slavica. Pp.
          <volume>55</volume>
          -
          <fpage>65</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <source>[Kazartsev</source>
          , 2017]
          <string-name>
            <surname>Kazartsev E.V.</surname>
          </string-name>
          (
          <year>2017</year>
          )
          <article-title>Stikhopodobnyye fragmenty prozy</article-title>
          <string-name>
            <given-names>A. S.</given-names>
            <surname>Pushkina</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. K.</given-names>
            <surname>Tolstogo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F. K.</given-names>
            <surname>Sologuba</surname>
          </string-name>
          i
          <string-name>
            <surname>B. L. Pasternaka</surname>
          </string-name>
          <article-title>v kontekste evolyutsii russkogo stikha [The Verse-Similar Fragments in the Prose by A</article-title>
          . S. Pushkin,
          <string-name>
            <given-names>A. K.</given-names>
            <surname>Tolstoy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Sologub</surname>
          </string-name>
          , and
          <string-name>
            <surname>B. L.</surname>
          </string-name>
          <article-title>Parternak], Trudy instituta russkogo yazyka im</article-title>
          . V.V.
          <article-title>Vinogradova [Proceedings of the Russian language Institute]</article-title>
          .
          <string-name>
            <given-names>V.V.</given-names>
            <surname>Vinogradova</surname>
          </string-name>
          . XI. Pp.
          <volume>235</volume>
          -
          <fpage>244</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [KLE]
          <article-title>Concise literary encyclopedia, in 9 volumes (1962-</article-title>
          <string-name>
            <surname>1978) M.: Soviet</surname>
            <given-names>Encyclopedia</given-names>
          </string-name>
          , (last accessed:
          <volume>19</volume>
          .
          <fpage>11</fpage>
          .
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <source>[Lafon</source>
          , 1980] Lafon P.
          <article-title>Sur la variabilit´e de la fr´equence des formes dans un corpus</article-title>
          .
          <source>Mots</source>
          ,
          <year>1980</year>
          , No. 1. Pp.
          <volume>127</volume>
          -
          <fpage>165</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [Lavrentiev et al.,
          <year>2018</year>
          ]
          <string-name>
            <surname>Lavrentiev</surname>
            <given-names>A.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Solovyev</surname>
            <given-names>F.N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Suvorova (Ananyeva) M.I.</surname>
          </string-name>
          ,
          <string-name>
            <surname>Fokina</surname>
            <given-names>A.I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chepovskiy</surname>
            <given-names>A.M.</given-names>
          </string-name>
          (
          <year>2018</year>
          )
          <article-title>Novyy kompleks instrumentov avtomaticheskoy obrabotki teksta dlya platformy TXM i yego aprobatsiya na korpuse dlya analiza ekstremistskikh tekstov [A New Toolkit for Natural Text Processing with the TXM Platform and its Appliсation to a Corpus for Analysis of Texts Propagating Extremist Views]</article-title>
          .
          <source>Vestnik NSU. Series: Linguistics and Intercultural Communication</source>
          ,
          <volume>16</volume>
          (
          <issue>3</issue>
          ). Pp.
          <volume>19</volume>
          -
          <fpage>31</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [Mahlberg et al.,
          <year>2016</year>
          ] Mahlberg,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Stockwell</surname>
          </string-name>
          , P., de Joode, J.,
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>O'Donnell</surname>
            ,
            <given-names>M. B.</given-names>
          </string-name>
          (
          <year>2016</year>
          )
          <article-title>CLiC Dickens: Novel uses of concordances for the integration of corpus stylistics and cognitive poetics</article-title>
          .
          <source>Corpora</source>
          ,
          <volume>11</volume>
          (
          <issue>3</issue>
          ). Pp.
          <volume>433</volume>
          -
          <fpage>463</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <source>[Martynenko</source>
          , 1988] Martynenko,
          <string-name>
            <surname>G.Ya.</surname>
          </string-name>
          (
          <year>1988</year>
          )
          <article-title>Osnovy stilemetrii [The Foundation of Stylometics]</article-title>
          .
          <source>St</source>
          . Petersburg State University, St. Petersburg.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [Martynenko, 2019a] Martynenko,
          <string-name>
            <surname>G. Ya.</surname>
          </string-name>
          (
          <year>2019</year>
          )
          <article-title>Metody matematicheskoj lingvistiki v stilisticheskikh issledovanijakh [Methods of mathematical linguistics in stylistic studies], St</article-title>
          . Petersburg: Nestor-Istoriya.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [Martynenko, 2019b]
          <string-name>
            <surname>Martynenko</surname>
            <given-names>G.Ya.</given-names>
          </string-name>
          (
          <year>2019</year>
          )
          <article-title>Stilizovannyye sintaksicheskiye triady v russkom rasskaze pervoy treti XX veka [Stylized syntactic triads in Russian short story of the first third of the 20th century]</article-title>
          ,
          <source>Proc. of the Int. Conf. Corpus Linguistics - 2019</source>
          . St. Petersburg State University. Pp.
          <volume>395</volume>
          -
          <fpage>404</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [Martynenko et al., 2018b]
          <string-name>
            <surname>Martynenko</surname>
            <given-names>G.Ya.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sherstinova</surname>
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Yu</surname>
          </string-name>
          .,
          <string-name>
            <surname>Popova</surname>
            <given-names>T.I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Melnik</surname>
            <given-names>А.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zamirajlova</surname>
            <given-names>E.V.</given-names>
          </string-name>
          (
          <year>2018</year>
          )
          <article-title>O printsipakh sozdaniya korpusa russkogo rasskaza pervoy treti XX veka [On the principles of creation of the Russian short stories corpus of the first third of the 20th century]</article-title>
          .
          <source>Proceedings of the XV International Conference on Computer and Cognitive Linguistics TEL</source>
          <year>2018</year>
          ,
          <article-title>Kazan</article-title>
          . Pp.
          <volume>180</volume>
          -
          <fpage>197</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <source>[Martynenko</source>
          , 2017] Martynenko,
          <string-name>
            <surname>G.</surname>
          </string-name>
          <article-title>Order Statistics as a Tool for Comparing Frequency Dictionaries (</article-title>
          <year>2017</year>
          ) ACM International Conference Proceeding Series,
          <source>Volume Part F133135</source>
          ,
          <source>2017 International Conference on Internet and Modern Society</source>
          , IMS 2017;
          <article-title>St</article-title>
          . Petersburg. Pp.
          <volume>116</volume>
          -
          <fpage>119</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [Martynenko, Fomin, 1989] Martynenko G.Y.,
          <string-name>
            <surname>Fomin</surname>
            <given-names>S.V.</given-names>
          </string-name>
          (
          <year>1989</year>
          )
          <article-title>Ranking moments, Nauchno-tekhnicheskaya informatsiya, Seriya 2 - informatsionnye protsessy i sistemy, Issue: 8</article-title>
          . Pp.
          <volume>9</volume>
          -
          <fpage>14</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [Martynenko, Martinovich, 2003]
          <string-name>
            <given-names>Martynenko G.Ya.</given-names>
            ,
            <surname>Martinovich</surname>
          </string-name>
          <string-name>
            <surname>G.A.</surname>
          </string-name>
          (
          <year>2003</year>
          )
          <article-title>Mnogoparametricheskiy statisticheskiy analiz rezul'tatov assotsiativnogo eksperimenta [Multiparametric statistical analysis of the results of an associative experiment] St</article-title>
          . Petersburg State University. Scientific reports. St. Petersburg State University. 28 pp.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [Martynenko, Sherstinova, 2000] Martynenko,
          <string-name>
            <given-names>G.</given-names>
            ,
            <surname>Sherstinova</surname>
          </string-name>
          ,
          <string-name>
            <surname>T.</surname>
          </string-name>
          (
          <year>2000</year>
          )
          <article-title>Statistical Parametrization of Text Corpora, P. Sojka, I. Kopeˇcek, and</article-title>
          K. Pala (eds.)
          <source>TSD</source>
          <year>2000</year>
          ,
          <article-title>LNAI</article-title>
          1902, Springer: Berlin-Heidelberg. Pp.
          <volume>99</volume>
          -
          <fpage>102</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [Martynenko, Sherstinova, 2018]
          <string-name>
            <given-names>Martynenko G.</given-names>
            ,
            <surname>Sherstinova</surname>
          </string-name>
          <string-name>
            <surname>T.</surname>
          </string-name>
          (
          <year>2018</year>
          )
          <article-title>Emotional Waves of a Plot in Literary Texts: New Approaches for Investigation of the Dynamics in Digital Culture</article-title>
          . In: Alexandrov D.et al. (eds.)
          <source>Digital Transformation and Global Society. DTGS 2018. Communications in Computer and Information Science</source>
          , vol
          <volume>859</volume>
          . Springer, Cham. Pp.
          <volume>299</volume>
          -
          <fpage>309</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <source>[Muratova</source>
          , 1963]
          <string-name>
            <given-names>Muratova</given-names>
            <surname>K.D.</surname>
          </string-name>
          (
          <year>1963</year>
          )
          <article-title>Istoriya russkoy literatury kontsa XIX - nachala XX veka, Bibliograficheskiy ukazatel' [The History of Russian literature of the late XIX - early XX century</article-title>
          , Bibliographic index], Moscow: Academy of Science of SSSR.
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <source>[Polivanov</source>
          , 1931]
          <string-name>
            <surname>Polivanov E.D.</surname>
          </string-name>
          (
          <year>1931</year>
          )
          <article-title>Revolyuciya i literaturnye yazyki soyuza SSR [The revolution and the literary languages of the USSR]</article-title>
          ,
          <string-name>
            <surname>Polivanov</surname>
            <given-names>E.D.</given-names>
          </string-name>
          <article-title>Za marksistskoye yazykoznaniye</article-title>
          . Moscow: Federatsiya. Pp.
          <volume>73</volume>
          -
          <fpage>94</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <source>[Popescu</source>
          , 2009] Popescu,
          <string-name>
            <surname>I.-I.</surname>
          </string-name>
          (
          <year>2009</year>
          )
          <article-title>Quantitative Linguistics: Word Frequency Studies</article-title>
          . Berlin-New-York: Mouton de Gruyter.
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          <source>[Rempel</source>
          , 1921]
          <string-name>
            <surname>Rempel E.</surname>
          </string-name>
          (
          <year>1921</year>
          )
          <article-title>Yazyk revolyucii i revolyuciya yazyka [The language of revolution and the revolution of language] Riga.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          <source>[Rogova</source>
          , 2018]
          <article-title>Rogova K. A</article-title>
          ., ed. (
          <year>2018</year>
          )
          <article-title>Analiz khudozhestvennogo teksta. Russkaya literatura XX veka: 20-ye gody: uchebnoye posobiye [Analysis of the literary text. Russian literature of the XX century: the 20s: a textbook], St</article-title>
          . Petersburg: Publishing House of St. Petersburg University. 286 p.
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          <source>[Russian Writers] Russian Writers of 1800-1917</source>
          ,
          <string-name>
            <given-names>Biographical</given-names>
            <surname>Dictionary</surname>
          </string-name>
          . In 7 voluems. Ed. by
          <string-name>
            <surname>Nikolaev</surname>
            <given-names>P.A.</given-names>
          </string-name>
          <year>1992</year>
          -
          <fpage>2000</fpage>
          . Moscow: Scientific publishing house Big Russian Encyclopedia.
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          <source>[Selishchev</source>
          , 1928]
          <string-name>
            <surname>Selishchev</surname>
            <given-names>A.M.</given-names>
          </string-name>
          (
          <year>1928</year>
          )
          <article-title>Yazyk revolyutsionnoy epokhi: Iz nablyudeniy nad russkim yazykom poslednikh let (1917-1926) [The language of the revolutionary era: From observations of the Russian language of recent years</article-title>
          .
          <source>(</source>
          <year>1917</year>
          -1926)], Moscow: Rabotnik prosveshcheniya.
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          <source>[Shaykevich</source>
          , 2015]
          <string-name>
            <surname>Shaykevich</surname>
            <given-names>A.Ya.</given-names>
          </string-name>
          (
          <year>2015</year>
          )
          <article-title>Mery leksicheskogo skhodstva chastotnyh slovarej [Measures of lexical similarity between frequency dictionaries]</article-title>
          ,
          <source>Proc. of the Int. Conference Corpus linguistics-2015</source>
          [
          <article-title>Trudy mezhd</article-title>
          . konf. Korpusnaya linguistica-2015]. Pp.
          <volume>422</volume>
          -
          <fpage>429</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          <source>[Sherstinova</source>
          , 2018] Sherstinova,
          <string-name>
            <surname>T.</surname>
          </string-name>
          (
          <year>2018</year>
          )
          <article-title>Quantitative Data on POS Distribution in the Beginnings and the Ends of Utterances in Everyday Russian Speech</article-title>
          . In: Potapova R.,
          <string-name>
            <surname>Jokisch</surname>
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Karpov</surname>
            <given-names>A</given-names>
          </string-name>
          . (eds.)
          <article-title>Speech and Computer (SPECOM</article-title>
          <year>2018</year>
          ),
          <source>Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)</source>
          ,
          <source>LNAI 11096</source>
          , Springer Verlag. Pp.
          <volume>596</volume>
          -
          <fpage>605</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          <source>[Sinclair</source>
          ,
          <year>2004</year>
          ] inclair,
          <string-name>
            <surname>J.</surname>
          </string-name>
          (
          <year>2004</year>
          )
          <article-title>Trust the Text: Language, Corpus</article-title>
          and Discourse, Routledge,
          <volume>224</volume>
          p.
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [Skorinkin, Fischer 2018]
          <string-name>
            <surname>Skorinkin</surname>
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fischer</surname>
            <given-names>F.</given-names>
          </string-name>
          (
          <year>2018</year>
          )
          <article-title>Measuring the 'Epification' of Drama</article-title>
          , in: Applications in Cultural Evolution: Arts, Languages, Technologies. University of Tartu. Pp.
          <volume>46</volume>
          -
          <fpage>48</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          <source>[Skrebtsova</source>
          , 2019]
          <string-name>
            <surname>Skrebtsova</surname>
            <given-names>T. G.</given-names>
          </string-name>
          (
          <year>2019</year>
          )
          <article-title>Struktura narrativa v russkom rasskaze nachala XX veka [Narrative structure of the Russian short story in the early XX century]</article-title>
          ,
          <source>Proceedings of the International Conference Corpus Linguistics-2019. St. Petersburg: Publishing House of St</source>
          . Petersburg University. Pp.
          <volume>426</volume>
          -
          <fpage>431</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          <source>[Tynyanov</source>
          , 1929] Tynyanov,
          <string-name>
            <surname>Yu.N.</surname>
          </string-name>
          (
          <year>1929</year>
          )
          <article-title>Arkhaisty i novatory [Archaists and Innovators]</article-title>
          .
          <source>Leningrad: Priboj.</source>
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          <source>[Vinokur</source>
          , 1923]
          <string-name>
            <surname>Vinokur G.O.</surname>
          </string-name>
          (
          <year>1923</year>
          )
          <article-title>O revolyucionnoj frazeologii (Odin iz voprosov yazykovoj politiki) [On revolutionary phraseology (One of the issues of language policy)]</article-title>
          .
          <source>LEF. No 2</source>
          . Moscow-Petrograd.
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          <source>[Voronchak</source>
          , 1972]
          <string-name>
            <surname>Voronchak E.</surname>
          </string-name>
          (
          <year>1972</year>
          )
          <article-title>Metody vychisleniya pokazateley leksicheskogo bogatstva tekstov [Methods of calculating indicators of the lexical richness of texts], Semiotics and artmetry</article-title>
          . Moscow. Pp.
          <volume>232</volume>
          -
          <fpage>250</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          <source>[Zyngier</source>
          , 2008] Zyngier,
          <string-name>
            <surname>S.</surname>
          </string-name>
          (
          <year>2008</year>
          )
          <article-title>Macbeth through the computer: Literary evaluation and pedagogical implications, The Quality of Literature: Linguistic studies in literary evaluation, edited by Willie van Peer, Linguistic Approaches to Literature, 4</article-title>
          . John Benjamins Publishing Company. Pp.
          <volume>169</volume>
          -
          <fpage>190</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>