<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Statistical Analysis of the Short Stories by Roman Ivanychuk</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ihor Kul</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>hytskyi[</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Lviv Polytechnic National University</institution>
          ,
          <addr-line>12 Bandera street, Lviv, Ukraine, 79013</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>The notion of idiostyle has been of a high interest of the modern linguistics so far. The personal style of an author possesses a significant value when it comes to the historical heritage of any language. The reason behind this is the fact that an author is the person living in a certain area, society and historical period that influence the way they speak and therefore write. When dealing with the idiolect, the best tools for researching and measuring it are the linguistic and statistics methods, as they show the quantitative characteristics representing the specific features of the writer's language in terms of numbers. The scope of statistics, though a mathematical science, is extremely broad, ranging from mathematics and social sciences to business and politics. It is known that statistics is increasingly used for linguistic research, in particular to study the features of the author's language.</p>
      </abstract>
      <kwd-group>
        <kwd>statistical analysis</kwd>
        <kwd>short stories</kwd>
        <kwd>idiostyle</kwd>
        <kwd>euphony</kwd>
        <kwd>length of word</kwd>
        <kwd>length of sentences</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>The paper presents the results of the statistical analysis of the short stories by the
Ukrainian writer of the 20th-21st centuries Roman Ivanychuk. He has got a completely
individual and unique manner of writing. This is an artist who modernized Ukrainian
history in a very professional and profound manner.</p>
      <p>
        A range of different researches, articles, and dissertations have been dedicated to the
specific features of the literature legacy of Roman Ivanychuk. They in details analyzed
the works of the writer and compared them with the works of the other Ukrianian
writers [
        <xref ref-type="bibr" rid="ref1 ref2 ref3 ref4">1, 2, 3, 4, 8, 9, 15</xref>
        ]. However, the specific features of Ivanychuk's historical prose,
the individual use of lexical means, his speech and the peculiarity of his style remained
unstudied what caused the topicality of the research.
      </p>
      <p>The object is represented by R. Ivanychuk’s discourse that is studied for individual
specificity. The subject is the statistical indicators of the short stories by R. Ivanychuk.</p>
      <p>In this paper the notion of idiostyle is interpreted as the set of those features of the
author's language, which distinguish it from other authors. More narrowly, it is realized
by means of certain stylistic, grammatical and lexical means of the author’s writing that
distinguishes one person from others [16, 17].</p>
      <p>The artistic language reflects not only the linguistic competence of the author, and
the advantages of using one or another language constructs and words, but also features
of the national language [10].</p>
      <p>The author's idiostyle should be investigated in the context of his creative
personality, including extra-linguistic and cultural factors and the literary era when the
particular writer lived and created. A general idea of the idiostyle of the writer can only be
obtained by studying a large number of his texts [11].
2</p>
    </sec>
    <sec id="sec-2">
      <title>Statistics in the language research</title>
      <p>In his work, the Ukrainian scholar V. Perebyinis claims that all complex systems are
subject to the laws of statistics. Such systems are living organisms, economics, as well
as science and human language. The scientist points out that it is necessary to use
statistical methods in linguistics, since language is a complex system, and therefore it is a
subject to the laws of statistics [14].</p>
      <p>V. Perebyinis argues that any language is a system of discrete units, and any discrete
unit has quantitative characteristics [14, 15]. Clearly or implicitly, but linguists
recognize the fact that language has quantitative characteristics. V. Perebyinis gives
examples of words such as ‘more’, ‘less’, ‘many’ others. Such estimates have no accuracy
[10].</p>
      <p>Statistical methods are widely used in linguistics. For example, in the Ukrainian
lexicography statistics is used to compile frequency dictionaries, which help to find the
most commonly used vocabulary. Also, statistical methods are related to the creation
of teaching methods of a foreign language and learning optimization [12]. Statistical
methods play an important role in applied linguistics (for the automatic processing of
natural language and speech, for the creation of systems of machine translation and
electronic dictionaries), psycholinguistics, sociolinguistics, comparative-historical
linguistics, pragmalinguistics, and others [12].</p>
      <p>The quantitative indicators provide some information about the text itself, the fact
that there are certain differences between the language styles and genres that are of a
statistical nature and are the basis of the so-called statistical stylistics.</p>
      <p>One of the regularities of linguistic units in functional styles, from the standpoint of
statistics, is a strict quantitative relationship between the abstract textual content
prevailing in it and the type of author's mindset and peculiarities of the speech, in other
words between the frequency use of words and grammatical forms with their
lexicalgrammatical meaning.
3</p>
    </sec>
    <sec id="sec-3">
      <title>The research corpus</title>
      <p>
        The research corpus consists of the forty-three short stories from the following
collections: ‘The Rod Brings Ice’ [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] that includes twenty-five short stories in general,
‘Autumn Patterns’ [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] with eleven short stories and ‘One Loaf for Two’ [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] consisting of
seven short stories. The material of the research is presented in the table 1 with the
number of tokens for each collection. Overall, the total number of tokens taken as a
basis of the research, makes up 97375 units. It is also to be mentioned that to receive
the clean material for further processing, the texts from each collection were converted
into electronic form, proofread and checked if the encoding was made correctly.
4
      </p>
    </sec>
    <sec id="sec-4">
      <title>The most frequent tokens</title>
      <p>The first stage of the research is to identify the most frequent tokens in general. The 10
most frequent tokens are listed in the following table.
Obviously, among the most frequent tokens are the ones that belong to the auxiliary
parts of speech. However, the first thing that attracts attention is a large number of
personal pronouns.</p>
      <p>So, the first 20 such word forms contain personal pronouns, while there are 11
personal pronouns in the set of 50 the most frequent tokens. This is so, despite the fact that,
the first 20 most frequent words is the 19% of all words of the text, and the first 50
words make up the 29% of the whole selection.</p>
      <p>The list of pronouns is presented in the table below:</p>
      <p>Word
Не (not)
І (and)
На (on/at)
Й (and)
В (in)
З (from/with)
Що (that)
До (to)
Я (I)
А (but/and)</p>
      <p>Worform
Я (I)
Він (he)
Мене (me)
Його (him)
Її (her)
The most frequent nouns were placed in the range of ranks from 93 to 111. It has to be
noted that the notion of rank is the position of the token in the selection according to its
frequency. The word with the highest rank is the lest frequent, while the one with the
rank = 1 is the most frequent. Thus, the frequency of nouns in the s stories of R.
Ivanychuk is much smaller than the frequency of pronouns. It statistically proves that
most of his short stores were written from the first person singular.</p>
      <p>The next table contains the most frequent nouns and pronouns (only lemmas are
given).</p>
    </sec>
    <sec id="sec-5">
      <title>Other statistical characteristics of the short stories by R.</title>
    </sec>
    <sec id="sec-6">
      <title>Ivanychuk</title>
      <p>The next stage of the research of the legacy of R. Ivanychuk was to determine such
statistical characteristics as the degree of euphony of his texts, the length of words and
the length of sentence and compare these indicators with the similar indicators of other
Ukrainian writers and Ukrainian prose in general.</p>
      <p>The degree of euphony was counted as the percentage of vowels, sonorant and
voiced consonant phonemes taken comparing to all phonemes in the text. For this
purpose, for each short story, the frequency of each phoneme was calculated on the basis
of which the degree of the euphony of each short story is determined. The tool for such
analysis was the author’s program that transformed the sequence of letters into the
sequence of phonemes according to the rules of theoretical phonology of the Ukrainian
language. The results of this processing are given in a table.</p>
      <p>The obtained results were compared to the degree of euphony of the Ukrainian
fiction in general [14]. As one would expect, the value of the euphony index for the works
of Roman Ivanychuk is the closest to the values of the mean euphony index of
Ukrainian prose in general.
As it can be seen, the most frequent are words consisting of 4 phonemes – 12001 units
(13.41%), 5 phonemes – 11199 units (12.51%) and 2 phonemes – 11058 units
(12.35%). The middle group consists of words with the length of 6 phonemes – 10015
units (11,19%), 3 phonemes – 9449 units (10,56%), 1 phoneme – 9047 units (10,11%),
7 phonemes – 8667 units (9,68%), 8 phonemes – 6670 units (7.45%) and 9 phonemes
– 4895 units (5.47%).</p>
      <p>Words with the length of more than 10 phonemes make up only 3.8% of the whole
selection: 10 phonemes – 3072 units (3.43%), 11 phonemes – 1861 units (2.08%), 12
phonemes – 906 units (1.01%), 13 phonemes – 369 units (0, 41%), 14 phonemes – 155
units (0.17%), 15 phonemes – 71 (0.08%), 16 phonemes – 28 units (0.03%), 17
phonemes – 21 (0.02%), 18 phonemes – 12 units (0, 01%), 19 phonemes – 5 units (0.01%),
20 phonemes – 3 units (0.0%), 22 phonemes – 2 units (0.0%), 25 phonemes – 1 units
(0.0%) of all words. Words, the length of which is greater than 19 phonemes are
compounds, with a hyphen, which were considered as one word. The result of the statistical
processing of the distribution of the lengths of words in presented in the table below:
The comparison of the obtained results with the average results of the styles of the
Ukrainian language shows that they are completely consistent. The average length of
words in the novels by R. Ivanychuk is the closest to the works by another Ukrainian
writer Yu Yanovsky.
R. Ivanychuk is known for using long sentences in his works. This is a distinguishing
feature of his short stores, so it was naturally to count and statistically prove this
statement. The results are shown on the figure below:</p>
      <p>Length interval
16,5%</p>
      <p>15,6%
3 6 9 2 5 8 1 4 7 0 3 6 9 2 5 8 1 4 7 0 0
– – – 1 1 1 2 2 2 3 3 3 3 4 4 4 5 5 5 6 6
1 4 7 –0 –3 –6 –9 –2 –5 –8 –1 –4 –7 –0 –3 –6 –9 –2 –5 –8 &gt;
1 1 1 1 2 2 2 3 3 3 4 4 4 4 5 5 5</p>
      <sec id="sec-6-1">
        <title>Length interval</title>
        <p>From the obtained results, it is seen that R. Ivanychuk indeed used very large sentences
that has more than 60 words. They make up almost 5% of all sentences he used. The
highest percentage was given to the length of 4-5 words in the sentence. It is interesting
to mention that sentences with the length from 16 words to 30 words are also pretty
frequent.
The comparison of the obtained data to the similar data from works of different styles
of the Ukrainian language (namely drama, prose, and poetry) is presented below:
60%
50%
40%
30%
20%
10%
0%</p>
      </sec>
      <sec id="sec-6-2">
        <title>Drama</title>
      </sec>
      <sec id="sec-6-3">
        <title>Poetry</title>
      </sec>
      <sec id="sec-6-4">
        <title>Prose</title>
      </sec>
      <sec id="sec-6-5">
        <title>Short Stories of R. Ivanychuk</title>
        <p>The only thing that can be noted after comparing the data, is that another Ukrainian
writer Yu. Yanovsky also liked to use such long sentences as Ivanychuk. The results
obtained in the study correspond to the data for the Ukrainian prose. The only and
significant difference is the large percentage of long sentences, which makes it possible to
distinguish this feature, as distinctive characteristic of the novels of R. Ivanychuk.
6</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>Conclusions</title>
      <p>Roman Ivanychuk is undoubtedly a talented Ukrainian writer who left a large legacy of
interesting and worth reading novels, short stories and novelets. From the statistical
standpoint, his works have not been researched enough to the date and the analysis
presented in the paper already shows that Ivanychuk had a specific, perhaps unique,
manner of writing.</p>
      <p>The short stories taken as a basis of the research have some specific features that
distinguish them from other works. The most frequent words used by the author are
pronouns and nouns. That allows to conclude that in the most of his short stories the
person of narrator is the main character.</p>
      <p>Furthermore, the degree of euphony of R. Ivanychuk’s works is similar to the
Ukrainian prose in general. The most frequently used words are 4-, 5-, 6-, 2-phoneme
words. This is also within the norm for the Ukrainian prose and other Ukrainian authors.
What stands out is the length of sentences used by the author. He tended to use lager
sentences comparing to other Ukrainian writers and this is a representative feature for
text attribution. Like in no other Ukrainian writers’ texts, the percentage of sentences
longer than 60 words in the texts of R. Ivanychuk makes up almost 5%.</p>
      <p>Other differences or specific features of short stories of R. Ivanychuk, are yet to be
defined. The results of the research presented in the paper will be applied a further
research in this direction. Also they already have a practical value for the text attribution
of the Ukrainian literature.
7
8. Kapliuk, K. M.: Khudoznii khronotop istorychnoi romanistyky R. Ivanychuka (na materiali
romaniv “Malvy”, “Zhuravlynyi kryk”, “Orda”). In: avtorepherat dysertatsii na zdobuttia
naukovoho stupenia kand. philol. nauk: spestialnist 10.01.01 “Ukrainska literatura”, 18
(2010)
9. Koliadych, Iu. V.: Osoblyvosti poetyky maloi prozy Romana Ivanychuka. In: avtorepherat
dysertatsii na zdobuttia naukovoho stupenia kand. philol. nauk: spestialnist 10.01.01
“Ukrainska literatura”, 20. (2008)
10. Kulchystkyii, I. M.: Doslidzhennia dovzhyky rechennia ta slova u tvorakh Romana
Ivanychuka. In: Inphormatsiini systemy ta merezhi, Visnyk Natsionalnoho universytetu
“Lvivska Politechnika”, Lviv, vol. 872, 139-149 (2017)
11. Kulchystkyii, I. M.: Idiolekt, idiostyl, indyvidualnyi styl. Totozne chy rizne?. In: Naukovyi
visnyk Volynskoho natsionalnoho universytety imeni L. Ukrainky, Lutsk, vol. 5 (282),
2262291 (2014)
12. Levytskyi, V.: Kvantytatyvnoe metody v linhvistike. Nova Knyha, Vinnystia, 234 (2007)
13. Perebyinis, V. I.: Statystychni metody dlia linhvistiv. Nova Knyha, Vinnystia, 176 (2013)
14. Perebyinis, V. S: Statystychni paramentry styliv, Naukova dumka, Kyiv (1967)
15. Pidopryhora, S. V.: Romannyi tryptykh “Vohnenni stovpy” u zhanrovii systemi istorychnoi
prozy Romana Ivanychuka. In: avtorepherat dysertatsii na zdobuttia naukovoho stupenia
kand. philol. nauk: spestialnist 10.01.01 “Ukrainska literatura”, 18 (2008)
16. Stavytska, L. O.: Pro termin idiolect. In: Ukrainska mova, vol. 4, 3-17 (2009)
Uirina, Ui. М.: Idiostyl Oleny Telihy. In: dyssertatsia kandydata philosophskykh nauk
10.02.01, Herson, 194 (2016)</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Bezlichenko</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          :
          <article-title>Poetyka istorychnoi prosy Romana Ivanychuka (osoblyvosti chasoprostorovykh kontseptsii)</article-title>
          .
          <article-title>In: avtorepherat dysertatsii na zdobuttia naukovoho stupenia kand</article-title>
          .
          <source>philol. nauk: spestialnist 10.01</source>
          .01 “Ukrainska literatura”,
          <volume>20</volume>
          . (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Burchenia</surname>
            ,
            <given-names>V. V.</given-names>
          </string-name>
          :
          <article-title>Katehoriia istoryzmu ta stylovi poshuky v prozi Romana Ivanychuka</article-title>
          . In:
          <article-title>avtorepherat dysertatsii na zdobuttia naukovoho stupenia kand</article-title>
          .
          <source>philol. nauk: spestialnist 10.01</source>
          .01 “Ukrainska literatura”,
          <volume>20</volume>
          . (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Hadzii</surname>
            .
            <given-names>O. A.</given-names>
          </string-name>
          :
          <article-title>Publitsystyka Romana Ivanychuka: problematyka I poetyka. In: avtorepherat dysertatsii na zdobuttia naukovoho stupenia kand</article-title>
          .
          <source>philol. nauk: spestialnist 10.01</source>
          .01 “Ukrainska literatura”,
          <volume>20</volume>
          . (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Horodylovska</surname>
            ,
            <given-names>H. P.</given-names>
          </string-name>
          :
          <article-title>Elementy naukovoho ta publistystychnoho slyliv v istorychnukh tvorakh Romana Ivanychuka</article-title>
          . In:
          <article-title>avtorepherat dysertatsii na zdobuttia naukovoho stupenia kand</article-title>
          .
          <source>philol. nauk: spestialnist 10.01</source>
          .01 “Ukrainska literatura”,
          <volume>18</volume>
          . (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Ivanuchyk</surname>
          </string-name>
          , Roman.:
          <article-title>Odna khlibyna na dvokh</article-title>
          ,
          <source>Sribne Slovo, Lviv</source>
          ,
          <volume>160</volume>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Ivanuchyk</surname>
          </string-name>
          , Roman:
          <article-title>Osinni uzory: novely tа opovidannia</article-title>
          , Pholio, Kharkiv,
          <volume>299</volume>
          . (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Ivanuchyk</surname>
          </string-name>
          , Roman: Prut nese krygu,
          <source>Kn</source>
          .- zhurnl. vydavnytsvo, Lviv,
          <volume>83</volume>
          (
          <year>1958</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>