<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Distribution of attributes as a feature of individual style</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Sergey Andreev smol.an@mail.com</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Smolensk State University</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Smolensk Russian Federation</string-name>
        </contrib>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <abstract>
        <p>The distribution of two types of attributes (adjectives and nouns in genitive construction) is studied. Busemann's coe cient reveals di erent types of relationship of the adjectival and nominal attributes in the texts of 6 Russian female authors. At the same time it was found that power function ts well the data irrespective of the peculiarities of the authors' individual style revealing a general order of the distribution of the attribute.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>To analyze individual styles quite a big list of characteristics is used to adequately reveal author's
speech peculiarities and establish reliable bases to di erentiate styles. This list includes a substantial
number of speech properties, both formal and semantic [Juola, 2006; Holms, 1994; Rudman, 1998].
One of such properties whose prognostic value in this respect should be tested is the frequency of
attributes in the texts of di erent authors in general and of certain attributive types, in particular
[Kohler, Altmann, 2014].</p>
      <p>The syntactic position of an attribute (adnominal) has at least one important peculiarity|it is
not obligatory in verbal syntactic structure and thus is highly optional, depending on the author's
inclinations and literary taste. On the other hand attributives play a highly important role in
elaborating topics.</p>
      <p>As a result one can suppose that the frequency and the patterns of the distribution of di erent
types of attributes can serve as an important feature of an author's style. In other words this can
serve as an explicit feature for comparing and/or discrimination of the styles of di erent authors.</p>
      <p>According to the part of speech of the word used as an attribute di erent types attributes can be
established: adjectives (green leaves), pronouns (my friend, this book, and other types of pronouns),
participle (dancing people), in nitive (a wish to win), adverb (a room upstairs) and some others.
One of the most frequent and semantically important is the genitive construction which in Russian
is formed by a noun in genitive case, corresponding to the English genitive of-construction (the book
of Peter). Such genitive constructions ( N ) re ect the nominal strategy of description, opposing it
to a more standard strategy of the use of adjectives ( A ).</p>
      <p>Here some questions may arise|are the relations between the frequencies of both types ( N
and A ) constant for all the authors and if not-to what extent they di er in di erent texts? These
questions are viewed on the material of the data-base which includes 6 feminine Russian authors
(Tokareva, Ulitskaya, Tolstaya, Marinina, Ustinova, Polyakova). The choice was motivated by the
following reasons:
all the authors are of the same gender;
they are very popular in their genres;
the genres of their novels are rather di erent: the rst three authors are writers of belle lettres
style, the last three ones are detective writers.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Methods</title>
      <p>For the analysis the samples from 3 books by each author were chosen for the analysis. The samples
were of 1000 words length and were taken from the beginning of each book. The list of the novels is
given in the appendix. Adjectival and genitive attributes were marked in the samples.</p>
      <p>To nd out the proportions between these two types of attributes Busemann's coe cient was
used [Altmann, 2015]:</p>
      <p>where C is Busemann's coe cient, A stands for all the adjectival attributes, G stands for all
the genitive attributes.</p>
      <p>The coe cient values can vary between 0 (genitive attributes are absent completely) and 1 (no
adjectival attributes were registered). High values of C (C &gt; 0:5) show that G |constructions play
a more important role in description, low values of the coe cient (C &lt; 0:5) indicate the predominance
of A in the style of the author. To test the results chi-square statistic was used [Andreev, M stecky,
Altmann, 2018]:</p>
      <p>C =</p>
      <p>G
A + G</p>
      <p>;
2 = (A</p>
      <p>A + G</p>
      <p>G)2
:
(1)
(2)</p>
      <p>Busemann's coe cient is statistically signi cant with 1 degree of freedom and p &lt; 0:05 if
2 &gt; 3:4 .
3</p>
    </sec>
    <sec id="sec-3">
      <title>Results &amp; Discussion</title>
      <p>The results of the analysis are shown in Table 1. In all cases the results proved to be statistically
signi cant. Ranking the values of C in increasing order one can get the following graphical image.
As seen from Figure 1 texts form a gently rising curve. It is noteworthy that in many cases texts
by the same author are positioned close to one another. Thus three novels by Ustinova (13{15) are
placed next to each other and besides are characterized by nearly the same values of the coe cient.
Close to one another are T1 and T3 (Tokareva), T5 and T6 (Ulitskaya), T7 and T9 (Tolstaya), T11
and T12 (Marinina). This demonstrates a comparatively similar relations of two types of attributes
in the works of the same author.
It should also be noted that the authors of detectives have somewhat lower values of the coe cient.</p>
      <p>The next step was to analyze the relationship of genitives and adjectival attributes in its
development from the beginning of the samples to the end. For this purpose the number of all adjectival
attributes found before each genitive in the text were counted on a cumulative basis.</p>
      <p>As an example, let us consider the development of the relations of these two attribute types over
the text in two novels: \Skazat'-ne skazat' " by Tokareva (T1) and \Moye vtoroye ya" by Polyakova
(T16). The results of the counts in these two texts are represented in table 2. In the rst column
the ordinal number of genitive each construction in the text is given. In the second column the
number of all adjectival attributes which come in the text before this given genitive construction are
summarized. The third column contains theoretically expected (according to the formula) frequencies
of adjectival attributes.</p>
      <p>The formula is as follows [Naumann et al., 2012]: y = a xb , where a and b are parameters.</p>
      <p>The results are shown in Figures 2 and 3 in graphical form. A shown in the gures the observed
frequencies of adjectival attributes (dots) are very near to those theoretically expected, shown as a
full line (curve).</p>
      <p>The values of the parameters a and b are as follows. For T1 a = 4:274 , b = 0:812 ; for T16
a = 1:638 ; b = 1:275 . If b &lt; 1 the curve is concave ( gure 2), if b &gt; 1 the curve is convex ( gure
3) [Naumann et al., 2012: 26{27].</p>
      <p>Table 3 contains the values of the parameters a and b of the power function and the coe cient
of determination R2 for all 18 texts.</p>
      <p>R2 for all the novels is very high which proves good tting. Parameter b showing the increase
or decrease of adjectival attributes towards the end is rater di erent even in the novels of the same
writer. Only in case (Ulitskayay) all the novels of the same author show the same tendency of
gradually decreasing the number of adjectives over the text as in all fer novels (T4|6) b &lt; 1 . The
biggest increase in the number of adjectival attributes and, correspondingly decrease of the genitives
is seen in T2 and T3 (Tokareva). Vice versa, the largest increase of the number of genitives takes
place in the novel of Ulitskaya (6). Marinina (10|12) demonstrates highly balanced relationship of
adjectival attributes and genitives from beginning to the end of her novels.</p>
      <p>On the whole the analysis revealed the existence of general tendencies as well as certain di erences
in style. Di erent aspects of relations between two main types of attributes in text makes it possible
to estimate the role of di erent kinds of descriptiveness in an author's style and can be used as
objective criteria for the di erentiation and classi cation of styles. It should be noted that to get a
more complete picture of such relationship of di erent types of attributes further steps are needed.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <source>[Juola</source>
          , 2006] Juola,
          <string-name>
            <surname>P.</surname>
          </string-name>
          (
          <year>2006</year>
          )
          <article-title>Authorship attribution // Foundations and Trends in Information Retrieval</article-title>
          .
          <source>December 2006</source>
          . Vol.
          <volume>1</volume>
          . Is. 3. Hanover, MA, USA: Now publishers Inc.,
          <year>2006</year>
          . P.
          <volume>233</volume>
          {
          <fpage>334</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <source>[Holms</source>
          , 1994] Holmes,
          <string-name>
            <surname>D.I.</surname>
          </string-name>
          (
          <year>1994</year>
          ) Authorship attribution // Computers and the Humanities.
          <year>1994</year>
          . Vol.
          <volume>28</volume>
          , No. 2, P.
          <volume>87</volume>
          {
          <fpage>106</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <source>[Rudman</source>
          , 1998] Rudman,
          <string-name>
            <surname>J.</surname>
          </string-name>
          (
          <year>1998</year>
          )
          <article-title>Non-traditional authorship attribution studies in the Historia Augusta</article-title>
          : Some caveats // Literary and
          <string-name>
            <given-names>Linguistic</given-names>
            <surname>Computing</surname>
          </string-name>
          .
          <year>1998</year>
          . Vol.
          <volume>13</volume>
          , No. 3.
          <year>1998</year>
          . P.
          <volume>151</volume>
          {
          <fpage>157</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <article-title>[Kohler</article-title>
          , Altmann 2014]
          <article-title>Kohler</article-title>
          R.,
          <string-name>
            <surname>Altmann</surname>
            <given-names>G.</given-names>
          </string-name>
          (
          <year>2014</year>
          )
          <article-title>Problems in Quantitative Linguistics</article-title>
          . Ludenscheid: Ram-Verlag,
          <year>2014</year>
          . { 148 p.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <source>[Altmann</source>
          , 2015]
          <string-name>
            <surname>Altmann G.</surname>
          </string-name>
          (
          <year>2015</year>
          ) Problems in Quantitative Linguistics.
          <year>2015</year>
          . Vol.
          <volume>5</volume>
          . Ludenscheid: RAM-Verlag.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [Andreev,
          <string-name>
            M stecky,
            <surname>Altmann</surname>
            , 2018] Andreev S.,
            <given-names>M stecky</given-names>
            M., Altmann
          </string-name>
          <string-name>
            <surname>G.</surname>
          </string-name>
          (
          <year>2018</year>
          )
          <article-title>Sonnets: Quantitative Inquiries</article-title>
          .
          <source>Studies in Quantative Linguistics</source>
          ,
          <volume>29</volume>
          . Ludenscheid: RAM-Verlag,
          <year>2018</year>
          . { 130 p.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>[Naumann</surname>
          </string-name>
          , et al.,
          <year>2012</year>
          ]
          <string-name>
            <surname>Naumann</surname>
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Popescu</surname>
            <given-names>I.-I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Altmann</surname>
            <given-names>G.</given-names>
          </string-name>
          (
          <year>2012</year>
          ) Aspects of nominal style // Glottometrics.
          <year>2012</year>
          . V. 23. P.
          <volume>23</volume>
          {
          <fpage>55</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>