<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Vocabulary in British</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Nataliya Bondarchuk</string-name>
          <email>nataliia.i.bondarchuk@lpnu.ua</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ivan Bekhta</string-name>
          <email>ivan.bekhta@lnu.edu.ua</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Oksana Melnychuk</string-name>
          <email>melnychuk_oksanadm@ukr.net</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Olha Matviienkiv</string-name>
          <email>olha.matviyenkiv@lnu.edu.ua</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Ivan Franko National University of Lviv</institution>
          ,
          <addr-line>Lviv, 7900</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Lviv Polytechnic National University, Ivan Franko National University of Lviv</institution>
          ,
          <addr-line>Lviv 79013</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Lviv Polytechnic National University</institution>
          ,
          <addr-line>Lviv 79013</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Rivne Medical Academy</institution>
          ,
          <addr-line>Rivne, Ukraine, 33017</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The survey centers on the examination of keywords and related themes representing weather in four daily British newspapers (The Times, The Guardian, The Daily Mail, The Sun) between 2014 and 2017. The articles in this period mentioning weather news represent the corpus of our research. The goals of the research are the following: expose frequently occurring words (keywords) in the corpus, categorize them into groups according to relevant themes in the text, identify the quantitative content of each lexicalthematic group, as well as determine dominant themes of weather news. The computer software that was used to establish keywords is WordSmith Tools 7.0 with the British National Corpus as a reference corpus. A method for automatic cataloging of keywords is described. The corpus contains 746 324 words taken from 180 newspaper articles under research. Despite the necessity of keyword study, thematic and quantitative analyses provide deeper insight into text-specific weather-related vocabulary and its textualizing role. The analysis of quantitative data helps to select two dominant lexical-thematic groups ‒ “Weather extremes” and “Weather and people”, giving evidence of central themes discussed in weather news. Hence, the resulting major themes are the depiction of adverse weather conditions affecting people's daily life; the representation of the effects of weather disasters on people and their environment. The obtained results highlight the link between a theme/themes and lexical level of the text proving the efficiency of keyword analysis in the research.</p>
      </abstract>
      <kwd-group>
        <kwd>1 keyword analysis</kwd>
        <kwd>lexical-thematic groups</kwd>
        <kwd>quantitative analysis</kwd>
        <kwd>weather</kwd>
        <kwd>WordSmith Tools</kwd>
        <kwd>corpus</kwd>
        <kwd>vocabulary</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Keyword analysis is a widely used method in various sciences and fields, in particular corpus
linguistics. Egbert and Biber suggest that it is used “to identify the words that are especially
characteristic of the texts in a target discourse domain” [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Keyword extraction is an optimal way while
clustering, classifying, indexing and visualizing texts of different discourses, genres or text types.
However, the application of keyword analysis to the text or corpus requires further interpretation of the
results since keywords which are blindly extracted on the basis of their frequencies do not convey
relationship with other words/keywords or texts. Considering the key words at the linguistic level, the
main idea, as stated by M.Scott, is that keyness is not language dependent, but text dependent [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
Drawing from this, the advantage of the use of keyword analysis lies in the extraction of text-specific
vocabulary. Therefore, in our research generation of keywords is a starting point of the analysis for
further categorization of keywords into thematic groups representing weather. We understand thematic
groups as groups of lexical units used within the text interchangeably to convey certain semantic
meaning [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        Prior surveys of such nature concentrated more on sentiment and quantitative analysis of weather
vocabulary [
        <xref ref-type="bibr" rid="ref4 ref5 ref6">4, 5, 6</xref>
        ], corpus-based analysis of weather metaphors [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] and climate representation [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
The absence of relevant computer-based studies on the topic constitute the topicality of the survey. This
paper presents an interdisciplinary approach that incorporates linguistic and computer-based (statistical)
techniques to the analysis of weather-related vocabulary to define dominant themes that are specific to
weather news of British press, thus offering prospects for a better understanding of its contextualizing
and textualizing role in newspaper discourse.
      </p>
      <p>The primary objective of this paper is to present an argument for the definition of keywords
according to different approaches. The secondary aim is to automatically extract keywords form the
research corpus and organize them into lexical-thematic groups, as well as to find out their quantitative
composition. As a result, dominant themes circulating in the texts of weather news may be defined.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related works</title>
      <p>
        The study of keywords is associated with the works of G. Matore [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], R. Williams [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], M. Scott
[
        <xref ref-type="bibr" rid="ref11 ref12">11, 12</xref>
        ], Tribble [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], P. Baker [14; 15], T. Berber Sardinga [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], M. Bondi [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], A. Wierzbicka [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], L.
Jeffriesand, B. Walker [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ], J. Sinclair [
        <xref ref-type="bibr" rid="ref19 ref20">19, 20</xref>
        ], J. Firth [21], N. Fairclough [
        <xref ref-type="bibr" rid="ref21">22</xref>
        ], Gries [
        <xref ref-type="bibr" rid="ref22">23</xref>
        ], A.Wilson
[
        <xref ref-type="bibr" rid="ref23">24</xref>
        ], M. Stubbs [
        <xref ref-type="bibr" rid="ref24 ref25">25, 26</xref>
        ], M. Nelson [
        <xref ref-type="bibr" rid="ref26">27</xref>
        ], G. Leech [
        <xref ref-type="bibr" rid="ref27">28</xref>
        ], M. Mahlberg [
        <xref ref-type="bibr" rid="ref28">29</xref>
        ], J. Culpeper [
        <xref ref-type="bibr" rid="ref29">30</xref>
        ], T.
McEnery [
        <xref ref-type="bibr" rid="ref30">31</xref>
        ], G. Wilcock [
        <xref ref-type="bibr" rid="ref31">32</xref>
        ]. The notion of “key word” is multi-faceted and understood in different
senses in various disciplines. From a sociological point of view, key words are part of the vocabulary
of culture and society [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. These are the words that have a special status, express an important social
meaning and play a special role. From a linguistic point of view, they contribute to the long-lasting
search for meaning [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] and are the most important units of the semantic and stylistic structure of the
text. In corpus linguistics, keyword is defined as a word which occurs with significantly high frequency
in one corpus when compared to some appropriate normative corpus (Scott, 1997; Scott &amp; Tribble,
2006).
      </p>
      <p>
        Paying attention to the importance of keywords in creating textual content and meaning, it is
believed that keywords are lexical units with the greatest semantic content contributing to the structure
and semantic framework of the text [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]. This makes keywords an effective method for identifying
lexical characteristics of texts [
        <xref ref-type="bibr" rid="ref33">34</xref>
        ]. The new research shows a tendency for ambiguity in the
terminological definition of keywords, which can be seen in three approaches:
      </p>
      <p>
        Cultural (Matore, J. Firth, R. Williams, A. Wierzbicka). The first researchers (J. Firth, R. Williams),
who discussed key words, were intuitively focused on words which, in their opinion, contain important
notions reflecting social or cultural problems. Already in the 1930s, J. Firth proposed to study socially
important words that could be called "focal" or "pivotal", and advocated an analysis of the distribution
of words, the meanings of which characterize the society in specific contexts, with specific associations
and values [21]. R. Williams tried to analyse modern culture by studying key words and established a
close link between key words and discursive society [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. However, while performing this analysis, he
focused on historical and social macro contextual factors without paying special attention to text and
genre and leaving the methodological tools of text analysis completely out of consideration.
      </p>
      <p>
        Quantitative (M. Scott). Based on the concept of corpus linguistics, M. Scott differentiated key
words by means of statistical characteristics. A word is deemed key if it is used in the text at least as
many times as the minimum frequency of occurrence is estimated by the user [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], or key words are
words whose frequency of occurrence in the text is exceptionally high, if we compare them with other
words [
        <xref ref-type="bibr" rid="ref32">33</xref>
        ]. Identification of elements that are repeated with statistically significant frequency is not an
analysis or interpretation of the text or corpus, but indicates the elements that need to be investigated
and explained. M. Scott distinguishes three types of key words: proper names, words that people
themselves consider to be key words and are indicators of the “aboutness” of a particular text, and
especially high-frequency words that are more indicative of style than of subject matter [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. When
talking about the topic and style of the text, as well as the role of key words in their identification,
attention is paid to what semantic structures are indicated by key words and in what way the author's
view influences them in the process of text creation [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. M. Scott compares the theme ("aboutness")
with the mental meta functions of M. A. K. Halliday [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. The words gain meaning not from the link
between the word and the meaning, but from the intrinsic interaction with other words. Later, P. Baker,
using M. Scott's classification, described lexical key words (nouns, adjectives,) as subject words, i.e.
words that can be used to identify the topic of the text [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. However, the key words are not only
elements of the conceptual, but also of the grammatical structure of the text. Apart from informational
conjugation, they are indicators of communicative intention and micro- or macrostructure of the text.
The text is stored in the memory in a set of key words, which are then revealed during its retrieval.
Therefore, the notions of key and subject words are not identical.
      </p>
      <p>
        Lexical-thematic groups include words that constitute components of one main thematic line, the
elements of which realise a certain idea, while the key words either serve individual thematic blocks of
the text (local thematic words) or implement, together with other text elements, the ideological idea of
the whole work (universal key words) [
        <xref ref-type="bibr" rid="ref21">22</xref>
        ]. Consequently, key words of the lexical-thematic group are
frequently varied units of the lexical organization of the text. They play an important role in the lexical
structure of the text, they take part in shaping the content and creating the meaning for an adequate
comprehension. The key word as a stimulus word, a source of textual associations, based on linguistic
(paradigmatic, syntagmatic) and extralinguistic (thematic) links of lexical units, performs the function
of a core, which directs the process of text comprehension. This approach appears to meet the tasks of
our research the most.
      </p>
      <p>
        Phraseological (M. Stubbs). Key words are defined as phraseological units and phrases that are
constructed according to similar word models [
        <xref ref-type="bibr" rid="ref24">25</xref>
        ]. Assumption of key meaning through frequency can
also be seen in word forms, lemmas, and word sequences. This definition can easily be applied to more
complex units than words, pointing to current trends in descriptive and theoretical linguistics, in
particular phraseology. In essence, key words are not necessarily individual words, they can be clusters
or even phrases [
        <xref ref-type="bibr" rid="ref20 ref28">20, 29</xref>
        ]. A quite different approach was taken by M. Hoey, who, taking the category
of text as a basis, showed that lexical links in a speech can be considered as indicators of text structure
or potential acceleration or lexical models can reveal textual (as opposed to grammatical) models [
        <xref ref-type="bibr" rid="ref35">36</xref>
        ].
Key words are not necessarily the main ones in the textual sense, but they can help to understand the
idea of the text by repetition. M.Toolan mentioned repetition as one of the key word figures, which has
"a very rich semantic meaning" [
        <xref ref-type="bibr" rid="ref36">37</xref>
        ]. The key words are intended to focus the reader's attention on the
necessary state of speech in the production of a coherent text. They can act as markers of coherence of
this or that text and at the same time give the texts a unique author’s style.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Methodology</title>
      <p>The procedure of our analysis involves the choice of relevant material and methods of the research
which combine a computer-based model of keyword analysis with traditional qualitative (in particular,
thematic analysis) and quantitative analysis. Thus, in our framework we use three procedural steps:
corpus compilation, keyword analysis, thematic and quantitative analyses.</p>
      <p>
        The first step of the investigation was to compile the corpus of the research. The data used in the
study is the corpus of weather news selected from British online daily newspapers between 2014 and
2017 (The Times, The Guardian, The Daily Mail, The Sun) which consists of 746 324 words taken
from 180 newspaper articles. While compiling a corpus the following criteria were considered: firstly,
the timeframe of four years (2014‒2017) to represent recent use of the related themes, secondly, open
access articles to be easily downloaded. Finally, the texts from theguardian.com/uk [
        <xref ref-type="bibr" rid="ref37">38</xref>
        ],
thedailymail.co.uk [
        <xref ref-type="bibr" rid="ref38">39</xref>
        ], thesun.co.uk [
        <xref ref-type="bibr" rid="ref39">40</xref>
        ], thetimes.co.uk [
        <xref ref-type="bibr" rid="ref40">41</xref>
        ] were selected by using the search term
“weather”. The timeframe and the amount of words testify the representativeness and validity of the
results. The dataset was made into a text file (.txt) and later imported to the WordSmith software.
      </p>
      <p>
        Our next step was to extract keywords using this software. For this reason, the British National
Corpus (hereinafter ‒ BNC) was chosen as a suitable reference corpus since all data are specific to
British English. In addition, BNC is one of the largest corpora which contains 100 million words of text
from a wide range of genres (e.g. spoken, fiction, magazines, newspapers, and academic) [
        <xref ref-type="bibr" rid="ref41">42</xref>
        ]. The aim
of a keyword analysis was to retrieve the words which are statistically relevant for the investigation.
Consequently, we constructed two word frequency lists with the help of WordList tool: of a target
corpus of weather texts and of a reference corpus (BNC) and generated a keyword list.
      </p>
      <p>
        According to P. Baker, keyword extraction requires “a way that combines the strength of key
keywords with those of keywords but is neither too general or exaggerates the importance of a word
based on the eccentricities of individual files [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. Therefore, we have taken into account both
statistically significant (positive, high-frequency) and negative (unusually low frequency) keyword
items using log likelihood ratio (Dunning 1993) [
        <xref ref-type="bibr" rid="ref42">43</xref>
        ]. The cells in the generated keyword list with
negative keywords were shaded in red and had a negative log likelihood value. The reason for taking
into consideration words with low frequencies is that our reference corpus consists of a collection of
rather small texts. Consequently, the distribution of some words in the text may be uneven and some of
the thematic lines might be lost. As stated by Gries, “corpora are inherently variable internally”[
        <xref ref-type="bibr" rid="ref22">23</xref>
        ] and
low frequency keywords may help us find additional „local‟ themes of weather news. In this case, the
issue may be solved by generating a wordlist for each single text in the research corpus, but it would be
very time-consuming.
      </p>
      <p>
        It is important to note here that the analysis is also restricted to content words only, which, being the
units of meaning, we define to be directly related to the identification of the theme. Function words
cannot demonstrate the link of lexical units and themes [
        <xref ref-type="bibr" rid="ref44">45</xref>
        ]. The extraction of keywords provides
insights into their further grouping by themes since the potential of the key lexical units is realized
within the whole text: the semantic influence of a single word (sign) is determined only by the whole
text [
        <xref ref-type="bibr" rid="ref43">44</xref>
        ]. This idea is also supported by Morris and Hirst, who explain that “when a unit of text is about
the same thing there is a strong tendency for semantically related words to be used within that unit”
[
        <xref ref-type="bibr" rid="ref45">46</xref>
        ].
      </p>
      <p>Thus, our last step was to group the keywords into lexical-thematic groups to define dominant
themes related to the representation of weather in the news of online press. To this end, we worked out
the following procedure: the context of each word from the keyword list is checked using the
concordancer and the word is put into the appropriate group manually.</p>
      <p>
        Two problematic issues that arose during this step were that of how to group 1) the words that could
fit in multiple thematic groups and 2) the words that do not fit into any one of them. We applied a
systematic hierarchical decision-making procedure and critical analysis to solve this issue: if a word
could fit into several thematic groups, it was categorized into each of them; and if a word did not have
an appropriate thematic group to be categorized, it was left out. In this step quantitative analysis was
also used to further explore and identify dominant/prevalent themes of weather news by finding
quantitative content of each thematic group. According to G. P. Cantos, in quantitative research,
linguistic features are classified and counted [
        <xref ref-type="bibr" rid="ref46">47</xref>
        ]. In recent years commensurate attention has been paid
to mixed-method studies of a text which use both quantitative and qualitative data [
        <xref ref-type="bibr" rid="ref43">44</xref>
        ]. As a result, the
combination of keyword, lexical-thematic and quantitative approach in our research opened new
opportunities for in-depth analysis of British weather news.
      </p>
    </sec>
    <sec id="sec-4">
      <title>4. Experiment</title>
      <p>Having compared two word frequency lists, we have obtained a computed list of keywords (Table
1). The list of keywords was limited to 300 words. The presented list in Table 1 provides only the first
45 keywords and their frequencies, as they will be further investigated during their thematic grouping.
Table 1</p>
      <sec id="sec-4-1">
        <title>The list of keywords and their frequencies</title>
      </sec>
      <sec id="sec-4-2">
        <title>Keyword/Frequency</title>
      </sec>
      <sec id="sec-4-3">
        <title>Keyword/Frequency</title>
      </sec>
      <sec id="sec-4-4">
        <title>Keyword/Frequency</title>
      </sec>
      <sec id="sec-4-5">
        <title>CLIMATE 32</title>
      </sec>
      <sec id="sec-4-6">
        <title>COUNTRY 57</title>
      </sec>
      <sec id="sec-4-7">
        <title>SHOWERS 46</title>
      </sec>
      <sec id="sec-4-8">
        <title>HEAVY 56</title>
      </sec>
      <sec id="sec-4-9">
        <title>WINDS 46</title>
      </sec>
      <sec id="sec-4-10">
        <title>FORECASTERS 38</title>
      </sec>
      <sec id="sec-4-11">
        <title>CONDITIONS 57</title>
      </sec>
      <sec id="sec-4-12">
        <title>WARNING 37</title>
      </sec>
      <sec id="sec-4-13">
        <title>UPDATED 17</title>
      </sec>
      <sec id="sec-4-14">
        <title>ARCTIC 14</title>
      </sec>
      <sec id="sec-4-15">
        <title>WEEKEND 50</title>
      </sec>
      <sec id="sec-4-16">
        <title>HIGHS 13</title>
      </sec>
      <sec id="sec-4-17">
        <title>TODAY 40</title>
      </sec>
      <sec id="sec-4-18">
        <title>HEAT 66</title>
      </sec>
      <sec id="sec-4-19">
        <title>LOCALS 21</title>
      </sec>
      <sec id="sec-4-20">
        <title>SPELLS 13</title>
      </sec>
      <sec id="sec-4-21">
        <title>MORNING 39</title>
      </sec>
      <sec id="sec-4-22">
        <title>DOWNPOURS 15</title>
      </sec>
      <sec id="sec-4-23">
        <title>POLICE 25 CAR 24</title>
      </sec>
      <sec id="sec-4-24">
        <title>CHILD 8 PM 8</title>
      </sec>
      <sec id="sec-4-25">
        <title>DORIS 8</title>
      </sec>
      <sec id="sec-4-26">
        <title>BRITS 18</title>
      </sec>
      <sec id="sec-4-27">
        <title>COMMUTERS 10</title>
      </sec>
      <sec id="sec-4-28">
        <title>CHRISTMAS 9</title>
      </sec>
      <sec id="sec-4-29">
        <title>FLOODS 34</title>
      </sec>
      <sec id="sec-4-30">
        <title>TRANSPORT 14</title>
      </sec>
      <sec id="sec-4-31">
        <title>DISASTER 19</title>
      </sec>
      <sec id="sec-4-32">
        <title>TRAFFIC 11</title>
      </sec>
      <sec id="sec-4-33">
        <title>WARNED 27</title>
      </sec>
      <sec id="sec-4-34">
        <title>GALES 15</title>
      </sec>
      <sec id="sec-4-35">
        <title>DELAYS 17</title>
      </sec>
      <sec id="sec-4-36">
        <title>COAST 17 ICY 11</title>
      </sec>
      <sec id="sec-4-37">
        <title>FLOODING 23</title>
      </sec>
      <sec id="sec-4-38">
        <title>SUNSHINE 28</title>
      </sec>
      <sec id="sec-4-39">
        <title>STORM 34</title>
      </sec>
      <sec id="sec-4-40">
        <title>RAIN 68</title>
      </sec>
      <sec id="sec-4-41">
        <title>HEATWAVE 33</title>
      </sec>
      <sec id="sec-4-42">
        <title>EXTREME 39</title>
      </sec>
      <sec id="sec-4-43">
        <title>DAMAGE 28</title>
      </sec>
      <sec id="sec-4-44">
        <title>BRITAIN 19</title>
        <p>
          Scotts’ classification of keywords into proper nouns, aboutness keywords and high-frequency words
[
          <xref ref-type="bibr" rid="ref12">12</xref>
          ] is relevant for our study. The first type of keywords is usually represented in the corpus of our
research by place names (Britain), names of nationalities (Brits) and names of storms (Doris).
Aboutness keywords, the words that have semantic correlations with the main ideas and central themes
of the text, are the most numerous. The third type constitutes words with high frequencies that are
considered to be more indicators of style than theme. However, the objective of our study consists in
grouping keywords into thematic groups rather than classifying them by types.
        </p>
        <p>As defined earlier, our next step was to group the keywords by thematic categories representing
weather, which consists in classifying the lexical units according to the thematic groups and quantifying
them. Thus, the computed list of keywords was divided into five groups, each of which was classified
thematically. As a result, the following groups were organized: “Weather extremes”, “Climate”,
“Weather and people”, “Weather and nature”, “Weather phenomena”.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Results</title>
      <p>The first thematic group “Weather extremes” consists of key lexical units that denote weather
catastrophes that cause destruction of material objects, casualties and even death of people. In the corpus
of the studied texts we find weather cataclysms of hydro- (floods, tsunami) and atmospheric
(blizzards/snowstorm, drought, hailstorm, heatwave, snow avalanche, showers, downpours,
thundersnow, storm, duststorm) origin, which accordingly constitute two subgroups of the group. This
group also collects words denoting: protective equipment (shelter, sandbag); locations (nomenclatures:
homes, businesses, region, village, area, country, town, adjectives: local, tropical, central, coastal);
means of transport and infrastructure (road, bridge, building, traffic, speed, travel, highway, train, boat,
van); general vocabulary (extreme, cancellation, delay, condition, incident, recovery, collapse, highs);
meteorological terms (icy, Doris, cyclone, warning); size/description (adjectives: thick, heavy, massive,
large, arctic), victims of the disaster (locals, mountaineer, immigrant, son, eyewitness, people, refugees,
children, driver, traveller, residents, civilians, kayakers, victims); organisations or political actors,
officials (minister, police); other actors involved in the disaster (commuters, coastguard, ambulance,
volunteers, paramedics, army, evacuees) and observation/analysis (expert, forecasters, meteorologist);
the consequences of the disaster (disaster, chaos, damage, mud, debris); emotional perception of the
disaster (alarm, alert, threat, fear, risk, danger, alarmed, terrified, fearful), actions (drop, force, leave,
move, block, trigger, halt, batter, collapse, damage, destroy, devastate, disrupt, kill, ruin, strike, warn).</p>
      <p>The second thematic group “Climate” is composed of the words denoting results and consequences
of climate problems, their interrelation and impact on weather conditions. The next Table 2 presents
the words that form this thematic group.
warming, flooding, greenhouse, ozone, hole, drought, deficit,
fire, glacier, thawing</p>
      <p>urbanization
ElNino, research, study, science, scientist, emissions, fossil,</p>
      <p>climate
impact, adaptation, conclusion, character, effects,</p>
      <p>assessment, committee
disastrous, climatic, man-made, annual, fossil,
human</p>
      <p>induced, global, vulnerable
exacerbate, pollute, emit, alter, modify, affect, devastate</p>
      <sec id="sec-5-1">
        <title>Total quantity: 41</title>
        <p>The keywords, the meaning of which thematically reflects the impact of weather conditions on
people’s comfortable living environment, their safety and health, we refer to the third thematic group
“Weather and people”. The keywords which are included into this group are shown in Table 3.
anger, grim, laugh, like, distressed, deranged, glee, happy,</p>
        <p>worried, devastated, shocked, hopeless, mad
traffic train, car, boat, vehicles, lights, tailback, sign, driving, speed,
delays, diversions, cancellation
roads, motorway, parks, station
passenger, the elderly, commuter, children, driver,
sun</p>
        <p>lovers, sunbathers, sunseekers, children, adults
bookies, holidays, Christmas, Easter, football, vacations,
weekend, match, holidaymakers, camper, traveler,
festival</p>
        <p>goer, beach, barbeque
inhaler, death, dehydration, cardiovascular, illness,
respiratory, diabetes, discomfort, faint, coughing,
hypothermia, cramp, dizziness, chronic, irritation, sensitive
country, homes, businesses, region, village, area, town</p>
      </sec>
      <sec id="sec-5-2">
        <title>Total quantity: 77</title>
        <p>Thematic group “Weather and nature” is characterized by the use of nominations of different species
of plants and animals, as well as natural processes which are directly influenced by the weather. Table
4 shows the keywords that refer to this group.
eucalupts, pines, sycamore, oak, ash, beech
chiffchaffs, dog, frigate, seabirds, fish, sparrow, starling,
chaffinches, greenfinches, blackbird, woodpigeon, dove, tit,</p>
        <p>puffin
apples, strawberry, blackberry, fruit</p>
        <p>crops, harvest, agriculture
breeding, flowering, leafing, ripening, budburst, growing,</p>
        <p>fruiting
wildlife, habitat, species, population, birds</p>
      </sec>
      <sec id="sec-5-3">
        <title>Total quantity: 40</title>
        <p>The fifth thematic group “Weather phenomena” includes key lexical units denoting different weather
phenomena and their manifestations. The lexical composition of this group is given in Table 5.
rain, flurry, thunder, snow, winds, heat, temperature, humid,</p>
        <p>fog, tempest, gale, sunshine</p>
        <p>ElNino, Met, mercury, updated
heavy, strong, clear, scorching, cold, conditions, humidity
unseasonable, awful, terrible, glorious, topsy-turvy, freak,</p>
        <p>crazy, driving, ropy, cooler
rise, blow, batter, hit, smite, move, soar, bask, plummet, grip,
batter, swoop, slam, loom, pummel, stop, finish, end, begin,
start</p>
      </sec>
      <sec id="sec-5-4">
        <title>Total quantity: 54</title>
        <p>Some of the keywords (morning, today, spells, pm) do not fit any of the groups and their number is
not enough to group them separately, that is why there were not classified. Quantitative composition
of each group is presented in the pie-chart diagram (Figure 1).</p>
        <p>Quantitative composition of thematic groups
77
41</p>
        <p>111
40</p>
        <p>54
“Weather extremes”</p>
        <p>“Weather phenomena" “Weather and nature”
"Weather and people"</p>
        <p>"Climate"</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6. Discussion</title>
      <p>Keywords appear as effective tools for the analysis of thematic focus/foci of the text or corpus.
While the analysis of individual keywords does not provide a consistent and exhaustive text analysis,
their grouping into categories according to the themes they represent establishes a link between the
lexical level of the text and its themes.</p>
      <p>
        A quantitative analysis of the keywords representing the weather allowed us to identify dominant
thematic groups and, therefore, to recognize the themes that prevail in weather news of British online
press. The identification of lexical-semantic groups [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] and related themes enables a more in-depth
understanding of the texts of weather news. This survey paves the way for the development of more
rigorous methodology for the analysis of relationship between keywords and other words in contexts.
It would also be interesting to closer examine weather vocabulary through keywords extracted using
other statistical tests (e.g. t-test, the Wilcoxon-Mann-Whitney test) or software packages and compare
their results, as well as to find out whether dominant themes related to the representation of weather
differ from newspaper to newspaper or particular season.
      </p>
    </sec>
    <sec id="sec-7">
      <title>7. Conclusions</title>
      <p>Having analysed the obtained data, we may conclude that weather news has anthropocentric
character, since dominant thematic groups are “Weather extremes” and “Weather and people”. Thus,
within the corpus of the research two thematic lines can be outlined: (1) the depiction of adverse weather
conditions affecting people’s daily life; (2) depicting the effects of weather disasters on people and their
environment. The overall study reveals the following conclusion: everything that happens in the sphere
of weather, in any case influences people, their physical, moral, psychological and emotional state.</p>
      <p>The method presented in this research provides more options for further collocational analysis of
weather-related vocabulary in weather news of British online press on the basis of concordances and
might be used as a cornerstone for the study of other vocabulary through keywords in different
discourses. Further scrutiny can be combined with a more interpretative approach to the analysis of
weather news in line with stylistic research.</p>
    </sec>
    <sec id="sec-8">
      <title>8. References</title>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J.</given-names>
            <surname>Egbert</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Biber</surname>
          </string-name>
          ,
          <article-title>Incorporating text dispersion into keyword analyses</article-title>
          .
          <source>Corpora</source>
          <volume>14</volume>
          /1:
          <fpage>77</fpage>
          -
          <lpage>104</lpage>
          ,
          <year>2019</year>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M.</given-names>
            <surname>Bondi</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          Scott (eds.), Keyness in Texts. Amsterdam: John Benjamins,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>D.</given-names>
            <surname>Dedrick</surname>
          </string-name>
          ,
          <string-name>
            <surname>R. E.</surname>
          </string-name>
          , MacLaury,
          <string-name>
            <given-names>G. V.</given-names>
            <surname>Paramei</surname>
          </string-name>
          , Anthropology of Color:
          <article-title>Interdisciplinary multilevel modeling. (OAPEN (Open Access Publishing in European Networks)</article-title>
          .) John Benjamins Publishing Company,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M.</given-names>
            <surname>Thelwall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Buckley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Paltoglou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Cai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kappas</surname>
          </string-name>
          ,
          <article-title>Sentiment strength detection in short informal text</article-title>
          .
          <source>Journal of the American Society for Information Science and Technology</source>
          ,
          <volume>61</volume>
          (
          <issue>12</issue>
          ),
          <fpage>2544</fpage>
          -
          <lpage>2558</lpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>N.</given-names>
            <surname>Bondarchuk</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Bekhta</surname>
          </string-name>
          ,
          <article-title>Quantitative characteristics of lexical-semantic groups representing weather in weather news stories (based on British online press) In: Computational Linguistics and Intelligent Systems</article-title>
          , COLINS,
          <source>CEUR workshop proceedings</source>
          , Vol
          <volume>2870</volume>
          ,
          <fpage>799</fpage>
          -
          <lpage>810</lpage>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Q.</given-names>
            <surname>Jun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Zhendong</surname>
          </string-name>
          , Sh. Chongyang,
          <article-title>Sentiment Analysis Model on Weather Related Tweets with Deep Neural Network</article-title>
          .
          <source>In Proceedings of the 2018 10th International Conference on Machine Learning and Computing (ICMLC</source>
          <year>2018</year>
          ).
          <article-title>Association for Computing Machinery</article-title>
          , New York, NY, USA, (
          <year>2018</year>
          )
          <fpage>31</fpage>
          -
          <lpage>35</lpage>
          . DOI:https://doi.org/10.1145/3195106.3195111
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>I. Żołnowska</surname>
          </string-name>
          ,
          <article-title>Weather as the source domain for metaphorical expressions</article-title>
          .
          <source>AVANT. Pismo Awangardy Filozoficzno-Naukowej, (1)</source>
          ,
          <fpage>165</fpage>
          -
          <lpage>179</lpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>A. E.</given-names>
            <surname>Stewart</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. K.</given-names>
            <surname>Lazo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. E.</given-names>
            <surname>Morss</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. L.</given-names>
            <surname>Demuth</surname>
          </string-name>
          ,
          <article-title>The Relationship of Weather Salience with the Perceptions and Uses of Weather Information in a Nationwide Sample of the United States</article-title>
          . Weather,
          <source>Climate &amp; Society</source>
          ,
          <volume>4</volume>
          (
          <issue>3</issue>
          ),
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>G.</given-names>
            <surname>Matoré</surname>
          </string-name>
          ,
          <article-title>La méthode en lexicologie</article-title>
          .
          <source>Domaine français</source>
          . Paris: Marcel Didier,
          <year>1953</year>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>R.</given-names>
            <surname>Williams</surname>
          </string-name>
          , Keywords. London: Fontana, [
          <year>1976</year>
          ]
          <fpage>1983</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>M.</given-names>
            <surname>Scott</surname>
          </string-name>
          , WordSmith Tools,
          <article-title>Version 5</article-title>
          .
          <source>Liverpool: Lexical Analysis Software</source>
          ,
          <year>2008</year>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>M.</given-names>
            <surname>Scott</surname>
          </string-name>
          ,
          <article-title>PC analysis of key words - and key key words</article-title>
          .
          <source>System</source>
          ,
          <volume>25</volume>
          (
          <issue>2</issue>
          ).
          <fpage>233</fpage>
          -
          <lpage>245</lpage>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>M.</given-names>
            <surname>Scott</surname>
          </string-name>
          , C. Tribble,
          <article-title>Textual patterns: Key words and corpus analysis in language education</article-title>
          . Amsterdam: Benjamins,
          <year>2006</year>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>P.</given-names>
            <surname>Baker</surname>
          </string-name>
          ,
          <article-title>Querying keywords: questions of difference, frequency and sense in keyword analysis</article-title>
          .
          <source>Journal of English Linguistics</source>
          <volume>32</volume>
          (
          <issue>4</issue>
          ):
          <fpage>346</fpage>
          -
          <lpage>359</lpage>
          ,
          <year>2004</year>
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>P.</given-names>
            <surname>Baker</surname>
          </string-name>
          ,
          <article-title>The question is, how cruel is it? Keywords, fox hunting and the house of commons. What‟s in a word-list? Investigating word frequency</article-title>
          and keyword extraction / ed. by
          <string-name>
            <given-names>D.</given-names>
            <surname>Archer</surname>
          </string-name>
          . Farnham: Ashgate. P.
          <volume>125</volume>
          -
          <fpage>136</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>T.</given-names>
            <surname>Berber</surname>
          </string-name>
          <string-name>
            <surname>Sardinha</surname>
          </string-name>
          ,
          <article-title>Using key words in text analysis: Practical aspects</article-title>
          .
          <source>DIRECT Papers 42</source>
          ,
          <string-name>
            <surname>LAEL</surname>
          </string-name>
          , Catholic University of Sao Paulo,
          <year>1999</year>
          . 9 p
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>A.</given-names>
            <surname>Wierzbicka</surname>
          </string-name>
          ,
          <source>Understanding Cultures Through Their Key Words: English</source>
          , Russian, Polish, German, and
          <string-name>
            <surname>Japanese</surname>
          </string-name>
          . New York: Oxford University Press, 317 p,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>L.</given-names>
            <surname>Jeffries</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Walker</surname>
          </string-name>
          , Key words in the press,
          <source>English Text Construction</source>
          <volume>5</volume>
          (
          <issue>2</issue>
          ):
          <fpage>208</fpage>
          -
          <lpage>29</lpage>
          ,
          <year>2012</year>
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <surname>J. M. Sinclair</surname>
            ,
            <given-names>Document</given-names>
          </string-name>
          <string-name>
            <surname>Relativity</surname>
          </string-name>
          . Tuscany, Italy: Tuscan Word Centre,
          <year>2005</year>
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>J.</given-names>
            <surname>Sinclair</surname>
          </string-name>
          ,
          <article-title>The search for units of meaning</article-title>
          . Textus. No.
          <volume>9</volume>
          (
          <issue>1</issue>
          ). P.
          <volume>75</volume>
          -
          <fpage>106</fpage>
          ,
          <year>1996</year>
          . [21]
          <string-name>
            <given-names>J.</given-names>
            <surname>Firth</surname>
          </string-name>
          , Papers in Linguistics 1934-1951. London: Oxford University Press, 233 p.,
          <year>1957</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>N.</given-names>
            <surname>Fairclough</surname>
          </string-name>
          , New Labour. New Language? London: Routledge,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>S.</given-names>
            <surname>Th</surname>
          </string-name>
          . Gries.„
          <article-title>Dispersions and adjusted frequencies in corpora‟</article-title>
          ,
          <source>International Journal of Corpus Linguistics</source>
          <volume>13</volume>
          :
          <fpage>403</fpage>
          -
          <lpage>37</lpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>A.</given-names>
            <surname>Wilson</surname>
          </string-name>
          , In press. „
          <article-title>Embracing Bayes Factors for key item analysis in corpus linguistics‟, in</article-title>
          <string-name>
            <given-names>A</given-names>
            .
            <surname>Koll-Stobbe</surname>
          </string-name>
          and
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>Bieswanger (eds</article-title>
          .),
          <article-title>New approaches to the study of linguistic variability</article-title>
          .
          <source>Frankfurt: Peter Lang</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>M.</given-names>
            <surname>Stubbs</surname>
          </string-name>
          , „
          <article-title>Three concepts of keywords‟</article-title>
          , in M. Bondi and
          <string-name>
            <given-names>M.</given-names>
            <surname>Scott</surname>
          </string-name>
          (eds.), pp.
          <fpage>21</fpage>
          -
          <lpage>42</lpage>
          ,
          <year>2010</year>
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>M.</given-names>
            <surname>Stubbs</surname>
          </string-name>
          , Text and
          <string-name>
            <given-names>Corpus</given-names>
            <surname>Analysis</surname>
          </string-name>
          .
          <source>Blackwell</source>
          , Oxford, 288 p,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>M.</given-names>
            <surname>Nelson</surname>
          </string-name>
          ,
          <article-title>A corpus based study of business English teaching materials</article-title>
          . Unpublished
          <source>PhD Thesis</source>
          . University of Manchester,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>G.</given-names>
            <surname>Leech</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Rayson</surname>
          </string-name>
          ,
          <string-name>
            <surname>A</surname>
          </string-name>
          . Wilson,
          <article-title>Word frequencies in written and spoken English: Based on the British National Corpus</article-title>
          . London: Longman,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>M.</given-names>
            <surname>Mahlberg</surname>
          </string-name>
          , „
          <article-title>Clusters, key clusters and local textual functions in Dickens‟</article-title>
          ,
          <source>Corpora</source>
          <volume>2</volume>
          (
          <issue>1</issue>
          ):
          <fpage>1</fpage>
          -
          <lpage>31</lpage>
          . Republished 2012 in D. Biber and R. Reppen (eds.),
          <source>Corpus Linguistics, Part</source>
          <volume>3</volume>
          : Phraseology. London: Sage,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>J.</given-names>
            <surname>Culpeper</surname>
          </string-name>
          , Keyness: Words,
          <article-title>parts-of-speech and semantic categories in the character-talk of Shakespeare‟s Romeo and Juliet</article-title>
          ,
          <source>International Journal of Corpus Linguistics</source>
          <volume>14</volume>
          (
          <issue>1</issue>
          ):
          <fpage>29</fpage>
          -
          <lpage>59</lpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [31]
          <string-name>
            <surname>T. McEnery</surname>
          </string-name>
          ,
          <article-title>Keywords and moral panics: Mary Whitehouse and media censorship</article-title>
          , in D. Archer (ed.), pp.
          <fpage>93</fpage>
          -
          <lpage>124</lpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>G.</given-names>
            <surname>Wilcock</surname>
          </string-name>
          ,
          <article-title>Introduction to linguistic annotation and text analytics. (Introduction to linguistic annotation and text analytics</article-title>
          .) San Rafael, Calif.: Morgan &amp; Claypool,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [33] . Dilai,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dilai</surname>
          </string-name>
          ,
          <source>Automatic Extraction of Keywords in Political Speeches</source>
          ,
          <source>2020 IEEE 15th International Scientific and Technical Conference on Computer Sciences and Information Technologies, CSIT 2020 - Proceedings</source>
          ,
          <year>2020</year>
          , 1, art. no.
          <issue>9322011</issue>
          , pp.
          <fpage>291</fpage>
          -
          <lpage>294</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [34]
          <string-name>
            <given-names>D.</given-names>
            <surname>Biber</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Reppen</surname>
          </string-name>
          , The Cambridge handbook of English corpus linguistics,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [35]
          <string-name>
            <surname>M. A. K. Halliday</surname>
          </string-name>
          , An Introduction to Functional
          <source>Grammar. 2nd edn. London: Edward Arnold</source>
          ,
          <year>1994</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [36]
          <string-name>
            <given-names>M.</given-names>
            <surname>Hoey</surname>
          </string-name>
          ,
          <article-title>The discourse colony; a preliminary study of a neglected discourse type</article-title>
          .
          <source>Talking about Text. Discourse Analysis Monograph</source>
          <volume>13</volume>
          , English Language Research / University of Birmingham.
          <year>1986</year>
          . P. 1-
          <fpage>26</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          [37]
          <string-name>
            <surname>M. J. Toolan</surname>
          </string-name>
          ,
          <article-title>Narrative progression in the short story: A corpus stylistic approach</article-title>
          . Amsterdam: John Benjamins Pub. Co,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          [38]
          <string-name>
            <given-names>The</given-names>
            <surname>Guardian</surname>
          </string-name>
          ,
          <fpage>2014</fpage>
          -
          <lpage>2017</lpage>
          [
          <article-title>Electronic resource]</article-title>
          . URL: https://www.theguardian.com/uk.
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          [39]
          <string-name>
            <given-names>The</given-names>
            <surname>Daily Mail</surname>
          </string-name>
          ,
          <fpage>2014</fpage>
          -
          <lpage>2017</lpage>
          [
          <article-title>Electronic resource]</article-title>
          . URL: https://www.thedailymail.co.uk.
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          [40]
          <string-name>
            <given-names>The</given-names>
            <surname>Sun</surname>
          </string-name>
          ,
          <fpage>2014</fpage>
          -
          <lpage>2017</lpage>
          [
          <article-title>Electronic resource]</article-title>
          . URL: https://www.thesun.co.uk.
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          [41]
          <string-name>
            <given-names>The</given-names>
            <surname>Times</surname>
          </string-name>
          ,
          <fpage>2014</fpage>
          -
          <lpage>2017</lpage>
          [
          <article-title>Electronic resource]</article-title>
          . URL: https://www.thetimes.co.uk.
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          [42]
          <article-title>BNC</article-title>
          . URL: https://www.english-corpora.org/bnc/
        </mixed-citation>
      </ref>
      <ref id="ref42">
        <mixed-citation>
          [43]
          <string-name>
            <given-names>T.</given-names>
            <surname>Dunning</surname>
          </string-name>
          ,
          <article-title>Accurate methods for the statistics of surprise and coincidence</article-title>
          ,
          <source>Computational Linguistics</source>
          <volume>19</volume>
          (
          <issue>1</issue>
          ):
          <fpage>61</fpage>
          -
          <lpage>74</lpage>
          ,
          <year>1993</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref43">
        <mixed-citation>
          [44]
          <string-name>
            <given-names>B.</given-names>
            <surname>Fischer-Starcke</surname>
          </string-name>
          ,
          <article-title>Corpus linguistics in literary analysis: Jane Austen and her contemporaries</article-title>
          .
          <source>London: Continuum</source>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref44">
        <mixed-citation>
          [45]
          <string-name>
            <given-names>L.</given-names>
            <surname>Mastropiero</surname>
          </string-name>
          ,
          <article-title>Corpus stylistics in heart of darkness and its Italian translations</article-title>
          : Bloomsburry Publishing,
          <volume>248</volume>
          p.,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref45">
        <mixed-citation>
          [46]
          <string-name>
            <given-names>J.</given-names>
            <surname>Morris</surname>
          </string-name>
          , G. Hirst,
          <article-title>Lexical cohesion computed by thesaural relations as an indicator of the structure of text</article-title>
          .
          <source>Computational Linguistics</source>
          ,
          <volume>17</volume>
          (
          <issue>1</issue>
          ),
          <fpage>21</fpage>
          -
          <lpage>48</lpage>
          ,
          <year>1991</year>
        </mixed-citation>
      </ref>
      <ref id="ref46">
        <mixed-citation>
          [47]
          <string-name>
            <given-names>G. P.</given-names>
            <surname>Cantos</surname>
          </string-name>
          ,
          <article-title>Statistical methods in language and linguistic research</article-title>
          . Oakville, CT: Equinox Pub. Ltd.,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref47">
        <mixed-citation>
          [48]
          <string-name>
            <given-names>N.</given-names>
            <surname>Hrytsiv</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Shestakevych</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Shyyka</surname>
          </string-name>
          ,
          <article-title>Quantitative Parameters of Lucy Montgomery‟s Literary Style</article-title>
          ,
          <source>In Computational Linguistics and Intelligent Systems. Proceedings of the 5th International Conference on COLINS 2021</source>
          . Volume I: Workshop. Kharkiv, Ukraine,
          <source>April 22-23</source>
          ,
          <year>2021</year>
          , CEURWS.org, pp.
          <fpage>670</fpage>
          -
          <lpage>684</lpage>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref48">
        <mixed-citation>
          [49 ]
          <string-name>
            <given-names>L.</given-names>
            <surname>Jeffries</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Walker</surname>
          </string-name>
          ,
          <article-title>Choice is the Word of the Hour</article-title>
          .
          <source>In Keywords in the Press: The New Labour Years</source>
          (pp.
          <fpage>67</fpage>
          -
          <lpage>92</lpage>
          ). London: Bloomsbury Academic,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>