<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Keyword-based Study of Thematic Vocabulary in British Weather News</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Nataliya</forename><surname>Bondarchuk</surname></persName>
							<email>nataliia.i.bondarchuk@lpnu.ua</email>
							<affiliation key="aff0">
								<orgName type="institution">Lviv Polytechnic National University</orgName>
								<address>
									<postCode>79013</postCode>
									<settlement>Lviv</settlement>
									<country key="UA">Ukraine</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Ivan</forename><surname>Bekhta</surname></persName>
							<email>ivan.bekhta@lnu.edu.ua</email>
							<affiliation key="aff1">
								<orgName type="institution" key="instit1">Lviv Polytechnic National University</orgName>
								<orgName type="institution" key="instit2">Ivan Franko National University of Lviv</orgName>
								<address>
									<postCode>79013</postCode>
									<settlement>Lviv</settlement>
									<country key="UA">Ukraine</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Oksana</forename><surname>Melnychuk</surname></persName>
							<email>melnychuk_oksanadm@ukr.net</email>
							<affiliation key="aff2">
								<orgName type="institution">Rivne Medical Academy</orgName>
								<address>
									<postCode>33017</postCode>
									<settlement>Rivne</settlement>
									<country>Ukraine, Ukraine</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Olha</forename><surname>Matviienkiv</surname></persName>
							<email>olha.matviyenkiv@lnu.edu.ua</email>
							<affiliation key="aff3">
								<orgName type="institution">Ivan Franko National University of Lviv</orgName>
								<address>
									<postCode>7900</postCode>
									<settlement>Lviv</settlement>
									<country key="UA">Ukraine</country>
								</address>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff4">
								<orgName type="department">International Conference on Computational Linguistics and Intelligent Systems</orgName>
								<address>
									<addrLine>May 12-13</addrLine>
									<postCode>2022</postCode>
									<settlement>Gliwice</settlement>
									<country key="PL">Poland</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Keyword-based Study of Thematic Vocabulary in British Weather News</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">554CCF2123026D4281C2D5E738D2FF97</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T12:58+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>keyword analysis, lexical-thematic groups, quantitative analysis, weather, WordSmith Tools, corpus, vocabulary (O. Matviienkiv)</term>
					<term>ORCID: 0000-0002-5772-8532 (N. Bondarchuk)</term>
					<term>0000-0002-9848-1505 (I. Bekhta)</term>
					<term>0000-0003-4619-363x (O. Melnychuk)</term>
					<term>0000-0002-4791-0277 (O. Matviienkiv)</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The survey centers on the examination of keywords and related themes representing weather in four daily British newspapers (The Times, The Guardian, The Daily Mail, The Sun) between 2014 and 2017. The articles in this period mentioning weather news represent the corpus of our research. The goals of the research are the following: expose frequently occurring words (keywords) in the corpus, categorize them into groups according to relevant themes in the text, identify the quantitative content of each lexicalthematic group, as well as determine dominant themes of weather news. The computer software that was used to establish keywords is WordSmith Tools 7.0 with the British National Corpus as a reference corpus. A method for automatic cataloging of keywords is described. The corpus contains 746 324 words taken from 180 newspaper articles under research. Despite the necessity of keyword study, thematic and quantitative analyses provide deeper insight into text-specific weather-related vocabulary and its textualizing role. The analysis of quantitative data helps to select two dominant lexical-thematic groups -"Weather extremes" and "Weather and people", giving evidence of central themes discussed in weather news. Hence, the resulting major themes are the depiction of adverse weather conditions affecting people's daily life; the representation of the effects of weather disasters on people and their environment. The obtained results highlight the link between a theme/themes and lexical level of the text proving the efficiency of keyword analysis in the research.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Keyword analysis is a widely used method in various sciences and fields, in particular corpus linguistics. Egbert and Biber suggest that it is used "to identify the words that are especially characteristic of the texts in a target discourse domain" <ref type="bibr" target="#b0">[1]</ref>. Keyword extraction is an optimal way while clustering, classifying, indexing and visualizing texts of different discourses, genres or text types. However, the application of keyword analysis to the text or corpus requires further interpretation of the results since keywords which are blindly extracted on the basis of their frequencies do not convey relationship with other words/keywords or texts. Considering the key words at the linguistic level, the main idea, as stated by M.Scott, is that keyness is not language dependent, but text dependent <ref type="bibr" target="#b1">[2]</ref>. Drawing from this, the advantage of the use of keyword analysis lies in the extraction of text-specific vocabulary. Therefore, in our research generation of keywords is a starting point of the analysis for further categorization of keywords into thematic groups representing weather. We understand thematic groups as groups of lexical units used within the text interchangeably to convey certain semantic meaning <ref type="bibr" target="#b2">[3]</ref>.</p><p>Prior surveys of such nature concentrated more on sentiment and quantitative analysis of weather vocabulary <ref type="bibr" target="#b3">[4,</ref><ref type="bibr" target="#b4">5,</ref><ref type="bibr" target="#b5">6]</ref>, corpus-based analysis of weather metaphors <ref type="bibr" target="#b6">[7]</ref> and climate representation <ref type="bibr" target="#b7">[8]</ref>. The absence of relevant computer-based studies on the topic constitute the topicality of the survey. This paper presents an interdisciplinary approach that incorporates linguistic and computer-based (statistical) techniques to the analysis of weather-related vocabulary to define dominant themes that are specific to weather news of British press, thus offering prospects for a better understanding of its contextualizing and textualizing role in newspaper discourse.</p><p>The primary objective of this paper is to present an argument for the definition of keywords according to different approaches. The secondary aim is to automatically extract keywords form the research corpus and organize them into lexical-thematic groups, as well as to find out their quantitative composition. As a result, dominant themes circulating in the texts of weather news may be defined.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Related works</head><p>The study of keywords is associated with the works of G. Matore <ref type="bibr" target="#b8">[9]</ref>, R. Williams <ref type="bibr" target="#b9">[10]</ref>, M. Scott <ref type="bibr" target="#b10">[11,</ref><ref type="bibr" target="#b11">12]</ref>, Tribble <ref type="bibr" target="#b12">[13]</ref>, P. Baker <ref type="bibr">[14; 15]</ref>, T. Berber Sardinga <ref type="bibr" target="#b15">[16]</ref>, M. Bondi <ref type="bibr" target="#b1">[2]</ref>, A. Wierzbicka <ref type="bibr" target="#b16">[17]</ref>, L. Jeffriesand, B. Walker <ref type="bibr" target="#b17">[18]</ref>, J. Sinclair <ref type="bibr" target="#b18">[19,</ref><ref type="bibr" target="#b19">20]</ref>, J. Firth <ref type="bibr" target="#b20">[21]</ref>, N. Fairclough <ref type="bibr" target="#b21">[22]</ref>, Gries <ref type="bibr" target="#b22">[23]</ref>, A.Wilson <ref type="bibr" target="#b23">[24]</ref>, M. Stubbs <ref type="bibr" target="#b24">[25,</ref><ref type="bibr" target="#b25">26]</ref>, M. Nelson <ref type="bibr" target="#b26">[27]</ref>, G. Leech <ref type="bibr" target="#b27">[28]</ref>, M. Mahlberg <ref type="bibr" target="#b28">[29]</ref>, J. Culpeper <ref type="bibr" target="#b29">[30]</ref>, T.</p><p>McEnery <ref type="bibr" target="#b30">[31]</ref>, G. Wilcock <ref type="bibr" target="#b31">[32]</ref>. The notion of "key word" is multi-faceted and understood in different senses in various disciplines. From a sociological point of view, key words are part of the vocabulary of culture and society <ref type="bibr" target="#b9">[10]</ref>. These are the words that have a special status, express an important social meaning and play a special role. From a linguistic point of view, they contribute to the long-lasting search for meaning <ref type="bibr" target="#b18">[19]</ref> and are the most important units of the semantic and stylistic structure of the text. In corpus linguistics, keyword is defined as a word which occurs with significantly high frequency in one corpus when compared to some appropriate normative corpus <ref type="bibr" target="#b11">(Scott, 1997;</ref><ref type="bibr" target="#b12">Scott &amp; Tribble, 2006)</ref>.</p><p>Paying attention to the importance of keywords in creating textual content and meaning, it is believed that keywords are lexical units with the greatest semantic content contributing to the structure and semantic framework of the text <ref type="bibr" target="#b18">[19]</ref>. This makes keywords an effective method for identifying lexical characteristics of texts <ref type="bibr" target="#b33">[34]</ref>. The new research shows a tendency for ambiguity in the terminological definition of keywords, which can be seen in three approaches:</p><p>Cultural (Matore, J. Firth, R. Williams, A. Wierzbicka). The first researchers (J. Firth, R. Williams), who discussed key words, were intuitively focused on words which, in their opinion, contain important notions reflecting social or cultural problems. Already in the 1930s, J. Firth proposed to study socially important words that could be called "focal" or "pivotal", and advocated an analysis of the distribution of words, the meanings of which characterize the society in specific contexts, with specific associations and values <ref type="bibr" target="#b20">[21]</ref>. R. Williams tried to analyse modern culture by studying key words and established a close link between key words and discursive society <ref type="bibr" target="#b9">[10]</ref>. However, while performing this analysis, he focused on historical and social macro contextual factors without paying special attention to text and genre and leaving the methodological tools of text analysis completely out of consideration.</p><p>Quantitative (M. Scott). Based on the concept of corpus linguistics, M. Scott differentiated key words by means of statistical characteristics. A word is deemed key if it is used in the text at least as many times as the minimum frequency of occurrence is estimated by the user <ref type="bibr" target="#b10">[11]</ref>, or key words are words whose frequency of occurrence in the text is exceptionally high, if we compare them with other words <ref type="bibr">[33]</ref>. Identification of elements that are repeated with statistically significant frequency is not an analysis or interpretation of the text or corpus, but indicates the elements that need to be investigated and explained. M. Scott distinguishes three types of key words: proper names, words that people themselves consider to be key words and are indicators of the "aboutness" of a particular text, and especially high-frequency words that are more indicative of style than of subject matter <ref type="bibr" target="#b11">[12]</ref>. When talking about the topic and style of the text, as well as the role of key words in their identification, attention is paid to what semantic structures are indicated by key words and in what way the author's view influences them in the process of text creation <ref type="bibr" target="#b1">[2]</ref>. M. Scott compares the theme ("aboutness") with the mental meta functions of M. A. K. Halliday <ref type="bibr" target="#b10">[11]</ref>. The words gain meaning not from the link between the word and the meaning, but from the intrinsic interaction with other words. Later, P. Baker, using M. Scott's classification, described lexical key words (nouns, adjectives,) as subject words, i.e. words that can be used to identify the topic of the text <ref type="bibr" target="#b14">[15]</ref>. However, the key words are not only elements of the conceptual, but also of the grammatical structure of the text. Apart from informational conjugation, they are indicators of communicative intention and micro-or macrostructure of the text. The text is stored in the memory in a set of key words, which are then revealed during its retrieval. Therefore, the notions of key and subject words are not identical.</p><p>Lexical-thematic groups include words that constitute components of one main thematic line, the elements of which realise a certain idea, while the key words either serve individual thematic blocks of the text (local thematic words) or implement, together with other text elements, the ideological idea of the whole work (universal key words) <ref type="bibr" target="#b21">[22]</ref>. Consequently, key words of the lexical-thematic group are frequently varied units of the lexical organization of the text. They play an important role in the lexical structure of the text, they take part in shaping the content and creating the meaning for an adequate comprehension. The key word as a stimulus word, a source of textual associations, based on linguistic (paradigmatic, syntagmatic) and extralinguistic (thematic) links of lexical units, performs the function of a core, which directs the process of text comprehension. This approach appears to meet the tasks of our research the most.</p><p>Phraseological (M. Stubbs). Key words are defined as phraseological units and phrases that are constructed according to similar word models <ref type="bibr" target="#b24">[25]</ref>. Assumption of key meaning through frequency can also be seen in word forms, lemmas, and word sequences. This definition can easily be applied to more complex units than words, pointing to current trends in descriptive and theoretical linguistics, in particular phraseology. In essence, key words are not necessarily individual words, they can be clusters or even phrases <ref type="bibr" target="#b19">[20,</ref><ref type="bibr" target="#b28">29]</ref>. A quite different approach was taken by M. Hoey, who, taking the category of text as a basis, showed that lexical links in a speech can be considered as indicators of text structure or potential acceleration or lexical models can reveal textual (as opposed to grammatical) models <ref type="bibr" target="#b35">[36]</ref>. Key words are not necessarily the main ones in the textual sense, but they can help to understand the idea of the text by repetition. M.Toolan mentioned repetition as one of the key word figures, which has "a very rich semantic meaning" <ref type="bibr" target="#b36">[37]</ref>. The key words are intended to focus the reader's attention on the necessary state of speech in the production of a coherent text. They can act as markers of coherence of this or that text and at the same time give the texts a unique author's style.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Methodology</head><p>The procedure of our analysis involves the choice of relevant material and methods of the research which combine a computer-based model of keyword analysis with traditional qualitative (in particular, thematic analysis) and quantitative analysis. Thus, in our framework we use three procedural steps: corpus compilation, keyword analysis, thematic and quantitative analyses.</p><p>The first step of the investigation was to compile the corpus of the research. The data used in the study is the corpus of weather news selected from British online daily newspapers between 2014 and 2017 (The Times, The Guardian, The Daily Mail, The Sun) which consists of 746 324 words taken from 180 newspaper articles. While compiling a corpus the following criteria were considered: firstly, the timeframe of four years (2014-2017) to represent recent use of the related themes, secondly, open access articles to be easily downloaded. Finally, the texts from theguardian.com/uk [38], thedailymail.co.uk <ref type="bibr" target="#b37">[39]</ref>, thesun.co.uk <ref type="bibr" target="#b38">[40]</ref>, thetimes.co.uk <ref type="bibr" target="#b39">[41]</ref> were selected by using the search term "weather". The timeframe and the amount of words testify the representativeness and validity of the results. The dataset was made into a text file (.txt) and later imported to the WordSmith software. Our next step was to extract keywords using this software. For this reason, the British National Corpus (hereinafter -BNC) was chosen as a suitable reference corpus since all data are specific to British English. In addition, BNC is one of the largest corpora which contains 100 million words of text from a wide range of genres (e.g. spoken, fiction, magazines, newspapers, and academic) <ref type="bibr" target="#b40">[42]</ref>. The aim of a keyword analysis was to retrieve the words which are statistically relevant for the investigation. Consequently, we constructed two word frequency lists with the help of WordList tool: of a target corpus of weather texts and of a reference corpus (BNC) and generated a keyword list.</p><p>According to P. Baker, keyword extraction requires "a way that combines the strength of key keywords with those of keywords but is neither too general or exaggerates the importance of a word based on the eccentricities of individual files <ref type="bibr" target="#b13">[14]</ref>. Therefore, we have taken into account both statistically significant (positive, high-frequency) and negative (unusually low frequency) keyword items using log likelihood ratio <ref type="bibr" target="#b41">(Dunning 1993</ref>) <ref type="bibr" target="#b41">[43]</ref>. The cells in the generated keyword list with negative keywords were shaded in red and had a negative log likelihood value. The reason for taking into consideration words with low frequencies is that our reference corpus consists of a collection of rather small texts. Consequently, the distribution of some words in the text may be uneven and some of the thematic lines might be lost. As stated by Gries, "corpora are inherently variable internally" <ref type="bibr" target="#b22">[23]</ref> and low frequency keywords may help us find additional "local" themes of weather news. In this case, the issue may be solved by generating a wordlist for each single text in the research corpus, but it would be very time-consuming.</p><p>It is important to note here that the analysis is also restricted to content words only, which, being the units of meaning, we define to be directly related to the identification of the theme. Function words cannot demonstrate the link of lexical units and themes <ref type="bibr" target="#b43">[45]</ref>. The extraction of keywords provides insights into their further grouping by themes since the potential of the key lexical units is realized within the whole text: the semantic influence of a single word (sign) is determined only by the whole text <ref type="bibr" target="#b42">[44]</ref>. This idea is also supported by Morris and Hirst, who explain that "when a unit of text is about the same thing there is a strong tendency for semantically related words to be used within that unit" <ref type="bibr" target="#b44">[46]</ref>. Thus, our last step was to group the keywords into lexical-thematic groups to define dominant themes related to the representation of weather in the news of online press. To this end, we worked out the following procedure: the context of each word from the keyword list is checked using the concordancer and the word is put into the appropriate group manually.</p><p>Two problematic issues that arose during this step were that of how to group 1) the words that could fit in multiple thematic groups and 2) the words that do not fit into any one of them. We applied a systematic hierarchical decision-making procedure and critical analysis to solve this issue: if a word could fit into several thematic groups, it was categorized into each of them; and if a word did not have an appropriate thematic group to be categorized, it was left out. In this step quantitative analysis was also used to further explore and identify dominant/prevalent themes of weather news by finding quantitative content of each thematic group. According to G. P. Cantos, in quantitative research, linguistic features are classified and counted <ref type="bibr" target="#b45">[47]</ref>. In recent years commensurate attention has been paid to mixed-method studies of a text which use both quantitative and qualitative data <ref type="bibr" target="#b42">[44]</ref>. As a result, the combination of keyword, lexical-thematic and quantitative approach in our research opened new opportunities for in-depth analysis of British weather news.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Experiment</head><p>Having compared two word frequency lists, we have obtained a computed list of keywords (Table <ref type="table" target="#tab_0">1</ref>). The list of keywords was limited to 300 words. The presented list in Table <ref type="table" target="#tab_0">1</ref> provides only the first 45 keywords and their frequencies, as they will be further investigated during their thematic grouping. Scotts' classification of keywords into proper nouns, aboutness keywords and high-frequency words <ref type="bibr" target="#b11">[12]</ref> is relevant for our study. The first type of keywords is usually represented in the corpus of our research by place names (Britain), names of nationalities (Brits) and names of storms (Doris). Aboutness keywords, the words that have semantic correlations with the main ideas and central themes of the text, are the most numerous. The third type constitutes words with high frequencies that are considered to be more indicators of style than theme. However, the objective of our study consists in grouping keywords into thematic groups rather than classifying them by types.</p><p>As defined earlier, our next step was to group the keywords by thematic categories representing weather, which consists in classifying the lexical units according to the thematic groups and quantifying them. Thus, the computed list of keywords was divided into five groups, each of which was classified thematically. As a result, the following groups were organized: "Weather extremes", "Climate", "Weather and people", "Weather and nature", "Weather phenomena".</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Results</head><p>The first thematic group "Weather extremes" consists of key lexical units that denote weather catastrophes that cause destruction of material objects, casualties and even death of people. In the corpus of the studied texts we find weather cataclysms of hydro-(floods, tsunami) and atmospheric (blizzards/snowstorm, drought, hailstorm, heatwave, snow avalanche, showers, downpours, thundersnow, storm, duststorm) origin, which accordingly constitute two subgroups of the group. This group also collects words denoting: protective equipment (shelter, sandbag); locations (nomenclatures: homes, businesses, region, village, area, country, town, adjectives: local, tropical, central, coastal); means of transport and infrastructure (road, bridge, building, traffic, speed, travel, highway, train, boat, van); general vocabulary (extreme, cancellation, delay, condition, incident, recovery, collapse, highs); meteorological terms (icy, Doris, cyclone, warning); size/description (adjectives: thick, heavy, massive, large, arctic), victims of the disaster (locals, mountaineer, immigrant, son, eyewitness, people, refugees, children, driver, traveller, residents, civilians, kayakers, victims); organisations or political actors, officials (minister, police); other actors involved in the disaster (commuters, coastguard, ambulance, volunteers, paramedics, army, evacuees) and observation/analysis (expert, forecasters, meteorologist); the consequences of the disaster (disaster, chaos, damage, mud, debris); emotional perception of the disaster (alarm, alert, threat, fear, risk, danger, alarmed, terrified, fearful), actions (drop, force, leave, move, block, trigger, halt, batter, collapse, damage, destroy, devastate, disrupt, kill, ruin, strike, warn).</p><p>The second thematic group "Climate" is composed of the words denoting results and consequences of climate problems, their interrelation and impact on weather conditions. The next Table <ref type="table" target="#tab_1">2</ref> presents the words that form this thematic group. The keywords, the meaning of which thematically reflects the impact of weather conditions on people's comfortable living environment, their safety and health, we refer to the third thematic group "Weather and people". The keywords which are included into this group are shown in Table <ref type="table" target="#tab_2">3</ref>. Thematic group "Weather and nature" is characterized by the use of nominations of different species of plants and animals, as well as natural processes which are directly influenced by the weather. Table <ref type="table" target="#tab_3">4</ref> shows the keywords that refer to this group. The fifth thematic group "Weather phenomena" includes key lexical units denoting different weather phenomena and their manifestations. The lexical composition of this group is given in Table <ref type="table" target="#tab_4">5</ref>. Some of the keywords (morning, today, spells, pm) do not fit any of the groups and their number is not enough to group them separately, that is why there were not classified. Quantitative composition of each group is presented in the pie-chart diagram (Figure <ref type="figure" target="#fig_0">1</ref>). </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Discussion</head><p>Keywords appear as effective tools for the analysis of thematic focus/foci of the text or corpus. While the analysis of individual keywords does not provide a consistent and exhaustive text analysis, their grouping into categories according to the themes they represent establishes a link between the lexical level of the text and its themes.</p><p>A quantitative analysis of the keywords representing the weather allowed us to identify dominant thematic groups and, therefore, to recognize the themes that prevail in weather news of British online press. The identification of lexical-semantic groups <ref type="bibr" target="#b4">[5]</ref> and related themes enables a more in-depth understanding of the texts of weather news. This survey paves the way for the development of more rigorous methodology for the analysis of relationship between keywords and other words in contexts. It would also be interesting to closer examine weather vocabulary through keywords extracted using other statistical tests (e.g. t-test, the Wilcoxon-Mann-Whitney test) or software packages and compare their results, as well as to find out whether dominant themes related to the representation of weather differ from newspaper to newspaper or particular season.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7.">Conclusions</head><p>Having analysed the obtained data, we may conclude that weather news has anthropocentric character, since dominant thematic groups are "Weather extremes" and "Weather and people". Thus, within the corpus of the research two thematic lines can be outlined: (1) the depiction of adverse weather conditions affecting people's daily life; (2) depicting the effects of weather disasters on people and their environment. The overall study reveals the following conclusion: everything that happens in the sphere of weather, in any case influences people, their physical, moral, psychological and emotional state.</p><p>The method presented in this research provides more options for further collocational analysis of weather-related vocabulary in weather news of British online press on the basis of concordances and might be used as a cornerstone for the study of other vocabulary through keywords in different "Weather phenomena" "Weather and nature" "Weather and people" "Climate"</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Quantitative composition of thematic groups</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1</head><label>1</label><figDesc>The list of keywords and their frequencies</figDesc><table><row><cell>COUNTRY 57</cell><cell>SPELLS 13</cell><cell>TRAFFIC 11</cell></row><row><cell>SHOWERS 46</cell><cell>MORNING 39</cell><cell>WARNED 27</cell></row><row><cell>HEAVY 56</cell><cell>DOWNPOURS 15</cell><cell>GALES 15</cell></row><row><cell>WINDS 46</cell><cell>POLICE 25</cell><cell>DELAYS 17</cell></row><row><cell>FORECASTERS 38</cell><cell>CAR 24</cell><cell>COAST 17</cell></row><row><cell>CONDITIONS 57</cell><cell>CHILD 8</cell><cell>ICY 11</cell></row><row><cell>WARNING 37</cell><cell>PM 8</cell><cell>FLOODING 23</cell></row><row><cell>UPDATED 17</cell><cell>DORIS 8</cell><cell>SUNSHINE 28</cell></row><row><cell>ARCTIC 14</cell><cell>BRITS 18</cell><cell>STORM 34</cell></row><row><cell>WEEKEND 50</cell><cell>COMMUTERS 10</cell><cell>RAIN 68</cell></row><row><cell>HIGHS 13</cell><cell>CHRISTMAS 9</cell><cell>HEATWAVE 33</cell></row><row><cell>TODAY 40</cell><cell>FLOODS 34</cell><cell>EXTREME 39</cell></row><row><cell>HEAT 66</cell><cell>TRANSPORT 14</cell><cell>DAMAGE 28</cell></row><row><cell>LOCALS 21</cell><cell>DISASTER 19</cell><cell>BRITAIN 19</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2</head><label>2</label><figDesc>Thematic group "Climate"</figDesc><table><row><cell>Subgroups</cell><cell>Keywords</cell></row><row><cell>1) the effects of climate</cell><cell>warming, flooding, greenhouse, ozone, hole, drought, deficit,</cell></row><row><cell>change</cell><cell>fire, glacier, thawing</cell></row><row><cell>2) the causes of climate</cell><cell>urbanization</cell></row><row><cell>change</cell><cell></cell></row><row><cell>3) scientific terminology</cell><cell>ElNino, research, study, science, scientist, emissions, fossil,</cell></row><row><cell></cell><cell>climate</cell></row><row><cell>4) general vocabulary</cell><cell>impact, adaptation, conclusion, character, effects,</cell></row><row><cell></cell><cell>assessment, committee</cell></row><row><cell>5) descriptive characteristics</cell><cell>disastrous, climatic, man-made, annual, fossil, human-</cell></row><row><cell>6) cause</cell><cell>induced, global, vulnerable</cell></row><row><cell></cell><cell>exacerbate, pollute, emit, alter, modify, affect, devastate</cell></row><row><cell></cell><cell>Total quantity: 41</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 3</head><label>3</label><figDesc>Thematic group "Weather and people"</figDesc><table><row><cell>Subgroups</cell><cell>Keywords</cell></row><row><cell>1) emotions</cell><cell>anger, grim, laugh, like, distressed, deranged, glee, happy,</cell></row><row><cell></cell><cell>worried, devastated, shocked, hopeless, mad</cell></row><row><cell>2) transport and traffic</cell><cell>train, car, boat, vehicles, lights, tailback, sign, driving, speed,</cell></row><row><cell>situation</cell><cell>delays, diversions, cancellation</cell></row><row><cell>3) places</cell><cell>roads, motorway, parks, station</cell></row><row><cell>4) people</cell><cell>passenger, the elderly, commuter, children, driver, sun-</cell></row><row><cell></cell><cell>lovers, sunbathers, sunseekers, children, adults</cell></row><row><cell>5) tourism/holidays</cell><cell>bookies, holidays, Christmas, Easter, football, vacations,</cell></row><row><cell></cell><cell>weekend, match, holidaymakers, camper, traveler, festival-</cell></row><row><cell></cell><cell>goer, beach, barbeque</cell></row><row><cell>6) health/safety</cell><cell>inhaler, death, dehydration, cardiovascular, illness,</cell></row><row><cell></cell><cell>respiratory, diabetes, discomfort, faint, coughing,</cell></row><row><cell></cell><cell>hypothermia, cramp, dizziness, chronic, irritation, sensitive</cell></row><row><cell>7) location</cell><cell>country, homes, businesses, region, village, area, town</cell></row><row><cell></cell><cell>Total quantity: 77</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 4</head><label>4</label><figDesc>Thematic group "Weather and nature"</figDesc><table><row><cell>Subgroups</cell><cell>Keywords</cell></row><row><cell>1) plants/flora</cell><cell>eucalupts, pines, sycamore, oak, ash, beech</cell></row><row><cell>2) animals/fauna</cell><cell>chiffchaffs, dog, frigate, seabirds, fish, sparrow, starling,</cell></row><row><cell></cell><cell>chaffinches, greenfinches, blackbird, woodpigeon, dove, tit,</cell></row><row><cell></cell><cell>puffin</cell></row><row><cell>3) vegetable products</cell><cell>apples, strawberry, blackberry, fruit</cell></row><row><cell>4) agriculture</cell><cell>crops, harvest, agriculture</cell></row><row><cell>5) natural processes</cell><cell>breeding, flowering, leafing, ripening, budburst, growing,</cell></row><row><cell></cell><cell>fruiting</cell></row><row><cell>6) other words related to the</cell><cell>wildlife, habitat, species, population, birds</cell></row><row><cell>field of nature</cell><cell></cell></row><row><cell></cell><cell>Total quantity: 40</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_4"><head>Table 5</head><label>5</label><figDesc>Thematic group "Weather phenomena"</figDesc><table><row><cell>Subgroups</cell><cell>Keywords</cell></row><row><cell>1) atmosphere phenomena</cell><cell>rain, flurry, thunder, snow, winds, heat, temperature, humid,</cell></row><row><cell></cell><cell>fog, tempest, gale, sunshine</cell></row><row><cell>2) meteorological terms</cell><cell>ElNino, Met, mercury, updated</cell></row><row><cell>3) description of weather</cell><cell>heavy, strong, clear, scorching, cold, conditions, humidity</cell></row><row><cell>phenomena</cell><cell></cell></row><row><cell>4) evaluation of weather</cell><cell>unseasonable, awful, terrible, glorious, topsy-turvy, freak,</cell></row><row><cell>phenomena</cell><cell>crazy, driving, ropy, cooler</cell></row><row><cell>5) action and movement</cell><cell>rise, blow, batter, hit, smite, move, soar, bask, plummet, grip,</cell></row><row><cell></cell><cell>batter, swoop, slam, loom, pummel, stop, finish, end, begin,</cell></row><row><cell></cell><cell>start</cell></row><row><cell></cell><cell>Total quantity: 54</cell></row></table></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>discourses. Further scrutiny can be combined with a more interpretative approach to the analysis of weather news in line with stylistic research.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Incorporating text dispersion into keyword analyses</title>
		<author>
			<persName><forename type="first">J</forename><surname>Egbert</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Biber</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Corpora</title>
		<imprint>
			<biblScope unit="volume">14</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="77" to="104" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<title level="m">Keyness in Texts</title>
				<editor>
			<persName><forename type="first">M</forename><surname>Bondi</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><surname>Scott</surname></persName>
		</editor>
		<meeting><address><addrLine>Amsterdam</addrLine></address></meeting>
		<imprint>
			<publisher>John Benjamins</publisher>
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Anthropology of Color: Interdisciplinary multilevel modeling</title>
		<author>
			<persName><forename type="first">D</forename><surname>Dedrick</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">E</forename><surname>Maclaury</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">V</forename><surname>Paramei</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">OAPEN (Open Access Publishing in European Networks</title>
				<imprint>
			<publisher>John Benjamins Publishing Company</publisher>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Sentiment strength detection in short informal text</title>
		<author>
			<persName><forename type="first">M</forename><surname>Thelwall</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Buckley</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Paltoglou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Cai</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Kappas</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of the American Society for Information Science and Technology</title>
		<imprint>
			<biblScope unit="volume">61</biblScope>
			<biblScope unit="issue">12</biblScope>
			<biblScope unit="page" from="2544" to="2558" />
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Quantitative characteristics of lexical-semantic groups representing weather in weather news stories (based on British online press)</title>
		<author>
			<persName><forename type="first">N</forename><surname>Bondarchuk</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Bekhta</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Computational Linguistics and Intelligent Systems, COLINS, CEUR workshop proceedings</title>
				<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="volume">2870</biblScope>
			<biblScope unit="page" from="799" to="810" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Sentiment Analysis Model on Weather Related Tweets with Deep Neural Network</title>
		<author>
			<persName><forename type="first">Q</forename><surname>Jun</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Zhendong</surname></persName>
		</author>
		<author>
			<persName><surname>Sh</surname></persName>
		</author>
		<author>
			<persName><surname>Chongyang</surname></persName>
		</author>
		<idno type="DOI">10.1145/3195106.3195111</idno>
		<idno>DOI:</idno>
		<ptr target="https://doi.org/10.1145/3195106.3195111" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2018 10th International Conference on Machine Learning and Computing (ICMLC 2018)</title>
				<meeting>the 2018 10th International Conference on Machine Learning and Computing (ICMLC 2018)<address><addrLine>New York, NY, USA</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computing Machinery</publisher>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="31" to="35" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Weather as the source domain for metaphorical expressions</title>
		<author>
			<persName><forename type="first">I</forename><surname>Żołnowska</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">AVANT. Pismo Awangardy Filozoficzno-Naukowej</title>
		<imprint>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="165" to="179" />
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">The Relationship of Weather Salience with the Perceptions and Uses of Weather Information in a Nationwide Sample of the United States</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">E</forename><surname>Stewart</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">K</forename><surname>Lazo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">E</forename><surname>Morss</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">L</forename><surname>Demuth</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Weather, Climate &amp; Society</title>
		<imprint>
			<biblScope unit="volume">4</biblScope>
			<biblScope unit="issue">3</biblScope>
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<author>
			<persName><forename type="first">G</forename><surname>Matoré</surname></persName>
		</author>
		<title level="m">La méthode en lexicologie. Domaine français</title>
				<meeting><address><addrLine>Paris</addrLine></address></meeting>
		<imprint>
			<publisher>Marcel Didier</publisher>
			<date type="published" when="1953">1953</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<monogr>
		<title level="m" type="main">Keywords</title>
		<author>
			<persName><forename type="first">R</forename><surname>Williams</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1976">1976. 1983</date>
			<pubPlace>London; Fontana</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<monogr>
		<author>
			<persName><forename type="first">M</forename><surname>Scott</surname></persName>
		</author>
		<title level="m">WordSmith Tools, Version 5. Liverpool: Lexical Analysis Software</title>
				<imprint>
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">PC analysis of key words -and key key words</title>
		<author>
			<persName><forename type="first">M</forename><surname>Scott</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">System</title>
		<imprint>
			<biblScope unit="volume">25</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="233" to="245" />
			<date type="published" when="1997">1997</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<title level="m" type="main">Textual patterns: Key words and corpus analysis in language education</title>
		<author>
			<persName><forename type="first">M</forename><surname>Scott</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Tribble</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2006">2006</date>
			<publisher>Benjamins</publisher>
			<pubPlace>Amsterdam</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Querying keywords: questions of difference, frequency and sense in keyword analysis</title>
		<author>
			<persName><forename type="first">P</forename><surname>Baker</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of English Linguistics</title>
		<imprint>
			<biblScope unit="volume">32</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="346" to="359" />
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<monogr>
		<title level="m" type="main">The question is, how cruel is it? Keywords, fox hunting and the house of commons. What&quot;s in a word-list? Investigating word frequency and keyword extraction</title>
		<author>
			<persName><forename type="first">P</forename><surname>Baker</surname></persName>
		</author>
		<editor>D. Archer. Farnham</editor>
		<imprint>
			<date type="published" when="2009">2009</date>
			<publisher>Ashgate</publisher>
			<biblScope unit="page" from="125" to="136" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<title level="m" type="main">Using key words in text analysis: Practical aspects</title>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">Berber</forename><surname>Sardinha</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1999">1999</date>
			<biblScope unit="volume">42</biblScope>
			<biblScope unit="page">9</biblScope>
		</imprint>
		<respStmt>
			<orgName>LAEL, Catholic University of Sao Paulo</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">DIRECT Papers</note>
</biblStruct>

<biblStruct xml:id="b16">
	<monogr>
		<author>
			<persName><forename type="first">A</forename><surname>Wierzbicka</surname></persName>
		</author>
		<title level="m">Understanding Cultures Through Their Key Words: English, Russian, Polish, German, and Japanese</title>
				<meeting><address><addrLine>New York</addrLine></address></meeting>
		<imprint>
			<publisher>Oxford University Press</publisher>
			<date type="published" when="1997">1997</date>
			<biblScope unit="page">317</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Key words in the press</title>
		<author>
			<persName><forename type="first">L</forename><surname>Jeffries</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Walker</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">English Text Construction</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="208" to="229" />
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<monogr>
		<title level="m" type="main">Document Relativity</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">M</forename><surname>Sinclair</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2005">2005</date>
			<publisher>Tuscan Word Centre</publisher>
			<pubPlace>Tuscany, Italy</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">The search for units of meaning</title>
		<author>
			<persName><forename type="first">J</forename><surname>Sinclair</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Textus. No</title>
		<imprint>
			<biblScope unit="volume">9</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="75" to="106" />
			<date type="published" when="1996">1996</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<monogr>
		<author>
			<persName><forename type="first">J</forename><surname>Firth</surname></persName>
		</author>
		<title level="m">Papers in Linguistics 1934-1951</title>
				<meeting><address><addrLine>London</addrLine></address></meeting>
		<imprint>
			<publisher>Oxford University Press</publisher>
			<date type="published" when="1957">1957</date>
			<biblScope unit="page">233</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<monogr>
		<author>
			<persName><forename type="first">N</forename><surname>Fairclough</surname></persName>
		</author>
		<title level="m">New Labour. New Language?</title>
				<meeting><address><addrLine>London</addrLine></address></meeting>
		<imprint>
			<publisher>Routledge</publisher>
			<date type="published" when="2000">2000</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Dispersions and adjusted frequencies in corpora</title>
		<author>
			<persName><forename type="first">S</forename><surname>Th</surname></persName>
		</author>
		<author>
			<persName><surname>Gries</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">International Journal of Corpus Linguistics</title>
		<imprint>
			<biblScope unit="volume">13</biblScope>
			<biblScope unit="page" from="403" to="437" />
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">Embracing Bayes Factors for key item analysis in corpus linguistics</title>
		<author>
			<persName><forename type="first">A</forename><surname>Wilson</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">New approaches to the study of linguistic variability</title>
				<editor>
			<persName><forename type="first">A</forename><surname>Koll-Stobbe</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><surname>Bieswanger</surname></persName>
		</editor>
		<meeting><address><addrLine>Frankfurt</addrLine></address></meeting>
		<imprint>
			<publisher>Peter Lang</publisher>
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<monogr>
		<title level="m" type="main">Three concepts of keywords</title>
		<author>
			<persName><forename type="first">M</forename><surname>Stubbs</surname></persName>
		</author>
		<editor>M. Bondi and M. Scott</editor>
		<imprint>
			<date type="published" when="2010">2010</date>
			<biblScope unit="page" from="21" to="42" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<monogr>
		<title level="m" type="main">Text and Corpus Analysis</title>
		<author>
			<persName><forename type="first">M</forename><surname>Stubbs</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1996">1996</date>
			<publisher>Blackwell</publisher>
			<biblScope unit="page">288</biblScope>
			<pubPlace>Oxford</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<monogr>
		<title level="m" type="main">A corpus based study of business English teaching materials</title>
		<author>
			<persName><forename type="first">M</forename><surname>Nelson</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2000">2000</date>
		</imprint>
		<respStmt>
			<orgName>University of Manchester</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Unpublished PhD Thesis</note>
</biblStruct>

<biblStruct xml:id="b27">
	<monogr>
		<author>
			<persName><forename type="first">G</forename><surname>Leech</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Rayson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Wilson</surname></persName>
		</author>
		<title level="m">Word frequencies in written and spoken English: Based on the British National Corpus</title>
				<meeting><address><addrLine>London</addrLine></address></meeting>
		<imprint>
			<publisher>Longman</publisher>
			<date type="published" when="2001">2001</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b28">
	<analytic>
		<title level="a" type="main">Clusters, key clusters and local textual functions in Dickens</title>
		<author>
			<persName><forename type="first">M</forename><surname>Mahlberg</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Corpus Linguistics, Part 3: Phraseology</title>
				<editor>
			<persName><forename type="first">D</forename><surname>Biber</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">R</forename><surname>Reppen</surname></persName>
		</editor>
		<meeting><address><addrLine>London</addrLine></address></meeting>
		<imprint>
			<publisher>Sage</publisher>
			<date type="published" when="2007">2007</date>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="page" from="1" to="31" />
		</imprint>
	</monogr>
	<note>Republished 2012</note>
</biblStruct>

<biblStruct xml:id="b29">
	<analytic>
		<title level="a" type="main">Keyness: Words, parts-of-speech and semantic categories in the character-talk of Shakespeare&quot;s Romeo and Juliet</title>
		<author>
			<persName><forename type="first">J</forename><surname>Culpeper</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">International Journal of Corpus Linguistics</title>
		<imprint>
			<biblScope unit="volume">14</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="29" to="59" />
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b30">
	<monogr>
		<title level="m" type="main">Keywords and moral panics: Mary Whitehouse and media censorship</title>
		<author>
			<persName><forename type="first">T</forename><surname>Mcenery</surname></persName>
		</author>
		<editor>D. Archer</editor>
		<imprint>
			<date type="published" when="2009">2009</date>
			<biblScope unit="page" from="93" to="124" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b31">
	<monogr>
		<author>
			<persName><forename type="first">G</forename><surname>Wilcock</surname></persName>
		</author>
		<title level="m">Introduction to linguistic annotation and text analytics</title>
				<meeting><address><addrLine>San Rafael, Calif</addrLine></address></meeting>
		<imprint>
			<publisher>Morgan &amp; Claypool</publisher>
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
	<note>Introduction to linguistic annotation and text analytics</note>
</biblStruct>

<biblStruct xml:id="b32">
	<analytic>
		<title level="a" type="main">Automatic Extraction of Keywords in Political Speeches</title>
		<author>
			<persName><forename type="first">M</forename><surname>Dilai</surname></persName>
		</author>
		<author>
			<persName><surname>Dilai</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">IEEE 15th International Scientific and Technical Conference on Computer Sciences and Information Technologies, CSIT 2020 -Proceedings</title>
				<imprint>
			<date type="published" when="2020">2020. 2020</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="291" to="294" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b33">
	<monogr>
		<author>
			<persName><forename type="first">D</forename><surname>Biber</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Reppen</surname></persName>
		</author>
		<title level="m">The Cambridge handbook of English corpus linguistics</title>
				<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b34">
	<monogr>
		<title level="m" type="main">An Introduction to Functional Grammar</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A K</forename><surname>Halliday</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1994">1994</date>
			<publisher>Edward Arnold</publisher>
			<pubPlace>London</pubPlace>
		</imprint>
	</monogr>
	<note>2nd edn</note>
</biblStruct>

<biblStruct xml:id="b35">
	<analytic>
		<title level="a" type="main">The discourse colony; a preliminary study of a neglected discourse type. Talking about Text</title>
		<author>
			<persName><forename type="first">M</forename><surname>Hoey</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Discourse Analysis Monograph</title>
		<imprint>
			<biblScope unit="volume">13</biblScope>
			<biblScope unit="page" from="1" to="26" />
			<date type="published" when="1986">1986</date>
		</imprint>
		<respStmt>
			<orgName>Research / University of Birmingham</orgName>
		</respStmt>
	</monogr>
	<note>English Language</note>
</biblStruct>

<biblStruct xml:id="b36">
	<monogr>
		<title level="m" type="main">Narrative progression in the short story: A corpus stylistic approach</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">J</forename><surname>Toolan</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2009">2009</date>
			<publisher>John Benjamins Pub. Co</publisher>
			<pubPlace>Amsterdam</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b37">
	<monogr>
		<ptr target="https://www.thedailymail.co.uk" />
		<title level="m">The Daily Mail</title>
				<imprint>
			<date type="published" when="2014">2014-2017</date>
		</imprint>
	</monogr>
	<note>Electronic resource</note>
</biblStruct>

<biblStruct xml:id="b38">
	<monogr>
		<title/>
		<author>
			<persName><forename type="first">The</forename><surname>Sun</surname></persName>
		</author>
		<ptr target="https://www.thesun.co.uk" />
		<imprint>
			<date type="published" when="2014">2014-2017</date>
		</imprint>
	</monogr>
	<note>Electronic resource</note>
</biblStruct>

<biblStruct xml:id="b39">
	<monogr>
		<ptr target="https://www.thetimes.co.uk" />
		<title level="m">The Times</title>
				<imprint>
			<biblScope unit="page" from="2014" to="2017" />
		</imprint>
	</monogr>
	<note>Electronic resource</note>
</biblStruct>

<biblStruct xml:id="b40">
	<monogr>
		<title/>
		<author>
			<persName><surname>Bnc</surname></persName>
		</author>
		<ptr target="https://www.english-corpora.org/bnc/" />
		<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b41">
	<analytic>
		<title level="a" type="main">Accurate methods for the statistics of surprise and coincidence</title>
		<author>
			<persName><forename type="first">T</forename><surname>Dunning</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Computational Linguistics</title>
		<imprint>
			<biblScope unit="volume">19</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="61" to="74" />
			<date type="published" when="1993">1993</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b42">
	<monogr>
		<author>
			<persName><forename type="first">B</forename><surname>Fischer-Starcke</surname></persName>
		</author>
		<title level="m">Corpus linguistics in literary analysis: Jane Austen and her contemporaries</title>
				<meeting><address><addrLine>London</addrLine></address></meeting>
		<imprint>
			<publisher>Continuum</publisher>
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b43">
	<monogr>
		<author>
			<persName><forename type="first">L</forename><surname>Mastropiero</surname></persName>
		</author>
		<title level="m">Corpus stylistics in heart of darkness and its Italian translations</title>
				<imprint>
			<publisher>Bloomsburry Publishing</publisher>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page">248</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b44">
	<analytic>
		<title level="a" type="main">Lexical cohesion computed by thesaural relations as an indicator of the structure of text</title>
		<author>
			<persName><forename type="first">J</forename><surname>Morris</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Hirst</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Computational Linguistics</title>
		<imprint>
			<biblScope unit="volume">17</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="21" to="48" />
			<date type="published" when="1991">1991</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b45">
	<monogr>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">P</forename><surname>Cantos</surname></persName>
		</author>
		<title level="m">Statistical methods in language and linguistic research</title>
				<meeting><address><addrLine>Oakville, CT</addrLine></address></meeting>
		<imprint>
			<publisher>Equinox Pub. Ltd</publisher>
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b46">
	<analytic>
		<title level="a" type="main">Quantitative Parameters of Lucy Montgomery&quot;s Literary Style</title>
		<author>
			<persName><forename type="first">N</forename><surname>Hrytsiv</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Shestakevych</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Shyyka</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 5th International Conference on COLINS 2021. Volume I: Workshop</title>
				<meeting>the 5th International Conference on COLINS 2021. Volume I: Workshop<address><addrLine>Kharkiv, Ukraine</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2021">April 22-23, 2021. 2021</date>
			<biblScope unit="page" from="670" to="684" />
		</imprint>
	</monogr>
	<note>Computational Linguistics and Intelligent Systems. CEUR-WS.org</note>
</biblStruct>

<biblStruct xml:id="b47">
	<analytic>
		<title level="a" type="main">Choice is the Word of the Hour</title>
		<author>
			<persName><forename type="first">L</forename><surname>Jeffries</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Walker</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Keywords in the Press: The New Labour Years</title>
				<meeting><address><addrLine>London</addrLine></address></meeting>
		<imprint>
			<publisher>Bloomsbury Academic</publisher>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="67" to="92" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
