<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Author's Choice for Keyword List: Research Aspect ∗</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Olga Kamshilova</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>onkamshilova@gmail.com</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lyubov Geikhman</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>lyubageykhman</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>@gmail.com</string-name>
          <email>lauranbel@gmail.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Herzen State Pedagogical University of Russia</institution>
          ,
          <addr-line>Saint Petersburg</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Larisa Beliaeva</institution>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Perm National Research Polytechnical University (PNRPU)</institution>
          ,
          <addr-line>Perm, Russian Federation</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>The paper addresses the problem of creating a relevant keyword list prefixing a research article. It discusses the issue through the lenses of informational, psycholinguistic and editorial concepts of keywords. It considers keywords as a text form within the modeled text of a research paper and deals with their information value. Based on a quantitative text analysis of current publications presented by Russian authors (8 case studies), the research shows at least three strategies of compiling a keyword list of which none is absolutely successful or meets the demands of publication promotion. Candidates for key words are multidimensionally treated in respect of a) their relation to other compositional parts of research text, b) their morphological and syntactic character, c) their information value according to query statistics.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>Keyword is an interdisciplinary term with more or less common meaning of “a word that
tells you about the main idea or subject of something” [Oxford Dictionary]. More common
nowadays is the meaning of “a word or phrase that you type on a computer keyboard to give
an instruction or to search for information about something” [ibid.]. The problem of keywords
is that of their almost routine presence in general discourse: we, at once, seem to know a
lot about keywords, enough to understand their prominent text value, but have very little
practical knowledge when it comes to defining a set of keywords for our own text, e.g. for a
paper or research article. Unlike other parts of a research paper that are prescribed by journal
formats, keywords are only regulated by their number limit, which definitely backs up our
claim that authors must be well aware about keywords choice proper.</p>
      <p>Academic use of the term has a long history (hence, its different spelling interpretations:
key word, key-word, keyword) and is associated with at least four research paradigms:
crosslinguistic studies [Wierzbicka 1997, Williams 1983, Stubbs 1996, Stubbs 2001, Zemskaya 1996,
Stepanov 1997, Shmelev 2002], style and text interpretation [Arnold 1999; Lukin 1999;
Bolotnova 2004], psycholinguistic investigations of child speech and speech impairment [Sakharny
et al. 1984; Sakharny, Stern 1988, Murzin, Stern 1991], and computer and information studies
(information retrieval, automatic text processing and abstracting, etc.) [Busa 1980; Ripp,
Falke 2018]. We claim to trace these interpretations in authors’ actual choice for the keywords
list they prefix their conference papers or journal articles.</p>
      <p>The linguo-cultural and cross-cultural approach develops the idea that languages are
sensitive indices to the cultures they belong and every language has key concepts, which
reflect the culture core values. These key concepts are expressed in key words (sic!). Thus,
different cultures possess a "natural semantic metalanguage" [Wierzbicka 1997], that helps
to study, compare and explain cultural identity and cultural differences. With appeal to
anthropologists, psychologists, and philosophers, as well as linguists, a coherent theoretical
framework based on multilanguage empirical evidence was introduced for cultural patterns
studies.</p>
      <p>
        Similar studies were done as early as 1935 with sociolinguistic focus on “sociologically
important words, what one might call focal of pivotal words” [Firth 1957, 10]
        <xref ref-type="bibr" rid="ref21">(see also [Williams,
1983])</xref>
        . Key words in stylistic and interpretative text studies (I.V. Arnold and others) are
treated as topical lexemes, often grouping around one or several lexemes that mark the
author’s artistic message and help to understand the text informational structure and appreciate
its artistic value.
      </p>
      <p>Since we here focus on keywords informational function, it is reasonable to refer to their
computational (informational) and linguistic characteristics. The linguistic (originally
psycholinguistic) understanding of keywords by L.V. Sakharny and his colleagues was much
inspired by the 1970s-1980s progress in information retrieval studies [Baeza-Yates R.,
RibeiroNeto 2011, Manning, Raghavan, Schutze 2008]. The idea of keywords (taken as a set) shaping
the so called “document search image” implies that keywords bare the essential information
about the text and it is taken as basic for coordinate indexing method [Lindemann, Kliche,
Heid 2018; Zeni et al. 2007]. This method claims that a text content may be presented
by a list of keywords reflecting its topic with a guaranteed sufficient degree of accuracy and
completeness.</p>
      <p>In psycholinguistic studies the idea of a keyword set representing the whole text was first
transformed into the idea of “a primitive text” or, rather “text-primitive” [Sakharny et al.,
1984], characteristic of early child speech. Later in [Sakharny, Stern 1988] it was resumed as
a text category in linguistic description of “synonymous texts” or texts-synonyms that present
a dynamic paradigm “title – keywords – abstract – text body”. It was then that the authors
pinpointed keyword set characteristics that we find relevant for the present study: it was
proved that a set of keywords has essential text characteristics that provide it with integrity
and cohesion, namely: it has a syntagmatic character (non-random, linear, a sort of “chain”
structure) and thematic progression (the keywords mark the text topic (theme) and their order
– through associative connections – rheme).</p>
      <p>However, understanding keywords as document search image is prevailing in today’s
practice: “A keyword is a word in a text that, with other keywords, can represent the text &lt;...&gt;
A set of document keywords is called a document search image. The set of keywords is close
to annotation, plan and abstract, which also represent a document with less detail, but
lacking a syntactic structure” 1. To assist information retrieval from big data keywords may be
obtained by linguistic and computational methods (e.g., by analyzing word frequency in the
text). In personal practices keywords are simply a query that a person prints into a search
engine window when they want to find something, or special HTML tags that we may add to
our texts (blogs, sites, chats, MS Word texts, etc.), which search engines, generally focusing
on keywords highlighted as a result of their analysis, shall take as additional information.</p>
      <p>The above-mentioned approaches reveal keywords essential characteristics that they
demonstrate by their discourse function, so it may be reasonably suggested that a conscious
author must have a most general idea of what a keyword is. But, evidently, neither everyday
search practice, nor journal or conference paper regulations are of much help when authors
decide on their choice of candidates for the keyword list to prefix their texts meant for
publication. In attempt to find out whether there are some patterns behind this choice we try to
answer the following questions.</p>
      <sec id="sec-1-1">
        <title>1. What candidates do authors choose for the keyword list?</title>
        <p>2. How do the keywords show in the text and what is their proportion in the text body?</p>
      </sec>
      <sec id="sec-1-2">
        <title>3. How do the keywords promote the publication informationally?</title>
      </sec>
      <sec id="sec-1-3">
        <title>4. What’s the actual need for keywords if any at all?</title>
        <p>2</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>Data and Methods</title>
      <p>Data used in this study are 8 sample Russian texts of research articles on different subject areas
in humanities submitted to a reviewed university journal. The rationale for concentrating on
a comparatively small number of texts in this paper is that the suggested analysis implies
taking text as a major analysis unit, including minor ones, such as title, keyword list, abstract
and keywords proper. Thus, every text in the selection provides a basis for a case study.</p>
      <p>We consider a research paper to be a modeled text, written according to a conventional
structure, though conventions in Russian academic tradition are not as developed as AIMRaD
pattern, and following [Sakharny, Stern 1988] we take title, abstract, keyword list and text body
as texts-synonyms representing a dynamic paradigm.</p>
      <p>The first step was to provide a quantitative description of keywords presence in the text.
For this purpose, we attempted to state the number of keyword occurrences in the text body,
as well as in all synonymous texts within the whole text of the research paper. The search of
texts and word lists (frequency), concordance plot analysis for every text and its units under
discussion were performed with AntConc toolkit that provides reliable results for Cyrillic texts.</p>
      <p>On the second step we analyzed the keyword list as a set, a primitive text with its possible
linguistic characteristics – syntagmatic and thematic relations – in order to trace patterns or
authors’ preferences in compiling the keyword list.</p>
      <p>Thirdly, we defined keyword density for some of the keywords in every text so as to
judge their informative value [Passonneau 2006]. Keyword density is the percentage of times a
keyword appears in the text compared to the total number of words in the text. In the context
of search engine optimization, keyword density is used to determine whether a web page is
relevant to a specified keyword or keyword phrase. In our context it is a way to establish
the information value of authors’ keywords, their ability to assist the text promotion on the
Internet.
3
3.1</p>
    </sec>
    <sec id="sec-3">
      <title>Results</title>
      <sec id="sec-3-1">
        <title>Keywords presence in the text</title>
        <p>Treating the keyword set as a text within a dynamic paradigm of 4 synonymous texts one
can expect repetition of the key words in every part of this paradigm. Searching authors’
keywords in the texts and their parts soon gave evidence enough that keywords in the 8
texts demonstrate various degree of their presence in the texts, as well as a quite unspecific
distribution among the text parts (synonymous texts). In some texts keywords are missing in
the abstract, e.g. in Text 3 and 4, or in the title and abstract, as in Text 8, Texts 5 and 8
demonstrate minimum use of keywords in the whole paradigm (Table 1).
• derivation (stemming): иероглифика → иероглифический, иероглифы (hieroglyphics
→ hieroglyphic, hieroglyphs);
• word composition / collocation: власть → властеотношения (power → power
relations);
• generalization / specification: кредитные институты/учреждения –
банковскокредитные учреждения (banking credit institutions → credit institutions).
Sometimes the keyword from the list is used in the text only in its derivative form (Table 2):</p>
      </sec>
      <sec id="sec-3-2">
        <title>Keyword sets as a pattern</title>
        <p>The composition of elements within the keyword set demonstrates what presumably may be
taken for the authors’ different strategies in compiling the keyword list. We dare speak of at
least three patterns underlying the analyzed sample texts.</p>
        <p>One may be traced back to the ideas expressed in [Sakharny, Stern 1988], since the sets
of the kind are characterized by linear syntagmatic relations (a sort of “chain” structure) of
terms (usually of different but crossing subject areas), very close to text-primitives, moving
from more general to specific notions or vice versa, as in Text 5. The effect of “telling a
story” is achieved by the keywords order that through associative connections reflects the text
thematic progression:
• язык, концепт, эмоция, поле, доминанта, лезгинский язык, английский язык /
language, concept, emotion, field, dominant, Lezgi, English (Text 1);
• власть, эстетика, красота, возвышенное, трагическое, комическое / power,
aesthetics, beauty, sublime, tragic, comic) (Text 2);
• китайский язык, машинный перевод, иероглифика, лексическая структура, статистический
анализ / Chinese, machine translation, hieroglyphic, lexical structure, statistic analysis
(Text 3);
• рациональность, наука, знание / rationality, science, knowledge (Text 5);
• власть, коммуникация, гражданское общество / power, communication, civil society
(Text 6).</p>
        <p>The second is nothing but classification indexing / subject indexing within a definite subject
area, which is common for Dewey Decimal Classification (DDC) or consequent Universal
Decimal Classification (УДК), library indexing, library and information science [Bates, Maack
2010]:
• банковско-кредитные учреждения, кредитный кооператив, кредитное товарищество,
ссудо-сберегательная касса, ростовщичество bank-credit institutions / banking and
credit institutions), credit cooperatives, credit partnerships, loan and savings bank, usury
(Text 4);
• профессиональное обучение, профессиональные знания, профессиональные и личностные
качества / professional education, professional knowledges, professional and personal
qualities (Text 8).</p>
        <p>The third pattern was found only in Text 7, its different composition may be regarded as
individual or rather inexperienced, since the position of keywords is presented by the key questions
of the article, with morphological (number, case) and syntactic (connection) marking ad hoc:
соотношение глобализации и национализма, проблемы международной миграции, роли
национальных государств / correlation of globalization and nationalism, problems of
international migration, the role of nation states (Text 7).</p>
        <p>As our observation shows, the linear composition of a text-primitive seems to be preferable
for the sample texts authors. As long as Russian academic tradition does not prescribe any
regulations for keyword list, we can only suggest that this preference may be explained by
natural speech habits, that is by psycholinguistic basis described in [Sakharny, Stern 1988].
The subject-oriented pattern found in Texts 4 and 8 demonstrate a rational strategy that will
be addressed to later in Discussion section. The pattern found in text 7 is, presumably, very
individual and hardly possible to be published unedited.
3.3</p>
      </sec>
      <sec id="sec-3-3">
        <title>Keyword list as a text information image</title>
        <p>Editorial demand for a keyword list to prefix a research publication has an unquestionable
informational purpose: it is a guide for the reader to decide on whether the text is of any
professional or other interest to him, a reason for the editor to accept and publish the text
under the appropriate subject heading, and, in case they match a reader’s query, a good
chance for the search engine to place the text in the search engine output. In this last case,
similar to web sites or other internet issues, to be placed on the first few pages of search
result the keywords must be repeated in the whole text often enough, so that their density
is not less than 1.5%. According to SEO Keyword Density Analyzer “The optimal density of
keywords (and phrases) is from 1.5 to 7%, preferably not more than 3.5%. And at least 2
exact occurrences of the search phrase on the text page” 2.</p>
        <p>Obviously, no author of a research article is either conscious of it or previously instructed.
The keyword ratio (keyword density) of the most frequent keyword (in case of Text 1 there
are two keywords that have almost equal high frequency) for each of the 8 sample texts show
that only 3 of them pretend to be hopefully placed on the first pages of search results be the
query including them (Table 3).</p>
        <p>Even with this relatively high density the “happy” keywords will not bring the text to
the top of the search results because the authors’ choice of the keyword itself is unhappy,
ineffectual: язык (language), эмоция (emotion), власть (power) are all very frequent terms
of general use that alone (other keywords are of insignificant density) do not specify the text
topic, as well as рациональность (rationality) does not specify its status of a philosophy
category in Text 6. So, none of the 8 texts under study have a keyword list that can be
treated as a text search image.</p>
        <p>There is one more factor that might improve the choice of keyword candidates, namely
accounting for the candidate search statistics, that is, how popular queries with the candidate
are. Thus, the choice of иероглифы / hieroglyphs as a candidate for the keywords in text 3
might have been more effective than the authors preference for иероглифика / hieroglyphics
(Table 4). Search engine statistics provides the number of queries with the keyword either
realized or supposed within a definite period, so it is feasible to rank on it.</p>
        <p>To informationally promote their text in digital space research writers shall be properly
instructed, provided the demand for keyword list is actually meant as that. Unfortunately, they
receive, as mentioned above, no other regulation except the limit set for their number. The
Internet advice for choosing keyword candidates is site-centric and caters for the interests of a
very broad target audience3. Objective editorial recommendations are found yet in [Abramov
2011], who suggests that candidates for a keyword list shall:
• be chosen from terms found in the title, abstract, the opening and closing paragraphs of
the text body;
• account for search engine statistics;
3See, for instancehttp://seomans.ru/nashel-info-about-plotnost-keywords.html (addressed 02.11.19)</p>
        <sec id="sec-3-3-1">
          <title>Most Frequent Key</title>
          <p>word
*less frequent keyword
**possible candidate
for keyword
Text 3 китайский язык 0,46%
(Chinese language)
*иероглифика
(hieroglyphics) 0,03%
**иероглифы 0,52%
(hieroglyphs)
• include terms and term collocations (e.g. бухгалтерский учет основных средств,
бухгалтерский учет, основные средства / fixed assets accounting, accounting, fixed
assets), cf.: “Longer search queries are narrower search queries, and narrower search
queries are less competitive” 4;
• be not limited to 3-5 units, but include 10-15 ones.</p>
          <p>These recommendations are definitely taking the job of promoting a research text as an
informational product to be the author’ personal issue. Submitting their text to a journal, authors
have to “do as Romans do”.
4</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Discussion</title>
      <p>The study of authors’ keywords in 8 sample Russian research articles is based on our strong
belief that compiling a keyword list is to a great degree unconventional writing practice within
a highly conventional text format. The findings confirmed that the authors’ choice of keyword
candidates and their functioning in the text is rather arbitrary.</p>
      <p>The quantitative description of keywords, which was meant to answer the question of how
keywords show in the text and in what proportion, demonstrated that:
• keywords show unevenly in the text and demonstrate various degree of their presence,
as well as a quite unspecific distribution among the text parts (synonymous texts); in
some texts keywords are missing in the abstract, or both in the title and the abstract;
• the idea of “keyness” is maintained through the text by derivation (stemming), word
composition / collocation and generalization / specification.</p>
      <p>In the absence of editorial prescriptions keyword lists reveal a number of patterns underlying
the actual composition of the keywords in the list, two of which seem rather significant:
• confirming the status of a minor text within the title – abstract – keyword list - text
body paradigm, keyword sets show specific linguistic characteristics, namely integrity and
cohesion, realized in their syntagmatic character (non-random, linear, a sort of “chain”
structure) and thematic progression; this pattern is preferable in our sample texts;
• a keyword set may be composed as a logical sequence of classification / subject indexes
of a definite subject area.</p>
      <p>Keyword lists in the sample texts are rather a “linguistic” than informational image of
a text (cf. a document search image), representing its content either according to subject
classification headings, or suggesting a linear (chain) expansion of terms or article key issues.</p>
      <p>The keywords informational value may be justified by the keyword ratio (keyword density)
in the text, of which no author of the selected texts is presumably aware. Useful
recommendations for promoting a research text as an informational product suggest picking up keyword
candidates from terms used in the title, abstract, opening and closing paragraphs (so called
“strong text positions”), accounting for search engine statistics, including terms and term
collocations, increasing keywords number.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusions</title>
      <p>None of the 8 texts under study have a keyword list that can be treated as a text search
image. To promote the text informationally in digital space the author shall be accordingly
instructed, provided the demand for keyword list is actually meant as that. The last pointed
research question in Introduction on whether there is any actual need for a keyword list is far
from being an idle one.</p>
      <p>In recent years authoritative publishers of research journals and peer-reviewed literature
databases like Elsevier and Scopus ensure the policy of subject indexing which is performed for
every text manually by a group of specially trained professionals. Subject indexing (see 3.2) is a
process of indexing a text by human experts with keywords derived from the accepted system
of controlled (authorized) terms (controlled vocabulary). Controlled vocabularies provide
a way to organize knowledge for subsequent retrieval. “They are used in subject indexing
schemes, subject headings, thesauri, taxonomies and other knowledge organization systems.
Controlled vocabulary schemes mandate the use of predefined, authorized terms that have
been preselected by the designers of the schemes, in contrast to natural language vocabularies,
which have no such restriction”. Elsevier admits, that authorized terms were manually added
to 80% of Scopus publications. The result of subject indexing in Scopus is seen in figures 1
and 2. Highlighted are the matches of the authors’ and added keywords.</p>
      <p>Compared to automatic indexing, the use of a controlled vocabulary and human expert
work can dramatically increase information retrieval progress Concerning the research
question, one may notice that added controlled terms in both cases (figures 1 and 2) find no match
in the authors’ keyword lists, the latter matching the added uncontrolled terms. Whether it
demonstrates the authors’ incompetence or the need to update the controlled vocabulary in
question, it is difficult to determine at present. One more thing worth noticing is that the
choice of main heading in figure 1 seems correct, while the main heading defined in figure 2
seems erroneous. Anyhow, with subject indexing realized by human experts in such a way
that it controls key terminology for different subject areas and promises dramatic increase in
information retrieval, is the demand for author’s keyword list still actual? Probably it isn’t,
since there is a tendency not to include keywords into conference paper templates.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [Abramov 2011]
          <string-name>
            <surname>Abramov</surname>
            ,
            <given-names>E.G.</given-names>
          </string-name>
          (
          <year>2011</year>
          )
          <article-title>Selection of Keywords for</article-title>
          a Research Article // Research Periodicals: Problems and Decisions. № 2. Pp.
          <volume>35</volume>
          -
          <fpage>40</fpage>
          . (In Rus.)
          <string-name>
            <surname>= Abramov</surname>
            <given-names>E.G.</given-names>
          </string-name>
          <article-title>Podbor klyuchevy'kh slov dlya nauchnoj stat'i // Nauchnaya periodika: problemy' i resheniya</article-title>
          .
          <year>2011</year>
          . № 2. S.
          <volume>35</volume>
          -
          <fpage>40</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [Arnold 1999]
          <string-name>
            <surname>Arnold</surname>
            ,
            <given-names>I.V.</given-names>
          </string-name>
          (
          <year>1999</year>
          )
          <article-title>Semantics</article-title>
          . Stylistics. Intertextuality / Bukharkin, P.E. - Ed. SPb.
          <year>1999</year>
          . - 444 p.
          <article-title>(In Rus</article-title>
          .)
          <string-name>
            <surname>Arnol'd</surname>
            <given-names>I.V.</given-names>
          </string-name>
          <string-name>
            <surname>Semantika</surname>
            . Stilistika. Intertekstual'nost. : sb. St. / nauch. Red. Bukharkin P. E. SPb. : Izd-vo
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Peterb.</surname>
          </string-name>
          Un-ta,
          <year>1999</year>
          . -
          <fpage>444</fpage>
          s.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [
          <string-name>
            <surname>Baeza-Yates</surname>
          </string-name>
          , Ribeiro-Neto 2011]
          <string-name>
            <surname>Baeza-Yates</surname>
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ribeiro-Neto</surname>
            <given-names>B.</given-names>
          </string-name>
          (
          <year>2011</year>
          )
          <article-title>Modern Information Retrieval. The Concepts and Technology behind Search. Second edi¬tion. Pearson Education Ltd</article-title>
          .,
          <string-name>
            <surname>Harlow</surname>
          </string-name>
          , England,
          <year>2011</year>
          . -
          <fpage>766</fpage>
          s.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [Bates, Maack 2010] Bates,
          <string-name>
            <given-names>M.J.</given-names>
            and
            <surname>Maack</surname>
          </string-name>
          , M.N. (eds.). (
          <year>2010</year>
          ).
          <source>Encyclopedia of Library and Information Sciences</source>
          . Vol.
          <volume>1</volume>
          -
          <fpage>7</fpage>
          . CRC Press, Boca Raton, USA.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [Bolotnova 2009 ] Bolotnova,
          <string-name>
            <surname>N.S.</surname>
          </string-name>
          (
          <year>2009</year>
          )
          <article-title>Philological Analysis of Text</article-title>
          . 4th ed. M. Flinta : Nauka. -
          <volume>520</volume>
          p.
          <article-title>(In Rus.) = Bolotnova N.S. Filologicheskij analiz teksta 4 e izd</article-title>
          . M. : Flinta : Nauka,
          <year>2009</year>
          . - 520 s [Firth 1957]
          <string-name>
            <surname>Firth</surname>
            <given-names>J.R.</given-names>
          </string-name>
          (
          <year>1957</year>
          )
          <article-title>The Technique</article-title>
          of Semantics // Papers in Linguistics 1934 - 1957. London. Oxford University Press. Pp.
          <volume>7</volume>
          -
          <fpage>33</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [Busa 1980] Busa,
          <string-name>
            <surname>R.</surname>
          </string-name>
          (
          <year>1980</year>
          ).
          <article-title>The Annals of Humanities Computing: The Index Thomisticus</article-title>
          .
          <source>In Computers and the Humanities</source>
          .
          <volume>14</volume>
          (
          <issue>2</issue>
          ). Pp.
          <volume>83</volume>
          -
          <fpage>90</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <source>[Iagounova</source>
          <year>2008</year>
          ] agounova,
          <string-name>
            <surname>E.V.</surname>
          </string-name>
          (
          <year>2008</year>
          )
          <article-title>Set of Recognizable Words as Compression Texts (With Comparison of Key-Word Set</article-title>
          ) // Computational Linguistics and
          <string-name>
            <given-names>Intellectual</given-names>
            <surname>Technologies</surname>
          </string-name>
          .
          <source>Papers from the Annual International Conference “Dialogue”. Issue</source>
          <volume>7</volume>
          (
          <issue>14</issue>
          ). Pp.
          <volume>588</volume>
          -
          <fpage>594</fpage>
          . (In Rus.) = Yagunova,
          <string-name>
            <surname>E.V.</surname>
          </string-name>
          <article-title>Nabor oporny'kh slov kak vid svertki teksta (v sopostavlenii s naborom klyuchevy'kh slov) // Komp'yuternaya lingvistika i intellektual'ny'e tekhnologii</article-title>
          .
          <source>Materialy' mezhdunarodnoj konferenczii ¾Dialog - 2008¿. Vy'pusk 7 (14)</source>
          .
          <year>2008</year>
          . Pp.
          <volume>588</volume>
          -
          <fpage>594</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <source>[Lukin</source>
          <year>1999</year>
          ] ukin,
          <string-name>
            <surname>V. A.</surname>
          </string-name>
          (
          <year>1999</year>
          )
          <article-title>Linguistic Theory basics for Text Analysis</article-title>
          . M. Os-
          <volume>89</volume>
          . - 192 p.
          <article-title>(In Rus</article-title>
          .) = Lukin,
          <string-name>
            <given-names>V. A.</given-names>
            <surname>Khudozhestvenny</surname>
          </string-name>
          <article-title>'j tekst</article-title>
          .
          <source>Osnovy' lingvisticheskoj teorii. Analiticheskij minimum. M. Os'-89</source>
          .
          <year>1999</year>
          .
          <article-title>- 192 s.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [Lindemann, Kliche, Heid 2018]
          <string-name>
            <surname>Lindemann</surname>
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kliche</surname>
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heid</surname>
            <given-names>U.</given-names>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>LexBib: A Corpus and Bibliography of Metalexicographical Publications</article-title>
          . // Lexicography in Global Contexts.
          <source>Proceedings of the XVIII EURALEX International Congress</source>
          . - Ljubljana: University of Ljubljana,
          <article-title>Centre for language resources</article-title>
          and technologies,
          <year>2018</year>
          . Pp.
          <volume>764</volume>
          -
          <fpage>777</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [Manning, Raghavan, Schutze 2008]
          <string-name>
            <surname>Manning C. D.</surname>
          </string-name>
          ,
          <string-name>
            <surname>Raghavan</surname>
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schutze</surname>
            <given-names>H.</given-names>
          </string-name>
          (
          <year>2008</year>
          ) Introduction to Information Retrieval Cambridge University Press New York, NY, USA. [Murzin, Stern 1991]
          <string-name>
            <surname>Murzin</surname>
            <given-names>L.N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stern</surname>
            <given-names>A.S.</given-names>
          </string-name>
          <string-name>
            <surname>Text</surname>
            and
            <given-names>Text</given-names>
          </string-name>
          <string-name>
            <surname>Perception. Sverdlovsk</surname>
          </string-name>
          . -
          <volume>171</volume>
          p.
          <article-title>(In Rus</article-title>
          .) = Murzin
          <string-name>
            <given-names>L.N.</given-names>
            ,
            <surname>Shtern</surname>
          </string-name>
          <string-name>
            <surname>A.S. Tekst</surname>
          </string-name>
          <article-title>i ego vospriyatie. Sverdlovsk, Izd-vo Ural</article-title>
          . un-ta,
          <year>1991</year>
          -
          <fpage>171</fpage>
          s.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <source>[Oxford Dictionary 2010] The Oxford Dictionary of English. 3rd edition</source>
          . (
          <year>2010</year>
          ). Ed.
          <article-title>By A</article-title>
          .
          <string-name>
            <surname>Stevenson</surname>
          </string-name>
          . Oxford University Press.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <source>[Passonneau</source>
          <year>2006</year>
          ] assonneau
          <string-name>
            <surname>R. J.</surname>
          </string-name>
          (
          <year>2006</year>
          ).
          <article-title>Measuring agreement on set-valued items (MASI) for semantic and pragmatic annotation</article-title>
          .
          <source>In Proceedings of the 5th International Conference on Language Resources and Evaluations (LREC</source>
          <year>2006</year>
          ). Pp.
          <volume>831</volume>
          -
          <fpage>836</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [Ripp, Falke 2018] ipp,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Falke</surname>
          </string-name>
          <string-name>
            <surname>S.</surname>
          </string-name>
          (
          <year>2018</year>
          )
          <article-title>Analyzing User Behavior with Matomo in the Online Information System Grammis</article-title>
          .
          <source>Lexicography in Global Contexts. Proceedings of the XVIII EURALEX International Congress</source>
          . - Ljubljana: University of Ljubljana,
          <article-title>Centre for language resources and technologies</article-title>
          . Pp.
          <volume>94</volume>
          -
          <fpage>108</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [Sakharny et al. 1984]
          <string-name>
            <surname>Sakharny</surname>
            ,
            <given-names>L.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sibirsky</surname>
            ,
            <given-names>S.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stern</surname>
            ,
            <given-names>A.S.</given-names>
          </string-name>
          (
          <year>1984</year>
          )
          <article-title>A Set of Key Words as a Text // Psycho-pedagogical and linguistic problems of text study</article-title>
          .
          <source>Perm</source>
          . Pp.
          <volume>81</volume>
          -
          <fpage>83</fpage>
          . (In Rus.) =
          <string-name>
            <surname>Sakharny'j L</surname>
          </string-name>
          .V.,
          <string-name>
            <surname>Sibirskij</surname>
            <given-names>S.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shtern</surname>
            <given-names>A.S.</given-names>
          </string-name>
          <article-title>Nabor klyuchevy'kh slov kak tekst // Psikhologo-pedagogicheskie i lingvisticheskie problemy' issledovaniya teksta -</article-title>
          <source>Perm'</source>
          ,
          <year>1984</year>
          . S.
          <volume>81</volume>
          -
          <fpage>83</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [Sakharny, Stern 1988]
          <string-name>
            <surname>Sakharny</surname>
            ,
            <given-names>L.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stern</surname>
            ,
            <given-names>A.S.</given-names>
          </string-name>
          (
          <year>1988</year>
          )
          <article-title>A set of key words as a Text Type // Lexical aspects in professionally oriented teaching of foreign speech activity</article-title>
          .
          <source>Perm</source>
          . Pp.
          <volume>34</volume>
          -
          <fpage>51</fpage>
          . (In Rus.) =
          <string-name>
            <surname>Sakharny'j L</surname>
          </string-name>
          . V.,
          <string-name>
            <surname>Shtern</surname>
            <given-names>A. S.</given-names>
          </string-name>
          <article-title>Nabor klyuchevy'kh slov kak tip teksta // Leksicheskie aspekty' v sisteme professional'no-orientirovannogo obucheniya inoyazy'chnoj rechevoj deyatel'nosti</article-title>
          . - Perm',
          <year>1988</year>
          . S.
          <volume>34</volume>
          -
          <fpage>51</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [Shmelev 2002]
          <string-name>
            <surname>Shmelev</surname>
            ,
            <given-names>A.D. A Russian</given-names>
          </string-name>
          <string-name>
            <surname>Language World Model</surname>
          </string-name>
          .
          <source>Materials for a Dictionary</source>
          . -
          <volume>224</volume>
          p.
          <article-title>(in Rus</article-title>
          .)
          <article-title>- Russkaya yazy'kovaya model' mira. Materialy' k slovaryu. Moskva: Yazy'ki slavyanskoj kul'tury</article-title>
          .
          <year>2002</year>
          .
          <article-title>- 224 s.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [Stepanov 1997] Stepanov,
          <string-name>
            <surname>Y.S.</surname>
          </string-name>
          (
          <year>1997</year>
          )
          <article-title>Constants. A Dictionary of Russian Culture</article-title>
          .
          <string-name>
            <given-names>A</given-names>
            <surname>Study. M</surname>
          </string-name>
          . -
          <volume>824</volume>
          p.
          <article-title>(In Rus</article-title>
          .) = Konstanty'. Slovar'
          <article-title>russkoj kul'tury'. Opy't issledovaniya. M. Yazy'ki russkoj kul'tury</article-title>
          .
          <year>1997</year>
          .
          <article-title>- 824 s.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [Stubbs 1996] Stubbs,
          <string-name>
            <surname>M.</surname>
          </string-name>
          (
          <year>1996</year>
          )
          <article-title>Text and Corpus Analysis: Computer-assisted Studies of Language and Culture</article-title>
          . Oxford, UK Cambridge, MA: Blackwell
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [Stubbs 2001] Stubbs,
          <string-name>
            <surname>M.</surname>
          </string-name>
          (
          <year>2001</year>
          )
          <article-title>Words and Phrases</article-title>
          .
          <source>Corpus Studies of Lexical Semantics</source>
          . Oxford, UK: Blackwell
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [Wierzbicka 1997]
          <string-name>
            <surname>Wierzbicka</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>1997</year>
          )
          <article-title>Understanding Cultures through Their Key Words: English, Russian, Polish, German, and Japanese (Oxford Studies in Anthropological Linguistics</article-title>
          . No 8.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <source>[Williams</source>
          <year>1983</year>
          ] illiams
          <string-name>
            <surname>R.</surname>
          </string-name>
          (
          <year>1983</year>
          )
          <article-title>Keywords: A Vocabulary of Culture and Society</article-title>
          . New York. Oxford University Press. - 341 p.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [Zemskaya 1996] Zemskaya,
          <string-name>
            <surname>T.A.</surname>
          </string-name>
          (
          <year>1996</year>
          )
          <article-title>Active Processes in Modern Wordbuilding // Russian at the End of XXth century</article-title>
          . M.
          <year>1996</year>
          . Pp.
          <volume>90</volume>
          -
          <fpage>141</fpage>
          . (In Rus.) = Zemskaya T.A.
          <string-name>
            <surname>Aktivny</surname>
          </string-name>
          <article-title>'e proczessy' sovremennogo slovoproizvodstva // russkij yazy'k koncza XX stoletiya. M. Yazy'ki russkoj kul'tury'</article-title>
          .
          <year>1996</year>
          . S.
          <volume>90</volume>
          -
          <fpage>141</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [Zeni et al. 2007]
          <string-name>
            <surname>Zeni</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kiyavitskaya</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mich</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mylopoulos</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cordy</surname>
            ,
            <given-names>J. R.</given-names>
          </string-name>
          (
          <year>2007</year>
          ). A Lightweight Approach to Semantic
          <source>Annotation of Research Papers. // Natural Language Processing and Information Systems, Lecture Notes in Computer Science</source>
          . Berlin, Heidelberg: Springer. Pp.
          <volume>61</volume>
          -
          <fpage>72</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>