<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Marking up Dramatic Text: a Case Study of “7 stories” by Morris Panych</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ivan Bekhta</string-name>
          <email>ivan.bekhta@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nataliia Hrytsiv</string-name>
          <email>nataliia.m.hrytsiv@lpnu.ua</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Anastasiia Matviychuk</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Lviv Franko National University</institution>
          ,
          <addr-line>Universytetska Street, 1, Lviv, 79000</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Lviv Polytechnic National University</institution>
          ,
          <addr-line>Stepana Bandery Street, 12, Lviv, 79000</addr-line>
          ,
          <country country="UA">Ukraine</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The paper elucidates the process, challenges and results of using computational linguistics tools (NLP) and pre-computer technique (TEI for personage utterance tagging) in processing dramatic text. As the material for analysis we have chosen the modern play ―7 stories‖ of t Canadian playwright Morris Panych, researched from the viewpoint of statistical indicator's and textual coefficients. Special attention is paid to statistical parameters of main personages in the play. Results obtained show numeric characteristics of such data: number of meanings (N); maximal meaning (max); minimal meaning (min); range (R); mode (Mo); median (Md); mean (Ẋ); standard deviation (Ϭ); coefficient of varνi)a;tionstan(dard error (Sẋ); measurement error (ε).</p>
      </abstract>
      <kwd-group>
        <kwd>1 Translation</kwd>
        <kwd>NLP</kwd>
        <kwd>quantitative analysis</kwd>
        <kwd>text mark-up</kwd>
        <kwd>applied linguistics</kwd>
        <kwd>drama text</kwd>
        <kwd>tagging</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>In addition, an approach towards and detailed study dramatic texts, as a unique literary genre, is a
separate challenge in current studies, which has special requirements within NLP tools application
and text mark-up. Therefore, the study of Morris Panych's playwork "7 Stories" is relevant.</p>
      <p>The idea is that modern Canadian drama is the aspect, little studied from numerous viewpoints, i.e.
philological, translatological, rhethorical; however, least studied from the angle of mathematical
linguistics and statistics.</p>
      <p>In order to understand the specifics of dramatic works, the concept of author's style, postmodern
literature, to which the work under study belongs, the life path of the author and translator were
additionally considered.</p>
      <p>The play "7 Stories" by Morris Panych and translated by Ivan Krychfalushiy is an example of
postmodern literature that has become a challenge and opposition to the laws of modernism.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Method and preparation characteristics</title>
      <p>
        Considering the vast quantities of ST and TT data available today for analysis, as discussed in [
        <xref ref-type="bibr" rid="ref3 ref4 ref5 ref6">3,
4, 5, 6</xref>
        ], Natural Language Processing is among most interesting and promising aspects of data science
[
        <xref ref-type="bibr" rid="ref10 ref11 ref12 ref13 ref7 ref8 ref9">7, 8, 9, 10, 11, 12, 13</xref>
        ].
      </p>
      <p>
        By default, text data of the original text is difficult to process [
        <xref ref-type="bibr" rid="ref14 ref15 ref16 ref17 ref18">14, 15, 16, 17, 18</xref>
        ] given the
challenge of comparing/contrasting it to the translated drama text [
        <xref ref-type="bibr" rid="ref19 ref20 ref21 ref22 ref23 ref24">19, 20, 21, 22, 23, 24</xref>
        ], the task can
be complicated [
        <xref ref-type="bibr" rid="ref25 ref26 ref27 ref28">25, 26, 27, 28</xref>
        ], though, incredibly appealing [
        <xref ref-type="bibr" rid="ref29">29, 30, 31, 32, 33</xref>
        ].
      </p>
      <p>Within this study project, we opted for exploring the way NLP techniques, especially mark-up
possibilities, can advance processing performing/drama text for statistical profiling of ST and TT.</p>
      <p>The project outlined in the current paper explores the ddistribution of the number of words in a
sentence as well as other numeric characteristics being analyzed collectively and for all the characters
of drama under analysis in their contrast with the Ukrainian translation.
2.1.</p>
    </sec>
    <sec id="sec-3">
      <title>Stages of working with the text document “7 stories” by Morris Panych</title>
      <p>A number of actions were performed for statistical analysis. Therefore, the analysis took place in
the following stages:
 The books of the original text and the translation were pre-scanned for further
manipulations using ABBYY Fine Reader software;
 Afterwards, it was converted from pdf to .docx to make it possible to work with text in
terms of mark-up;
 The correct formatting of text was checked and discrepancies between scanned pdf file and
text documents were detected; it was normalized in the MS Word editor;</p>
      <p>Next, the focus was on:
 Selection of text marking up system according to its features;
 Implementation of proper tags for the original work
 Implementation of proper tags for the translated version;
 Calculated texts results were processed using the Python programming language;
 Afterwards, the results of the statistical parameters, such as N, max, min, R, Mo, Md, Ẋ, Ϭ,
ν, Sẋ, ε were analyzed and described.</p>
      <p>The original text and its translation was marked up using the same marking rules.</p>
      <p>To recall, the use was made of the XML (eXtensible Markup Language) – a text markup language.
It was used to conduct research and implement on the structural level.</p>
      <p>The XML language was preffered since it fully determines the logical structure of a document.</p>
      <p>The task of the XML language is to ensure certain data: images, texts, and other parts of a Web
document; it can be defined and structured regardless of the platform used to recreate them.</p>
      <p>Since in the current paper we deal with a dramatic work, text mark up and tag patterns were
selected and adjusted for the appropriate analysis of this type of work. Thus, let us now turn our sights
to text mark-up system, peculiar to drama text.</p>
    </sec>
    <sec id="sec-4">
      <title>2.2. Mark-up pattern 2.2.1. Pattern</title>
      <p>Thus, the following text markings were chosen according to the features of the dramatic work:
&lt;chtr&gt;...&lt;/chtr&gt; — paired marking, which is used to indicate a solid whole part of the text related
to a particular character;</p>
      <p>&lt;cnm&gt;...&lt;/cnm&gt; — paired marking, which is used to indicate the name of the character with a
colon;
&lt;s&gt;...&lt;/s&gt; —— paired marking, which is used to denote a sentence in the speech of the character;
&lt;mtr&gt;...&lt;/mtr&gt; — paired marking, which is used to mark all author's remarks throughout the text
2.2.2. Example</p>
      <p>&lt;mtr&gt;The action of the play takes place outside an apartment building-on the ledge, outside
various windows of the seventh storey. As the play progresses, the lights emphasize the time elapsed
between early evening and late night. As the play opens, we hear a party in progress from one of the
windows, MAN stands on the ledge, in a state of perplexity, contemplating the depths below. He
seems disturbed, confused. Then he comes to what seems to be a resolution. He prepares to jump.
When he is about to leap, the window next to him flies open. CHARLOTTE appears. She holds a
MAN wAllet, which she attempts to throw out the window, RODNEY,charging up from behind,
grabs her hand. A window-ledge struggle ensues.&lt;/mtr&gt;
&lt;chtr&gt;&lt;cnm&gt;CHARLOTTE&lt;/cnm&gt;
&lt;s&gt;Let GO of me!!!&lt;/s&gt;&lt;s&gt; Let GO!!&lt;/s&gt;&lt;/chtr&gt;
&lt;chtr&gt;&lt;cnm&gt;RODNEY&lt;/cnm&gt;
&lt;mtr&gt;(threatening)&lt;/mtr&gt;&lt;s&gt; So-help-me-GOD, CHARLOTTE. &lt;/s&gt;&lt;/chtr&gt;
&lt;chtr&gt;&lt;cnm&gt;CHARLOTTE&lt;/cnm&gt;
&lt;mtr&gt;(daring him)&lt;/mtr&gt;&lt;s&gt; What??&lt;/s&gt;&lt;s&gt; WHAT??!! &lt;/s&gt;&lt;/chtr&gt;
&lt;chtr&gt;&lt;cnm&gt;RODNEY&lt;/cnm&gt;
&lt;s&gt;Give me back my wallet! &lt;/s&gt;&lt;/chtr&gt;
&lt;mtr&gt;She tries to throw it again. They struggle. &lt;/mtr&gt;
&lt;chtr&gt;&lt;cnm&gt;RODNEY&lt;/cnm&gt;
&lt;s&gt;What’s WRONGwith you?&lt;/s&gt;&lt;s&gt; Are you CRAZY?! &lt;/s&gt;&lt;/chtr&gt;
&lt;chtr&gt;&lt;cnm&gt;CHARLOTTE&lt;/cnm&gt;
&lt;s&gt;YES! &lt;/s&gt;&lt;s&gt;YES, I AM!!! &lt;/s&gt;&lt;/chtr&gt;
&lt;chtr&gt;&lt;cnm&gt;RODNEY&lt;/cnm&gt;
&lt;s&gt;MY GOLD CARD is in there!! &lt;/s&gt;&lt;/chtr&gt;</p>
    </sec>
    <sec id="sec-5">
      <title>3. Results</title>
      <p>This section of the study presents statistics taken from the calculation of data based on the number
of words in a sentence. That is, the unit of measurement in this statistical calculation is the word. The
findings illustrate the contrast of ST and TT results of statistical parameters, i.e. N, max, min, R, Mo,
Md, Ẋ, Ϭ, ν, .STẋ,heεschematic representation follows the data of each drama character one by one.</p>
    </sec>
    <sec id="sec-6">
      <title>3.1. Analysis of the part of the text that belongs to the drama character of "Charlotte"</title>
      <p>Having analysed the distribution of the number of words in a sentence by absolute and relevant
frequency, we have obtained such numeric characteristics:</p>
      <p>Charlotte: the whole ST data: 1 — 58 (90,62%); 2 — 4 (6,25%); 3 — 1 (1,56%); 4 — 1 (1,56%);.</p>
      <p>The data for «Charlotte» presupposes that bthseoluate frequency of sentence lengths with word
number 1 equals to 58; consequently, with word number of 2 equals to 4; with word number 3 equals
to 1; with word number of 4 equals to 1.</p>
      <p>Talking about translation, the most frequent are sentences with the number of words that equals to
1.</p>
      <p>Charlotte: the whole TT data: 1 — 35 (30,97%); 4 — 17 (15,04%); 5 — 13 (11,50%); 2 — 12
(10,62%); 6 — 12 (10,62%); 3 — 10 (8,85%); 7 — 5 (4,42%); 11 — 3 (2,65%); 9 — 2 (1,77%); 10
— 2 (1,77%); 8 — 1 (0,88%); 12 — 1 (0,88%). The last two are the least frequent.</p>
      <p>On the basis of the data above the following calculations are made of number of meanings,
maximal meaning, minimal meaning, range, mode, median, mean, standard deviation, coefficient of
variation, standard error, measurement error.</p>
      <p>Results are presented in Table 1.</p>
    </sec>
    <sec id="sec-7">
      <title>3.2. Analysis of the part of the text that belongs to the drama character of "Rodney"</title>
      <p>Having analysed the distribution of the number of words in a sentence by absolute and relevant
frequency, we have obtained such numeric characteristics:</p>
      <p>Rodney: the whole ST data: 1 — 37 (90,24%); 2 — 3 (7,32%); 3 — 1 (2,44%).</p>
      <p>The data for «Rod»neypresupposes that tahbesolute frequency of sentence lengths with word
number 1 equals to 37; consequently, with word number of 2 equals to 3; with word number 3 equals
to 1.</p>
      <p>Rodney: the whole TT data: 1 — 16 (21,05%); 2 — 13 (17,11%); 3 — 11 (14,47%); 4 — 9
(11,84%); 5 — 9 (11,84%); 6 — 7 (9,21%); 7 — 5 (6,58%); 9 — 3 (3,95%); 8 — 2 (2,63%); 10 — 1
(1,32%).</p>
      <p>Based on the data above, the following calculations are made and presented in Table 2.
Table 2 shows the following results:</p>
      <p>ST data numeric characteristic: Number of meanings (N) — 41; maximal meaning (max) — 3;
minimal meaning (min) — 1; range (R) — 2; mode (Mo) — 1; median (Md) — 2,0; mean (Ẋ) —
1,12; standard deviation (Ϭ) — 0,39; coefficient of variation (ν) — 0,3519; standard error (Sẋ) —
0,0617; measurement error (ε) — 0,1077.</p>
      <p>TT data numeric characteristic: Number of meanings (N) — 76; maximal meaning (max) — 10;
minimal meaning (min) — 1; range (R) — 9; mode (Mo) — 1; median (Md) — 5,5; mean (Ẋ) —
3,76; standard deviation (Ϭ) — 2,37; coefficient of variation (ν) — 0,6304; standard error (Sẋ) —
0,2721; measurement error (ε) — 0,1417.</p>
    </sec>
    <sec id="sec-8">
      <title>3.3. Analysis of the part of the text that belongs to the drama character of "Man"</title>
      <p>By analogue to the previous characters (Charlotte and Rondey) we obtain the results for other
characters; here – Man.</p>
      <p>Man: the whole ST data: 1 — 228 (87,36%); 2 — 27 (10,34%); 3 — 6 (2,30%).</p>
      <p>Thus, the data for «Man» states thbastoltuhte fraequency of sentence lengths with word number
1 equals to 228; consequently, with word number of 2 equals to 27; with word number 3 equals to 6.</p>
      <p>Man: the whole TT data: 1 — 99 (18,50%); 3 — 90 (16,82%); 4 — 78 (14,58%); 2 — 61
(11,40%); 5 — 48 (8,97%); 6 — 47 (8,79%); 7 — 37 (6,92%); 8 — 18 (3,36%); 9 — 16 (2,99%); 10
— 9 (1,68%); 11 — 8 (1,50%); 12 — 7 (1,31%); 15 — 4 (0,75%); 13 — 3 (0,56%); 16 — 3 (0,56%);
18 — 2 (0,37%); 14 — 1 (0,19%); 17 — 1 (0,19%); 19 — 1 (0,19%); 23 — 1 (0,19%); 27 — 1
(0,19%).</p>
      <p>Next, we have calculated number of meanings, maximal meaning, minimal meaning, range, mode,
median, mean, standard deviation, coefficient of variation, standard error, measurement error. The
results are demonstrated in Table 3.</p>
      <p>ST data numeric characteristic: Number of meanings (N) — 261; maximal meaning (max) — 3;
minimal meaning (min) — 1; range (R) — 2; mode (Mo) — 1; median (Md) — 2,0; mean (Ẋ) —
1,15; standard deviation (Ϭ) — 0,42; coefficient of variation (ν) — 0,3619; standard error (Sẋ) —
0,0258; measurement error (ε) — 0,0439.</p>
      <p>TT data numeric characteristic: Number of meanings (N) — 535; maximal meaning (max) — 27;
minimal meaning (min) — 1; range (R) — 26; mode (Mo) — 1; median (Md) — 11,0; mean (Ẋ) —
4,52; standard deviation (Ϭ) — 3,47; coefficient of variation (ν) — 0,7678; standard error (Sẋ) —
0,1500; measurement error (ε) — 0,0651.</p>
    </sec>
    <sec id="sec-9">
      <title>3.4. Analysis of the part of the text that belongs to the drama character of "Leonard"</title>
      <p>By analogue to the previous characters we obtain the results for the character – Leonard.
Leonard: the whole ST data: 1 — 92 (86,79%); 2 — 12 (11,32%); 3 — 2 (1,89%).</p>
      <p>Leonard: the whole TT data: 1 — 30 (14,49%); 5 — 28 (13,53%); 2 — 27 (13,04%); 3 — 27
(13,04%); 4 — 24 (11,59%); 6 — 23 (11,11%); 8 — 15 (7,25%); 7 — 8 (3,86%); 9 — 5 (2,42%); 10
— 5 (2,42%); 12 — 4 (1,93%); 14 — 3 (1,45%); 13 — 2 (0,97%); 16 — 2 (0,97%); 17 — 2 (0,97%);
11 — 1 (0,48%); 19 — 1 (0,48%).
ST data numeric characteristic:</p>
      <p>Number of meanings (N) — 106; maximal meaning (max) — 3; minimal meaning (min) — 1;
range (R) — 2; mode (Mo) — 1; median (Md) — 2,0; mean (Ẋ) — 1,15; standard deviation (Ϭ) —
0,41; coefficient of variation (ν) — 0,3539; standard error (Sẋ) — 0,0396; measurement error (ε) —
0,0674.</p>
      <p>TT data numeric characteristic:</p>
      <p>Number of meanings (N) — 207; maximal meaning (max) — 19; minimal meaning (min) — 1;
range (R) — 18; mode (Mo) — 1; median (Md) — 9,0; mean (Ẋ) — 4,94; standard deviation (Ϭ) —
3,53; coefficient of variation (ν) — 0,7148; standard error (Sẋ) — 0,2453; measurement error (ε) —
0,0974.</p>
    </sec>
    <sec id="sec-10">
      <title>3.5. Analysis of the part of the text that belongs to the drama character of "Jennifer"</title>
      <p>Jennifer: the whole ST data: 1 — 21 (84,00%); 2 — 3 (12,00%); 6 — 1 (4,00%);.</p>
      <p>Jennifer: the whole TT data: 6 — 5 (19,23%); 4 — 4 (15,38%); 2 — 3 (11,54%); 3 — 3
(11,54%); 5 — 2 (7,69%); 9 — 2 (7,69%); 1 — 1 (3,85%); 7 — 1 (3,85%); 8 — 1 (3,85%); 10 — 1
(3,85%); 11 — 1 (3,85%); 14 — 1 (3,85%); 15 — 1 (3,85%).</p>
      <p>ST data numeric characteristic: Number of meanings (N) — 25; maximal meaning (max) — 6;
minimal meaning (min) — 1; range (R) — 5; mode (Mo) — 1; median (Md) — 2,0; mean (Ẋ) —
1,32; standard deviation (Ϭ) — 1,01; coefficient of variation (ν) — 0,7642; standard error (Sẋ) —
0,2018; measurement error (ε) — 0,2996.</p>
      <p>TT data numeric characteristic: Number of meanings (N) — 26; maximal meaning (max) — 15;
minimal meaning (min) — 1; range (R) — 14; mode (Mo) — 6; median (Md) — 7,0; mean (Ẋ) —
5,96; standard deviation (Ϭ) — 3,55; coefficient of variation (ν) — 0,5948; standard error (Sẋ) —
0,6955; measurement error (ε) — 0,2287.</p>
    </sec>
    <sec id="sec-11">
      <title>3.6. Analysis of the part of the text that belongs to the drama character of "Marshall"</title>
      <p>Marshall: the whole ST data: 1 — 94 (85,45%); 2 — 15 (13,64%); 4 — 1 (0,91%).</p>
      <p>Marshall: the whole TT data: 2 — 31 (15,74%); 4 — 27 (13,71%); 3 — 26 (13,20%); 5 — 25
(12,69%); 6 — 21 (10,66%); 8 — 16 (8,12%); 7 — 11 (5,58%); 9 — 11 (5,58%); 1 — 9 (4,57%); 10
— 7 (3,55%); 11 — 6 (3,05%); 12 — 2 (1,02%); 16 — 2 (1,02%); 17 — 2 (1,02%); 23 — 1 (0,51%).</p>
      <p>ST data numeric characteristic: Number of meanings (N) — 110; maximal meaning (max) — 4;
minimal meaning (min) — 1; range (R) — 3; mode (Mo) — 1; median (Md) — 2,0; mean (Ẋ) —
1,16; standard deviation (Ϭ) — 0,44; coefficient of variation (ν) — 0,3760; standard error (Sẋ) —
0,0417; measurement error (ε) — 0,0703.</p>
      <p>TT data numeric characteristic: Number of meanings (N) — 197; maximal meaning (max) — 23;
minimal meaning (min) — 1; range (R) — 22; mode (Mo) — 2; median (Md) — 8,0; mean (Ẋ) —
5,39; standard deviation (Ϭ) — 3,38; coefficient of variation (ν) — 0,6279; standard error (Sẋ) —
0,2409; measurement error (ε) — 0,0877.</p>
    </sec>
    <sec id="sec-12">
      <title>3.7. Analysis of the part of the text that belongs to the drama character of "Joan"</title>
      <p>Joan: the whole ST data: 1 — 43 (84,31%); 2 — 7 (13,73%); 3 — 1 (1,96%);.</p>
      <p>Joan: the whole TT data: 3 — 16 (16,49%); 4 — 16 (16,49%); 1 — 13 (13,40%); 5 — 12
(12,37%); 2 — 10 (10,31%); 7 — 10 (10,31%); 6 — 6 (6,19%); 9 — 4 (4,12%); 8 — 3 (3,09%); 12
— 3 (3,09%); 11 — 1 (1,03%); 14 — 1 (1,03%); 17 — 1 (1,03%); 18 — 1 (1,03%).</p>
      <p>ST data numeric characteristic: Number of meanings (N) — 51; maximal meaning (max) — 3;
minimal meaning (min) — 1; range (R) — 2; mode (Mo) — 1; median (Md) — 2,0; mean (Ẋ) —
1,18; standard deviation (Ϭ) — 0,43; coefficient of variation (ν) — 0,3651; standard error (Sẋ) —
0,0602; measurement error (ε) — 0,1002.</p>
      <p>TT data numeric characteristic: Number of meanings (N) — 97; maximal meaning (max) — 18;
minimal meaning (min) — 1; range (R) — 17; mode (Mo) — 3; median (Md) — 7,5; mean (Ẋ) —
4,81; standard deviation (Ϭ) — 3,35; coefficient of variation (ν) — 0,6958; standard error (Sẋ) —
0,3402; measurement error (ε) — 0,1385.</p>
      <p>Unit</p>
    </sec>
    <sec id="sec-13">
      <title>3.8. Analysis of the part of the text that belongs to the drama character of "Michael"</title>
      <p>ST data numeric characteristic: Number of meanings (N) — 37; maximal meaning (max) — 2;
minimal meaning (min) — 1; range (R) — 1; mode (Mo) — 1; median (Md) — 1,5; mean (Ẋ) —
1,08; standard deviation (Ϭ) — 0,27; coefficient of variation (ν) — 0,2525; standard error (Sẋ) —
0,0449; measurement error (ε) — 0,0814.</p>
      <p>TT data numeric characteristic: Number of meanings (N) — 55; maximal meaning (max) — 12;
minimal meaning (min) — 1; range (R) — 11; mode (Mo) — 4; median (Md) — 5,5; mean (Ẋ) —
5,16; standard deviation (Ϭ) — 2,57; coefficient of variation (ν) — 0,4979; standard error (Sẋ) —
0,3467; measurement error (ε) — 0,1316.</p>
    </sec>
    <sec id="sec-14">
      <title>3.9. Analysis of the part of the text that belongs to the drama character of "Rachel"</title>
      <p>Rachel: the whole ST data: 1 — 53 (91,38%); 2 — 5 (8,62%).</p>
      <p>Rachel: the whole TT data: 4 — 18 (15,00%); 5 — 14 (11,67%); 7 — 14 (11,67%); 3 — 12
(10,00%); 2 — 11 (9,17%); 6 — 11 (9,17%); 1 — 10 (8,33%); 8 — 5 (4,17%); 9 — 4 (3,33%); 11 —
4 (3,33%); 10 — 3 (2,50%); 12 — 3 (2,50%); 13 — 3 (2,50%); 14 — 3 (2,50%); 16 — 3 (2,50%); 15
— 1 (0,83%); 20 — 1 (0,83%).</p>
      <p>N
max
min</p>
      <p>R
Mo
Md
Ẋ
Ϭ
ν
Sẋ
ε</p>
      <p>ST data numeric characteristic: Number of meanings (N) — 58; maximal meaning (max) — 2;
minimal meaning (min) — 1; range (R) — 1; mode (Mo) — 1; median (Md) — 1,5; mean (Ẋ) —
1,09; standard deviation (Ϭ) — 0,28; coefficient of variation (ν) — 0,2584; standard error (Sẋ) —
0,0369; measurement error (ε) — 0,0665.</p>
      <p>TT data numeric characteristic: Number of meanings (N) — 120; maximal meaning (max) — 20;
minimal meaning (min) — 1; range (R) — 19; mode (Mo) — 4; median (Md) — 9,0; mean (Ẋ) —
6,03; standard deviation (Ϭ) — 3,94; coefficient of variation (ν) — 0,6529; standard error (Sẋ) —
0,3596; measurement error (ε) — 0,1168.</p>
    </sec>
    <sec id="sec-15">
      <title>3.10. Analysis of the part of the text that belongs to the drama character of "Percy"</title>
      <p>Percy: the whole ST data: 1 — 34 (80,95%); 2 — 7 (16,67%); 3 — 1 (2,38%).</p>
      <p>Percy: the whole TT data: 6 — 12 (16,44%); 3 — 11 (15,07%); 4 — 10 (13,70%); 5 — 7 (9,59%);
1 — 6 (8,22%); 2 — 5 (6,85%); 7 — 4 (5,48%); 8 — 4 (5,48%); 9 — 3 (4,11%); 11 — 3 (4,11%); 14
— 3 (4,11%); 10 — 2 (2,74%); 12 — 1 (1,37%); 18 — 1 (1,37%); 23 — 1 (1,37%).</p>
      <p>ST data numeric characteristic: Number of meanings (N) — 42; maximal meaning (max) — 3;
minimal meaning (min) — 1; range (R) — 2; mode (Mo) — 1; median (Md) — 2,0; mean (Ẋ) —
1,21; standard deviation (Ϭ) — 0,46; coefficient of variation (ν) — 0,3827; standard error (Sẋ) —
0,0717; measurement error (ε) — 0,1158.</p>
      <p>TT data numeric characteristic: Number of meanings (N) — 73; maximal meaning (max) — 23;
minimal meaning (min) — 1; range (R) — 22; mode (Mo) — 6; median (Md) — 8,0; mean (Ẋ) —
5,90; standard deviation (Ϭ) — 4,04; coefficient of variation (ν) — 0,6839; standard error (Sẋ) —
0,4726; measurement error (ε) — 0,1569.</p>
    </sec>
    <sec id="sec-16">
      <title>3.11. Analysis of the part of the text that belongs to the drama character of "Al"</title>
      <p>ST data numeric characteristic: Number of meanings (N) — 31; maximal meaning (max) — 3;
minimal meaning (min) — 1; range (R) — 2; mode (Mo) — 1; median (Md) — 2,0; mean (Ẋ) —
1,32; standard deviation (Ϭ) — 0,59; coefficient of variation (ν) — 0,4457; standard error (Sẋ) —
0,1059; measurement error (ε) — 0,1569.</p>
      <p>TT data numeric characteristic: Number of meanings (N) — 58; maximal meaning (max) — 16;
minimal meaning (min) — 1; range (R) — 15; mode (Mo) — 6; median (Md) — 7,5; mean (Ẋ) —
5,40; standard deviation (Ϭ) — 3,41; coefficient of variation (ν) — 0,6316; standard error (Sẋ) —
0,4476; measurement error (ε) — 0,1626.</p>
    </sec>
    <sec id="sec-17">
      <title>3.12. Analysis of the part of the text that belongs to the drama character of "Nurse Wilson"</title>
      <p>Nurse Wilson: the whole ST data: 1 — 42 (87,50%); 2 — 5 (10,42%); 3 — 1 (2,08%);.</p>
      <p>Nurse Wilson: the whole TT data: 3 — 10 (13,16%); 4 — 10 (13,16%); 1 — 9 (11,84%); 5 — 9
(11,84%); 2 — 7 (9,21%); 6 — 6 (7,89%); 7 — 6 (7,89%); 12 — 4 (5,26%); 8 — 3 (3,95%); 9 — 3
(3,95%); 11 — 2 (2,63%); 13 — 2 (2,63%); 18 — 2 (2,63%); 10 — 1 (1,32%); 17 — 1 (1,32%); 23
— 1 (1,32%).</p>
      <p>N
max
min</p>
      <p>R
Mo
Md
Ẋ
Ϭ
ν
Sẋ
ε</p>
      <p>ST data numeric characteristic: Number of meanings (N) — 48; maximal meaning (max) — 3;
minimal meaning (min) — 1; range (R) — 2; mode (Mo) — 1; median (Md) — 2,0; mean (Ẋ) —
1,15; standard deviation (Ϭ) — 0,41; coefficient of variation (ν) — 0,3558; standard error (Sẋ) —
0,0588; measurement error (ε) — 0,1007.</p>
      <p>TT data numeric characteristic: Number of meanings (N) — 76; maximal meaning (max) — 23;
minimal meaning (min) — 1; range (R) — 22; mode (Mo) — 3; median (Md) — 8,5; mean (Ẋ) —
5,91; standard deviation (Ϭ) — 4,48; coefficient of variation (ν) — 0,7586; standard error (Sẋ) —
0,5141; measurement error (ε) — 0,1705.</p>
    </sec>
    <sec id="sec-18">
      <title>3.13. Analysis of the part of the text that belongs to the drama character of "Lilian"</title>
      <p>Lilian: the whole ST data: 1 — 68 (91,89%); 2 — 6 (8,11%);.</p>
      <p>Lilian: the whole TT data: 2 — 23 (14,94%); 4 — 19 (12,34%); 3 — 18 (11,69%); 5 — 17
(11,04%); 1 — 14 (9,09%); 6 — 13 (8,44%); 10 — 12 (7,79%); 8 — 10 (6,49%); 7 — 9 (5,84%); 9
— 7 (4,55%); 11 — 3 (1,95%); 13 — 2 (1,30%); 16 — 2 (1,30%); 17 — 2 (1,30%); 12 — 1 (0,65%);
14 — 1 (0,65%); 18 — 1 (0,65%).</p>
      <p>ST data numeric characteristic: Number of meanings (N) — 74; maximal meaning (max) — 2;
minimal meaning (min) — 1; range (R) — 1; mode (Mo) — 1; median (Md) — 1,5; mean (Ẋ) —
1,08; standard deviation (Ϭ) — 0,27; coefficient of variation (ν) — 0,2525; standard error (Sẋ) —
0,0317; measurement error (ε) — 0,0575.</p>
      <p>TT data numeric characteristic: Number of meanings (N) — 154; maximal meaning (max) — 18;
minimal meaning (min) — 1; range (R) — 17; mode (Mo) — 2; median (Md) — 9,0; mean (Ẋ) —
5,51; standard deviation (Ϭ) — 3,69; coefficient of variation (ν) — 0,6704; standard error (Sẋ) —
0,2975; measurement error (ε) — 0,1059.</p>
    </sec>
    <sec id="sec-19">
      <title>3.14. Analysis of the part of the text that belongs to the secondary drama</title>
      <p>characters</p>
    </sec>
    <sec id="sec-20">
      <title>3.14.1. Character "One"</title>
      <p>One: the whole ST data: 1 — 1 (100,00%). One: the whole TT data: 4 — 2 (40,00%); 3 — 1
(20,00%); 5 — 1 (20,00%); 6 — 1 (20,00%).</p>
      <p>ST data numeric characteristic: Number of meanings (N) — 1; maximal meaning (max) — 1;
minimal meaning (min) — 1; range (R) — 0; mode (Mo) — 1; median (Md) — 1,0; mean (Ẋ) —
1,00; standard deviation (Ϭ) — 0,00; coefficient of variation (ν) — 0,0000; standard error (Sẋ) —
0,0000; measurement error (ε) — 0,0000.</p>
      <p>TT data numeric characteristic: Number of meanings (N) — 5; maximal meaning (max) — 6;
minimal meaning (min) — 3; range (R) — 3; mode (Mo) — 4; median (Md) — 4,5; mean (Ẋ) —
4,40; standard deviation (Ϭ) — 1,02; coefficient of variation (ν) — 0,2318; standard error (Sẋ) —
0,4561; measurement error (ε) — 0,2032.</p>
    </sec>
    <sec id="sec-21">
      <title>3.14.2. Character "Two"</title>
      <p>Two: the whole ST data: 1 — 2 (66,67%); 2 — 1 (33,33%). Two: the whole TT data: 4 — 2
(33,33%); 8 — 2 (33,33%); 5 — 1 (16,67%); 10 — 1 (16,67%).</p>
      <p>ST data numeric characteristic: Number of meanings (N) — 3; maximal meaning (max) — 2;
minimal meaning (min) — 1; range (R) — 1; mode (Mo) — 1; median (Md) — 1,5; mean (Ẋ) —
1,33; standard deviation (Ϭ) — 0,47; coefficient of variation (ν) — 0,3536; standard error (Sẋ) —
0,2722; measurement error (ε) — 0,4001.</p>
      <p>TT data numeric characteristic: Number of meanings (N) — 6; maximal meaning (max) — 10;
minimal meaning (min) — 4; range (R) — 6; mode (Mo) — 4; median (Md) — 6,5; mean (Ẋ) —
6,50; standard deviation (Ϭ) — 2,29; coefficient of variation (ν) — 0,3525; standard error (Sẋ) —
0,9354; measurement error (ε) — 0,2821.</p>
      <p>ST data numeric characteristic: Number of meanings (N) — 5; maximal meaning (max) — 2;
minimal meaning (min) — 1; range (R) — 1; mode (Mo) — 1; median (Md) — 1,5; mean (Ẋ) —
1,20; standard deviation (Ϭ) — 0,40; coefficient of variation (ν) — 0,3333; standard error (Sẋ) —
0,1789; measurement error (ε) — 0,2922.</p>
      <p>TT data numeric characteristic: Number of meanings (N) — 5; maximal meaning (max) — 8;
minimal meaning (min) — 3; range (R) — 5; mode (Mo) — 4; median (Md) — 5,0; mean (Ẋ) —
5,00; standard deviation (Ϭ) — 1,79; coefficient of variation (ν) — 0,3578; standard error (Sẋ) —
0,8000; measurement error (ε) — 0,3136.</p>
    </sec>
    <sec id="sec-22">
      <title>3.14.4. Character "Four"</title>
      <p>Four: the whole ST data: 1 — 2 (100,00%).</p>
      <p>Four: the whole TT data: 4 — 2 (50,00%); 1 — 1 (25,00%); 2 — 1 (25,00%).
Character’s name</p>
      <p>Charlotte
Rodney</p>
      <p>Man
Leonard
Jennifer
Marshal</p>
      <p>Joan
Michael
Rachel
Percy</p>
      <p>Al
Nurse Wilson</p>
      <p>Lilian
One
Two
Three</p>
      <p>Four</p>
      <p>ST data numeric characteristic: Number of meanings (N) — 2; maximal meaning (max) — 1;
minimal meaning (min) — 1; range (R) — 0; mode (Mo) — 1; median (Md) — 1,0; mean (Ẋ) —
1,00; standard deviation (Ϭ) — 0,00; coefficient of variation (ν) — 0,0000; standard error (Sẋ) —
0,0000; measurement error (ε) — 0,0000.</p>
      <p>TT data numeric characteristic: Number of meanings (N) — 4; maximal meaning (max) — 4;
minimal meaning (min) — 1; range (R) — 3; mode (Mo) — 4; median (Md) — 2,0; mean (Ẋ) —
2,75; standard deviation (Ϭ) — 1,30; coefficient of variation (ν) — 0,4724; standard error (Sẋ) —
0,6495; measurement error (ε) — 0,4629.</p>
    </sec>
    <sec id="sec-23">
      <title>4. Comparative analysis of word distribution in sentences</title>
    </sec>
    <sec id="sec-24">
      <title>4.1. Difference in the Number of meanings (N) in ST and TT</title>
      <p>Given form the results above that the translated variant statistical parameters data exceeds the
original drama in the majority of cases, we now turn our sights to one parameter – Number of
meanings (N). We tend to compare the data and find the difference (if present). Our assumption 1 is
that the TT is much longer in terms of word usage within the sentence.</p>
      <p>ST
64
41
261
106
25
110
51
37
58
42
31
48
74
1
3
5
2</p>
      <p>TT
113
76
535
207
26
197
97
55
120
73
58
76
154
5
6
5
4</p>
      <p>Difference
+49
+35
+274
+101
+1
+87
+46
+18
+62
+31
+27
+28
+80
+4
+3
0
+2</p>
      <p>To recall, character "Man" is the protagonist and the main character of the play. He is a
welldressed gentleman who is willing to jump off the seventh story.</p>
      <p>He has a number of conversations with the residents of the building. He feels lost and compelled to
stand on the seventh story of the building. Taking into account the results of Figure 1 we hold
assumption 2 that the translator adds a considerable number of words (274), or, he rather, doubles the
ST quantity, due to a number of reasons:
 to explain the original;
 to compensate literary imagery losses;
 to add something from the translator himself, to recreate, so to say, the original;
 due to structural and lexico-gramatical allomorphic features of a language pair.</p>
      <p>Whatever reason stands behind this translator’s decis-imonaking, it is a prosperous ground for
further Translation Studies analysis.</p>
    </sec>
    <sec id="sec-25">
      <title>4.2. Analysis of the whole text</title>
      <p>Here we focus on statistical parameters with the defined unit of measurement – a word. The
number of words in a drama text utterunces is important due to a couple of reasons:
 the length of lines of the written script;
 chronometry and metrics of the whole drama act;
 pithiness and iconicity of each phrase.</p>
      <p>Below are the results on the distribution of the number of words in a TT sentence by absolute and
relevant frequency.</p>
      <p>The most frequent are sentences in the translated text with the number of words 4 – 259 (14,2%),
1 – 255(13,98%), 3– 255(13,98%), 2 – 219(12,01%) 5 – 205(11,24%), 6 -182 (9,98), 7-121 (6,63%),
8 – 84(4,61), 9- 62 (3,4%), 10 – 49 (2,69%), 11 – 32 (1,75), 12 – 31 (1,7%), 14 – 14 (0,77%), 13 – 13
(0,71%), 16 – 13(0,71%), 17 – 9 (0,49%), 18 – 7 (0,38%), 15 – 6 (0,33%), 23 – 4 (0,22%), 19 – 2
(0,11%), 20 – 1 (0,05%), 27 – 1 (0,05%). The last two results are the least frequent.</p>
      <p>In the following Figure 2 we can see a comparison of the number of words in the sentences of the
whole TT drama work.</p>
      <p>The x-axis is the number of sentences, and the y-axis is the number of words in a sentence.</p>
    </sec>
    <sec id="sec-26">
      <title>5. Conclusions</title>
      <p>The main advances of statistical linguistics have been retrieved in the article. The original
Canadian play has been compared with the corresponding translated text in terms of statistical
parameters, which has never been done before.</p>
      <p>The paper is of practical and applied value; however, the scientific value of the paper is seen as
such that the suggested approach and methods will eventually allow formulating and substantiating a
plausible scientific hypothesis in the realm of statistical linguistics and translation studies. At this
point it is proven that bilingual drama texts are well adoptable for NLP and reveal promising
outcomes.</p>
      <p>We have verified absolute and relevant distribution, probability measurement, also: N, max, min,
R, Mo, Md, Ẋ, Ϭ, ν, Sẋ, ε in the sentences of both texts.</p>
      <p>Specifically designed software, which is represented as a combination of XML markup language,
Microsoft Excel spreadsheet, and Python programming language, has been used. Results of statistical
calculations of the drama ―7 stories‖ by Morris Panych by unitwoorfd amreeapsruerseented in the
corresponding Tables 1 – 17.</p>
      <p>Structural recognition provides useful information about the characters of the play, original and
translation, namely the length of the sentence in word units that will help with further comparisons of
ST and TT. The quantitative characteristics of the original play and its Ukrainian translation on the
lexical level relying on the linguistic statistical analysis have been clarified: the amount of translated
text Numbers of meaning (N) exceeds considerably and demands further analysis. The discrepancy
becomes obvious with number of characters (Man, Leonard, Marshal, Lilian)</p>
      <p>The correlation of coefficients has been presented in tables and figures to illustrate the material
under research.</p>
      <p>The prospect of the study is to further explore the problems of translator’s
which resulted in the declared above data.
meaningful choices</p>
    </sec>
    <sec id="sec-27">
      <title>6. Acknowledgement</title>
      <p>The project has been carried out within the complex academAipcplitcoaptiocn ―of modern
technologies for optimization of information processes in natural langLuavgive‖Poalyttechnic
National University. At the initial stage the project underwent the consultancy of Ihor Kulchytskyy, to
whom we express our gratitude.
7. References
International</p>
      <p>S. Shaheen, and M. Spruit. Full-Text or Abstract? Examining Topic Coherence
Scores Using Latent Dirichlet Allocation. 2017 IEEE International Conference on Data
Science and Advanced Analytics (DSAA), 2017, doi:10.1109/dsaa.2017.61.</p>
      <p>Topic Modeling in Python with Gensim. Machine Learning Plus, 16 Apr. 2020, URL:
www.machinelearningplus.com/nlp/topic-modeling-gensim-python.</p>
      <p>K. Aguilar, NLP Techniques with Shakespeare’s Plays: Cleaning and Classifying
Text with the Bard, 2020. URL:
https://medium.com/analytics-vidhya/nlp-techniques-withshakespeares-plays-d8843ba26a4f.</p>
      <p>O. Levchenko, M. Dilai, (2019) Attitudes Toward Feminism in Ukraine: A
Sentiment Analysis of Tweets. In: Shakhovska N., Medykovskyy M. (eds) Advances in
Intelligent Systems and Computing III. CSIT 2018. Advances in Intelligent Systems and
Computing, vol 871. Springer, Cham. doi:10.1007/978-3-030-01069-0_9</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M.</given-names>
            <surname>Panych</surname>
          </string-name>
          , Seven Stories, Vancouver: Talonbooks,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M.</given-names>
            <surname>Panych</surname>
          </string-name>
          ,
          <volume>7</volume>
          <fpage>istorii</fpage>
          , [per. Z anhliiskoi Ivana Krychfalushiia],
          <source>Brusturiv: Dyskursus</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>S.</given-names>
            <surname>Laviosa</surname>
          </string-name>
          (Ed.),
          <source>Corpus-based Translation Studies: Theory</source>
          , Findings, Applications, Rodopy,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>K. H.</given-names>
            <surname>Chen</surname>
          </string-name>
          , and
          <string-name>
            <given-names>H. H.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <article-title>Aligning bilingual corpora especially for language pairs from different families</article-title>
          .
          <source>Information Sciences Applications</source>
          ,
          <year>1995</year>
          ,
          <volume>42</volume>
          , pp.
          <fpage>57</fpage>
          -
          <lpage>81</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>J.</given-names>
            <surname>Munday</surname>
          </string-name>
          ,
          <article-title>A Computer-assisted approach to the Analysis of Translation Shifts</article-title>
          , Meta,
          <year>1998</year>
          , XLIII, 4.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>F.</given-names>
            <surname>Zanettin</surname>
          </string-name>
          ,
          <article-title>Parallel corpora in translation studies: Issues in corpus design and analysis</article-title>
          .
          <source>In Intercultural Faultlines. Research Models in Translation Studies I: Textual and Cognitive Aspects</source>
          , ed. M. Olohan, pp.
          <fpage>105</fpage>
          -
          <lpage>118</lpage>
          . Manchester: St. Jerome,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>J.</given-names>
            <surname>Allen</surname>
          </string-name>
          , Natural Language Understanding. Cummings Publishing Company, Redwood City,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>D.</given-names>
            <surname>Barnard</surname>
          </string-name>
          , et al.
          <article-title>―SG-MBLased Markup for Literary Texts: Two Problems and Some Solutions.‖ Computers and the Humanities</article-title>
          , vol.
          <volume>22</volume>
          , no.
          <issue>4</issue>
          ,
          <issue>1988</issue>
          , pp.
          <fpage>265</fpage>
          -
          <lpage>276</lpage>
          . JSTOR, URL: www.jstor.org/stable/30200136. Accessed 28 Feb.
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>P.</given-names>
            <surname>Blackburn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Bos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kohlhase</surname>
          </string-name>
          , &amp;
          <string-name>
            <surname>H. De Nivelle</surname>
          </string-name>
          ,
          <article-title>Inference and computational semantics</article-title>
          .
          <source>In Computing Meaning</source>
          , Springer Netherlands,
          <year>2001</year>
          , pp.
          <fpage>11</fpage>
          -
          <lpage>28</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>I.</given-names>
            <surname>Dagan</surname>
          </string-name>
          , and
          <string-name>
            <given-names>O.</given-names>
            <surname>Glickman</surname>
          </string-name>
          ,
          <article-title>Probabilistic textual entailment: generic applied modeling of language variability</article-title>
          .
          <source>In Proceedings of the PACAL Workshop on Learning Methods for Text Understanding and Mining</source>
          , Grenoble, France,
          <year>2004</year>
          , pp.
          <fpage>26</fpage>
          -
          <lpage>29</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>R.</given-names>
            <surname>Dale</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Moisl</surname>
          </string-name>
          , H. Somers (Eds.),
          <article-title>Handbook of natural language processing</article-title>
          . CRC press,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>M.</given-names>
            <surname>Dilai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Levchenko</surname>
          </string-name>
          ,
          <source>Discourses Surrounding Feminism in Ukraine: A Sentiment Analysis of Twitter Data 2018 IEEE 13th International Scientific and Technical Conference on Computer Sciences and Information Technologies, CSIT 2018 - Proceedings 2018 | conference-paper doi: 10</source>
          .1109/STC-CSIT.
          <year>2018</year>
          .8526694
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>A.</given-names>
            <surname>Hogan</surname>
          </string-name>
          ,
          <source>The Web of Data</source>
          . Springer,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>V.</given-names>
            <surname>Lytvyn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Vysotska</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Hamon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Grabar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Sharonova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Cherednichenko</surname>
          </string-name>
          ,
          <string-name>
            <surname>O.</surname>
          </string-name>
          Kanishcheva (Eds.),
          <source>Computational Linguistics and Intelligent Systems. Proc. 4thInt. Conf. COLINS 2020</source>
          . Volume I:Workshop. Lviv, Ukraine,
          <source>April 23-24</source>
          ,
          <year>2020</year>
          , CEURWS.org, online
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>M.</given-names>
            <surname>Marcus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Santorini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Marcinkiewicz</surname>
          </string-name>
          ,
          <article-title>Building a Large Annotated Corpus of English: Penn TreeBank</article-title>
          . Computational linguistics: Special Issue on Using Large Corpora,
          <year>1993</year>
          ,
          <volume>19</volume>
          (
          <issue>2</issue>
          ), pp.
          <fpage>313</fpage>
          -
          <lpage>330</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>C.</given-names>
            <surname>Matthews</surname>
          </string-name>
          ,
          <article-title>An Introduction to Natural Language Processing Through Prolog</article-title>
          , Routledge: London and New York,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>M.</given-names>
            <surname>Oakes</surname>
          </string-name>
          ,
          <article-title>Sentence and word alignment in the CARTER project</article-title>
          .
          <source>In Using Corpora for Language Research</source>
          , ed. J.
          <string-name>
            <surname>Thomas</surname>
            , and
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Short</surname>
          </string-name>
          , London: Longman,
          <year>1996</year>
          , pp.
          <fpage>211</fpage>
          -
          <lpage>233</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>P.</given-names>
            <surname>Pavis</surname>
          </string-name>
          , Theatre at the Crossroads of Culture, Routledge,
          <year>1992</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>S.</given-names>
            <surname>Bassnett</surname>
          </string-name>
          ,
          <article-title>Translating for the Theatre: The Case Against Performability</article-title>
          . TTR : traduction, terminologie, rédaction,
          <year>1991</year>
          ,
          <volume>4</volume>
          (
          <issue>1</issue>
          ),pp.
          <fpage>99</fpage>
          -
          <lpage>111</lpage>
          . URL: https://doi.org/10.7202/037084ar.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>S.</given-names>
            <surname>Bassnett</surname>
          </string-name>
          ,
          <article-title>Still Trapped in the Labyrinth: Further Reflections on Translation and Theatre</article-title>
          , Constructing Cultures: Essays on
          <string-name>
            <given-names>Literary</given-names>
            <surname>Translation</surname>
          </string-name>
          .-Multilingual
          <string-name>
            <surname>Matters</surname>
          </string-name>
          ,
          <year>1998</year>
          , pp.
          <fpage>90</fpage>
          -
          <lpage>108</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>T.H.</given-names>
            <surname>Howard-Hill</surname>
          </string-name>
          ,
          <article-title>Modern Textual Theories and the Editing of Plays. The Library, 6th ser</article-title>
          .,
          <year>1989</year>
          ,
          <volume>11</volume>
          , pp.
          <fpage>89</fpage>
          -
          <lpage>115</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>M.</given-names>
            <surname>Issacharoff</surname>
          </string-name>
          , F. Robin Jones (Eds.), Performing Texts. Philadelphia: University of Pennsylvania Press,
          <year>1988</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>J.</given-names>
            <surname>Lavagnino</surname>
          </string-name>
          ,
          <string-name>
            <surname>E. Mylonas,</surname>
          </string-name>
          <article-title>The show must go on: Problems of tagging performance texts</article-title>
          .
          <source>Comput Hum</source>
          ,
          <year>1995</year>
          , pp.
          <fpage>113</fpage>
          -
          <lpage>121</lpage>
          . URL: https://doi.org/10.1007/BF01830705
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <article-title>Corpus-based Language Studies: An Advanced Resource Book</article-title>
          , ed. T.
          <string-name>
            <surname>McEnery</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Xiao</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Tono</surname>
          </string-name>
          , Routledge,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>N.</given-names>
            <surname>Dershowitz</surname>
          </string-name>
          , E. Nissan (Eds.), Language, Culture, Computation:
          <article-title>Computing for the Humanities</article-title>
          ,
          <source>Law and Narratives</source>
          . Springer,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>O.</given-names>
            <surname>Levchenko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Tyshchenko</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Dilai</surname>
          </string-name>
          .
          <article-title>Associative Verbal Network of the Conceptual Domain БІДА (MISERY) in Ukrainian</article-title>
          .
          <source>Proceedings of the 4th Conference on Computational Linguistics and Intelligent Systems (COLINS</source>
          <year>2020</year>
          ). Volume I: Main Conference. URL: http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2604</volume>
          /
          <article-title>Associative Verbal Network of the Conceptual Domain БІДА (MISERY</article-title>
          ) in Ukrainian
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>N.</given-names>
            <surname>Shakhovska</surname>
          </string-name>
          , and M. Medykovskyy (Eds),
          <source>Advances in Intelligent Systems and Computing III: Selected papers from the International Conference on Computer Science and Information Technologies, CSIT 2018, September</source>
          <volume>11</volume>
          -14 Lviv, Ukraine. Springer: Springer Nature Switzerland,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <surname>C.M. Sperberg-McQueen</surname>
          </string-name>
          ,
          <article-title>Text in the Electronic Age: Textual Study and Text Encoding, with Examples from Medieval texts</article-title>
          .
          <source>Literary and Linguistic Computing</source>
          ,
          <volume>6</volume>
          (
          <year>1991</year>
          ), pp.
          <fpage>34</fpage>
          -
          <lpage>46</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <surname>C.M. Sperberg-McQueen</surname>
          </string-name>
          ,
          <article-title>and B</article-title>
          .
          <string-name>
            <surname>Lou</surname>
          </string-name>
          (Eds.),
          <article-title>Guidelines for Electronic Text Encoding and Interchange (TEI P3)</article-title>
          .
          <source>Chicago and Oxford: Text Encoding Initiative</source>
          ,
          <year>1994</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>